BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019063
(346 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 348 bits (893), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 168/312 (53%), Positives = 220/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C QKE TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV N CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE G+IR+ R+ G
Sbjct: 284 GGVFNGKCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 341 LCGINKMASYPT 352
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 331 bits (848), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 168/346 (48%), Positives = 221/346 (63%), Gaps = 16/346 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+ F+++ L + + G H E S+ E +E+W + H E EKA R N
Sbjct: 1 MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TF 123
+FK N+++I + NK+ +++YKL N+F D+T+EEFR Y G N + + + + +F
Sbjct: 60 VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y NV +PTS+DWR+ GAVT +K+QGQCGSCWAFS V AVEGI QI KL LSEQ+L
Sbjct: 119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
VDC T+ N GC+GGLMD AFE+I E GL +E YPY+ + TCD KE A +I +E
Sbjct: 179 VDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
D+PK E L++AV+NQPVSV +DA G F FY GV CG +HGVAVVG+GT +
Sbjct: 239 DVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
G KYW++KNSWGE WGE GYIR+ R GLCGIA ASYP+
Sbjct: 299 --GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 329 bits (843), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 206/309 (66%), Gaps = 10/309 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W + H + EK R N+FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRN 95
Query: 102 LYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
Y+G + R R + TF Y+ V VP S+DWR+KGAVT +KDQGQCGSCWAFS
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
+ AVEGI QI KL+ LSEQ+LVDC TD N GC+GGLMD AFE+I + G+ TEA+YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
+GTCD KE A A +I +E++P+ DE ALL+AV+NQPVSV +DA G F FY GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
CG DHGVA+VG+GT + G KYW +KNSWG WGE GYIR+ R GLCG
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTID--GTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333
Query: 336 IATAASYPV 344
IA ASYP+
Sbjct: 334 IAMEASYPI 342
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 327 bits (838), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 220/313 (70%), Gaps = 12/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+ +Y G + V R R + F Y++V VP S+DWR+KGAV +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T N+GC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY EEGTC+ QK+++ TI+ ++D+P DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
GV + CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE GYIR+ R+
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 332 GLCGIATAASYPV 344
GLCGI AS+P
Sbjct: 341 GLCGINKMASFPT 353
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 324 bits (830), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 159/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK NL ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G P + R + + F Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 90 MTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY+ +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEHGYIRMQRNI 327
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA SYP+
Sbjct: 328 SKKEGLCGIAMLPSYPI 344
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 323 bits (827), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R S S TF Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 90 MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS + AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNI 327
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 322 bits (826), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +KDQG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
++ A TI YED+P E++L +AV++QP+S+ ++A GRAF Y SG+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
DHGV VG+GT ENG YW+++NSWG++WGESGY+R+ R+ +G CGIA SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 320 bits (819), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 221/350 (63%), Gaps = 13/350 (3%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+K +I +F ++IL C E + +++W + H + E+ R N
Sbjct: 1 MKKLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN---RPVPSVSRQSSRPS 121
+F+ N+ ++ NK+ NR+YKL N+F+DLT EF+ YTG N + ++ S+
Sbjct: 60 VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+ ++N++ +P+S+DWR+KGAVT IK+QG+CGSCWAFS VAAVEGI +I KL+ LSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178
Query: 182 QLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+LVDC T N GC+GGLM+ AFE+I +N G+ TE YPY +G CD K+ V TI
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
+ED+P+ DE ALL+AV+NQPVSV +DA F FY GV CG +HGVA VG+G+
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS- 297
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVAI 346
E G KYW+++NSWG WGE GYI+I R+ G CGIA ASYP+ +
Sbjct: 298 --ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 318 bits (816), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
EFRA + G R P+ + S P F Y NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98 DQAEFRATFVGDLRRDTPA--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
L TEA YPYR GTC+ + V I ++D+P E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV DCG DHGVAVVG+G AE+ G YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDA----GLCGIATAASYPV 344
+D+ GLCGIA ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 318 bits (814), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
EFRA + G R PS + S P F Y NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98 DQAEFRATFVGDLRRDTPS--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
L TEA YPYR GTC+ + V I ++D+P E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV +CG DHGVAVVG+G AE+ G YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDA----GLCGIATAASYPV 344
+D+ GLCGIA ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 317 bits (812), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 160/349 (45%), Positives = 220/349 (63%), Gaps = 23/349 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEK------HEQWMAQHGRTYKDELEKAMRLNI 65
P F+ + LV + E + + +E+W H +D EK R N+
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPS 121
FK+N+++I + N++ + YKL N+F D+TN+EFR+ Y G ++R + + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTG--- 119
Query: 122 TFKYQNVTDVPT-SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+F Y+NV +P SIDWR KGAVT +KDQGQCGSCWAFS +A+VEGI QI G+L+ LSE
Sbjct: 120 SFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179
Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+LVDC T N GC+GGLMD AFE+I +N G+ TE YPY ++GTC + + +I
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSID 238
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
++D+P +E AL+QAV+NQP+SV ++ASG F FY GV CG DHGVA+VG+G
Sbjct: 239 GHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGA 298
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G KYW++KNSWGE WGESGYIR+ R G CGIA ASYP+
Sbjct: 299 TRD--GTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 311 bits (796), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 207/315 (65%), Gaps = 19/315 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
+W +HG++ + ++ R NIFK NL +I+ N+ N TYKLG F++LTN+E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGS 154
+R+LY G PV +++ ++ KY NV +VP ++DWR+KGAV IKDQG CGS
Sbjct: 66 YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGS 123
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+L+ LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY G C++ + + TI YED+P DE AL +AVS QPVSV +DA GRAF
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 310 bits (795), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 309 bits (792), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 206/315 (65%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R NIFK NL +I+ N++ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
+R LY G R P+ ++ KY N +VP ++DWR+KGAV IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
DYPYR G C++ + + +I YED+P DE AL +A+S QPVSV ++A GR F
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 306 bits (783), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 224/339 (66%), Gaps = 20/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WMA++GR YKD EK +R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+IE N +Y LG N+F+D+TN EF A YTG + P+ ++ R+ +F +++
Sbjct: 66 NHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPL-NIKREPV--VSFDDVDISS 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
VP SIDWR+ GAVT +K+QG+CGSCWAF+++A VE I +I RG L+ LSEQQ++DC+ +
Sbjct: 123 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV-S 181
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKGD 248
+GC GG ++KA+ +II NKG+A+ A YPY+ +GTC K V +A I++Y + + +
Sbjct: 182 YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC---KTNGVPNSAYITRYTYVQRNN 238
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E+ ++ AVSNQP++ +DASG F YK GV CG +H + ++G+G ++ +G K+
Sbjct: 239 ERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYG--QDSSGKKF 295
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
W+++NSWG WGE GYIR+ RD GLCGIA YP
Sbjct: 296 WIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 303 bits (775), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 208/313 (66%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDEL--EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
++ W+A++G + L E R +F NL++++ N + ++LG N F+DLTNE
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EFRA + G R + +++ V ++P S+DWREKGAV +K+QGQCGSCWA
Sbjct: 112 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
FSAV+ VE I Q+ G++I LSEQ+LV+CST+ N GC+GGLMD AF++II+N G+ TE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
SGV + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 344
Query: 332 GLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 345 GKCGIAMMASYPT 357
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 301 bits (771), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 14/318 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + +EQW+ ++ + Y EK R IFK NL+++++ N +RT+++G F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LTNEEFRA+Y R ++ S + + Y+ +P +DWR GAV +KDQG CG
Sbjct: 96 LTNEEFRAIYL---RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 212 ATEADYPYR-HEEGTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
T+ DYPY ++ G C+ K TI YED+P+ DE++L +AV++QPVSV ++AS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+AF YKSGV+ CG + DHGV VVG+G+ +G YW+I+NSWG WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST---SGEDYWIIRNSWGLNWGDSGYVKLQR 329
Query: 330 DA----GLCGIATAASYP 343
+ G CGIA SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 299 bits (766), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/364 (41%), Positives = 222/364 (60%), Gaps = 27/364 (7%)
Query: 3 LKFEKSFIIPMFVIIILVITCAS----QVVSGRSMH-------------EPSIVEKHEQW 45
+ + KS ++ +F++ +++ +CA+ VVS H + E W
Sbjct: 1 MGYAKSAML-IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESW 59
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
M +HG+ Y EK RL IF+ NL +I N E N +Y+LG N F+DL+ E+ + G
Sbjct: 60 MVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHG 118
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
+ P + + +K + +P S+DWR +GAVT +KDQG C SCWAFS V AVE
Sbjct: 119 ADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVE 178
Query: 166 GITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGT 225
G+ +I G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+ G
Sbjct: 179 GLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGV 238
Query: 226 CDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
C+ + KE I YE+LP DE AL++AV++QPV+ VD+S R F Y+SGV + C
Sbjct: 239 CEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTC 298
Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
G N +HGV VVG+GT ENG YW++KNS G+TWGE+GY+++ R+ GLCGIA A
Sbjct: 299 GTNLNHGVVVVGYGT---ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRA 355
Query: 341 SYPV 344
SYP+
Sbjct: 356 SYPL 359
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 298 bits (763), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 298 bits (762), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 220/351 (62%), Gaps = 27/351 (7%)
Query: 13 MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ ++ +++ +CA+ VVS +H E S++ E WM +HG+ Y EK
Sbjct: 10 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 67
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
RL IF+ NL +I N E N +Y+LG F+DL+ E++ + G + P
Sbjct: 68 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPR--NHVFMT 124
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
S+ +Y+ D +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I G+L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KEKAVAAT 237
SEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 185 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 244
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YE+LP DE AL++AV++QPV+ +D+S R F Y+SGV + CG N +HGV VVG+
Sbjct: 245 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 304
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
GT ENG YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 305 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 296 bits (757), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 218/335 (65%), Gaps = 12/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L AS + R ++++ E+WMA++GR YKD+ EK R IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N +Y LG N+F+D+T EF A YTG + P+ ++ R+ +F N++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVP 124
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
SIDWR+ GAV +K+Q CGSCW+F+A+A VEGI +I G L+ LSEQ+++DC+ ++G
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYG 183
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG ++KA+++II N G+ TE +YPY +GTC N +A I+ Y + + DE+++
Sbjct: 184 CKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTC-NANSFPNSAYITGYSYVRRNDERSM 242
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+ AVSNQP++ +DAS F +Y GV + CG + +H + ++G+G ++ +G KYW+++
Sbjct: 243 MYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVR 299
Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYP 343
NSWG +WGE GY+R+ R +G+CGIA A +P
Sbjct: 300 NSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 295 bits (756), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 211/349 (60%), Gaps = 17/349 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+ +F I+++ Q G E ++ + +E+W H + + E R N
Sbjct: 1 MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-F 123
+F+ N+ ++ + NK+ N+ YKL N F+D+T+ EFR+ Y G N + R R S F
Sbjct: 60 VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y+NVT VP+S+DWREKGAVT +K+Q CGSCWAFS VAAVEGI +I KL+ LSEQ+L
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178
Query: 184 VDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKAVAATISKY 241
VDC T +N GC+GGLM+ AFE+I N G+ TE YPY + C TI +
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
E +P+ DE+ LL+AV++QPVSV +DA F Y GV +CG +HGV +VG+G E
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--E 296
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
+NG KYW+++NSWG WGE GY+RI R + G CGIA ASYP +
Sbjct: 297 TKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKL 345
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 294 bits (753), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 216/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y + S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 284 bits (727), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 188/296 (63%), Gaps = 14/296 (4%)
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
E R +F NL++++ N + ++LG N F+DLTN EFRA Y G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+++ V +P S+DWR+KGAV +K+QGQCGSCWAFSAVAAVEGI +I G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL TE DYPY +G C+ K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+I +ED+P+ DE +L +AV++QPVSV +DA GR F Y SGV CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
VG+GT + GA YW ++NSWG WGE+GYIR+ R+ G CGIA ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 276 bits (706), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K V AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 352 MLRNKENQCGIASASSYPL 370
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 271 bits (693), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 200/318 (62%), Gaps = 14/318 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
I E+ + QH + Y +E+E+ R+ IF +N I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+ EF+ GYN + + R+ + +T+ VP S+DWRE GAVT +KDQG C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASG 269
+ TE YPY + +C K + AT + + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 204 IDTEKSYPYEGIDDSCHFNK-ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 270 RAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+F Y GV N +C N DHGV VVG+GT +E+G YWL+KNSWG TWGE GYI++
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGT--DESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIATA+SYP
Sbjct: 321 ARNQNNQCGIATASSYPT 338
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 271 bits (692), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 203/345 (58%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC ++GC GG + +Y + N G+ T YPY+ ++ C + I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P E + L A++NQP+SV V+A G+ F YKSGV + CG DH V VG+GT++
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
+N Y +IKNSWG WGE GY+R+ R + G CG+ ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 267 bits (682), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 263 bits (672), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 207/340 (60%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ ++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + P Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M +AF+YII+NKG+ +EA YPY+ + C K AAT SKY +LP
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQ-YDSKYRAATCSKYTELPY 231
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E L +AV+N+ PV V VDAS +F Y+SGV + C +HGV V+G+G + N
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---DLN 288
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 GKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 262 bits (670), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ L+ C+ V + +P++ W + + YK+E E+ R I+++NL++
Sbjct: 1 MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T EE +L G R VPS Q R T++ +
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL-MGSLR-VPS---QWQRNVTYRSNSNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ G C +K AAT SKY +LP
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPF 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E AL +AV+N+ PVSV +DAS +F Y+SGV C N +HGV VVG+G N
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 262 bits (669), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 261 bits (666), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ ++ C+S + +P++ + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMA--HVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y+LG N D+T+EE +L + P Q R T+K
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 114 KLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTA 173
Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +LP
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPF 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E+AL +AV+N+ PVSV +DAS +F YK+GV + C N +HGV VVG+G +
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD--- 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA SYP
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 260 bits (664), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 207/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ L + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAALCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 258 bits (659), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 198/351 (56%), Gaps = 16/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
M+ K + + + + + ++ + G S + + E+ Q WM H + Y++
Sbjct: 3 MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK NL YI++ NK+ N +Y LG NEF+DL+N+EF Y G + + +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSNDEFNEKYVG---SLIDATIE 118
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F ++ ++P ++DWR+KGAVT ++ QG CGSCWAFSAVA VEGI +I GKL+
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
ELSEQ+LVDC +HGC GG A EY+ +N G+ + YPY+ ++GTC ++
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
S + +E LL A++ QPVSV V++ GR F YK G+ CG DH V V
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYP 343
+ G Y LIKNSWG WGE GYIRI R G+CG+ ++ YP
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYP 345
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 256 bits (654), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 202/337 (59%), Gaps = 18/337 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L C + S + S+ + QW A H R Y E+ R ++++N++ IE
Sbjct: 5 LFLTALCLG-IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G + + N F D+TNEEFR + G+ +++ + F+ ++P
Sbjct: 63 NREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKMFQEPLFAEIPK 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF Y+ +N GL +E YPY + N K + AA + + DLP+ E+A
Sbjct: 177 GCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKA 235
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
L++AV+ P+SV +DA ++F FYKSG+ + DC + + DHGV VVG+G ++ K+
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKF 295
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 296 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 332
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 256 bits (653), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 17/321 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E ++ +EQW+ ++G+ Y EK R IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKDQGQC 152
LT +EF+A Y G S+S + R ++Y+ +P +DWRE+GAV +K QG+C
Sbjct: 93 LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKG 210
GSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 211 LATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
+ ++ Y Y E+ C + K TI+ +E +P DE +L +AV+ QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269
Query: 269 GRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+ YKSGV C N DH V +VG+GT+ +E YWLI+NSWG WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325
Query: 328 LRD----AGLCGIATAASYPV 344
R+ G C +A A YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 255 bits (651), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 200/352 (56%), Gaps = 29/352 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M V+ L + S + + E WM H ++Y E E R NIFK N++Y
Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDY 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQNVTDV 131
+++ N +G+ T LG N F+D+TNEE+R Y G S + Q + T T
Sbjct: 60 VQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVFT------TSS 112
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
S DWR +GAVT +K+QGQCG CW+FS + EG ++G+L+ LSEQ L+DCST+N
Sbjct: 113 AASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENS 172
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLM AFEYII N G+ TE+ YPY+ E G C+ + E + AT+S Y+ + G E +
Sbjct: 173 GCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSENS-GATLSSYKTVTAGSESS 231
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGA--- 306
L AV+ PVSV +DAS ++F Y SG+ +C + N DHGV VG+G+ +
Sbjct: 232 LESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSS 291
Query: 307 -------------KYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSWG +WG GYI + R+ CGIA++AS+PV
Sbjct: 292 GQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 255 bits (651), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 194/311 (62%), Gaps = 18/311 (5%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
+W A HGR Y E+ R ++++N++ IE N+E G + + N F D+TNEEFR
Sbjct: 31 KWKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ G+ +++ + F V +VP S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVMNGFQ------NQKHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
A+EG GKL+ LSEQ LVDCS N GC+GGLMD AF+Y+ +N GL TE YP
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y E K + AA + + D+P+ E+AL++AV+ P+SV +DA +F FYKS
Sbjct: 204 YLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKS 262
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ + DC + + DHGV VVG+G + N +K+W++KNSWG WG +GY+++ +D
Sbjct: 263 GIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH 322
Query: 334 CGIATAASYPV 344
CGI+TAASYP
Sbjct: 323 CGISTAASYPT 333
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 254 bits (649), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 121/219 (55%), Positives = 153/219 (69%), Gaps = 8/219 (3%)
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+P SIDWRE GAV +K+QG CGSCWAFS VAAVEGI QI G LI LSEQQLVDC+T
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
NHGC GG M+ AF++I+ N G+ +E YPYR ++G C N A +I YE++P +E
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNAPVVSIDSYENVPSHNE 120
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
Q+L +AV+NQPVSV +DA+GR F Y+SG+ C + +H + VVG+GT EN +W
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT---ENDKDFW 177
Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
++KNSWG+ WGESGYIR R+ G CGI ASYPV
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 254 bits (648), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 204/343 (59%), Gaps = 22/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F + +L + V S +P++ QW A H R Y E+ R ++++N +
Sbjct: 3 PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
I+ N+E G +++ N F D+TNEEFR + G+ +++ + F +
Sbjct: 58 IIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
DVP S+DW +KG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+YI +N GL +E YPY + N K + AA + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYKSG+ + DC + + DHGV VVG+G +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N K+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 254 bits (648), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 253 bits (645), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 204/342 (59%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY E +C + +V A + + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ KYWL+KNSWGE WG GY+++ +D CGIA+AASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 253 bits (645), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 191/311 (61%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R ++++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P + +P ++DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QIVNGYRHQKHKKGRLFQEPLMLQ------IPKTVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS D N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AVA + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEYAVAND-TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + + DHGV VVG+G + N KYWL+KNSWG+ WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYP+
Sbjct: 322 CGLATAASYPI 332
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 252 bits (643), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 134/357 (37%), Positives = 206/357 (57%), Gaps = 29/357 (8%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPS----IVEKHEQWMAQHGRTYKDE 56
M+ K + + + + + ++ + G S ++ + +++ E WM +H + YK+
Sbjct: 3 MIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNI 62
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK NL+YI++ NK+ N +Y LG N F+D++N+EF+ YTG S++
Sbjct: 63 DEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG------SIAGN 115
Query: 117 SSRPSTFKYQNV-----TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
+ + Y+ V ++P +DWR+KGAVT +K+QG CGSCWAFSAV +EGI +I
Sbjct: 116 YT-TTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIR 174
Query: 172 RGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKE 231
G L E SEQ+L+DC ++GC+GG A + ++ G+ YPY + C ++++
Sbjct: 175 TGNLNEYSEQELLDCDRRSYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREK 233
Query: 232 KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHG 291
AA + +E ALL +++NQPVSV ++A+G+ F Y+ G+ CGN DH
Sbjct: 234 GPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHA 293
Query: 292 VAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VA VG+ G Y LIKNSWG WGE+GYIRI R G+CG+ T++ YPV
Sbjct: 294 VAAVGY-------GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 250 bits (639), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 203/343 (59%), Gaps = 22/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F + +L + V S +P++ QW A H R Y E+ R ++++N +
Sbjct: 3 PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
I+ N+E G +++ N F D+TNEEFR + G+ +++ + F +
Sbjct: 58 IIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
DVP S+DW +KG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+YI +N L +E YPY + N K + AA + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYKSG+ + DC + + DHGV VVG+G +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N K+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 249 bits (637), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 206/350 (58%), Gaps = 31/350 (8%)
Query: 10 IIPMFVIIILVIT--CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
I +F +I+L I+ A V S + + I WM + + Y + E R FK
Sbjct: 5 ITLIFTLIVLSISFISAGNVFSHKQYQDSFI-----DWMRSNNKAYTHK-EFMPRYEEFK 58
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP-------SVSRQSSRP 120
+N++Y+ N +G++T LG N+ +DL+NEE+R Y G + ++ + +RP
Sbjct: 59 KNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRP 117
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
FK P ++DWREK AVT +KDQGQCGSC++FS +VEG+T I GKL+ LSE
Sbjct: 118 Q-FK------QPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSE 170
Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q ++DCS+ N GC+GGLM AFEYII+N GL +E YPY + +E +VAA I
Sbjct: 171 QNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKI 230
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVG 296
+ Y+++ GDE L A+ PVSV +DAS +F Y +GV A + DHGV VG
Sbjct: 231 TSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVG 290
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
GT +NG Y+++KNSWG +WG +GYI + R+ CGI+T ASYP+A
Sbjct: 291 MGT---DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 248 bits (633), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 199/341 (58%), Gaps = 19/341 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + ++L C + S + ++ K QW A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G + + N F D+TNEEFR + + +++ + F+
Sbjct: 59 IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR------NQKFRKGKVFREPLFL 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
D+P S+DWR+KG VT +K+Q QCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M +AF+Y+ EN GL +E YPY + C + E +V A + + + G
Sbjct: 173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSV-ANDTGFTVVAPG 231
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYKSG+ DC + N DHGV VVG+G
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
N +KYWL+KNSWG WG +GY++I +D CGIATAASYP
Sbjct: 292 NNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYP 332
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 248 bits (632), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 117/219 (53%), Positives = 150/219 (68%), Gaps = 8/219 (3%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P SIDWREKG + +KDQG CGSCWAFSAVAA+E I I G LI LSEQ+LVDC
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC GGLMD AFE++I+N G+ TE DYPY+ G CD ++ A I YED+P +E
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+AL +AV++QPVS+ ++A GR F YKSG+ CG DHGV + G+GT ENG YW
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT---ENGMDYW 194
Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+++NSWG E+GY+R+ R+ +GLCG+A SYPV
Sbjct: 195 IVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 246 bits (629), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 187/313 (59%), Gaps = 23/313 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
E++ + GR Y D E+ RLN+F NL+YIE+ NK+ G TY L N+FSD+TNE+F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 100 RALYTGYNR-PVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
A+ GY + P P+ + F + T +DWR KGAVT +KDQGQCGSCWAF
Sbjct: 81 NAVMKGYKKGPRPA--------AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDC---STDNHGCSGGLMDKAFEYIIENKGLATEA 215
S +EG + G+L+ LSEQQLVDC S N GC+GG +++A Y+ +N G+ TE+
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHF 274
YPY + TC + AT + Y + +G E AL A + P+SV +DAS R+F
Sbjct: 193 SYPYEARDNTC-RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQS 251
Query: 275 YKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
Y +GV C ++ DH V VG+G+ E G +WL+KNSW +WGESGYI++ R+
Sbjct: 252 YYTGVYYEPSCSSSQLDHAVLAVGYGS---EGGQDFWLVKNSWATSWGESGYIKMARNRN 308
Query: 332 GLCGIATAASYPV 344
CGIAT A YP
Sbjct: 309 NNCGIATDACYPT 321
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 246 bits (629), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/218 (55%), Positives = 149/218 (68%), Gaps = 8/218 (3%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
+P SIDWREKGAV +K+QG CGSCWAF A+AAVEGI QI G LI LSEQQLVDCST N
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRN 62
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
HGC GG +AF+YII N G+ +E YPY GTCD KE A +I Y ++P DE+
Sbjct: 63 HGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSNDEK 121
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
+L +AV+NQPVSV +DA+GR F Y++G+ C + +H V G E EN YW
Sbjct: 122 SLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTV---GGRETENDKDYWT 178
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+KNSWG+ WGESGYIR+ R+ +G CGIA + SYP+
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPI 216
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.132 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 131,356,968
Number of Sequences: 539616
Number of extensions: 5601380
Number of successful extensions: 15128
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 219
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 14013
Number of HSP's gapped (non-prelim): 304
length of query: 346
length of database: 191,569,459
effective HSP length: 118
effective length of query: 228
effective length of database: 127,894,771
effective search space: 29160007788
effective search space used: 29160007788
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)