BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 042468
(346 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 380 bits (975), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/308 (61%), Positives = 222/308 (72%), Gaps = 6/308 (1%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
+E W + + V R EK+ RF +FK N ++ + N +KPYKL +N+FAD TN EFR
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANK--MDKPYKLKLNKFADMTNHEFR 94
Query: 99 APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
+G K + R + +F YE +VPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 95 NTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFS 154
Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
+ A+EGIN I T KL SLSEQELVDCDT ++QGC GGLMD AFEFI G+ TEA Y
Sbjct: 155 TIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANY 213
Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
PY+A DG+C+ + N A I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS
Sbjct: 214 PYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSE 273
Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
GVFTG CGTELDHGV VGYGT DGTKYW VKNSWG WGE GYIRM+R I KEGLCG
Sbjct: 274 GVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333
Query: 337 IAMQASYP 344
IAM+ASYP
Sbjct: 334 IAMEASYP 341
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 377 bits (967), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/342 (56%), Positives = 233/342 (68%), Gaps = 14/342 (4%)
Query: 10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMW-----MAQYGRVYRDNAEKEMRFKIFK 64
+VL+ LVLGV + S +D + +W + V R EK RF +FK
Sbjct: 9 VVLSFSLVLGV----ANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFK 64
Query: 65 ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV-RSSETTDVSFRY 123
N+ ++ N +KPYKL +N+FAD TN EFR+ G K P + R + + +F Y
Sbjct: 65 ANLMHV--HNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122
Query: 124 ENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
E SVP S+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T KL +LSEQELVD
Sbjct: 123 EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVD 182
Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
CD E+QGC GGLM+ AFEFI G+ TE+ YPYKA +G+C+ + N A I G+E+
Sbjct: 183 CDKE-ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHEN 241
Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
VP+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG C T+L+HGV VGYGT DG
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDG 301
Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
T YW+V+NSWG WGE+GYIRMQR+I KEGLCGIAM SYP
Sbjct: 302 TNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYP 343
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 377 bits (967), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/308 (60%), Positives = 219/308 (71%), Gaps = 6/308 (1%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
+E W + + V R EK RF +FK NV ++ N +KPYKL +N+FAD TN EFR
Sbjct: 40 YERWRSHH-TVSRSLGEKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96
Query: 99 APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
+ G K R S+ +F YE SVPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97 STYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 156
Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
+ A+EGIN I T KL SLSEQELVDCD E+QGC GGLM+ AFEFI G+ TE+ Y
Sbjct: 157 TIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGGITTESNY 215
Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
PY A +G+C++ + N A I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS
Sbjct: 216 PYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSE 275
Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
GVFTG C T+L+HGV VGYGT DGT YW+V+NSWG WGE GYIRMQR+I KEGLCG
Sbjct: 276 GVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCG 335
Query: 337 IAMQASYP 344
IAM ASYP
Sbjct: 336 IAMMASYP 343
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 376 bits (965), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/310 (61%), Positives = 221/310 (71%), Gaps = 6/310 (1%)
Query: 37 ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
E +E W + + V R EK RF +FK NV++I N K +K YKL +N+F D T+EE
Sbjct: 36 ELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK--DKSYKLKLNKFGDMTSEE 92
Query: 97 FRAPRNGYKRRLPSVRSSETTDV-SFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
FR G + + E SF Y N ++P S+DWRK GAVT VK+QGQCG CWA
Sbjct: 93 FRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWA 152
Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
FS V A+EGIN I T+KLTSLSEQELVDCDT+ ++QGC GGLMD AFEFI GL +E
Sbjct: 153 FSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSEL 211
Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
YPYKASD +C+ + N I G+EDVP N+E LMKAVANQPVSVAIDA GSDFQFY
Sbjct: 212 VYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFY 271
Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
S GVFTG+CGTEL+HGV VGYGT DGTKYW+VKNSWG WGE GYIRMQR I KEGL
Sbjct: 272 SEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGL 331
Query: 335 CGIAMQASYP 344
CGIAM+ASYP
Sbjct: 332 CGIAMEASYP 341
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 374 bits (961), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/338 (57%), Positives = 234/338 (69%), Gaps = 7/338 (2%)
Query: 10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
LV + L + P + ++ ++ +E W + V RD EK RF +FKENV++
Sbjct: 11 LVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKF 69
Query: 70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-S 127
I FN K ++ PYKL +N+F D TN+EFR+ G K + S R + SF YEN S
Sbjct: 70 IHEFNQK-KDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGS 128
Query: 128 VPA-SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
+PA SIDWR KGAVTGVKDQGQCG CWAFS +A++EGIN I T +L SLSEQELVDCDTS
Sbjct: 129 LPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTS 188
Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
++GC GGLMD AFEFI N G+ TE YPY DG+C N I G++DVP+N
Sbjct: 189 -YNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPAN 246
Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
NE ALM+AVANQP+SV+I+ASG FQFYS GVFTG+CGTELDHGV VGYG DGTKYW
Sbjct: 247 NENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYW 306
Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
+VKNSWG WGE+GYIRMQR I K G CGIAM+ASYP
Sbjct: 307 IVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYP 344
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 352 bits (904), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 174/311 (55%), Positives = 215/311 (69%), Gaps = 7/311 (2%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
+E W + + RV R +AEK RF FK N +I S N + + PY+L +N F D EFR
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG-DHPYRLHLNRFGDMDQAEFR 103
Query: 99 APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
A G RR + + N S +P S+DWR+KGAVTGVKDQG+CG CWAFS
Sbjct: 104 ATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163
Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
V ++EGIN I T L SLSEQEL+DCDT+ D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
Y+A+ G+CN A +P I G++DVP+N+E L +AVANQPVSVA++ASG F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282
Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
S GVFTG+CGTELDHGV VGYG A+DG YW VKNSWG +WGE GYIR+++D A GL
Sbjct: 283 SEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342
Query: 335 CGIAMQASYPT 345
CGIAM+ASYP
Sbjct: 343 CGIAMEASYPV 353
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 352 bits (902), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 175/311 (56%), Positives = 215/311 (69%), Gaps = 7/311 (2%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
+E W + + RV R +AEK RF FK N +I S +NK + PY+L +N F D EFR
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHS-HNKRGDHPYRLHLNRFGDMDQAEFR 103
Query: 99 APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
A G RR + + N S +P S+DWR+KGAVTGVKDQG+CG CWAFS
Sbjct: 104 ATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163
Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
V ++EGIN I T L SLSEQEL+DCDT+ D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
Y+A+ G+CN A +P I G++DVP+N+E L +AVANQPVSVA++ASG F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282
Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
S GVFTG CGTELDHGV VGYG A+DG YW VKNSWG +WGE GYIR+++D A GL
Sbjct: 283 SEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342
Query: 335 CGIAMQASYPT 345
CGIAM+ASYP
Sbjct: 343 CGIAMEASYPV 353
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 344 bits (883), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 173/311 (55%), Positives = 220/311 (70%), Gaps = 11/311 (3%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
++ W + + V R E+E RF +F+ NV ++ + N K N+ YKL +N+FAD T EF+
Sbjct: 38 YDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHNTNKK--NRSYKLKLNKFADLTINEFK 94
Query: 99 APRNG----YKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
G + R L + + + +EN S +P+S+DWRKKGAVT +K+QG+CG CW
Sbjct: 95 NAYTGSNIKHHRMLQGPKRG-SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCW 153
Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
AFS VAA+EGIN I T KL SLSEQELVDCDT +++GC GGLM+ AFEFI N G+ TE
Sbjct: 154 AFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGGITTE 212
Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
YPY+ DG C+ + N I G+EDVP N+E AL+KAVANQPVSVAIDA SDFQF
Sbjct: 213 DSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQF 272
Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
YS GVFTG CGTEL+HGV AVGYG+ + G KYW+V+NSWG WGE GYI+++R+ID EG
Sbjct: 273 YSEGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEG 331
Query: 334 LCGIAMQASYP 344
CGIAM+ASYP
Sbjct: 332 RCGIAMEASYP 342
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 342 bits (877), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 170/316 (53%), Positives = 220/316 (69%), Gaps = 7/316 (2%)
Query: 31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
N + E E WM+++ + Y+ EK RF++F+EN+ +I NN+ + Y LG+NEFA
Sbjct: 43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS--YWLGLNEFA 100
Query: 91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
D T+EEF+ G + P +FRY + + +P S+DWRKKGAV VKDQGQC
Sbjct: 101 DLTHEEFKGRYLGLAK--PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQC 158
Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
G CWAFS VAA+EGIN ITT L+SLSEQEL+DCDT+ + GC GGLMD AF++IIS G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGG 217
Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
L E YPY +G C +++ + ISGYEDVP N++ +L+KA+A+QPVSVAI+ASG
Sbjct: 218 LHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 277
Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
DFQFY GVF G+CGT+LDHGV AVGYG++ G+ Y +VKNSWG WGE G+IRM+R+
Sbjct: 278 DFQFYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTG 336
Query: 330 AKEGLCGIAMQASYPT 345
EGLCGI ASYPT
Sbjct: 337 KPEGLCGINKMASYPT 352
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 339 bits (869), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 172/310 (55%), Positives = 209/310 (67%), Gaps = 7/310 (2%)
Query: 39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
+E W + V R + E RF +F+ NV ++ N K NKPYKL IN FAD T+ EFR
Sbjct: 38 YERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKK--NKPYKLKINRFADITHHEFR 94
Query: 99 APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
+ G + +R + F YEN + VP+S+DWR+KGAVT VK+Q CG CWAFS
Sbjct: 95 SSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFS 154
Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
VAA+EGIN I T KL SLSEQELVDCDT E+QGC GGLM+ AFEFI +N G+ TE Y
Sbjct: 155 TVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETY 213
Query: 217 PYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
PY +SD C I G+E VP N+E L+KAVA+QPVSVAIDA SDFQ YS
Sbjct: 214 PYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYS 273
Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
GVF G+CGT+L+HGV VGYG +GTKYW+V+NSWG WGE GY+R++R I EG C
Sbjct: 274 EGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRC 333
Query: 336 GIAMQASYPT 345
GIAM+ASYPT
Sbjct: 334 GIAMEASYPT 343
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 337 bits (863), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 165/311 (53%), Positives = 220/311 (70%), Gaps = 9/311 (2%)
Query: 39 HEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNE 95
+++W+A+ G + E E RF +F +N++++ + N +A + ++LG+N FAD TNE
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 96 EFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
EFRA G K S + E +R++ +P S+DWR+KGAV VK+QGQCG CWA
Sbjct: 112 EFRATFLGAKVAERSRAAGE----RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
FSAV+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
YPYKA DG C+ N I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
SGVF+G+CGT LDHGV AVGYGT D+G YW+V+NSWG WGE+GY+RM+R+I+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 335 CGIAMQASYPT 345
CGIAM ASYPT
Sbjct: 347 CGIAMMASYPT 357
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 334 bits (856), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 161/295 (54%), Positives = 207/295 (70%), Gaps = 7/295 (2%)
Query: 55 EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
+++ RF IFK+N+ +I N +N YKLG+ FA+ TN+E+R+ G R P R +
Sbjct: 24 QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG-ARTEPVRRIT 82
Query: 115 ETTDVSFRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
+ +V+ +Y A VP ++DWR+KGAV +KDQG CG CWAFS AA+EGIN I T
Sbjct: 83 KAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTG 142
Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
+L SLSEQELVDCD S +QGC GGLMD AF+FI+ N GL TE YPY ++G CN
Sbjct: 143 ELVSLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201
Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
N I GYEDVPS +E AL +AV+ QPVSVAIDA G FQ Y SG+FTG+CGT +DH
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261
Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
V AVGYG+ ++G YW+V+NSWGT WGE+GYIRM+R++ +K G CGIA++ASYP
Sbjct: 262 VVAVGYGS-ENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 333 bits (854), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 165/305 (54%), Positives = 210/305 (68%), Gaps = 6/305 (1%)
Query: 42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
W A++G+ Y E+E R+ F++N+ YI N A ++LG+N FAD TNEE+R
Sbjct: 43 WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102
Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
G + + R + +D +N ++P S+DWR KGAV +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160
Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
A+EGIN I T L SLSEQELVDCDTS ++GC GGLMD AF+FII+N G+ TE YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219
Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
D C+ N I YEDV N+E +L KAVANQPVSVAI+A G FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 279
Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
TG+CGT LDHGV AVGYGT ++G YW+V+NSWG +WGE+GY+RM+R+I A G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338
Query: 340 QASYP 344
+ SYP
Sbjct: 339 EPSYP 343
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 331 bits (849), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 217/319 (68%), Gaps = 14/319 (4%)
Query: 31 NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
++A + +E W+ ++G+ N+ EK+ RF+IFK+N+ ++ N K N Y+LG+
Sbjct: 42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99
Query: 89 FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
FAD TN+E+R+ G K R + S RYE +P SIDWRKKGAV VKD
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154
Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
QG CG CWAFS + A+EGIN I T L +LSEQELVDCDTS ++GC GGLMD AFEFII
Sbjct: 155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213
Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
N G+ T+ YPYK DG+C++ N I YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273
Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
A G FQ Y SG+F G CGT+LDHGV AVGYGT ++G YW+V+NSWG +WGE+GY+RM
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332
Query: 326 RDIDAKEGLCGIAMQASYP 344
R+I + G CGIA++ SYP
Sbjct: 333 RNIASSSGKCGIAIEPSYP 351
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 331 bits (848), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 163/310 (52%), Positives = 216/310 (69%), Gaps = 6/310 (1%)
Query: 37 ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
E E W++ + + Y EK +RF++FK+N+++I N K K Y LG+NEFAD ++EE
Sbjct: 49 ELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG--KSYWLGLNEFADLSHEE 106
Query: 97 FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
F+ G K + R E + F Y + +VP S+DWRKKGAV VK+QG CG CWAF
Sbjct: 107 FKKMYLGLKTDIVR-RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165
Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
S VAA+EGIN I T LT+LSEQEL+DCDT+ + GC GGLMD AFE+I+ N GL E
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224
Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
YPY +G+C ++ I+G++DVP+N+E +L+KA+A+QP+SVAIDASG +FQFYS
Sbjct: 225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
GVF G+CG +LDHGV AVGYG++ G+ Y +VKNSWG WGE GYIR++R+ EGLC
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSS-KGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLC 343
Query: 336 GIAMQASYPT 345
GI AS+PT
Sbjct: 344 GINKMASFPT 353
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 330 bits (847), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 166/294 (56%), Positives = 209/294 (71%), Gaps = 8/294 (2%)
Query: 55 EKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
E E RF++F +N++++ + N +A + ++LG+N FAD TN EFRA Y P+ R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT---YLGTTPAGRG 140
Query: 114 SETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTRK 171
+ ++R++ ++P S+DWR KGAV VK+QGQCG CWAFSAVAA+EGIN I T +
Sbjct: 141 RRVGE-AYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
L SLSEQELV+C +G++ GC GG+MDDAF FI N GL TE YPY A DG CN + +
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 292 TAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
AVGYGT A G YW V+NSWG WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 327 bits (838), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 170/343 (49%), Positives = 223/343 (65%), Gaps = 19/343 (5%)
Query: 9 KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
+LV + + +WA P + S M ++ E WMA+YGRVY+DN EK +RF+IFK NV
Sbjct: 6 QLVFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65
Query: 68 EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFR 122
+I +FNN+ N Y LGIN+F D TN EF A G +R P V S + D+S
Sbjct: 66 NHIETFNNRNGNS-YTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVV-SFDDVDIS-- 121
Query: 123 YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
SVP SIDWR GAVT VK+QG+CG CWAF+++A +E I I L SLSEQ+++D
Sbjct: 122 ----SVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLD 177
Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
C S GC+GG ++ A+ FIISNKG+A+ A YPYKA+ G+C K P++A I+ Y
Sbjct: 178 CAVS---YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC-KTNGVPNSAYITRYTY 233
Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
V NNE +M AV+NQP++ A+DASG +FQ Y GVFTG CGT L+H + +GYG G
Sbjct: 234 VQRNNERNMMYAVSNQPIAAALDASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSG 292
Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
K+W+V+NSWG WGE GYIR+ RD+ + GLCGIAM YPT
Sbjct: 293 KKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYPT 335
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 324 bits (831), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 11/339 (3%)
Query: 9 KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
+LV + + +WA P + SR + M +R E WMA+YGRVY+D+ EK RF+IFK NV
Sbjct: 6 QLVFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNV 65
Query: 68 EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
++I +FN++ N Y LGIN+F D T EF A G L R VSF N S
Sbjct: 66 KHIETFNSRNENS-YTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPV---VSFDDVNIS 121
Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
VP SIDWR GAV VK+Q CG CW+F+A+A +EGI I T L SLSEQE++DC S
Sbjct: 122 AVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS 181
Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
GC+GG ++ A++FIISN G+ TE YPY A G+CN + P++A I+GY V N
Sbjct: 182 ---YGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNAN-SFPNSAYITGYSYVRRN 237
Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
+E ++M AV+NQP++ IDAS +FQ+Y+ GVF+G CGT L+H +T +GYG GTKYW
Sbjct: 238 DERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYW 296
Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
+V+NSWG++WGE GY+RM R + + G+CGIAM +PT
Sbjct: 297 IVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPT 335
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 323 bits (827), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 159/317 (50%), Positives = 210/317 (66%), Gaps = 6/317 (1%)
Query: 31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
N+ + +E W+ + + Y EKE RFKIFK+N++++ +N ++ +++G+ FA
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDE-HNSVPDRTFEVGLTRFA 94
Query: 91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
D TNEEFRA ++++ + S T+ E +P +DWR GAV VKDQG CG
Sbjct: 95 DLTNEEFRAIY--LRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
CWAFSAV A+EGIN ITT +L SLSEQELVDCD + GC+GG+M+ AFEFI+ N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 211 ATEAKYPYKASD-GSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
T+ YPY A+D G CN K N I GYEDVP ++E +L KAVA+QPVSVAI+AS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
FQ Y SGV TG CG LDHGV VGYG+ G YW+++NSWG WG++GY+++QR+I
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRNI 331
Query: 329 DAKEGLCGIAMQASYPT 345
D G CGIAM SYPT
Sbjct: 332 DDPFGKCGIAMMPSYPT 348
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 322 bits (825), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 165/323 (51%), Positives = 214/323 (66%), Gaps = 12/323 (3%)
Query: 32 DATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
D + + W A++G+ +N +++ RF IFK+N+ +I N +N YKLG+
Sbjct: 42 DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101
Query: 88 EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRKKGAVTGV 143
+F D TN+E+R G R P+ R ++ +V+ +Y A VP ++DWR+KGAV +
Sbjct: 102 KFTDLTNDEYRKLYLG-ARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160
Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
KDQG CG CWAFS AA+EGIN I T +L SLSEQELVDCD S +QGC GGLMD AF+F
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQF 219
Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
I+ N GL TE YPY+ G CN N I GYEDVP+ +E AL KA++ QPVSVA
Sbjct: 220 IMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVA 279
Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
I+A G FQ Y SG+FTG CGT LDH V AVGYG+ ++G YW+V+NSWG WGE GYIR
Sbjct: 280 IEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 324 MQRDIDA-KEGLCGIAMQASYPT 345
M+R++ A K G CGIA++ASYP
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 317 bits (813), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 215/340 (63%), Gaps = 14/340 (4%)
Query: 10 LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
L + +L+L + + ++ ++ ND + +E W+ +YG+ Y E E RF+IFKE +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71
Query: 69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
+I +N N+ YK+G+N+FAD T+EEFR+ G+ S T VS RYE
Sbjct: 72 FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124
Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
+P+ +DWR GAV +K QG+CG CWAFSA+A +EGIN I T L SLSEQEL+DC
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
+ +GC GG + D F+FII+N G+ TE YPY A DG CN N I YE+VP
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPY 244
Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
NNE AL AV QPVSVA+DA+G F+ YSSG+FTG CGT +DH VT VGYGT + G Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303
Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
W+VKNSW TTWGE GY+R+ R++ G CGIA SYP
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 315 bits (806), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 214/340 (62%), Gaps = 14/340 (4%)
Query: 10 LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
L + +L+L + + ++ ++ ND + +E W+ +YG+ Y E E RF+IFKE +
Sbjct: 13 LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71
Query: 69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
+I +N N+ YK+G+N+FAD T+EEFR+ L S T VS RYE
Sbjct: 72 FIDE-HNADTNRSYKVGLNQFADLTDEEFRS------TYLRFTSGSNKTKVSNRYEPRVG 124
Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
+P+ +DWR GAV +K QG+CG CWAFSA+A +EGIN I T L SLSEQEL+DC
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
+ +GC GG + D F+FII+N G+ TE YPY A DG CN N I YE+VP
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPY 244
Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
NNE AL AV QPVSVA+DA+G F+ YSSG+FTG CGT +DH VT VGYGT + G Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDY 303
Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
W+VKNSW TTWGE GY+R+ R++ G CGIA SYP
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 311 bits (798), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 163/352 (46%), Positives = 229/352 (65%), Gaps = 16/352 (4%)
Query: 2 AMILLENKLVLAAI-----LVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEK 56
AM++L +V+A+ + + + + ++ DA + E WM ++G+VY AEK
Sbjct: 7 AMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEK 66
Query: 57 EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
E R IF++N+ +I N A N Y+LG+ FAD + E++ +G R P R+
Sbjct: 67 ERRLTIFEDNLRFIN--NRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPP--RNHVF 122
Query: 117 TDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
S RY+ ++ +P S+DWR +GAVT VKDQG C CWAFS V A+EG+N I T +L
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182
Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK-EANP 232
+LSEQ+L++C+ E+ GC GG ++ A+EFI+ N GL T+ YPYKA +G C+ + + N
Sbjct: 183 TLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENN 240
Query: 233 SAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVT 292
I GYE++P+N+E+ALMKAVA+QPV+ ID+S +FQ Y SGVF G CGT L+HGV
Sbjct: 241 KNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVV 300
Query: 293 AVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
VGYGT ++G YWLVKNS G TWGE GY++M R+I GLCGIAM+ASYP
Sbjct: 301 VVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYP 351
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 309 bits (791), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/316 (50%), Positives = 210/316 (66%), Gaps = 9/316 (2%)
Query: 32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
DA E WM ++G+VY AEKE R IF++N+ +I N A N Y+LG+N FAD
Sbjct: 49 DAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFIT--NRNAENLSYRLGLNRFAD 106
Query: 92 QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCG 150
+ E+ +G R P T+ ++ + V P S+DWR +GAVT VKDQG C
Sbjct: 107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCR 166
Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
CWAFS V A+EG+N I T +L +LSEQ+L++C+ E+ GC GG ++ A+EFI++N GL
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224
Query: 211 ATEAKYPYKASDGSCNK--KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
T+ YPYKA +G C KE N + I GYE++P+N+EAALMKAVA+QPV+ +D+S
Sbjct: 225 GTDNDYPYKALNGVCEGRLKEDNKNVM-IDGYENLPANDEAALMKAVAHQPVTAVVDSSS 283
Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
+FQ Y SGVF G CGT L+HGV VGYGT ++G YW+VKNS G TWGE GY++M R+I
Sbjct: 284 REFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNI 342
Query: 329 DAKEGLCGIAMQASYP 344
GLCGIAM+ASYP
Sbjct: 343 ANPRGLCGIAMRASYP 358
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 285 bits (730), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 134/219 (61%), Positives = 164/219 (74%), Gaps = 2/219 (0%)
Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
S+P SIDWR+KG + GVKDQG CG CWAFSAVAAME IN I T L SLSEQELVDCD S
Sbjct: 17 SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76
Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
++GC+GGLMD AFEF+I N G+ TE YPYK +G C++ N KI YEDVP N
Sbjct: 77 -YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVN 135
Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
NE AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV GYGT ++G YW
Sbjct: 136 NEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT-ENGMDYW 194
Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
+V+NSWG ENGY+R+QR++ + GLCG+A++ SYP
Sbjct: 195 IVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 278 bits (712), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 206/347 (59%), Gaps = 14/347 (4%)
Query: 3 MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
+I L L++ L + +S+ +D T ER + WM ++ ++Y EK
Sbjct: 10 IIFLATCLIIHMGLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67
Query: 59 RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-KRRLPSVRSSETT 117
RF+IF++N+ YI N K N Y LG+N FAD +N+EF+ G+ + +
Sbjct: 68 RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNE 125
Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
D ++++ + P SIDWR KGAVT VK+QG CG CWAFS +A +EGIN I T L LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184
Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
QELVDCD GC+GG + +++ +N G+ T YPY+A C + KI
Sbjct: 185 QELVDCDK--HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPYQAKQYKCRATDKPGPKVKI 241
Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
+GY+ VPSN E + + A+ANQP+SV ++A G FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301
Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
T+ DG Y ++KNSWG WGE GY+R++R +G CG+ + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 278 bits (710), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 193/320 (60%), Gaps = 12/320 (3%)
Query: 35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQ 92
+ E + Q+ + Y + E+ R KIF EN IA N A+ K YKLG+N++AD
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQC 149
+ EF+ NGY L + T V Y + +VP S+DWR+ GAVTGVKDQG C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
G CWAFS+ A+EG + L SLSEQ LVDC T + GC GGLMD+AF +I N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
+ TE YPY+ D SC+ +A A +G+ D+P +E + KAVA PVSVAIDAS
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATD-TGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 269 SDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
FQ YS GV+ +C + LDHGV VGYGT + G YWLVKNSWGTTWGE GYI+M R
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322
Query: 327 DIDAKEGLCGIAMQASYPTA 346
+ + + CGIA +SYPT
Sbjct: 323 NQNNQ---CGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 272 bits (695), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 18/327 (5%)
Query: 29 TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
+ D M E H + ++ + Y+D E+ R KIF EN IA N + A K +KL +
Sbjct: 50 SFADVVMEEWHTFKL-EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 108
Query: 87 NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
N++AD + EFR NG+ L + D SF+ + ++P S+DWR KGAV
Sbjct: 109 NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 166
Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
T VKDQG CG CWAFS+ A+EG + + L SLSEQ LVDC T + GC GGLMD+A
Sbjct: 167 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 226
Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
F +I N G+ TE YPY+A D SC+ + A G+ D+P +E + +AVA P
Sbjct: 227 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGP 285
Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
VSVAIDAS FQFYS GV+ QC + LDHGV VG+GT + G YWLVKNSWGTTWG
Sbjct: 286 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 345
Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
+ G+I+M R+ KE CGIA +SYP
Sbjct: 346 DKGFIKMLRN---KENQCGIASASSYP 369
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 271 bits (692), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 154/362 (42%), Positives = 206/362 (56%), Gaps = 36/362 (9%)
Query: 1 MAMILLENKLVLAAI-------LVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
MAMI +KL+ AI L G ++ +S+ ND T ER E WM ++ ++
Sbjct: 1 MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQ--NDLTSTERLIQLFESWMLKHNKI 58
Query: 50 YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
Y++ EK RF+IFK+N++YI N K N Y LG+N FAD +N+EF+ G
Sbjct: 59 YKNIDEKIYRFEIFKDNLKYIDETNKK--NNSYWLGLNVFADMSNDEFKEKYTG------ 110
Query: 110 SVRSSETTDVSFRYE------NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
S+ + TT YE + ++P +DWR+KGAVT VK+QG CG CWAFSAV +EG
Sbjct: 111 SIAGNYTT-TELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEG 169
Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
I I T L SEQEL+DCD GC GG A + +++ G+ YPY+
Sbjct: 170 IIKIRTGNLNEYSEQELLDCDR--RSYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQR 226
Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQC 283
C +E P AAK G V NE AL+ ++ANQPVSV ++A+G DFQ Y G+F G C
Sbjct: 227 YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPC 286
Query: 284 GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
G ++DH V AVGY G Y L+KNSWGT WGENGYIR++R G+CG+ + Y
Sbjct: 287 GNKVDHAVAAVGY-----GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFY 341
Query: 344 PT 345
P
Sbjct: 342 PV 343
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 270 bits (691), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 158/357 (44%), Positives = 207/357 (57%), Gaps = 23/357 (6%)
Query: 1 MAMILLENKLVLAAILVL-------GVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
MAMI +KL+ AI + G ++ +S+ +D T ER WM + +
Sbjct: 1 MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQ--DDLTSTERLIQLFNSWMLNHNKF 58
Query: 50 YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
Y + EK RF+IFK+N+ YI N K N Y LG+NEFAD +N+EF Y L
Sbjct: 59 YENVDEKLYRFEIFKDNLNYIDETNKK--NNSYWLGLNEFADLSNDEFNEK---YVGSLI 113
Query: 110 SVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
++ D F E+ ++P ++DWRKKGAVT V+ QG CG CWAFSAVA +EGIN I
Sbjct: 114 DATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIR 173
Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
T KL LSEQELVDC+ GC+GG A E++ N G+ +KYPYKA G+C K
Sbjct: 174 TGKLVELSEQELVDCER--RSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAK 230
Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
+ K SG V NNE L+ A+A QPVSV +++ G FQ Y G+F G CGT++D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290
Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
H VTAVGYG + L+KNSWGT WGE GYIR++R G+CG+ + YPT
Sbjct: 291 HAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPT 346
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 260 bits (665), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 124/218 (56%), Positives = 154/218 (70%), Gaps = 4/218 (1%)
Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
+P SIDWR+ GAV VK+QG CG CWAFS VAA+EGIN I T L SLSEQ+LVDC T+
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA- 61
Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
+ GC GG M+ AF+FI++N G+ +E YPY+ DG CN N I YE+VPS+N
Sbjct: 62 -NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNST-VNAPVVSIDSYENVPSHN 119
Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
E +L KAVANQPVSV +DA+G DFQ Y SG+FTG C +H +T VGYGT +D +W+
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWI 178
Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
VKNSWG WGE+GYIR +R+I+ +G CGI ASYP
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 260 bits (664), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 191/310 (61%), Gaps = 14/310 (4%)
Query: 40 EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
E + +YGR Y D E R IF++N +YI FN K N + L +N+F D T EEF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 98 RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
A G R RS+ + + E +DWR KGAVT VKDQGQCG CWAFS
Sbjct: 81 NAVMKGNIPR----RSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136
Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
++EG + + T L SL+EQ+LVDC QGC GG M+DAF++I +N G+ TEA YP
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYP 196
Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
Y+A DGSC + ++N AA SG+ ++ S +E L +AV + P+SV IDA+ S FQFYSS
Sbjct: 197 YEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255
Query: 277 GV-FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
GV + C + LDH V AVGYG+ + G +WLVKNSW T+WG+ GYI+M R+ +
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMSRN---RNNN 311
Query: 335 CGIAMQASYP 344
CGIA ASYP
Sbjct: 312 CGIATVASYP 321
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 258 bits (659), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 155/218 (71%), Gaps = 5/218 (2%)
Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
+P+ +DWR KGAV +K+Q QCG CWAFSAVAA+E IN I T +L SLSEQELVDCDT+
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
GC GG M++AF++II+N G+ T+ YPY A GSC K I+G++ V NN
Sbjct: 60 -SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC--KPYRLRVVSINGFQRVTRNN 116
Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
E+AL AVA+QPVSV ++A+G+ FQ YSSG+FTG CGT +HGV VGYGT G YW+
Sbjct: 117 ESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGT-QSGKNYWI 175
Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
V+NSWG WG GYI M+R++ + GLCGIA SYPT
Sbjct: 176 VRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPT 213
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 258 bits (658), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 194/318 (61%), Gaps = 16/318 (5%)
Query: 33 ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
AT + + + QYGR Y D E+ R ++F++N + I FN K N +K+ +N+F
Sbjct: 14 ATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFG 73
Query: 91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
D TNEEF A GYK+ S F E + A +DWR K VT VKDQ QCG
Sbjct: 74 DMTNEEFNAVMKGYKKG-----SRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCG 128
Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
CWAFSA A+EG + + +L SLSEQ+LVDC T + GC GG M AF++I N G+
Sbjct: 129 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188
Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
TE+ YPY+A D SC + +AN A +G +V + E AL +AV+ P+SVAIDAS
Sbjct: 189 DTESSYPYEAEDRSC-RFDANSIGAICTGSVEV-QHTEEALQEAVSGVGPISVAIDASHF 246
Query: 270 DFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
FQFYSSGV+ Q C T LDHGV AVGYGT + YWLVKNSWG++WG+ GYI+M R+
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSRN 305
Query: 328 IDAKEGLCGIAMQASYPT 345
D CGIA + SYPT
Sbjct: 306 RDNN---CGIASEPSYPT 320
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 256 bits (655), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 150/354 (42%), Positives = 206/354 (58%), Gaps = 19/354 (5%)
Query: 1 MAMILLENKLVLAAILVLGVWAPQSWSRTL-----NDATMNER----HEMWMAQYGRVYR 51
MA+I +KL+ AI + G + ++ +D T ER WM ++ + Y+
Sbjct: 1 MAIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYK 60
Query: 52 DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV 111
+ EK RF+IFK+N++YI NK N Y LG+NEF+D +N+EF+ Y LP
Sbjct: 61 NVDEKLYRFEIFKDNLKYIDE-RNKMING-YWLGLNEFSDLSNDEFKEK---YVGSLPED 115
Query: 112 RSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
+++ D F E+ +P S+DWR KGAVT VK QG C CWAFS VA +EGIN I T
Sbjct: 116 YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTG 175
Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
L LSEQELVDCD + GC G + +++ N G+ AKYPY A +C +
Sbjct: 176 NLVELSEQELVDCDK--QSYGCNRGYQSTSLQYVAQN-GIHLRAKYPYIAKQQTCRANQV 232
Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
K +G V SNNE +L+ A+A+QPVSV ++++G DFQ Y G+F G CGT++DH
Sbjct: 233 GGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHA 292
Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
VTAVGYG + L+KNSWG WGENGYIR++R G+CG+ + YP
Sbjct: 293 VTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYP 345
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 254 bits (650), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)
Query: 32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
D T + W + + R+Y N E+E R I+++N+ I N + N + + +N F
Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80
Query: 90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
D TNEEFR NGY+ + + + +P S+DWR+KG VT VK+QGQC
Sbjct: 81 GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135
Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
G CWAFSA +EG + T KL SLSEQ LVDC + +QGC GGLMD AF++I N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195
Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
L +E YPY+A DGSC K A + A +G+ D+P E ALMKAVA P+SVA+DAS
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253
Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
QFYSSG+ + C ++ LDHGV VGY GT + KYWLVKNSWG+ WG GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313
Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
+ +D D CG+A ASYP
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 254 bits (650), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 15/321 (4%)
Query: 31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
N+ + +E W+ + G+ Y EKE RFKIFK+N++ I N+ N+ Y+ G+N+F+
Sbjct: 33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP-NRSYERGLNKFS 91
Query: 91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTG-VKDQ 146
D T +EF+A G K S+ +DV+ RY E +P +DWR++GAV VK Q
Sbjct: 92 DLTADEFQASYLGGKMEKKSL-----SDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQ 146
Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
G+CG CWAF+A A+EGIN ITT +L SLSEQEL+DCD ++ GC GG AFEFI
Sbjct: 147 GECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKE 206
Query: 207 NKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
N G+ ++ Y Y D +C E + I+G+E VP N+E +L KAVA QP+SV I
Sbjct: 207 NGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI 266
Query: 265 DASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
A ++ Y SGV+ G C DH V VGYGT+ D YWL++NSWG WGE GY+R
Sbjct: 267 SA--ANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLR 324
Query: 324 MQRDIDAKEGLCGIAMQASYP 344
+QR+ G C +A+ YP
Sbjct: 325 LQRNFHEPTGKCAVAVAPVYP 345
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 253 bits (646), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 153/347 (44%), Positives = 200/347 (57%), Gaps = 23/347 (6%)
Query: 8 NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
N + A L LG+ S + T N ++ + W A + R+Y N E+ R ++++N+
Sbjct: 2 NPTFILAALCLGI---ASATLTFNH-SLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56
Query: 68 EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
+ I N + + + +N F D T+EEFR NG++ R P R + YE
Sbjct: 57 KMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE- 113
Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
P S+DWR+KG VT VK+QGQCG CWAFSA A+EG T KL SLSEQ LVDC
Sbjct: 114 --APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
++GC GGLMD AF+++ N GL +E YPY+A++ SC K S A +G+ D+P
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESC-KYNPEYSVANDTGFVDIP- 229
Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---TA 299
E ALMKAVA P+SVAIDA F FY G+ F C +E +DHGV VGYG T
Sbjct: 230 KQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
D +KYWLVKNSWG WG GYI+M +D + CGIA ASYPT
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKD---RRNHCGIASAASYPTV 333
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 253 bits (645), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 192/321 (59%), Gaps = 19/321 (5%)
Query: 32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
D T N + W + + R+Y N E+E R ++++N+ I N + N + + +N F
Sbjct: 22 DQTFNAQWHQWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 80
Query: 90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
D TNEEFR NGY+ + + + +P ++DWR+KG VT VK+QGQC
Sbjct: 81 GDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQ-----IPKTVDWREKGCVTPVKNQGQC 135
Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
G CWAFSA +EG + T KL SLSEQ LVDC +QGC GGLMD AF++I N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGG 195
Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
L +E YPY+A DGSC K A + A +G+ D+P E ALMKAVA P+SVA+DAS
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253
Query: 269 SDFQFYSSGV-FTGQCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
QFYSSG+ + C + +LDHGV VGY GT + KYWLVKNSWG WG +GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIK 313
Query: 324 MQRDIDAKEGLCGIAMQASYP 344
+ +D + CG+A ASYP
Sbjct: 314 IAKD---RNNHCGLATAASYP 331
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 251 bits (642), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 152/348 (43%), Positives = 200/348 (57%), Gaps = 25/348 (7%)
Query: 8 NKLVLAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
N ++ A LG+ S TL D ++ + W A + R+Y N E+ R ++++N
Sbjct: 2 NPTLILAAFCLGIA-----SATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKN 55
Query: 67 VEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
++ I N + R + + +N F D T+EEFR NG++ R P R + YE
Sbjct: 56 MKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE 113
Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
P S+DWR+KG VT VK+QGQCG CWAFSA A+EG T +L SLSEQ LVDC
Sbjct: 114 ---APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCS 170
Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
++GC GGLMD AF+++ N GL +E YPY+A++ SC K S A +G+ D+P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIP 229
Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---T 298
E ALMKAVA P+SVAIDA F FY G+ F C +E +DHGV VGYG T
Sbjct: 230 K-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST 288
Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
D KYWLVKNSWG WG GY++M +D + CGIA ASYPT
Sbjct: 289 ESDNNKYWLVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV 333
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 251 bits (641), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 154/350 (44%), Positives = 202/350 (57%), Gaps = 28/350 (8%)
Query: 8 NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
N +L LGV AP+ D ++ W A + R+Y N E+E R ++++
Sbjct: 2 NPSFFLTVLCLGVASAAPKL------DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEK 54
Query: 66 NVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
N + I N + +++ +N F D TNEEFR NG++ + + +
Sbjct: 55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEP 109
Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
VP S+DW KKG VT VK+QGQCG CWAFSA A+EG T KL SLSEQ LVDC
Sbjct: 110 LLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYED 242
+ +QGC GGLMD+AF++I N GL +E YPY A+D SCN K SAA +G+ D
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-PECSAANDTGFVD 228
Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGY--- 296
+P E ALMKAVA P+SVAIDA + FQFY SG+ + C + +LDHGV VGY
Sbjct: 229 IPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFE 287
Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
GT + K+W+VKNSWG WG NGY++M +D + CGIA ASYPT
Sbjct: 288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPTV 334
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 248 bits (633), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 22/343 (6%)
Query: 13 AAILVLGVWAPQSWSRTL----NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
AAI L W P S + D T++ ++W + + Y+D E+E+R I+++N++
Sbjct: 7 AAIRWL-FWMPLVCSVAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLK 65
Query: 69 YIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YEN 125
+I N + Y++G+N+ D TNEE R G R+P R S T V+FR Y N
Sbjct: 66 FIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC-RMG-ALRIP--RQSPKT-VTFRSYSN 120
Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
++P ++DWR+KG VT VK QG CG CWAFSAV A+EG + T KL SLS Q LVDC
Sbjct: 121 RTLPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSN 180
Query: 186 SGE--DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
+ ++GC GG M +AF++II N G+ +A YPYKA+D C+ N AA S Y +
Sbjct: 181 EEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYNSKN-RAATCSRYIQL 239
Query: 244 PSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADD 301
P +E AL +AVA + PVSV IDAS S F FY SGV+ C ++HGV VGYGT D
Sbjct: 240 PFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-D 298
Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
G YWLVKNSWG +G+ GYIRM R+ + CGIA SYP
Sbjct: 299 GKDYWLVKNSWGLNFGDQGYIRMARN---NKNHCGIASYCSYP 338
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 248 bits (632), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 201/350 (57%), Gaps = 28/350 (8%)
Query: 8 NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
N +L LGV AP+ D ++ W A + R+Y N E+E R ++++
Sbjct: 2 NPSFFLTVLCLGVASAAPKL------DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEK 54
Query: 66 NVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
N + I N + +++ +N F D TNEEFR NG++ + + +
Sbjct: 55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEP 109
Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
VP S+DW KKG VT VK+QGQCG CWAFSA A+EG T KL SLSEQ LVDC
Sbjct: 110 LLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYED 242
+ +QGC GGLMD+AF++I N L +E YPY A+D SCN K SAA +G+ D
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYK-PECSAANDTGFVD 228
Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGY--- 296
+P E ALMKAVA P+SVAIDA + FQFY SG+ + C + +LDHGV VGY
Sbjct: 229 IPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFE 287
Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
GT + K+W+VKNSWG WG NGY++M +D + CGIA ASYPT
Sbjct: 288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPTV 334
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 248 bits (632), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 189/313 (60%), Gaps = 18/313 (5%)
Query: 42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
W A +GR+Y N E+ R ++++N++ I N + + + +N F D TNEEFR
Sbjct: 32 WKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQ 90
Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
NG++ + + + S E VP S+DWR+KG VT VK+QGQCG CWAFSA
Sbjct: 91 VMNGFQNQ--KHKKGKVFHESLVLE---VPKSVDWREKGYVTAVKNQGQCGSCWAFSATG 145
Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
A+EG T KL SLSEQ LVDC +QGC GGLMD+AF+++ N GL TE YPY
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYL 205
Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
+ + + SAA +G+ D+P E ALMKAVA P+SVAIDA S FQFY SG+
Sbjct: 206 GRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKSGI 264
Query: 279 FTG-QCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
+ C + +LDHGV VGY GT + +K+W+VKNSWG WG NGY++M +D +
Sbjct: 265 YYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKD---QNN 321
Query: 334 LCGIAMQASYPTA 346
CGI+ ASYPT
Sbjct: 322 HCGISTAASYPTV 334
>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
Length = 376
Score = 246 bits (628), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/343 (41%), Positives = 194/343 (56%), Gaps = 43/343 (12%)
Query: 42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
W ++ R Y ++E R+ IFK N++Y+ ++N+K ++ LG+N FAD TNEE+R
Sbjct: 39 WTLKFNRQY-SSSEFSNRYSIFKSNMDYVDNWNSKGDSQTV-LGLNNFADITNEEYRKTY 96
Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
G + S + +V + + P SIDWR K AVT +KDQGQCG CW+FS +
Sbjct: 97 LGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGST 156
Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
EG + + T+KL SLSEQ LVDC E+ GC+GGLM++AF++II NKG+ TE+ YPY A
Sbjct: 157 EGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAE 216
Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV-FT 280
GS + A I GY ++ + +E +L + PVSVAIDAS + FQ Y+SG+ +
Sbjct: 217 TGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYE 276
Query: 281 GQCG-TELDHGVTAVGYGTA---DDG---------------------------------T 303
+C TELDHGV VGYG D+G
Sbjct: 277 PKCSPTELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKAN 336
Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
YW+VKNSWGT+WG GYI M +D ++ CGIA +SYP A
Sbjct: 337 NYWIVKNSWGTSWGIKGYILMSKD---RKNNCGIASVSSYPLA 376
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 246 bits (627), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 195/316 (61%), Gaps = 26/316 (8%)
Query: 42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR--- 98
WM + Y + E R++ FK+N++Y+ ++N+K LG+N+ AD +NEE+R
Sbjct: 37 WMRSNNKAYT-HKEFMPRYEEFKKNMDYVHNWNSKGSKTV--LGLNQHADLSNEEYRLNY 93
Query: 99 ------APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
NGY +R +R + F+ P ++DWR+K AVT VKDQGQCG C
Sbjct: 94 LGTRAHIKLNGYHKRNLGLRLNRP---QFKQ-----PLNVDWREKDAVTPVKDQGQCGSC 145
Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
++FS ++EG+ I T KL SLSEQ ++DC +S ++GC GGLM +AFE+II N GL +
Sbjct: 146 YSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNS 205
Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
E +YPY+ K + AAKI+ Y+++ + +E L A+ PVSVAIDAS + FQ
Sbjct: 206 EEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQ 265
Query: 273 FYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
Y++GV + C +E LDHGV AVG GT D+G Y++VKNSWG +WG NGYI M R+
Sbjct: 266 LYTAGVYYEPACSSEDLDHGVLAVGMGT-DNGEDYYIVKNSWGPSWGLNGYIHMARN--- 321
Query: 331 KEGLCGIAMQASYPTA 346
K+ CGI+ ASYP A
Sbjct: 322 KDNNCGISTMASYPIA 337
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 245 bits (626), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 150/217 (69%), Gaps = 4/217 (1%)
Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
+P SIDWR+KGAV VK+QG CG CWAF A+AA+EGIN I T L SLSEQ+LVDC T
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCST-- 60
Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
+ GCEGG AF++II+N G+ +E YPY ++G+C+ KE N I Y +VPSN+
Sbjct: 61 RNHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKE-NAHVVSIDSYRNVPSND 119
Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
E +L KAVANQPVSV +DA+G DFQ Y +G+FTG C +H T G T +D YW
Sbjct: 120 EKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETEND-KDYWT 178
Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
VKNSWG WGE+GYIR++R+I G CGIA+ SYP
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYP 215
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 244 bits (624), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 192/323 (59%), Gaps = 19/323 (5%)
Query: 32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
D ++N + W A + R+Y N E+ R ++++N++ I N + + + +N F
Sbjct: 22 DQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80
Query: 90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
D TNEEFR NG++ + + + F A +P S+DWR+KG VT VK+QGQC
Sbjct: 81 GDMTNEEFRQVMNGFQNQ-KHKKGKMFQEPLF----AEIPKSVDWREKGYVTPVKNQGQC 135
Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
G CWAFSA A+EG T KL SLSEQ LVDC + ++GC GGLMD+AF ++ N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGG 195
Query: 210 LATEAKYPYKASDG-SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
L +E YPY D +CN K SAA +G+ D+P E ALMKAVA P+SVAIDA
Sbjct: 196 LDSEESYPYLGRDTETCNYK-PECSAANDTGFVDLPQ-REKALMKAVATLGPISVAIDAG 253
Query: 268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYG--TADDGTKYWLVKNSWGTTWGENGYIR 323
FQFY SG+ F C + +LDHGV VGYG D K+W+VKNSWG WG NGY++
Sbjct: 254 HQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVK 313
Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
M +D + CGIA ASYPT
Sbjct: 314 MAKD---QNNHCGIATAASYPTV 333
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 244 bits (623), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 189/323 (58%), Gaps = 33/323 (10%)
Query: 42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
WM + + Y E R+ IFK N++Y+ +N+K LG+N FAD TNEE+R
Sbjct: 33 WMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKGSETV--LGLNNFADITNEEYRNTY 89
Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
G K S+ ++ V S AS DWR +GAVT VK+QGQCG CW+FS +
Sbjct: 90 LGTKFDASSLIGTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGST 145
Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
EG + + +L SLSEQ L+DC T E+ GC+GGLM AFE+II+N G+ TE+ YPYKA
Sbjct: 146 EGAHFQSKGELVSLSEQNLIDCST--ENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAE 203
Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV-FT 280
+G C K N S A +S Y+ V + +E++L AV PVSVAIDAS FQ Y+SG+ +
Sbjct: 204 NGKCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYE 262
Query: 281 GQCGTE-LDHGVTAVGY------------------GTADDGTKYWLVKNSWGTTWGENGY 321
+C +E LDHGV AVGY +A +YW+VKNSWGT+WG GY
Sbjct: 263 PECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGY 322
Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
I M R+ D CGIA AS+P
Sbjct: 323 ILMSRNRDNN---CGIASSASFP 342
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 244 bits (623), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 188/319 (58%), Gaps = 17/319 (5%)
Query: 33 ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
A N E + ++GR Y D E+ R +F +N++YI FN K Y L IN+F+
Sbjct: 14 AAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFS 73
Query: 91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
D TNE+F A GYK+ +TD A +DWR KGAVT VKDQGQCG
Sbjct: 74 DMTNEKFNAVMKGYKKGPRPAAVFTSTDA------APESTEVDWRTKGAVTPVKDQGQCG 127
Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDC-DTSGEDQGCEGGLMDDAFEFIISNKG 209
CWAFS +EG + + T +L SLSEQ+LVDC S +QGC GG ++ A ++ N G
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGG 187
Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
+ TE+ YPY+A D +C + +N A +GY + +E+AL A + P+SVAIDAS
Sbjct: 188 VDTESSYPYEARDNTC-RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASH 246
Query: 269 SDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
FQ Y +GV + C ++LDH V AVGYG+ + G +WLVKNSW T+WGE+GYI+M R
Sbjct: 247 RSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGESGYIKMAR 305
Query: 327 DIDAKEGLCGIAMQASYPT 345
+ + CGIA A YPT
Sbjct: 306 N---RNNNCGIATDACYPT 321
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.130 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 129,859,603
Number of Sequences: 539616
Number of extensions: 5439372
Number of successful extensions: 12950
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 224
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 11911
Number of HSP's gapped (non-prelim): 276
length of query: 346
length of database: 191,569,459
effective HSP length: 118
effective length of query: 228
effective length of database: 127,894,771
effective search space: 29160007788
effective search space used: 29160007788
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)