BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047793
(324 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 363 bits (931), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 180/307 (58%), Positives = 218/307 (71%), Gaps = 7/307 (2%)
Query: 20 HEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA 79
+E+W S + V ++ EK+KRF +FK N + + N +KPYKL +N+FAD TN EF+
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRN 95
Query: 80 FRNGYRRPDGLTSRKGT----SFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
+G + R G +F YE V VPA++DWRK GAVT +K+QG CGSCWAFS
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
+ A EGI Q+ T KL+SLSEQELV CDT + GC GG M+ AF+FI GITTEANYP
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANYP 214
Query: 196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSG 255
Y+A DGTC+ + E + I G+E VP N E ALLKAVANQPV+V+IDA GS FQFYS G
Sbjct: 215 YEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEG 274
Query: 256 VFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
VFTG CGTELDHGV VGYG T +GTKYW VKNSWG WGE+GYIRM+R I KEGLCGI
Sbjct: 275 VFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGI 334
Query: 316 AMDSSYP 322
AM++SYP
Sbjct: 335 AMEASYP 341
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 358 bits (918), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 183/316 (57%), Positives = 224/316 (70%), Gaps = 11/316 (3%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E SL E +E+W S + V ++ EEK KRF +FK NV+ I N +K YKL +N+F D
Sbjct: 31 ENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK-DKSYKLKLNKFGDM 88
Query: 73 TNQEFKAFRNG-----YRRPDGLTSRKGT-SFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
T++EF+ G +R G +K T SF Y NV +P ++DWRKNGAVTP+KNQG
Sbjct: 89 TSEEFRRTYAGSNIKHHRMFQG--EKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQ 146
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFS V A EGI Q+ T KL SLSEQELV CDT+ + GC GG M+ AF+FI
Sbjct: 147 CGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQ-NQGCNGGLMDLAFEFIKEKG 205
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
G+T+E YPY+A D TC+ E + V I G+E VP NSE+ L+KAVANQPV+V+IDA G
Sbjct: 206 GLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGG 265
Query: 247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
S FQFYS GVFTG CGTEL+HGV VGYG T +GTKYW+VKNSWG WGE+GYIRM+R I
Sbjct: 266 SDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGI 325
Query: 307 DAKEGLCGIAMDSSYP 322
KEGLCGIAM++SYP
Sbjct: 326 RHKEGLCGIAMEASYP 341
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 357 bits (917), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 180/316 (56%), Positives = 226/316 (71%), Gaps = 10/316 (3%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E SL +E+W + + V ++ +EK +RF +FK+NV+FI N + PYKL++N+F D
Sbjct: 33 EDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDM 91
Query: 73 TNQEFKAFRNG-----YRRPDGLTSRKGTSFKYENVIDVPA-TMDWRKNGAVTPIKNQGP 126
TNQEF++ G +R G+ G SF YENV +PA ++DWR GAVT +K+QG
Sbjct: 92 TNQEFRSKYAGSKIQHHRSQRGIQKNTG-SFMYENVGSLPAASIDWRAKGAVTGVKDQGQ 150
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFS +A+ EGI Q+ TG+L+SLSEQELV CDTS + GC GG M+ AF+FI N
Sbjct: 151 CGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTS-YNEGCNGGLMDYAFEFIQKN- 208
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
GITTE +YPY DGTC S V I G++ VPAN+E AL++AVANQP++VSI+ASG
Sbjct: 209 GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268
Query: 247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
FQFYS GVFTG CGTELDHGV VGYGAT +GTKYW+VKNSWG WGE GYIRM+R I
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328
Query: 307 DAKEGLCGIAMDSSYP 322
K G CGIAM++SYP
Sbjct: 329 SDKRGKCGIAMEASYP 344
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 356 bits (914), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 176/314 (56%), Positives = 220/314 (70%), Gaps = 7/314 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E SL + +E+W S + V ++ EK KRF +FK N+ + + N +KPYKL +N+FAD
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 73 TNQEFKAFRNG----YRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
TN EF++ G + R T + +F YE V+ VP ++DWRK GAVT +K+QG CG
Sbjct: 91 TNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCG 150
Query: 129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
SCWAFS V A EGI Q+ T KL++LSEQELV CD + GC GG ME AF+FI GI
Sbjct: 151 SCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGGI 209
Query: 189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
TTE+NYPY+A +GTC+ + I G+E VPAN E+ALLKAVANQPV+V+IDA GS
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
FQFYS GVFTGDC T+L+HGV VGYG T +GT YW+V+NSWG WGE GYIRM+R+I
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISK 329
Query: 309 KEGLCGIAMDSSYP 322
KEGLCGIAM SYP
Sbjct: 330 KEGLCGIAMLPSYP 343
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 356 bits (913), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 178/315 (56%), Positives = 220/315 (69%), Gaps = 9/315 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E SL + +E+W S + V ++ EK KRF +FK NV + + N +KPYKL +N+FAD
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 73 TNQEFKAFR-----NGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPC 127
TN EF++ N ++ G GT F YE V VPA++DWRK GAVT +K+QG C
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGT-FMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDG 187
GSCWAFS + A EGI Q+ T KL+SLSEQELV CD + GC GG ME AF+FI G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGG 208
Query: 188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGS 247
ITTE+NYPY A +GTC+++ I G+E VP N E ALLKAVANQPV+V+IDA GS
Sbjct: 209 ITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGS 268
Query: 248 AFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
FQFYS GVFTGDC T+L+HGV VGYG T +GT YW+V+NSWG WGE+GYIRM+R+I
Sbjct: 269 DFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNIS 328
Query: 308 AKEGLCGIAMDSSYP 322
KEGLCGIAM +SYP
Sbjct: 329 KKEGLCGIAMMASYP 343
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 344 bits (883), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 170/317 (53%), Positives = 218/317 (68%), Gaps = 8/317 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E +L + +E+W S + +V ++ EK +RF FK N FI S N G+ PY+L +N F D
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 73 TNQEFKA-FRNGYRR--PDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
EF+A F RR P S G + NV D+P ++DWR+ GAVT +K+QG CGS
Sbjct: 98 DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFS V + EGI + TG L+SLSEQEL+ CDT+ D GC+GG M++AF++I +N G+
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLI 216
Query: 190 TEANYPYQAVDGTCNKTNEASH---VAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
TEA YPY+A GTCN A + V I G++ VPANSEE L +AVANQPV+V+++ASG
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
AF FYS GVFTGDCGTELDHGV VGYG +G YW VKNSWG SWGE+GYIR+++D
Sbjct: 277 KAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 307 DAKEGLCGIAMDSSYPT 323
A GLCGIAM++SYP
Sbjct: 337 GASGGLCGIAMEASYPV 353
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 343 bits (880), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/317 (53%), Positives = 218/317 (68%), Gaps = 8/317 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E +L + +E+W S + +V ++ EK +RF FK N FI S N G+ PY+L +N F D
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 73 TNQEFKA-FRNGYRR--PDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
EF+A F RR P S G + NV D+P ++DWR+ GAVT +K+QG CGS
Sbjct: 98 DQAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFS V + EGI + TG L+SLSEQEL+ CDT+ D GC+GG M++AF++I +N G+
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLI 216
Query: 190 TEANYPYQAVDGTCNKTNEASH---VAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
TEA YPY+A GTCN A + V I G++ VPANSEE L +AVANQPV+V+++ASG
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276
Query: 247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
AF FYS GVFTG+CGTELDHGV VGYG +G YW VKNSWG SWGE+GYIR+++D
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336
Query: 307 DAKEGLCGIAMDSSYPT 323
A GLCGIAM++SYP
Sbjct: 337 GASGGLCGIAMEASYPV 353
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 336 bits (862), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 172/316 (54%), Positives = 217/316 (68%), Gaps = 10/316 (3%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E LS +++W S + V ++ E+EKRF +F+ NV + + N N+ YKL +N+FAD
Sbjct: 31 EEGLSTLYDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHNTNKK-NRSYKLKLNKFADL 88
Query: 73 TNQEFKAFRNG----YRRPDGLTSRKGTSFKY--ENVIDVPATMDWRKNGAVTPIKNQGP 126
T EFK G + R R F Y EN+ +P+++DWRK GAVT IKNQG
Sbjct: 89 TINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGK 148
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFS VAA EGI ++ T KL+SLSEQELV CDT + GC GG ME AF+FI N
Sbjct: 149 CGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIAFEFIKKNG 207
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
GITTE +YPY+ +DG C+ + + + I G+E VP N E ALLKAVANQPV+V+IDA
Sbjct: 208 GITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGS 267
Query: 247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
S FQFYS GVFTG CGTEL+HGV AVGYG + G KYW+V+NSWG WGE GYI+++R+I
Sbjct: 268 SDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGKKYWIVRNSWGAEWGEGGYIKIEREI 326
Query: 307 DAKEGLCGIAMDSSYP 322
D EG CGIAM++SYP
Sbjct: 327 DEPEGRCGIAMEASYP 342
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 333 bits (855), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 162/315 (51%), Positives = 220/315 (69%), Gaps = 6/315 (1%)
Query: 13 EASLSEKHEQWMSKYGKVYKNP--EEKEKRFRIFKDNVEFIESLNAAGNKP--YKLSINE 68
EA ++ W+++ G N E E+RF +F DN++F+++ NA ++ ++L +N
Sbjct: 45 EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNR 104
Query: 69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
FAD TN+EF+A G + + + G ++++ V ++P ++DWR+ GAV P+KNQG CG
Sbjct: 105 FADLTNEEFRATFLGAKVAE-RSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCG 163
Query: 129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
SCWAFSAV+ E I QL TG++I+LSEQELV C T+G + GC GG M+DAF FII N GI
Sbjct: 164 SCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGI 223
Query: 189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
TE +YPY+AVDG C+ E + V I G+E VP N E++L KAVA+QPV+V+I+A G
Sbjct: 224 DTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGRE 283
Query: 249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
FQ Y SGVF+G CGT LDHGV AVGYG T NG YW+V+NSWG WGE GY+RM+R+I+
Sbjct: 284 FQLYHSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINV 342
Query: 309 KEGLCGIAMDSSYPT 323
G CGIAM +SYPT
Sbjct: 343 TTGKCGIAMMASYPT 357
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 330 bits (846), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 162/309 (52%), Positives = 213/309 (68%), Gaps = 4/309 (1%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L E E WMS++ K YK+ EEK RF +F++N+ I+ N N Y L +NEFAD T++
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS-YWLGLNEFADLTHE 105
Query: 76 EFKAFRNGYRRPDGLTSRKGTS-FKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFS 134
EFK G +P R+ ++ F+Y ++ D+P ++DWRK GAV P+K+QG CGSCWAFS
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFS 165
Query: 135 AVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
VAA EGI Q+TTG L SLSEQEL+ CDT+ + GC GG M+ AF++II G+ E +Y
Sbjct: 166 TVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKEDDY 224
Query: 195 PYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSS 254
PY +G C + E I GYE VP N +E+L+KA+A+QPV+V+I+ASG FQFY
Sbjct: 225 PYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKG 284
Query: 255 GVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
GVF G CGT+LDHGV AVGYG ++ G+ Y +VKNSWG WGE+G+IRMKR+ EGLCG
Sbjct: 285 GVFNGKCGTDLDHGVAAVGYG-SSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCG 343
Query: 315 IAMDSSYPT 323
I +SYPT
Sbjct: 344 INKMASYPT 352
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 329 bits (844), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 214/322 (66%), Gaps = 5/322 (1%)
Query: 2 AASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKP 61
A+ SR + ++ E+WM++YG+VYK+ +EK +RF+IFK+NV+ IE+ N+
Sbjct: 19 ASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENS 78
Query: 62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPI 121
Y L IN+F D T EF A G P + SF N+ VP ++DWR GAV +
Sbjct: 79 YTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEV 138
Query: 122 KNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKF 181
KNQ PCGSCW+F+A+A EGI ++ TG L+SLSEQE++ C V +GC+GG + A+ F
Sbjct: 139 KNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDF 195
Query: 182 IIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVS 241
II N+G+TTE NYPY A GTCN N + A I GY V N E +++ AV+NQP+A
Sbjct: 196 IISNNGVTTEENYPYLAYQGTCN-ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAAL 254
Query: 242 IDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIR 301
IDAS + FQ+Y+ GVF+G CGT L+H +T +GYG ++GTKYW+V+NSWG+SWGE GY+R
Sbjct: 255 IDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVR 313
Query: 302 MKRDIDAKEGLCGIAMDSSYPT 323
M R + + G+CGIAM +PT
Sbjct: 314 MARGVSSSSGVCGIAMAPLFPT 335
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 328 bits (841), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 160/291 (54%), Positives = 204/291 (70%), Gaps = 5/291 (1%)
Query: 36 EKEKRFRIFKDNVEFIESLNAAGNKP--YKLSINEFADQTNQEFKAFRNGYRRPDGLTSR 93
E E+RFR+F DN++F+++ NA ++ ++L +N FAD TN EF+A G P G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 94 KGTSFKYENVIDVPATMDWRKNGAVT-PIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS 152
G +++++ V +P ++DWR GAV P+KNQG CGSCWAFSAVAA EGI ++ TG+L+S
Sbjct: 143 VGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVS 202
Query: 153 LSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHV 212
LSEQELV C +G + GC GG M+DAF FI N G+ TE +YPY A+DG CN + V
Sbjct: 203 LSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKV 262
Query: 213 AKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAV 272
I G+E VP N E +L KAVA+QPV+V+IDA G FQ Y SGVFTG CGT LDHGV AV
Sbjct: 263 VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAV 322
Query: 273 GYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
GYG A G YW V+NSWG WGE GYIRM+R++ A+ G CGIAM +SYP
Sbjct: 323 GYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 327 bits (839), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 162/314 (51%), Positives = 214/314 (68%), Gaps = 9/314 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPE--EKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFA 70
EA + +E W+ K+GK EK++RF IFKDN+ F++ N N Y+L + FA
Sbjct: 43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFA 101
Query: 71 DQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI--DVPATMDWRKNGAVTPIKNQGPCG 128
D TN E+++ G + R TS +YE + ++P ++DWRK GAV +K+QG CG
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERR--TSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159
Query: 129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
SCWAFS + A EGI Q+ TG LI+LSEQELV CDTS + GC GG M+ AF+FII N GI
Sbjct: 160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGI 218
Query: 189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
T+ +YPY+ VDGTC++ + + V I YE VP SEE+L KAVA+QP++++I+A G A
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
FQ Y SG+F G CGT+LDHGV AVGYG T NG YW+V+NSWG SWGE GY+RM R+I +
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIAS 337
Query: 309 KEGLCGIAMDSSYP 322
G CGIA++ SYP
Sbjct: 338 SSGKCGIAIEPSYP 351
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 324 bits (831), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 166/316 (52%), Positives = 207/316 (65%), Gaps = 8/316 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
E ++ + +E+W + V + E KRF +F+ NV + N NKPYKL IN FAD
Sbjct: 31 EENVWKLYERWRGHH-SVSRASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88
Query: 73 TNQEFKAFRNG----YRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
T+ EF++ G + R R F YENV VP+++DWR+ GAVT +KNQ CG
Sbjct: 89 THHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCG 148
Query: 129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
SCWAFS VAA EGI ++ T KL+SLSEQELV CDT + GC GG ME AF+FI +N GI
Sbjct: 149 SCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEE-NQGCAGGLMEPAFEFIKNNGGI 207
Query: 189 TTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGS 247
TE YPY + D C + I G+E VP N EE LLKAVA+QPV+V+IDA S
Sbjct: 208 KTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSS 267
Query: 248 AFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
FQ YS GVF G+CGT+L+HGV VGYG T NGTKYW+V+NSWG WGE GY+R++R I
Sbjct: 268 DFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGIS 327
Query: 308 AKEGLCGIAMDSSYPT 323
EG CGIAM++SYPT
Sbjct: 328 ENEGRCGIAMEASYPT 343
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 323 bits (828), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 159/319 (49%), Positives = 214/319 (67%), Gaps = 3/319 (0%)
Query: 7 TSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSI 66
T + E + +EQW+ + K Y EKE+RF+IFKDN++F++ N+ ++ +++ +
Sbjct: 31 TEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGL 90
Query: 67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
FAD TN+EF+A + S K + Y+ +P +DWR NGAV +K+QG
Sbjct: 91 TRFADLTNEEFRAIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFSAV A EGI Q+TTG+LISLSEQELV CD V+ GC+GG M AF+FI+ N
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 187 GITTEANYPYQAVD-GTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDA 244
GI T+ +YPY A D G CN N + V I GYE VP + E++L KAVA+QPV+V+I+A
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270
Query: 245 SGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
S AFQ Y SGV TG CG LDHGV VGYG+T+ G YW+++NSWG +WG+ GY++++R
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQR 329
Query: 305 DIDAKEGLCGIAMDSSYPT 323
+ID G CGIAM SYPT
Sbjct: 330 NIDDPFGKCGIAMMPSYPT 348
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 322 bits (826), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 163/310 (52%), Positives = 211/310 (68%), Gaps = 13/310 (4%)
Query: 20 HEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFADQTNQE 76
+ +W +++GK Y E+E+R+ F+DN+ +I+ NAA G ++L +N FAD TN+E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 77 FK----AFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
++ RN RR ++ R + + +P ++DWR GAV IK+QG CGSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR----YLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
FSA+AA EGI Q+ TG LISLSEQELV CDTS + GC GG M+ AF FII+N GI TE
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTED 214
Query: 193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFY 252
+YPY+ D C+ + + V I YE V NSE +L KAVANQPV+V+I+A G AFQ Y
Sbjct: 215 DYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLY 274
Query: 253 SSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
SSG+FTG CGT LDHGV AVGYG T NG YW+V+NSWG SWGE GY+RM+R+I A G
Sbjct: 275 SSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGK 333
Query: 313 CGIAMDSSYP 322
CGIA++ SYP
Sbjct: 334 CGIAVEPSYP 343
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 321 bits (822), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 210/312 (67%), Gaps = 9/312 (2%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L E E W+S + K Y+ EEK RF +FKDN++ I+ N G K Y L +NEFAD +++
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 76 EFKAFRNGYR----RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
EFK G + R D R F Y +V VP ++DWRK GAV +KNQG CGSCW
Sbjct: 106 EFKKMYLGLKTDIVRRD--EERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
AFS VAA EGI ++ TG L +LSEQEL+ CDT+ ++GC GG M+ AF++I+ N G+ E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222
Query: 192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQF 251
+YPY +GTC + S I G++ VP N E++LLKA+A+QP++V+IDASG FQF
Sbjct: 223 EDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQF 282
Query: 252 YSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
YS GVF G CG +LDHGV AVGYG ++ G+ Y +VKNSWG WGE+GYIR+KR+ EG
Sbjct: 283 YSGGVFDGRCGVDLDHGVAAVGYG-SSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEG 341
Query: 312 LCGIAMDSSYPT 323
LCGI +S+PT
Sbjct: 342 LCGINKMASFPT 353
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 319 bits (817), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/312 (50%), Positives = 209/312 (66%), Gaps = 12/312 (3%)
Query: 22 QWMSKYGKVYKNPE----EKEKRFRIFKDNVEFIESLNAAG-NKPYKLSINEFADQTNQE 76
+W ++GK N ++++RF IFKDN+ FI+ N N YKL + FA+ TN E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 77 FKAFRNGYRRP--DGLTSRKGTSFKYE---NVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
+++ G R +T K + KY NV +VP T+DWR+ GAV IK+QG CGSCW
Sbjct: 66 YRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCW 125
Query: 132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
AFS AA EGI ++ TG+L+SLSEQELV CD S + GC GG M+ AF+FI+ N G+ TE
Sbjct: 126 AFSTAAAVEGINKIVTGELVSLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTE 184
Query: 192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQF 251
+YPY +G CN + S V I GYE VP+ E AL +AV+ QPV+V+IDA G AFQ
Sbjct: 185 KDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQH 244
Query: 252 YSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
Y SG+FTG CGT +DH V AVGYG + NG YW+V+NSWGT WGE+GYIRM+R++ +K G
Sbjct: 245 YQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSG 303
Query: 312 LCGIAMDSSYPT 323
CGIA+++SYP
Sbjct: 304 KCGIAIEASYPV 315
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 315 bits (808), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 204/308 (66%), Gaps = 5/308 (1%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
+ ++ E+WM++YG+VYK+ +EK RF+IFK+NV IE+ N Y L IN+F D TN
Sbjct: 33 MMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNN 92
Query: 76 EFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
EF A G P + SF ++ VP ++DWR +GAVT +KNQG CGSCWAF++
Sbjct: 93 EFVAQYTGLSLPLNIKREPVVSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFAS 152
Query: 136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
+A E I ++ G L+SLSEQ+++ C V +GC+GG + A+ FII N G+ + A YP
Sbjct: 153 IATVESIYKIKRGNLVSLSEQQVLDC---AVSYGCKGGWINKAYSFIISNKGVASAAIYP 209
Query: 196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSG 255
Y+A GTC KTN + A I Y V N+E ++ AV+NQP+A ++DASG+ FQ Y G
Sbjct: 210 YKAAKGTC-KTNGVPNSAYITRYTYVQRNNERNMMYAVSNQPIAAALDASGN-FQHYKRG 267
Query: 256 VFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
VFTG CGT L+H + +GYG ++G K+W+V+NSWG WGE GYIR+ RD+ + GLCGI
Sbjct: 268 VFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGI 327
Query: 316 AMDSSYPT 323
AMD YPT
Sbjct: 328 AMDPLYPT 335
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 311 bits (796), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 205/313 (65%), Gaps = 13/313 (4%)
Query: 22 QWMSKYGKVYKNPE----EKEKRFRIFKDNVEFIESLNAAG-NKPYKLSINEFADQTNQE 76
QW +++GK N +++KRF IFKDN+ FI+ N N YKL + +F D TN E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 77 FKAFRNGYRRPDG--LTSRKGTSFKYENVI---DVPATMDWRKNGAVTPIKNQGPCGSCW 131
++ G R + K + KY + +VP T+DWR+ GAV PIK+QG CGSCW
Sbjct: 111 YRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCW 170
Query: 132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
AFS AA EGI ++ TG+LISLSEQELV CD S + GC GG M+ AF+FI+ N G+ TE
Sbjct: 171 AFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQF 251
+YPY+ G CN + S V I GYE VP E AL KA++ QPV+V+I+A G FQ
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 252 YSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA-KE 310
Y SG+FTG CGT LDH V AVGYG + NG YW+V+NSWG WGEEGYIRM+R++ A K
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKS 348
Query: 311 GLCGIAMDSSYPT 323
G CGIA+++SYP
Sbjct: 349 GKCGIAVEASYPV 361
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 310 bits (793), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 208/323 (64%), Gaps = 9/323 (2%)
Query: 3 ASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY 62
A +T R E + +E W+ KYGK Y + E E+RF IFK+ + FI+ NA N+ Y
Sbjct: 27 AKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSY 84
Query: 63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID--VPATMDWRKNGAVTP 120
K+ +N+FAD T++EF R+ Y R +++ S +YE + +P+ +DWR GAV
Sbjct: 85 KVGLNQFADLTDEEF---RSTYLRFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVVD 141
Query: 121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
IK+QG CG CWAFSA+A EGI ++ TG LISLSEQEL+ C + GC GG + D F+
Sbjct: 142 IKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQ 201
Query: 181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV 240
FII+N GI TE NYPY A DG CN + I YE VP N+E AL AV QPV+V
Sbjct: 202 FIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 241 SIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
++DA+G AF+ YSSG+FTG CGT +DH VT VGYG T G YW+VKNSW T+WGEEGY+
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYM 320
Query: 301 RMKRDIDAKEGLCGIAMDSSYPT 323
R+ R++ G CGIA SYP
Sbjct: 321 RILRNVGGA-GTCGIATMPSYPV 342
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 308 bits (790), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/323 (48%), Positives = 208/323 (64%), Gaps = 9/323 (2%)
Query: 3 ASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY 62
A +T R E + +E W+ KYGK Y + E E+RF IFK+ + FI+ NA N+ Y
Sbjct: 27 AKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSY 84
Query: 63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID--VPATMDWRKNGAVTP 120
K+ +N+FAD T++EF++ G+ +++ S +YE + +P+ +DWR GAV
Sbjct: 85 KVGLNQFADLTDEEFRSTYLGFTSG---SNKTKVSNRYEPRVGQVLPSYVDWRSAGAVVD 141
Query: 121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
IK+QG CG CWAFSA+A EGI ++ TG LISLSEQEL+ C + GC GG + D F+
Sbjct: 142 IKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQ 201
Query: 181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV 240
FII+N GI TE NYPY A DG CN + I YE VP N+E AL AV QPV+V
Sbjct: 202 FIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 241 SIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
++DA+G AF+ YSSG+FTG CGT +DH VT VGYG T G YW+VKNSW T+WGEEGY+
Sbjct: 262 ALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYM 320
Query: 301 RMKRDIDAKEGLCGIAMDSSYPT 323
R+ R++ G CGIA SYP
Sbjct: 321 RILRNVGGA-GTCGIATMPSYPV 342
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 304 bits (779), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 156/314 (49%), Positives = 210/314 (66%), Gaps = 10/314 (3%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
EASL E WM K+GKVY + EKE+R IF+DN+ FI + NA N Y+L + FAD
Sbjct: 44 EASLI--FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADL 100
Query: 73 TNQEFKAFRNGYR-RPDGLTSRKGTSFKYENVID--VPATMDWRKNGAVTPIKNQGPCGS 129
+ E+K +G RP +S +Y+ D +P ++DWR GAVT +K+QG C S
Sbjct: 101 SLHEYKEVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRS 160
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFS V A EG+ ++ TG+L++LSEQ+L++C+ ++GC GG++E A++FI+ N G+
Sbjct: 161 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKLETAYEFIMKNGGLG 218
Query: 190 TEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
T+ +YPY+AV+G C+ + E + I GYE +PAN E AL+KAVA+QPV ID+S
Sbjct: 219 TDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSRE 278
Query: 249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
FQ Y SGVF G CGT L+HGV VGYG T NG YWLVKNS G +WGE GY++M R+I
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYG-TENGRDYWLVKNSRGITWGEAGYMKMARNIAN 337
Query: 309 KEGLCGIAMDSSYP 322
GLCGIAM +SYP
Sbjct: 338 PRGLCGIAMRASYP 351
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 298 bits (762), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 208/314 (66%), Gaps = 8/314 (2%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
+A + E WM K+GKVY + EKE+R IF+DN+ FI + NA N Y+L +N FAD
Sbjct: 49 DAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADL 107
Query: 73 TNQEFKAFRNGYR-RP--DGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
+ E+ +G RP + + +K + +P ++DWR GAVT +K+QG C S
Sbjct: 108 SLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRS 167
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFS V A EG+ ++ TG+L++LSEQ+L++C+ ++GC GG++E A++FI++N G+
Sbjct: 168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKVETAYEFIMNNGGLG 225
Query: 190 TEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
T+ +YPY+A++G C + E + I GYE +PAN E AL+KAVA+QPV +D+S
Sbjct: 226 TDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSRE 285
Query: 249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
FQ Y SGVF G CGT L+HGV VGYG T NG YW+VKNS G +WGE GY++M R+I
Sbjct: 286 FQLYESGVFDGTCGTNLNHGVVVVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIAN 344
Query: 309 KEGLCGIAMDSSYP 322
GLCGIAM +SYP
Sbjct: 345 PRGLCGIAMRASYP 358
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 294 bits (752), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 198/309 (64%), Gaps = 7/309 (2%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L + + WM K+ K+Y++ +EK RF IF+DN+ +I+ N N Y L +N FAD +N
Sbjct: 44 LIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS-YWLGLNGFADLSND 102
Query: 76 EFKAFRNGYRRPD--GLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAF 133
EFK G+ D GL F Y++V + P ++DWR GAVTP+KNQG CGSCWAF
Sbjct: 103 EFKKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAF 162
Query: 134 SAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEAN 193
S +A EGI ++ TG L+ LSEQELV CD +GC+GG + +++ N+G+ T
Sbjct: 163 STIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYVA-NNGVHTSKV 219
Query: 194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYS 253
YPYQA C T++ KI GY+ VP+N E + L A+ANQP++V ++A G FQ Y
Sbjct: 220 YPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYK 279
Query: 254 SGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLC 313
SGVF G CGT+LDH VTAVGYG T++G Y ++KNSWG +WGE+GY+R+KR +G C
Sbjct: 280 SGVFDGPCGTKLDHAVTAVGYG-TSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTC 338
Query: 314 GIAMDSSYP 322
G+ S YP
Sbjct: 339 GVYKSSYYP 347
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 266 bits (679), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 125/219 (57%), Positives = 159/219 (72%), Gaps = 4/219 (1%)
Query: 105 DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTS 164
D+P ++DWR+NGAV P+KNQG CGSCWAFS VAA EGI Q+ TG LISLSEQ+LV C T+
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61
Query: 165 GVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPAN 224
+HGC GG M AF+FI++N GI +E YPY+ DG CN T A V I YE VP++
Sbjct: 62 --NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAP-VVSIDSYENVPSH 118
Query: 225 SEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYW 284
+E++L KAVANQPV+V++DA+G FQ Y SG+FTG C +H +T VGYG T N +W
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYG-TENDKDFW 177
Query: 285 LVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
+VKNSWG +WGE GYIR +R+I+ +G CGI +SYP
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 266 bits (679), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 185/308 (60%), Gaps = 5/308 (1%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L + WM + K Y+N +EK RF IFKDN+ +I+ N N Y L +NEFAD +N
Sbjct: 44 LIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS-YWLGLNEFADLSND 102
Query: 76 EFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
EF G + F E+ +++P +DWRK GAVTP+++QG CGSCWAFSA
Sbjct: 103 EFNEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSA 162
Query: 136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
VA EGI ++ TGKL+ LSEQELV C+ HGC+GG A +++ N GI + YP
Sbjct: 163 VATVEGINKIRTGKLVELSEQELVDCERR--SHGCKGGYPPYALEYVAKN-GIHLRSKYP 219
Query: 196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSG 255
Y+A GTC + K G V N+E LL A+A QPV+V +++ G FQ Y G
Sbjct: 220 YKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGG 279
Query: 256 VFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
+F G CGT++DH VTAVGYG + L+KNSWGT+WGE+GYIR+KR G+CG+
Sbjct: 280 IFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 338
Query: 316 AMDSSYPT 323
S YPT
Sbjct: 339 YKSSYYPT 346
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 263 bits (672), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 180/313 (57%), Gaps = 18/313 (5%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L + E WM K+ K+YKN +EK RF IFKDN+++I+ N N Y L +N FAD +N
Sbjct: 44 LIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFADMSND 102
Query: 76 EFKAFRNGYRRPDGLTSRKGTSFKYENV-----IDVPATMDWRKNGAVTPIKNQGPCGSC 130
EFK G + T T YE V +++P +DWR+ GAVTP+KNQG CGSC
Sbjct: 103 EFKEKYTGSIAGNYTT----TELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSC 158
Query: 131 WAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITT 190
WAFSAV EGI ++ TG L SEQEL+ CD +GC GG A + + GI
Sbjct: 159 WAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR--SYGCNGGYPWSALQLVAQY-GIHY 215
Query: 191 EANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQ 250
YPY+ V C + + AK G V +E ALL ++ANQPV+V ++A+G FQ
Sbjct: 216 RNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQ 275
Query: 251 FYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
Y G+F G CG ++DH V AVGYG Y L+KNSWGT WGE GYIR+KR
Sbjct: 276 LYRGGIFVGPCGNKVDHAVAAVGYGPN-----YILIKNSWGTGWGENGYIRIKRGTGNSY 330
Query: 311 GLCGIAMDSSYPT 323
G+CG+ S YP
Sbjct: 331 GVCGLYTSSFYPV 343
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 259 bits (661), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 124/218 (56%), Positives = 157/218 (72%), Gaps = 5/218 (2%)
Query: 106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
+P+ +DWR GAV IKNQ CGSCWAFSAVAA E I ++ TG+LISLSEQELV CDT+
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
HGC GG M +AF++II N GI T+ NYPY AV G+C V I G++ V N+
Sbjct: 60 -SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGFQRVTRNN 116
Query: 226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
E AL AVA+QPV+V+++A+G+ FQ YSSG+FTG CGT +HGV VGYG T +G YW+
Sbjct: 117 ESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYG-TQSGKNYWI 175
Query: 286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
V+NSWG +WG +GYI M+R++ + GLCGIA SYPT
Sbjct: 176 VRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPT 213
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 256 bits (655), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 199/321 (61%), Gaps = 18/321 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ + S + QW S + ++Y EE+ +R I++ N+ I+ N + G + + +N F
Sbjct: 22 DQTFSAEWHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ NGYR +KG F+ ++ +P ++DWR+ G VTP+KNQG CGS
Sbjct: 81 GDMTNEEFRQVVNGYRHQ---KHKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA EG L TGKLISLSEQ LV C + + GC GG M+ AF++I N G+
Sbjct: 138 CWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLD 197
Query: 190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSA 248
+E +YPY+A DG+C E + VA G+ +P E+AL+KAVA P++V++DAS +
Sbjct: 198 SEESYPYEAKDGSCKYRAEFA-VANDTGFVDIP-QQEKALMKAVATVGPISVAMDASHPS 255
Query: 249 FQFYSSGV-FTGDCGTE-LDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRMK 303
QFYSSG+ + +C ++ LDHGV VGY G +N KYWLVKNSWG+ WG EGYI++
Sbjct: 256 LQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIA 315
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
+D D CG+A +SYP
Sbjct: 316 KDRDNH---CGLATAASYPVV 333
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 255 bits (652), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 117/218 (53%), Positives = 159/218 (72%), Gaps = 2/218 (0%)
Query: 106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
+P ++DWR+ G + +K+QG CGSCWAFSAVAA E I + TG LISLSEQELV CD S
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS- 76
Query: 166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
+ GC+GG M+ AF+F+I N GI TE +YPY+ +G C++ + + V KI YE VP N+
Sbjct: 77 YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNN 136
Query: 226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
E+AL KAVA+QPV+++++A G FQ Y SG+FTG CGT +DHGV GYG T NG YW+
Sbjct: 137 EKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYG-TENGMDYWI 195
Query: 286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
V+NSWG + E GY+R++R++ + GLCG+A++ SYP
Sbjct: 196 VRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 255 bits (651), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 187/320 (58%), Gaps = 6/320 (1%)
Query: 7 TSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSI 66
T + E + +EQW+ + GK Y EKE+RF+IFKDN++ IE N+ N+ Y+ +
Sbjct: 28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTP-IKNQG 125
N+F+D T EF+A G + S ++Y+ +P +DWR+ GAV P +K QG
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQG 147
Query: 126 PCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHN 185
CGSCWAF+A A EGI Q+TTG+L+SLSEQEL+ CD + GC GG AF+FI N
Sbjct: 148 ECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKEN 207
Query: 186 DGITTEANYPYQAVDGTCNKTNE--ASHVAKIKGYETVPANSEEALLKAVANQPVAVSID 243
GI ++ Y Y D K E + V I G+E VP N E +L KAVA QP++V I
Sbjct: 208 GGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMIS 267
Query: 244 ASGSAFQFYSSGVFTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
A+ Y SGV+ G C DH V VGYG +++ YWL++NSWG WGE GY+R+
Sbjct: 268 AAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRL 325
Query: 303 KRDIDAKEGLCGIAMDSSYP 322
+R+ G C +A+ YP
Sbjct: 326 QRNFHEPTGKCAVAVAPVYP 345
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 254 bits (650), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 195/321 (60%), Gaps = 24/321 (7%)
Query: 21 EQWMS---KYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQTN 74
E+W + ++ K Y N E+ R +IF +N I N A G YKL +N++AD +
Sbjct: 26 EEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLH 85
Query: 75 QEFKAFRNGY--------RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
EFK NGY R GL G ++ + VP ++DWR++GAVT +K+QG
Sbjct: 86 HEFKETMNGYNHTLRQLMRERTGLV---GATYIPPAHVTVPKSVDWREHGAVTGVKDQGH 142
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFS+ A EG G L+SLSEQ LV C T ++GC GG M++AF++I N
Sbjct: 143 CGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 202
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDAS 245
GI TE +YPY+ +D +C+ N+A+ A G+ +P EE + KAVA PV+V+IDAS
Sbjct: 203 GIDTEKSYPYEGIDDSCH-FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDAS 261
Query: 246 GSAFQFYSSGVFT-GDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
+FQ YS GV+ +C + LDHGV VGYG +G YWLVKNSWGT+WGE+GYI+M
Sbjct: 262 HESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMA 321
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
R+ + + CGIA SSYPT
Sbjct: 322 RNQNNQ---CGIATASSYPTV 339
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 252 bits (644), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 197/319 (61%), Gaps = 18/319 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ + + + QW S + ++Y EE+ +R +++ N+ I+ N + G + + +N F
Sbjct: 22 DQTFNAQWHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ NGYR +KG F+ ++ +P T+DWR+ G VTP+KNQG CGS
Sbjct: 81 GDMTNEEFRQIVNGYRHQ---KHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA EG L TGKLISLSEQ LV C + GC GG M+ AF++I N G+
Sbjct: 138 CWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLD 197
Query: 190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
+E +YPY+A DG+C E + VA G+ +P E+AL+KAVA P++V++DAS +
Sbjct: 198 SEESYPYEAKDGSCKYRAEYA-VANDTGFVDIP-QQEKALMKAVATVGPISVAMDASHPS 255
Query: 249 FQFYSSGV-FTGDCGT-ELDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRMK 303
QFYSSG+ + +C + +LDHGV VGY G +N KYWLVKNSWG WG +GYI++
Sbjct: 256 LQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIA 315
Query: 304 RDIDAKEGLCGIAMDSSYP 322
+D + CG+A +SYP
Sbjct: 316 KD---RNNHCGLATAASYP 331
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 252 bits (644), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 181/307 (58%), Gaps = 5/307 (1%)
Query: 16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
L + WM K+ K YKN +EK RF IFKDN+++I+ N N Y L +NEF+D +N
Sbjct: 44 LIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMING-YWLGLNEFSDLSND 102
Query: 76 EFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
EFK G D F E+++D+P ++DWR GAVTP+K+QG C SCWAFS
Sbjct: 103 EFKEKYVGSLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFST 162
Query: 136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
VA EGI ++ TG L+ LSEQELV CD +GC G + +++ N GI A YP
Sbjct: 163 VATVEGINKIKTGNLVELSEQELVDCDKQ--SYGCNRGYQSTSLQYVAQN-GIHLRAKYP 219
Query: 196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSG 255
Y A TC K G V +N+E +LL A+A+QPV+V ++++G FQ Y G
Sbjct: 220 YIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGG 279
Query: 256 VFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
+F G CGT++DH VTAVGYG + L+KNSWG WGE GYIR++R G+CG+
Sbjct: 280 IFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCGV 338
Query: 316 AMDSSYP 322
S YP
Sbjct: 339 YRSSYYP 345
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 252 bits (644), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 192/321 (59%), Gaps = 17/321 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ +L K QW + + ++Y EE +R +++ N++ IE N + G + +++N F
Sbjct: 22 DQNLDTKWYQWKATHRRLYGANEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFTMAMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ +R RKG F+ +D+P ++DWRK G VTP+KNQ CGS
Sbjct: 81 GDMTNEEFRQMMGCFRNQK---FRKGKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TGKL+SLSEQ LV C + GC GG M AF+++ N G+
Sbjct: 138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLD 197
Query: 190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
+E +YPY AVD C E S VA G+ V E+AL+KAVA P++V++DA S+
Sbjct: 198 SEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSS 256
Query: 249 FQFYSSGV-FTGDCGTE-LDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRMK 303
FQFY SG+ F DC ++ LDHGV VGY GA +N +KYWLVKNSWG WG GY+++
Sbjct: 257 FQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIA 316
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
+D K CGIA +SYP
Sbjct: 317 KD---KNNHCGIATAASYPNV 334
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 252 bits (643), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 121/217 (55%), Positives = 153/217 (70%), Gaps = 4/217 (1%)
Query: 106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
+P ++DWR+ GAV P+KNQG CGSCWAF A+AA EGI Q+ TG LISLSEQ+LV C T
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR- 61
Query: 166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
+HGCEGG AF++II+N GI +E +YPY +GTC+ T E +HV I Y VP+N
Sbjct: 62 -NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSND 119
Query: 226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
E++L KAVANQPV+V++DA+G FQ Y +G+FTG C +H T VG T N YW
Sbjct: 120 EKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRT-VGGRETENDKDYWT 178
Query: 286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
VKNSWG +WGE GYIR++R+I G CGIA+ SYP
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYP 215
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 250 bits (638), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/319 (43%), Positives = 197/319 (61%), Gaps = 23/319 (7%)
Query: 21 EQWMS---KYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQTN 74
E+W + ++ K Y++ E+ R +IF +N I N A G +KL++N++AD +
Sbjct: 57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116
Query: 75 QEFKAFRNGY--------RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
EF+ NG+ R D S KG +F + +P ++DWR GAVT +K+QG
Sbjct: 117 HEFRQLMNGFNYTLHKQLRAAD--ESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
CGSCWAFS+ A EG +G L+SLSEQ LV C T ++GC GG M++AF++I N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDAS 245
GI TE +YPY+A+D +C+ N+ + A +G+ +P E+ + +AVA PV+V+IDAS
Sbjct: 235 GIDTEKSYPYEAIDDSCH-FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 246 GSAFQFYSSGVFT-GDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
+FQFYS GV+ C + LDHGV VG+G +G YWLVKNSWGT+WG++G+I+M
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 353
Query: 304 RDIDAKEGLCGIAMDSSYP 322
R+ KE CGIA SSYP
Sbjct: 354 RN---KENQCGIASASSYP 369
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 247 bits (631), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 127/218 (58%), Positives = 150/218 (68%), Gaps = 12/218 (5%)
Query: 106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
+P +DWRK GAVTP+KNQG CGSCWAFS V+ E I Q+ TG LISLSEQELV CD
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
+HGC GG A+++II+N GI T+ANYPY+AV G C AS V I GY VP +
Sbjct: 60 -NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPC---QAASKVVSIDGYNGVPFCN 115
Query: 226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
E AL +AVA QP V+IDAS + FQ YSSG+F+G CGT+L+HGVT VGY A YW+
Sbjct: 116 EXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA-----NYWI 170
Query: 286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
V+NSWG WGE+GYIRM R GLCGIA YPT
Sbjct: 171 VRNSWGRYWGEKGYIRMLR--VGGCGLCGIARLPYYPT 206
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 247 bits (630), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 197/322 (61%), Gaps = 19/322 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ +L QW + + ++Y EE+ +R +++ N + I+ N + G +++++N F
Sbjct: 22 DPNLDAHWHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ NG++ +KG F ++DVP ++DW K G VTP+KNQG CGS
Sbjct: 81 GDMTNEEFRQVMNGFQNQ---KHKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TGKL+SLSEQ LV C + + GC GG M++AF++I N G+
Sbjct: 138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLD 197
Query: 190 TEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
+E +YPY A D +CN E S A G+ +P E+AL+KAVA P++V+IDA +
Sbjct: 198 SEESYPYLATDTNSCNYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHT 255
Query: 248 AFQFYSSGVFTG-DCGT-ELDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRM 302
+FQFY SG++ DC + +LDHGV VGY G +N K+W+VKNSWG WG GY++M
Sbjct: 256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKM 315
Query: 303 KRDIDAKEGLCGIAMDSSYPTA 324
+D + CGIA +SYPT
Sbjct: 316 AKD---QNNHCGIATAASYPTV 334
>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
Length = 334
Score = 244 bits (623), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 196/322 (60%), Gaps = 19/322 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ +L QW + + ++Y EE+ +R +++ N + I+ N + G +++++N F
Sbjct: 22 DPNLDAHWHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ NG++ +KG F ++DVP ++DW K G VTP+KNQG CGS
Sbjct: 81 GDMTNEEFRQVMNGFQNQ---KHKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TGKL+SLSEQ LV C + + GC GG M++AF++I N +
Sbjct: 138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLD 197
Query: 190 TEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
+E +YPY A D +CN E S A G+ +P E+AL+KAVA P++V+IDA +
Sbjct: 198 SEESYPYLATDTNSCNYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHT 255
Query: 248 AFQFYSSGVFTG-DCGT-ELDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRM 302
+FQFY SG++ DC + +LDHGV VGY G +N K+W+VKNSWG WG GY++M
Sbjct: 256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKM 315
Query: 303 KRDIDAKEGLCGIAMDSSYPTA 324
+D + CGIA +SYPT
Sbjct: 316 AKD---QNNHCGIATAASYPTV 334
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 244 bits (622), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 196/321 (61%), Gaps = 18/321 (5%)
Query: 13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
+ SL+ + QW + + ++Y EE +R +++ N++ IE N + G + +++N F
Sbjct: 22 DQSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80
Query: 70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
D TN+EF+ NG++ +KG F+ ++P ++DWR+ G VTP+KNQG CGS
Sbjct: 81 GDMTNEEFRQVMNGFQNQ---KHKKGKMFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TGKL+SLSEQ LV C + + GC GG M++AF+++ N G+
Sbjct: 138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLD 197
Query: 190 TEANYPYQAVDG-TCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
+E +YPY D TCN E S A G+ +P E+AL+KAVA P++V+IDA
Sbjct: 198 SEESYPYLGRDTETCNYKPECS-AANDTGFVDLP-QREKALMKAVATLGPISVAIDAGHQ 255
Query: 248 AFQFYSSGV-FTGDCGT-ELDHGVTAVGYG--ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
+FQFY SG+ F DC + +LDHGV VGYG T + K+W+VKNSWG WG GY++M
Sbjct: 256 SFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMA 315
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
+D + CGIA +SYPT
Sbjct: 316 KD---QNNHCGIATAASYPTV 333
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 243 bits (621), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 194/321 (60%), Gaps = 22/321 (6%)
Query: 15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFAD 71
SL + +W + + ++Y EE +R +++ N++ IE N + G + +++N F D
Sbjct: 24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGD 82
Query: 72 QTNQEFKAFRNGY--RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
T++EF+ NG+ R+P RKG F+ + P ++DWR+ G VTP+KNQG CGS
Sbjct: 83 MTSEEFRQVMNGFQNRKP-----RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TGKL+SLSEQ LV C + GC GG M+ AF+++ N G+
Sbjct: 138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLD 197
Query: 190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
+E +YPY+A + +C K N VA G+ +P E+AL+KAVA P++V+IDA +
Sbjct: 198 SEESYPYEATEESC-KYNPEYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHES 255
Query: 249 FQFYSSGV-FTGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
F FY G+ F DC +E +DHGV VGYG ++ +KYWLVKNSWG WG GYI+M
Sbjct: 256 FMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNSKYWLVKNSWGEEWGMGGYIKMA 315
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
+D + CGIA +SYPT
Sbjct: 316 KD---RRNHCGIASAASYPTV 333
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 243 bits (620), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 189/320 (59%), Gaps = 16/320 (5%)
Query: 11 LQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSIN 67
L A+ + E++ K+G+ Y + EE+ R +F DN+++IE N G Y L+IN
Sbjct: 11 LALAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAIN 70
Query: 68 EFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPC 127
+F+D TN++F A GY++ R F + +DWR GAVTP+K+QG C
Sbjct: 71 QFSDMTNEKFNAVMKGYKK----GPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQC 126
Query: 128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSC-DTSGVDHGCEGGEMEDAFKFIIHND 186
GSCWAFS EG L TG+L+SLSEQ+LV C S + GC GG +E A ++ N
Sbjct: 127 GSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNG 186
Query: 187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDAS 245
G+ TE++YPY+A D TC + N + A GY + SE AL A + P++V+IDAS
Sbjct: 187 GVDTESSYPYEARDNTC-RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDAS 245
Query: 246 GSAFQFYSSGV-FTGDC-GTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
+FQ Y +GV + C ++LDH V AVGYG+ G +WLVKNSW TSWGE GYI+M
Sbjct: 246 HRSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEG-GQDFWLVKNSWATSWGESGYIKMA 304
Query: 304 RDIDAKEGLCGIAMDSSYPT 323
R+ + CGIA D+ YPT
Sbjct: 305 RN---RNNNCGIATDACYPT 321
>sp|P83443|MDO1_PSEMR Macrodontain-1 OS=Pseudananas macrodontes PE=1 SV=1
Length = 213
Score = 241 bits (616), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 150/218 (68%), Gaps = 8/218 (3%)
Query: 106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
VP ++DWR GAV +KNQGPCG CWAF+A+A EGI ++ G L+ LSEQE++ C
Sbjct: 2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDC---A 58
Query: 166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
V +GC+GG + A+ FII N+G+TT+ NYPY+A GTCN N + A I GY V N
Sbjct: 59 VSYGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCN-ANYFPNSAYITGYSYVRRND 117
Query: 226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
E ++ AV+NQP+A IDASG FQ+Y GV++G CG L+H +T +GYG + YW+
Sbjct: 118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS----YWI 173
Query: 286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
V+NSWG+SWG+ GY+R++RD+ G+CGIAM +PT
Sbjct: 174 VRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPT 211
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 240 bits (613), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 22/321 (6%)
Query: 15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFAD 71
SL + +W + + ++Y EE +R +++ N++ IE N G + +++N F D
Sbjct: 24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD 82
Query: 72 QTNQEFKAFRNGY--RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
T++EF+ NG+ R+P RKG F+ + P ++DWR+ G VTP+KNQG CGS
Sbjct: 83 MTSEEFRQVMNGFQNRKP-----RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGS 137
Query: 130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
CWAFSA A EG TG+LISLSEQ LV C + GC GG M+ AF+++ N G+
Sbjct: 138 CWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLD 197
Query: 190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
+E +YPY+A + +C K N VA G+ +P E+AL+KAVA P++V+IDA +
Sbjct: 198 SEESYPYEATEESC-KYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHES 255
Query: 249 FQFYSSGV-FTGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
F FY G+ F DC +E +DHGV VGYG ++ KYWLVKNSWG WG GY++M
Sbjct: 256 FLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 315
Query: 304 RDIDAKEGLCGIAMDSSYPTA 324
+D + CGIA +SYPT
Sbjct: 316 KD---RRNHCGIASAASYPTV 333
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 239 bits (611), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 192/328 (58%), Gaps = 13/328 (3%)
Query: 1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AA 57
+ S + ++ +L + W YGK YK E+ R I++ N++ + N +
Sbjct: 9 LLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSM 68
Query: 58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
G Y+L +N D T++E + + R P + ++K + +P +MDWR+ G
Sbjct: 69 GMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWP--RNVTYKSDPNQKLPDSMDWREKGC 126
Query: 118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV-DHGCEGGEME 176
VT +K QG CGSCWAFSAV A E +L TGKL+SLS Q LV C T+ + GC GG M
Sbjct: 127 VTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMT 186
Query: 177 DAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
+AF++II N+GI +EA+YPY+A+DG C + + + A Y +P SEEAL +AVAN+
Sbjct: 187 EAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPFGSEEALKEAVANK 245
Query: 237 -PVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
PV+V IDAS S+F Y +GV+ C ++HGV VGYG +G YWLVKNSWG +
Sbjct: 246 GPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLDGKDYWLVKNSWGLHF 304
Query: 295 GEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
G++GYIRM R+ CGIA SYP
Sbjct: 305 GDQGYIRMARN---SGNHCGIANYPSYP 329
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 239 bits (611), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 192/313 (61%), Gaps = 19/313 (6%)
Query: 22 QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQTNQEFK 78
+W + +G++Y EE +R +++ N++ IE N + G + +++N F D TN+EF+
Sbjct: 31 KWKATHGRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89
Query: 79 AFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
NG++ +KG F V++VP ++DWR+ G VT +KNQG CGSCWAFSA A
Sbjct: 90 QVMNGFQNQ---KHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGA 146
Query: 139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
EG TGKL+SLSEQ LV C + GC GG M++AF+++ N G+ TE +YPY
Sbjct: 147 LEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLG 206
Query: 199 VD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFYSSGV 256
+ +C E S A G+ +P E+AL+KAVA P++V+IDA S+FQFY SG+
Sbjct: 207 RETNSCTYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHSSFQFYKSGI 264
Query: 257 FTG-DCGT-ELDHGVTAVGY---GATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
+ DC + +LDHGV VGY G +N +K+W+VKNSWG WG GY++M +D +
Sbjct: 265 YYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKD---QNN 321
Query: 312 LCGIAMDSSYPTA 324
CGI+ +SYPT
Sbjct: 322 HCGISTAASYPTV 334
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 239 bits (609), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 197/329 (59%), Gaps = 17/329 (5%)
Query: 1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA--- 57
+AA + L AS S H + ++YG+ Y + +E+ R R+F+ N + IE N
Sbjct: 3 VAALFLCGLALATASPSWDH--FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFEN 60
Query: 58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
G +K+++N+F D TN+EF A GY++ G F E + A +DWR
Sbjct: 61 GEVTFKVAMNQFGDMTNEEFNAVMKGYKK--GSRGEPKAVFTAE-AGPMAADVDWRTKAL 117
Query: 118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
VTP+K+Q CGSCWAFSA A EG L +L+SLSEQ+LV C T + GC GG M
Sbjct: 118 VTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTS 177
Query: 178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ- 236
AF +I N GI TE++YPY+A D +C +A+ + I ++EEAL +AV+
Sbjct: 178 AFDYIKDNGGIDTESSYPYEAEDRSCRF--DANSIGAICTGSVEVQHTEEALQEAVSGVG 235
Query: 237 PVAVSIDASGSAFQFYSSGV-FTGDCG-TELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
P++V+IDAS +FQFYSSGV + +C T LDHGV AVGYG T + YWLVKNSWG+SW
Sbjct: 236 PISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYG-TESTKDYWLVKNSWGSSW 294
Query: 295 GEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
G+ GYI+M R+ D CGIA + SYPT
Sbjct: 295 GDAGYIKMSRNRDNN---CGIASEPSYPT 320
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 238 bits (608), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 182/311 (58%), Gaps = 17/311 (5%)
Query: 21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA---AGNKPYKLSINEFADQTNQEF 77
E + +K+GK Y N EE+ R +F D ++FI+ N G Y L IN F+D T++E
Sbjct: 21 ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80
Query: 78 KAFRNGYRRPDGLTSRKGTSFKYENVIDVP--ATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
A + G R R S ++ P A +DWR GAVTP+K+QG CGSCWAFSA
Sbjct: 81 LATKTGMTR-----RRHPLSVLPKSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSA 135
Query: 136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
VAA EG L TG L+SLSEQ LV C +S + GC GG A+++II N GI TE++YP
Sbjct: 136 VAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYP 195
Query: 196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSS 254
Y+A+D C + + + A + Y + E AL AV N+ PV+V IDA S+F Y
Sbjct: 196 YKAIDDNC-RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGG 254
Query: 255 GV-FTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
GV + +C + +H VTAVGYG ANG YW+VKNSWG WGE GYI+M R+ D
Sbjct: 255 GVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNN--- 311
Query: 313 CGIAMDSSYPT 323
C IA S YP
Sbjct: 312 CAIATYSVYPV 322
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.312 0.129 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 123,154,346
Number of Sequences: 539616
Number of extensions: 5230097
Number of successful extensions: 12312
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 223
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 11327
Number of HSP's gapped (non-prelim): 276
length of query: 324
length of database: 191,569,459
effective HSP length: 118
effective length of query: 206
effective length of database: 127,894,771
effective search space: 26346322826
effective search space used: 26346322826
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (28.1 bits)