BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019447
(341 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 310 bits (794), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 158/304 (51%), Positives = 202/304 (66%), Gaps = 11/304 (3%)
Query: 24 LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 83
+L+ +++ SC G+CWAFSA A+E IN IVTG+L+SLSEQEL+DCDRSYN GC GG
Sbjct: 29 VLVGVKDQGSC----GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGG 84
Query: 84 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
LMDYA++FVIKN GIDTE+DYPY+ + G C++ + N +V ID Y+DVP NNEK L +AV
Sbjct: 85 LMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAV 144
Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
QPVS+ + R FQ Y SGIFTG C T++DH V+I GY +ENG+DYWI++NSWG +
Sbjct: 145 AHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVRNSWGANC 204
Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAA 257
NGY+ +QRN +S G+CG+ + SYP KTG PPSP PT C + CA
Sbjct: 205 RENGYLRVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDEYSQCAV 264
Query: 258 GETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAA 317
G TCCC C SW CC A CC DH CCP +YPIC+ VR + GN
Sbjct: 265 GTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICN-VRQGTCSMSKGNPLGV 323
Query: 318 EAIE 321
+A++
Sbjct: 324 KAMK 327
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 307 bits (787), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 149/275 (54%), Positives = 186/275 (67%), Gaps = 6/275 (2%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
DT+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QP+S+ I RA
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASS 338
Query: 219 LGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICL 272
G CGI + SYP K G+ PPSP PT+C C TCCC C
Sbjct: 339 SGKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCF 398
Query: 273 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 307
+W CC +A CC D+ CCP YP+CD + CL
Sbjct: 399 AWGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL 433
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 305 bits (782), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 152/275 (55%), Positives = 182/275 (66%), Gaps = 6/275 (2%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GI
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGI 210
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
DTE DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RA
Sbjct: 211 DTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRA 270
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQLYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S
Sbjct: 271 FQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKAS 330
Query: 219 LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICL 272
G CGI + SYP K G+NPP P P+ C C TCCC C
Sbjct: 331 SGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCY 390
Query: 273 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 307
+W CC A CC DH CCP YPIC+ + CL
Sbjct: 391 AWGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 294 bits (753), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 146/291 (50%), Positives = 189/291 (64%), Gaps = 17/291 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDY 87
+N+ C G+CWAFSA +E IN++VTG +++LSEQEL++C + NSGC GGLMD
Sbjct: 157 KNQGQC----GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDD 212
Query: 88 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
A+ F+IKN GIDTE DYPY+ G+C+ + N +V+IDG++DVP+N+EK L +AV QP
Sbjct: 213 AFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQP 272
Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
VSV I R FQLY SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +G
Sbjct: 273 VSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESG 332
Query: 208 YMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYC 255
Y+ M+RN + G CGI M+ASYPTK+G NPP P PT C C
Sbjct: 333 YVRMERNINVTTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSC 392
Query: 256 AAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 306
AG TCCC +CL W CC A CC DH CCP +YP+C++ C
Sbjct: 393 PAGSTCCCAFGFRNLCLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 443
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 291 bits (745), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 146/290 (50%), Positives = 185/290 (63%), Gaps = 11/290 (3%)
Query: 24 LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGG 82
++ +N+ C G+CWAFSA A+EGINKIVTG LVSLSEQEL++C R+ NSGC G
Sbjct: 167 VVAPVKNQGQC----GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNG 222
Query: 83 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 142
G+MD A+ F+ +N G+DTE+DYPY G+CN K +R +V+IDG++DVPEN+E L +A
Sbjct: 223 GIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKA 282
Query: 143 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWG 200
V QPVSV I R FQLY SG+FTG C T+LDH V+ VGY D+ G YW ++NSWG
Sbjct: 283 VAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWG 342
Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYP----TKTGQNPPPSPPPGPTRCSLLTYCA 256
WG NGY+ M+RN G CGI M+ASYP +PP P P +C + C
Sbjct: 343 PDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNPKPSPPSPAPSPPQQCDRYSKCP 402
Query: 257 AGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 306
AG TCCC I C+ W CC A CC DH CCP YP+C++ C
Sbjct: 403 AGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPKEYPVCNAKARTC 452
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 275 bits (703), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 127/200 (63%), Positives = 151/200 (75%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS A+EGINKIVTG LVSLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G+
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 181
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
+TEKDYPY G G+CN N +VTIDGY+DVP +E L +AV QPVSV I RA
Sbjct: 182 NTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRA 241
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQ Y SGIFTG C T++DHAV+ VGY SENGVDYWI++NSWG WG +GY+ M+RN +
Sbjct: 242 FQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASK 301
Query: 219 LGICGINMLASYPTKTGQNP 238
G CGI + ASYP K NP
Sbjct: 302 SGKCGIAIEASYPVKYSPNP 321
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 269 bits (687), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 127/201 (63%), Positives = 151/201 (75%), Gaps = 1/201 (0%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G+
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 226
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
+TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPVSV I R
Sbjct: 227 NTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRI 286
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN S
Sbjct: 287 FQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 219 L-GICGINMLASYPTKTGQNP 238
G CGI + ASYP K NP
Sbjct: 347 KSGKCGIAVEASYPVKYSPNP 367
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 254 bits (650), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 150/210 (71%), Gaps = 4/210 (1%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
+ + +N+ SC G+CWAFS A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGL
Sbjct: 150 VAEVKNQGSC----GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGL 205
Query: 85 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
MDYA+++++KN G+ E+DYPY + G C QK VTI+G++DVP N+EK LL+A+
Sbjct: 206 MDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALA 265
Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
QP+SV I S R FQ YS G+F G C LDH V VGY S G DY I+KNSWG WG
Sbjct: 266 HQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWG 325
Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKT 234
GY+ ++RNTG G+CGIN +AS+PTKT
Sbjct: 326 EKGYIRLKRNTGKPEGLCGINKMASFPTKT 355
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 253 bits (646), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 119/213 (55%), Positives = 159/213 (74%), Gaps = 7/213 (3%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGG 83
++ +++ +C G+CWAFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG
Sbjct: 142 VVSVKDQGNC----GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGG 197
Query: 84 LMDYAYQFVIKNHGIDTEKDYPYRGQA-GQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQ 141
+M+YA++F++KN GI+T++DYPY G CN K N +VTIDGY+DVP ++EK L +
Sbjct: 198 IMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKK 257
Query: 142 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 201
AV QPVSV I S +AFQLY SG+ TG C SLDH V++VGY S +G DYWII+NSWG
Sbjct: 258 AVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGL 317
Query: 202 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 234
+WG +GY+ +QRN + G CGI M+ SYPTK+
Sbjct: 318 NWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKS 350
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 249 bits (637), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 120/204 (58%), Positives = 141/204 (69%), Gaps = 2/204 (0%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS ++EGIN+I TG LVSLSEQEL+DCD SYN GC GGLMDYA++F+ KN GI
Sbjct: 152 GSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKN-GI 210
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
TE YPY Q G C LN +V+IDG++DVP NNE L+QAV QP+SV I S
Sbjct: 211 TTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYG 270
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
FQ YS G+FTG C T LDH V IVGY + +G YWI+KNSWG WG +GY+ MQR +
Sbjct: 271 FQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISD 330
Query: 218 SLGICGINMLASYPTKTGQNPPPS 241
G CGI M ASYP KT NP S
Sbjct: 331 KRGKCGIAMEASYPIKTSANPKNS 354
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 248 bits (633), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 114/196 (58%), Positives = 141/196 (71%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA+Q++I G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
E DYPY + G C +QK + VTI GY+DVPEN+++ L++A+ QPVSV I S R
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQ Y G+F G C T LDH V VGY S G DY I+KNSWG WG G++ M+RNTG
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338
Query: 219 LGICGINMLASYPTKT 234
G+CGIN +ASYPTKT
Sbjct: 339 EGLCGINKMASYPTKT 354
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 241 bits (614), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 117/227 (51%), Positives = 145/227 (63%), Gaps = 5/227 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
+ + +N+ C G+CWAFS A+EGINKI T LVSLSEQEL+DCD N GC GGL
Sbjct: 140 VTEIKNQGKC----GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEGCNGGL 195
Query: 85 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
M+ A++F+ KN GI TE YPY G G+C+ K N +VTIDG++DVPEN+E LL+AV
Sbjct: 196 MEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVA 255
Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
QPVSV I FQ YS G+FTG C T L+H V VGY SE G YWI++NSWG WG
Sbjct: 256 NQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWG 315
Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSL 251
GY+ ++R G CGI M ASYP K + P+P G + L
Sbjct: 316 EGGYIKIEREIDEPEGRCGIAMEASYPIKL-SSSNPTPKDGDVKDEL 361
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 237 bits (605), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 117/219 (53%), Positives = 147/219 (67%), Gaps = 6/219 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGG 83
++ +++ C G CWAFSA +EGINKIVTG L+SLSEQELIDC R+ N+ GC GG
Sbjct: 139 VVDIKSQGEC----GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGG 194
Query: 84 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
+ +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP NNE L AV
Sbjct: 195 YITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEWALQTAV 254
Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+KNSW +W
Sbjct: 255 TYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIVKNSWDTTW 314
Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 242
G GYM + RN G + G CGI + SYP K P P
Sbjct: 315 GEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 237 bits (604), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 117/219 (53%), Positives = 147/219 (67%), Gaps = 6/219 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGG 83
++ +++ C G CWAFSA +EGINKIVTG L+SLSEQELIDC R+ N+ GC GG
Sbjct: 139 VVDIKSQGEC----GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGG 194
Query: 84 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
+ +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP NNE L AV
Sbjct: 195 YITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEWALQTAV 254
Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+KNSW +W
Sbjct: 255 TYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVKNSWDTTW 314
Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 242
G GYM + RN G + G CGI + SYP K P P
Sbjct: 315 GEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 234 bits (597), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 109/209 (52%), Positives = 141/209 (67%), Gaps = 1/209 (0%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS A+EGIN+I T LVSLSEQEL+DCD+ N GC GGLM+ A++F+ + GI
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
TE +YPY Q G C++ K+N V+IDG+++VP N+E LL+AV QPVSV I
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
FQ YS G+FTG C+T L+H V IVGY + +G +YWI++NSWG WG GY+ MQRN
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISK 329
Query: 218 SLGICGINMLASYPTKTGQNPPPSPPPGP 246
G+CGI M+ASYP K + P P
Sbjct: 330 KEGLCGIAMMASYPIKNSSDNPTGSLSSP 358
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 234 bits (597), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 108/202 (53%), Positives = 140/202 (69%), Gaps = 1/202 (0%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS A+EGIN+I T LV+LSEQEL+DCD+ N GC GGLM+ A++F+ + GI
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
TE +YPY+ Q G C+ K+N V+IDG+++VP N+E LL+AV QPVSV I
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
FQ YS G+FTG CST L+H V IVGY + +G +YWI++NSWG WG +GY+ MQRN
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISK 329
Query: 218 SLGICGINMLASYPTKTGQNPP 239
G+CGI ML SYP K + P
Sbjct: 330 KEGLCGIAMLPSYPIKNSSDNP 351
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 233 bits (594), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 112/208 (53%), Positives = 145/208 (69%), Gaps = 4/208 (1%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N GC GGLMD A++++ N G+
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGL 215
Query: 99 DTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 155
TE YPYR G CN + ++ +V IDG++DVP N+E+ L +AV QPVSV + S
Sbjct: 216 ITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEAS 275
Query: 156 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+AF YS G+FTG C T LDH V +VGY +E+G YW +KNSWG SWG GY+ ++++
Sbjct: 276 GKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKD 335
Query: 215 TGNSLGICGINMLASYPTKTGQNPPPSP 242
+G S G+CGI M ASYP KT P P+P
Sbjct: 336 SGASGGLCGIAMEASYPVKTYSKPKPTP 363
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 231 bits (589), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 111/213 (52%), Positives = 151/213 (70%), Gaps = 6/213 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
++ +N+ C G+CWAFS A+EGIN+IVTG L+SLSEQ+L+DC + N GC GG
Sbjct: 15 VVPVKNQGGC----GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTANHGCRGGW 69
Query: 85 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
M+ A+QF++ N GI++E+ YPYRGQ G CN +N +V+ID Y++VP +NE+ L +AV
Sbjct: 70 MNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHNEQSLQKAVA 128
Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
QPVSV + + R FQLY SGIFTG C+ S +HA+ +VGY +EN D+WI+KNSWG++WG
Sbjct: 129 NQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWG 188
Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 237
+GY+ +RN N G CGI ASYP K G N
Sbjct: 189 ESGYIRAERNIENPDGKCGITRFASYPVKKGTN 221
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 230 bits (587), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 111/209 (53%), Positives = 135/209 (64%), Gaps = 1/209 (0%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS A+EGIN+I T LVSLSEQEL+DCD N GC GGLMDYA++F+ + GI
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGI 207
Query: 99 DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
TE +YPY G C+ K N V+IDG+++VPEN+E LL+AV QPVSV I
Sbjct: 208 TTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSD 267
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
FQ YS G+FTG C T LDH V IVGY + +G YW +KNSWG WG GY+ M+R +
Sbjct: 268 FQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISD 327
Query: 218 SLGICGINMLASYPTKTGQNPPPSPPPGP 246
G+CGI M ASYP K N P P
Sbjct: 328 KEGLCGIAMEASYPIKKSSNNPSGIKSSP 356
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 230 bits (587), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 107/196 (54%), Positives = 143/196 (72%), Gaps = 2/196 (1%)
Query: 40 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 99
+CWAFS GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++KN G+
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLG 218
Query: 100 TEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
T+ DYPY+ G C+ + K N V IDGY+++P N+E L++AV QPV+ I S R
Sbjct: 219 TDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSRE 278
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQLY SG+F G C T+L+H V++VGY +ENG DYW++KNS G +WG GYM M RN N
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANP 338
Query: 219 LGICGINMLASYPTKT 234
G+CGI M ASYP K
Sbjct: 339 RGLCGIAMRASYPLKN 354
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 228 bits (582), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 111/206 (53%), Positives = 143/206 (69%), Gaps = 4/206 (1%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
G+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N GC GGLMD A++++ N G+
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGL 215
Query: 99 DTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 155
TE YPYR G CN + ++ +V IDG++DVP N+E+ L +AV QPVSV + S
Sbjct: 216 ITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEAS 275
Query: 156 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+AF YS G+FTG C T LDH V +VGY +E+G YW +KNSWG SWG GY+ ++++
Sbjct: 276 GKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKD 335
Query: 215 TGNSLGICGINMLASYPTKTGQNPPP 240
+G S G+CGI M ASYP KT P P
Sbjct: 336 SGASGGLCGIAMEASYPVKTYNKPMP 361
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 226 bits (577), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 105/195 (53%), Positives = 141/195 (72%), Gaps = 2/195 (1%)
Query: 40 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 99
+CWAFS GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLG 225
Query: 100 TEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
T+ DYPY+ G C + K + V IDGY+++P N+E L++AV QPV+ + S R
Sbjct: 226 TDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSRE 285
Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
FQLY SG+F G C T+L+H V++VGY +ENG DYWI+KNS G +WG GYM M RN N
Sbjct: 286 FQLYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANP 345
Query: 219 LGICGINMLASYPTK 233
G+CGI M ASYP K
Sbjct: 346 RGLCGIAMRASYPLK 360
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 226 bits (576), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 110/216 (50%), Positives = 141/216 (65%), Gaps = 6/216 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
+ + +N+ C G+CWAFS A+EGINKI T LVSLSEQEL+DCD N GC GGL
Sbjct: 138 VTEVKNQQDC----GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGL 193
Query: 85 MDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
M+ A++F+ N GI TE+ YPY Q C + VTIDG++ VPEN+E++LL+AV
Sbjct: 194 MEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAV 253
Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 202
QPVSV I FQLYS G+F G C T L+H V+IVGY +++NG YWI++NSWG
Sbjct: 254 AHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPE 313
Query: 203 WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 238
WG GY+ ++R + G CGI M ASYPTK P
Sbjct: 314 WGEGGYVRIERGISENEGRCGIAMEASYPTKLSSTP 349
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 224 bits (570), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 107/206 (51%), Positives = 146/206 (70%), Gaps = 7/206 (3%)
Query: 28 FRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 87
+N+ C G+CWAFSA A+E INKI TG L+SLSEQEL+DCD + + GC GG M+
Sbjct: 16 IKNQKQC----GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA-SHGCNGGWMNN 70
Query: 88 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
A+Q++I N GIDT+++YPY G C +L +V+I+G++ V NNE L AV +QP
Sbjct: 71 AFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGFQRVTRNNESALQSAVASQP 128
Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
VSV + + FQ YSSGIFTGPC T+ +H V+IVGY +++G +YWI++NSWG++WG G
Sbjct: 129 VSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVRNSWGQNWGNQG 188
Query: 208 YMHMQRNTGNSLGICGINMLASYPTK 233
Y+ M+RN +S G+CGI L SYPTK
Sbjct: 189 YIWMERNVASSAGLCGIAQLPSYPTK 214
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 221 bits (564), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 105/212 (49%), Positives = 137/212 (64%), Gaps = 5/212 (2%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS A+EGIN+I T L SLSEQEL+DCD + N GC GGLMD A
Sbjct: 142 KNQGQC----GSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLA 197
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
++F+ + G+ +E YPY+ C+ K N +V+IDG++DVP+N+E L++AV QPV
Sbjct: 198 FEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPV 257
Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNG 207
SV I FQ YS G+FTG C T L+H V +VGY + +G YWI+KNSWG WG G
Sbjct: 258 SVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKG 317
Query: 208 YMHMQRNTGNSLGICGINMLASYPTKTGQNPP 239
Y+ MQR + G+CGI M ASYP K P
Sbjct: 318 YIRMQRGIRHKEGLCGIAMEASYPLKNSNTNP 349
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 211 bits (538), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 103/209 (49%), Positives = 143/209 (68%), Gaps = 6/209 (2%)
Query: 25 LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
++ +N+ C G+CWAF A A+EGIN+IVTG L+SLSEQ+L+DC + N GC GG
Sbjct: 15 VVPVKNQGGC----GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCS-TRNHGCEGGW 69
Query: 85 MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
A+Q++I N GI++E+ YPY G G C+ K N H+V+ID Y++VP N+EK L +AV
Sbjct: 70 PYRAFQYIINNGGINSEEHYPYTGTNGTCDT-KENAHVVSIDSYRNVPSNDEKSLQKAVA 128
Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
QPVSV + + R FQLY +GIFTG C+ S +H + G ++EN DYW +KNSWG++WG
Sbjct: 129 NQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVKNSWGKNWG 188
Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTK 233
+GY+ ++RN S G CGI + SYP K
Sbjct: 189 ESGYIRVERNIAESSGKCGIAISPSYPIK 217
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 202 bits (513), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/206 (51%), Positives = 133/206 (64%), Gaps = 14/206 (6%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ SC G+CWAFS +E IN+I TG+L+SLSEQEL+DCD+ N GC GG +A
Sbjct: 17 KNQGSC----GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK-NHGCLGGAFVFA 71
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
YQ++I N GIDT+ +YPY+ G C + +V+IDGY VP NE L QAV QP
Sbjct: 72 YQYIINNGGIDTQANYPYKAVQGPC---QAASKVVSIDGYNGVPFCNEXALKQAVAVQPS 128
Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
+V I S FQ YSSGIF+GPC T L+H V IVGY + +YWI++NSWGR WG GY
Sbjct: 129 TVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVRNSWGRYWGEKGY 184
Query: 209 MHMQRNTGNSLGICGINMLASYPTKT 234
+ M R G G+CGI L YPTK
Sbjct: 185 IRMLRVGG--CGLCGIARLPYYPTKA 208
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 193 bits (490), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/205 (48%), Positives = 131/205 (63%), Gaps = 6/205 (2%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ +C G+CWAFS +EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG +
Sbjct: 151 KNQGAC----GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTS 205
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
Q+V N+G+ T K YPY+ + +C V I GYK VP N E L A+ QP+
Sbjct: 206 LQYVA-NNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPL 264
Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
SV + + FQLY SG+F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GY
Sbjct: 265 SVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGY 324
Query: 209 MHMQRNTGNSLGICGINMLASYPTK 233
M ++R +GNS G CG+ + YP K
Sbjct: 325 MRLKRQSGNSQGTCGVYKSSYYPFK 349
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 190 bits (482), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 138/213 (64%), Gaps = 10/213 (4%)
Query: 27 QFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 86
+ +N++ C G+CW+F+A +EGI KI TG LVSLSEQE++DC SY GC GG ++
Sbjct: 137 EVKNQNPC----GSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVN 190
Query: 87 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 146
AY F+I N+G+ TE++YPY G CN + I GY V N+E+ ++ AV Q
Sbjct: 191 KAYDFIISNNGVTTEENYPYLAYQGTCNANSF-PNSAYITGYSYVRRNDERSMMYAVSNQ 249
Query: 147 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGM 205
P++ I SE FQ Y+ G+F+GPC TSL+HA+ I+GY ++ G YWI++NSWG SWG
Sbjct: 250 PIAALIDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGE 308
Query: 206 NGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 237
GY+ M R +S G+CGI M +PT ++G N
Sbjct: 309 GGYVRMARGVSSSSGVCGIAMAPLFPTLQSGAN 341
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 188 bits (478), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 92/196 (46%), Positives = 131/196 (66%), Gaps = 6/196 (3%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
G+C++FS TG++EG+ I TG LVSLSEQ ++DC S+ N GC GGLM A++++IKN+G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202
Query: 98 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 157
+++E+ YPY + K + I YK++ +E L A++ PVSV I S
Sbjct: 203 LNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHN 262
Query: 158 AFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 215
+FQLY++G++ P S LDH VL VG ++NG DY+I+KNSWG SWG+NGY+HM RN
Sbjct: 263 SFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNK 322
Query: 216 GNSLGICGINMLASYP 231
N+ CGI+ +ASYP
Sbjct: 323 DNN---CGISTMASYP 335
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 187 bits (474), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 139/220 (63%), Gaps = 18/220 (8%)
Query: 24 LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 83
++ Q + + C G WAFSATGAIE + I TG LVSLSEQEL+DC + G G
Sbjct: 146 VITQVKYQGGC----GRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNG 200
Query: 84 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNE 136
++++V+++ GI T+ DYPYR + G+C K+ + VTIDGY+ + E
Sbjct: 201 WQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETE 259
Query: 137 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYW 193
+ L A++ QP+SV I + F LY+ GI+ G TS ++H VL+VGY S +GVDYW
Sbjct: 260 QAFLSAILEQPISVSIDAKD--FHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYW 317
Query: 194 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 233
I KNSWG WG +GY+ +QRNTGN LG+CG+N ASYPTK
Sbjct: 318 IAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 186 bits (473), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/201 (49%), Positives = 132/201 (65%), Gaps = 13/201 (6%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
G+CWAFS+TGA+EG + G LVSLSEQ L+DC Y N+GC GGLMD A++++ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 98 IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQ-PVSVGICG 154
IDTEK YPY G C+ N+ + T G+ D+PE +E+++ +AV PVSV I
Sbjct: 204 IDTEKSYPYEGIDDSCH---FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDA 260
Query: 155 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 211
S +FQLYS G++ P +LDH VL+VGY + E+G+DYW++KNSWG +WG GY+ M
Sbjct: 261 SHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 212 QRNTGNSLGICGINMLASYPT 232
RN N CGI +SYPT
Sbjct: 321 ARNQNNQ---CGIATASSYPT 338
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 186 bits (471), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/206 (49%), Positives = 127/206 (61%), Gaps = 6/206 (2%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
R++ SC G+CWAFSA +EGINKI TG LV LSEQEL+DC+R + GC GG YA
Sbjct: 149 RHQGSC----GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYA 203
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
++V KN GI YPY+ + G C +++ IV G V NNE LL A+ QPV
Sbjct: 204 LEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPV 262
Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
SV + R FQLY GIF GPC T +DHAV VGY G Y +IKNSWG +WG GY
Sbjct: 263 SVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGY 322
Query: 209 MHMQRNTGNSLGICGINMLASYPTKT 234
+ ++R GNS G+CG+ + YPTK
Sbjct: 323 IRIKRAPGNSPGVCGLYKSSYYPTKN 348
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 184 bits (468), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 130/206 (63%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDY 87
+N++ C G+CWAFS IEGINKI+TG L+SLSEQEL+DC+ RS+ GC GG
Sbjct: 17 KNQNPC----GSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRSH--GCDGGYQTP 70
Query: 88 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
+ Q+V+ N G+ TE++YPY + G+C + V I GYK VP N+E L+QA+ QP
Sbjct: 71 SLQYVVDN-GVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEISLIQAIANQP 129
Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
VSV R FQ Y GI+ GPC T+ DHAV VGY G Y ++KNSWG +WG G
Sbjct: 130 VSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGY----GKTYLLLKNSWGPNWGEKG 185
Query: 208 YMHMQRNTGNSLGICGINMLASYPTK 233
Y+ ++R +G S G CG+ + +P K
Sbjct: 186 YIRIKRASGRSKGTCGVYTSSFFPIK 211
>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 215
Score = 184 bits (467), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 125/205 (60%), Gaps = 9/205 (4%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N++ C G+CWAFS +EGINKI TG L+SLSEQEL+DCDR + GC GG +
Sbjct: 17 KNQNPC----GSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR-SHGCKGGYQTGS 71
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
Q+V N G+ TEK+YPY + G+C ++ V I GYK VP N+E L+Q + QPV
Sbjct: 72 IQYVADNGGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQGIGNQPV 131
Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
SV RAFQLY GIF GPC DHAV +GY +D KNSWG +WG GY
Sbjct: 132 SVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTAIGYGKAQLLD----KNSWGPNWGEKGY 187
Query: 209 MHMQRNTGNSLGICGINMLASYPTK 233
+ ++R +G S G CG+ + +P K
Sbjct: 188 IKIKRASGKSEGTCGVYKSSYFPIK 212
>sp|P83443|MDO1_PSEMR Macrodontain-1 OS=Pseudananas macrodontes PE=1 SV=1
Length = 213
Score = 179 bits (455), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 129/206 (62%), Gaps = 10/206 (4%)
Query: 27 QFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 86
+ +N+ C G CWAF+A +EGI KI G+LV LSEQE++DC SY GC GG ++
Sbjct: 16 EVKNQGPC----GGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY--GCKGGWVN 69
Query: 87 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 146
AY F+I N+G+ T+++YPYR G CN + I GY V N+E ++ AV Q
Sbjct: 70 RAYDFIISNNGVTTDENYPYRAYQGTCNANYF-PNSAYITGYSYVRRNDESHMMYAVSNQ 128
Query: 147 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 206
P++ I S FQ Y G+++GPC SL+HA+ I+GY ++ YWI++NSWG SWG
Sbjct: 129 PIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS---YWIVRNSWGSSWGQG 185
Query: 207 GYMHMQRNTGNSLGICGINMLASYPT 232
GY+ ++R+ +S G+CGI M +PT
Sbjct: 186 GYVRIRRDVSHSGGVCGIAMSPLFPT 211
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 179 bits (454), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 130/200 (65%), Gaps = 13/200 (6%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
G+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A++++ N G
Sbjct: 176 GSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 235
Query: 98 IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGICG 154
IDTEK YPY C+ N+ V T G+ D+P+ +EK++ +AV PVSV I
Sbjct: 236 IDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDA 292
Query: 155 SERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 211
S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG G++ M
Sbjct: 293 SHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKM 352
Query: 212 QRNTGNSLGICGINMLASYP 231
RN N CGI +SYP
Sbjct: 353 LRNKENQ---CGIASASSYP 369
>sp|P09648|CATL1_CHICK Cathepsin L1 (Fragments) OS=Gallus gallus GN=CTSL1 PE=1 SV=1
Length = 218
Score = 179 bits (453), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 122/197 (61%), Gaps = 7/197 (3%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
G+CWAFS TGA+EG + G LVSLSEQ L+DC R N GC GGLMD A+Q+V N G
Sbjct: 23 GSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGG 82
Query: 98 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSE 156
ID+E+ YPY + + + K + G+ D+P+ +E+ L++AV + PVSV I
Sbjct: 83 IDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGH 142
Query: 157 RAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+FQ Y SGI+ P S LDH VL+VGY E G YWI+KNSWG WG GY++M ++
Sbjct: 143 SSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYWIVKNSWGEKWGDKGYIYMAKD 202
Query: 215 TGNSLGICGINMLASYP 231
N CGI ASYP
Sbjct: 203 RKNH---CGIATAASYP 216
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 176 bits (446), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/224 (45%), Positives = 130/224 (58%), Gaps = 30/224 (13%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G CW+FS TG+ EG + G LVSLSEQ LIDC NSGC GGLM YA
Sbjct: 128 KNQGQC----GGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYA 182
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
++++I N+GIDTE YPY+ + G+C + N T+ YK V +E L AV PV
Sbjct: 183 FEYIINNNGIDTESSYPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPV 241
Query: 149 SVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGV---------------- 190
SV I S ++FQLY+SGI+ P S +LDH VL VGY S +G
Sbjct: 242 SVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSAS 301
Query: 191 ---DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 231
+YWI+KNSWG SWG+ GY+ M RN N+ CGI AS+P
Sbjct: 302 SSNEYWIVKNSWGTSWGIEGYILMSRNRDNN---CGIASSASFP 342
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 176 bits (445), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 96/197 (48%), Positives = 127/197 (64%), Gaps = 8/197 (4%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN-SGCGGGLMDYAYQFVIKNHG 97
G+CWAFS TG++EG + + TGSL+SL+EQ+L+DC R Y GC GG M+ A+ ++ N+G
Sbjct: 129 GSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNG 188
Query: 98 IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSE 156
IDTE YPY + G C + N T G+ ++ +E L QAV P+SV I +
Sbjct: 189 IDTEAAYPYEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAH 247
Query: 157 RAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+FQ YSSG++ P CS S LDHAVL VGY SE G D+W++KNSW SWG GY+ M RN
Sbjct: 248 SSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRN 307
Query: 215 TGNSLGICGINMLASYP 231
N+ CGI +ASYP
Sbjct: 308 RNNN---CGIATVASYP 321
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 174 bits (441), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNH 96
GACWAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196
Query: 97 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGS 155
GID+E YPY+ G+C R T Y ++P +E L +AV + PVSV I S
Sbjct: 197 GIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDAS 255
Query: 156 ERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN
Sbjct: 256 HYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARN 315
Query: 215 TGNSLGICGINMLASYP 231
+GN CGI SYP
Sbjct: 316 SGNH---CGIASYPSYP 329
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 174 bits (440), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/197 (46%), Positives = 126/197 (63%), Gaps = 8/197 (4%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY--NSGCGGGLMDYAYQFVIKNH 96
G+CWAFSA GA+E K+ TG LVSLS Q L+DC + N GC GG M A+Q++I N+
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196
Query: 97 GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGS 155
GID+E YPY+ G+C NR T Y ++P +E+ L +AV + PVSVGI S
Sbjct: 197 GIDSEASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGSEEALKEAVANKGPVSVGIDAS 255
Query: 156 ERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
+F LY +G++ P C+ +++H VL+VGY + +G DYW++KNSWG +G GY+ M RN
Sbjct: 256 HSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARN 315
Query: 215 TGNSLGICGINMLASYP 231
+GN CGI SYP
Sbjct: 316 SGNH---CGIANYPSYP 329
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 172 bits (437), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS+ GA+EG K TG L++LS Q L+DC S N GCGGG M A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
+Q+V KN GID+E YPY GQ C + GY+++PE NEK L +AV P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244
Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
VSV I S +FQ YS G++ S +L+HAVL VGY + G +WIIKNSWG +WG
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304
Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
GY+ M RN N+ CGI LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 172 bits (437), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS+ GA+EG K TG L++LS Q L+DC S N GCGGG M A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
+Q+V KN GID+E YPY GQ C + GY+++PE NEK L +AV P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244
Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
VSV I S +FQ YS G++ S +L+HAVL VGY + G +WIIKNSWG +WG
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304
Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
GY+ M RN N+ CGI LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 172 bits (437), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS+ GA+EG K TG L++LS Q L+DC S N GCGGG M A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
+Q+V KN GID+E YPY GQ C + GY+++PE NEK L +AV P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244
Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
VSV I S +FQ YS G++ S +L+HAVL VGY + G +WIIKNSWG +WG
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304
Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
GY+ M RN N+ CGI LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 172 bits (436), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 131/202 (64%), Gaps = 9/202 (4%)
Query: 39 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHG 97
G+CWAF+ATGA+EGIN+I TG LVSLSEQELIDCDR + N GC GG +A++F+ +N G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 98 IDTEKDYPYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 154
I +++ Y Y G+ A + + K R +VTI+G++ VP N+E L +AV QP+SV I
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVAYQPISVMISA 268
Query: 155 SERAFQLYSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQ 212
+ Y SG++ G CS DH VLIVGY S + DYW+I+NSWG WG GY+ +Q
Sbjct: 269 AN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQ 326
Query: 213 RNTGNSLGICGINMLASYPTKT 234
RN G C + + YP K+
Sbjct: 327 RNFHEPTGKCAVAVAPVYPIKS 348
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 172 bits (436), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS+ GA+EG K TG L++LS Q L+DC S N GCGGG M A
Sbjct: 132 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 186
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
+Q+V KN GID+E YPY GQ C + GY+++PE NEK L +AV P
Sbjct: 187 FQYVQKNRGIDSEDAYPYVGQDENCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 245
Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
VSV I S +FQ YS G++ S +L+HAVL VGY + G +WIIKNSWG +WG
Sbjct: 246 VSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGN 305
Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
GY+ M RN N+ CGI LAS+P
Sbjct: 306 KGYILMARNKNNA---CGIANLASFP 328
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 172 bits (436), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 125/212 (58%), Gaps = 16/212 (7%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 87
+N+ C G+CWAFSATGA+EG TG LVSLSEQ L+DC R N GC GGLMD
Sbjct: 130 KNQGQC----GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDN 185
Query: 88 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 146
A+Q+V N G+DTE+ YPY G+ K G+ D+P+ EK L++AV
Sbjct: 186 AFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244
Query: 147 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWG 200
P+SV I +FQ Y SGI+ P S LDH VL+VGY E N +WI+KNSWG
Sbjct: 245 PISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWG 304
Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 232
WG NGY+ M ++ N CGI+ ASYPT
Sbjct: 305 PEWGWNGYVKMAKDQNNH---CGISTAASYPT 333
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 171 bits (434), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 126/206 (61%), Gaps = 12/206 (5%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
+N+ C G+CWAFS+ GA+EG K TG L++LS Q L+DC S N GCGGG M A
Sbjct: 132 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 186
Query: 89 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
+Q+V KN GID+E YPY GQ C + GY+++PE NEK L +AV P
Sbjct: 187 FQYVQKNRGIDSEDAYPYVGQDESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 245
Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
+SV I S +FQ YS G++ S +L+HAVL VGY + G +WIIKNSWG +WG
Sbjct: 246 ISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 305
Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
GY+ M RN N+ CGI LAS+P
Sbjct: 306 KGYILMARNKNNA---CGIANLASFP 328
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 171 bits (432), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 92/211 (43%), Positives = 130/211 (61%), Gaps = 17/211 (8%)
Query: 29 RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 87
+N+ C G+CWAFSA+G +EG + TG L+SLSEQ L+DC N GC GGLMD+
Sbjct: 130 KNQGQC----GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDF 185
Query: 88 AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 146
A+Q++ +N G+D+E+ YPY + G C K + + G+ D+P+ EK L++AV
Sbjct: 186 AFQYIKENGGLDSEESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQ-QEKALMKAVATVG 243
Query: 147 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWG 200
P+SV + S + Q YSSGI+ P S LDH VL+VGY E N YW++KNSWG
Sbjct: 244 PISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWG 303
Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYP 231
+ WGM+GY+ + ++ N CG+ ASYP
Sbjct: 304 KEWGMDGYIKIAKDRNNH---CGLATAASYP 331
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.137 0.453
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 139,470,684
Number of Sequences: 539616
Number of extensions: 6100936
Number of successful extensions: 22821
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 223
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 21944
Number of HSP's gapped (non-prelim): 406
length of query: 341
length of database: 191,569,459
effective HSP length: 118
effective length of query: 223
effective length of database: 127,894,771
effective search space: 28520533933
effective search space used: 28520533933
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)