BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019447
         (341 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
           lycopersicum PE=2 SV=1
          Length = 346

 Score =  310 bits (794), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 158/304 (51%), Positives = 202/304 (66%), Gaps = 11/304 (3%)

Query: 24  LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 83
           +L+  +++ SC    G+CWAFSA  A+E IN IVTG+L+SLSEQEL+DCDRSYN GC GG
Sbjct: 29  VLVGVKDQGSC----GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGG 84

Query: 84  LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
           LMDYA++FVIKN GIDTE+DYPY+ + G C++ + N  +V ID Y+DVP NNEK L +AV
Sbjct: 85  LMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAV 144

Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
             QPVS+ +    R FQ Y SGIFTG C T++DH V+I GY +ENG+DYWI++NSWG + 
Sbjct: 145 AHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVRNSWGANC 204

Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCSLLTYCAA 257
             NGY+ +QRN  +S G+CG+ +  SYP KTG         PPSP   PT C   + CA 
Sbjct: 205 RENGYLRVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDEYSQCAV 264

Query: 258 GETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAA 317
           G TCCC       C SW CC    A CC DH  CCP +YPIC+ VR    +   GN    
Sbjct: 265 GTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICN-VRQGTCSMSKGNPLGV 323

Query: 318 EAIE 321
           +A++
Sbjct: 324 KAMK 327


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  307 bits (787), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 149/275 (54%), Positives = 186/275 (67%), Gaps = 6/275 (2%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS  GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           DT+KDYPY+G  G C++ + N  +VTID Y+DVP  +E+ L +AV  QP+S+ I    RA
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN  +S
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASS 338

Query: 219 LGICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICL 272
            G CGI +  SYP K G+        PPSP   PT+C     C    TCCC       C 
Sbjct: 339 SGKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCF 398

Query: 273 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 307
           +W CC   +A CC D+  CCP  YP+CD  +  CL
Sbjct: 399 AWGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL 433


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  305 bits (782), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 152/275 (55%), Positives = 182/275 (66%), Gaps = 6/275 (2%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFSA  A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GI
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGI 210

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           DTE DYPY+G+  +C+  + N  +VTID Y+DV  N+E  L +AV  QPVSV I    RA
Sbjct: 211 DTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRA 270

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQLYSSGIFTG C T+LDH V  VGY +ENG DYWI++NSWG+SWG +GY+ M+RN   S
Sbjct: 271 FQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKAS 330

Query: 219 LGICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICL 272
            G CGI +  SYP K G+NPP   P  P+       C     C    TCCC       C 
Sbjct: 331 SGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCY 390

Query: 273 SWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 307
           +W CC    A CC DH  CCP  YPIC+  +  CL
Sbjct: 391 AWGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  294 bits (753), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 146/291 (50%), Positives = 189/291 (64%), Gaps = 17/291 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDY 87
           +N+  C    G+CWAFSA   +E IN++VTG +++LSEQEL++C  +  NSGC GGLMD 
Sbjct: 157 KNQGQC----GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDD 212

Query: 88  AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
           A+ F+IKN GIDTE DYPY+   G+C+  + N  +V+IDG++DVP+N+EK L +AV  QP
Sbjct: 213 AFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQP 272

Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
           VSV I    R FQLY SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG  WG +G
Sbjct: 273 VSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESG 332

Query: 208 YMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYC 255
           Y+ M+RN   + G CGI M+ASYPTK+G NPP   P  PT             C     C
Sbjct: 333 YVRMERNINVTTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSC 392

Query: 256 AAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 306
            AG TCCC      +CL W CC    A CC DH  CCP +YP+C++    C
Sbjct: 393 PAGSTCCCAFGFRNLCLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 443


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  291 bits (745), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 146/290 (50%), Positives = 185/290 (63%), Gaps = 11/290 (3%)

Query: 24  LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGG 82
           ++   +N+  C    G+CWAFSA  A+EGINKIVTG LVSLSEQEL++C R+  NSGC G
Sbjct: 167 VVAPVKNQGQC----GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNG 222

Query: 83  GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 142
           G+MD A+ F+ +N G+DTE+DYPY    G+CN  K +R +V+IDG++DVPEN+E  L +A
Sbjct: 223 GIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKA 282

Query: 143 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY--DSENGVDYWIIKNSWG 200
           V  QPVSV I    R FQLY SG+FTG C T+LDH V+ VGY  D+  G  YW ++NSWG
Sbjct: 283 VAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWG 342

Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYP----TKTGQNPPPSPPPGPTRCSLLTYCA 256
             WG NGY+ M+RN     G CGI M+ASYP         +PP   P  P +C   + C 
Sbjct: 343 PDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNPKPSPPSPAPSPPQQCDRYSKCP 402

Query: 257 AGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 306
           AG TCCC   I   C+ W CC    A CC DH  CCP  YP+C++    C
Sbjct: 403 AGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPKEYPVCNAKARTC 452


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  275 bits (703), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 127/200 (63%), Positives = 151/200 (75%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   A+EGINKIVTG LVSLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G+
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 181

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           +TEKDYPY G  G+CN    N  +VTIDGY+DVP  +E  L +AV  QPVSV I    RA
Sbjct: 182 NTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRA 241

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQ Y SGIFTG C T++DHAV+ VGY SENGVDYWI++NSWG  WG +GY+ M+RN  + 
Sbjct: 242 FQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASK 301

Query: 219 LGICGINMLASYPTKTGQNP 238
            G CGI + ASYP K   NP
Sbjct: 302 SGKCGIAIEASYPVKYSPNP 321


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  269 bits (687), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 127/201 (63%), Positives = 151/201 (75%), Gaps = 1/201 (0%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G+
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 226

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           +TEKDYPYRG  G+CN    N  +V+IDGY+DVP  +E  L +A+  QPVSV I    R 
Sbjct: 227 NTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRI 286

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG  WG  GY+ M+RN   S
Sbjct: 287 FQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346

Query: 219 L-GICGINMLASYPTKTGQNP 238
             G CGI + ASYP K   NP
Sbjct: 347 KSGKCGIAVEASYPVKYSPNP 367


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  254 bits (650), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 150/210 (71%), Gaps = 4/210 (1%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
           + + +N+ SC    G+CWAFS   A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGL
Sbjct: 150 VAEVKNQGSC----GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGL 205

Query: 85  MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
           MDYA+++++KN G+  E+DYPY  + G C  QK     VTI+G++DVP N+EK LL+A+ 
Sbjct: 206 MDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALA 265

Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
            QP+SV I  S R FQ YS G+F G C   LDH V  VGY S  G DY I+KNSWG  WG
Sbjct: 266 HQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWG 325

Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKT 234
             GY+ ++RNTG   G+CGIN +AS+PTKT
Sbjct: 326 EKGYIRLKRNTGKPEGLCGINKMASFPTKT 355


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  253 bits (646), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 119/213 (55%), Positives = 159/213 (74%), Gaps = 7/213 (3%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGG 83
           ++  +++ +C    G+CWAFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG
Sbjct: 142 VVSVKDQGNC----GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGG 197

Query: 84  LMDYAYQFVIKNHGIDTEKDYPYRGQA-GQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQ 141
           +M+YA++F++KN GI+T++DYPY     G CN  K N   +VTIDGY+DVP ++EK L +
Sbjct: 198 IMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKK 257

Query: 142 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 201
           AV  QPVSV I  S +AFQLY SG+ TG C  SLDH V++VGY S +G DYWII+NSWG 
Sbjct: 258 AVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGL 317

Query: 202 SWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 234
           +WG +GY+ +QRN  +  G CGI M+ SYPTK+
Sbjct: 318 NWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKS 350


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  249 bits (637), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 120/204 (58%), Positives = 141/204 (69%), Gaps = 2/204 (0%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   ++EGIN+I TG LVSLSEQEL+DCD SYN GC GGLMDYA++F+ KN GI
Sbjct: 152 GSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKN-GI 210

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
            TE  YPY  Q G C    LN  +V+IDG++DVP NNE  L+QAV  QP+SV I  S   
Sbjct: 211 TTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYG 270

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
           FQ YS G+FTG C T LDH V IVGY  + +G  YWI+KNSWG  WG +GY+ MQR   +
Sbjct: 271 FQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISD 330

Query: 218 SLGICGINMLASYPTKTGQNPPPS 241
             G CGI M ASYP KT  NP  S
Sbjct: 331 KRGKCGIAMEASYPIKTSANPKNS 354


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  248 bits (633), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 114/196 (58%), Positives = 141/196 (71%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA+Q++I   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
             E DYPY  + G C +QK +   VTI GY+DVPEN+++ L++A+  QPVSV I  S R 
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQ Y  G+F G C T LDH V  VGY S  G DY I+KNSWG  WG  G++ M+RNTG  
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338

Query: 219 LGICGINMLASYPTKT 234
            G+CGIN +ASYPTKT
Sbjct: 339 EGLCGINKMASYPTKT 354


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  241 bits (614), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 117/227 (51%), Positives = 145/227 (63%), Gaps = 5/227 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
           + + +N+  C    G+CWAFS   A+EGINKI T  LVSLSEQEL+DCD   N GC GGL
Sbjct: 140 VTEIKNQGKC----GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEGCNGGL 195

Query: 85  MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
           M+ A++F+ KN GI TE  YPY G  G+C+  K N  +VTIDG++DVPEN+E  LL+AV 
Sbjct: 196 MEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVA 255

Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
            QPVSV I      FQ YS G+FTG C T L+H V  VGY SE G  YWI++NSWG  WG
Sbjct: 256 NQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWG 315

Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSL 251
             GY+ ++R      G CGI M ASYP K   +  P+P  G  +  L
Sbjct: 316 EGGYIKIEREIDEPEGRCGIAMEASYPIKL-SSSNPTPKDGDVKDEL 361


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  237 bits (605), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 117/219 (53%), Positives = 147/219 (67%), Gaps = 6/219 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGG 83
           ++  +++  C    G CWAFSA   +EGINKIVTG L+SLSEQELIDC R+ N+ GC GG
Sbjct: 139 VVDIKSQGEC----GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGG 194

Query: 84  LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
            +   +QF+I N GI+TE++YPY  Q G+CN    N   VTID Y++VP NNE  L  AV
Sbjct: 195 YITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEWALQTAV 254

Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
             QPVSV +  +  AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+KNSW  +W
Sbjct: 255 TYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIVKNSWDTTW 314

Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 242
           G  GYM + RN G + G CGI  + SYP K      P P
Sbjct: 315 GEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  237 bits (604), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 117/219 (53%), Positives = 147/219 (67%), Gaps = 6/219 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS-GCGGG 83
           ++  +++  C    G CWAFSA   +EGINKIVTG L+SLSEQELIDC R+ N+ GC GG
Sbjct: 139 VVDIKSQGEC----GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGG 194

Query: 84  LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
            +   +QF+I N GI+TE++YPY  Q G+CN    N   VTID Y++VP NNE  L  AV
Sbjct: 195 YITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEWALQTAV 254

Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 203
             QPVSV +  +  AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI+KNSW  +W
Sbjct: 255 TYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVKNSWDTTW 314

Query: 204 GMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 242
           G  GYM + RN G + G CGI  + SYP K      P P
Sbjct: 315 GEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  234 bits (597), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 109/209 (52%), Positives = 141/209 (67%), Gaps = 1/209 (0%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   A+EGIN+I T  LVSLSEQEL+DCD+  N GC GGLM+ A++F+ +  GI
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
            TE +YPY  Q G C++ K+N   V+IDG+++VP N+E  LL+AV  QPVSV I      
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
           FQ YS G+FTG C+T L+H V IVGY +  +G +YWI++NSWG  WG  GY+ MQRN   
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISK 329

Query: 218 SLGICGINMLASYPTKTGQNPPPSPPPGP 246
             G+CGI M+ASYP K   + P      P
Sbjct: 330 KEGLCGIAMMASYPIKNSSDNPTGSLSSP 358


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  234 bits (597), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 108/202 (53%), Positives = 140/202 (69%), Gaps = 1/202 (0%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   A+EGIN+I T  LV+LSEQEL+DCD+  N GC GGLM+ A++F+ +  GI
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
            TE +YPY+ Q G C+  K+N   V+IDG+++VP N+E  LL+AV  QPVSV I      
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
           FQ YS G+FTG CST L+H V IVGY +  +G +YWI++NSWG  WG +GY+ MQRN   
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISK 329

Query: 218 SLGICGINMLASYPTKTGQNPP 239
             G+CGI ML SYP K   + P
Sbjct: 330 KEGLCGIAMLPSYPIKNSSDNP 351


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  233 bits (594), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 145/208 (69%), Gaps = 4/208 (1%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   ++EGIN I TGSLVSLSEQELIDCD + N GC GGLMD A++++  N G+
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGL 215

Query: 99  DTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 155
            TE  YPYR   G CN  +  ++   +V IDG++DVP N+E+ L +AV  QPVSV +  S
Sbjct: 216 ITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEAS 275

Query: 156 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
            +AF  YS G+FTG C T LDH V +VGY  +E+G  YW +KNSWG SWG  GY+ ++++
Sbjct: 276 GKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKD 335

Query: 215 TGNSLGICGINMLASYPTKTGQNPPPSP 242
           +G S G+CGI M ASYP KT   P P+P
Sbjct: 336 SGASGGLCGIAMEASYPVKTYSKPKPTP 363


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  231 bits (589), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 111/213 (52%), Positives = 151/213 (70%), Gaps = 6/213 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
           ++  +N+  C    G+CWAFS   A+EGIN+IVTG L+SLSEQ+L+DC  + N GC GG 
Sbjct: 15  VVPVKNQGGC----GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TTANHGCRGGW 69

Query: 85  MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
           M+ A+QF++ N GI++E+ YPYRGQ G CN   +N  +V+ID Y++VP +NE+ L +AV 
Sbjct: 70  MNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHNEQSLQKAVA 128

Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
            QPVSV +  + R FQLY SGIFTG C+ S +HA+ +VGY +EN  D+WI+KNSWG++WG
Sbjct: 129 NQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWG 188

Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 237
            +GY+  +RN  N  G CGI   ASYP K G N
Sbjct: 189 ESGYIRAERNIENPDGKCGITRFASYPVKKGTN 221


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  230 bits (587), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 111/209 (53%), Positives = 135/209 (64%), Gaps = 1/209 (0%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   A+EGIN+I T  LVSLSEQEL+DCD   N GC GGLMDYA++F+ +  GI
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGI 207

Query: 99  DTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
            TE +YPY    G C+  K N   V+IDG+++VPEN+E  LL+AV  QPVSV I      
Sbjct: 208 TTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSD 267

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 217
           FQ YS G+FTG C T LDH V IVGY +  +G  YW +KNSWG  WG  GY+ M+R   +
Sbjct: 268 FQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISD 327

Query: 218 SLGICGINMLASYPTKTGQNPPPSPPPGP 246
             G+CGI M ASYP K   N P      P
Sbjct: 328 KEGLCGIAMEASYPIKKSSNNPSGIKSSP 356


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  230 bits (587), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 107/196 (54%), Positives = 143/196 (72%), Gaps = 2/196 (1%)

Query: 40  ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 99
           +CWAFS  GA+EG+NKIVTG LV+LSEQ+LI+C++  N+GCGGG ++ AY+F++KN G+ 
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLG 218

Query: 100 TEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           T+ DYPY+   G C+ + K N   V IDGY+++P N+E  L++AV  QPV+  I  S R 
Sbjct: 219 TDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSRE 278

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQLY SG+F G C T+L+H V++VGY +ENG DYW++KNS G +WG  GYM M RN  N 
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANP 338

Query: 219 LGICGINMLASYPTKT 234
            G+CGI M ASYP K 
Sbjct: 339 RGLCGIAMRASYPLKN 354


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  228 bits (582), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 111/206 (53%), Positives = 143/206 (69%), Gaps = 4/206 (1%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGI 98
           G+CWAFS   ++EGIN I TGSLVSLSEQELIDCD + N GC GGLMD A++++  N G+
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGL 215

Query: 99  DTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 155
            TE  YPYR   G CN  +  ++   +V IDG++DVP N+E+ L +AV  QPVSV +  S
Sbjct: 216 ITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEAS 275

Query: 156 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
            +AF  YS G+FTG C T LDH V +VGY  +E+G  YW +KNSWG SWG  GY+ ++++
Sbjct: 276 GKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKD 335

Query: 215 TGNSLGICGINMLASYPTKTGQNPPP 240
           +G S G+CGI M ASYP KT   P P
Sbjct: 336 SGASGGLCGIAMEASYPVKTYNKPMP 361


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  226 bits (577), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 105/195 (53%), Positives = 141/195 (72%), Gaps = 2/195 (1%)

Query: 40  ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 99
           +CWAFS  GA+EG+NKIVTG LV+LSEQ+LI+C++  N+GCGGG ++ AY+F++ N G+ 
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLG 225

Query: 100 TEKDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERA 158
           T+ DYPY+   G C  + K +   V IDGY+++P N+E  L++AV  QPV+  +  S R 
Sbjct: 226 TDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSRE 285

Query: 159 FQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 218
           FQLY SG+F G C T+L+H V++VGY +ENG DYWI+KNS G +WG  GYM M RN  N 
Sbjct: 286 FQLYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANP 345

Query: 219 LGICGINMLASYPTK 233
            G+CGI M ASYP K
Sbjct: 346 RGLCGIAMRASYPLK 360


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  226 bits (576), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 110/216 (50%), Positives = 141/216 (65%), Gaps = 6/216 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
           + + +N+  C    G+CWAFS   A+EGINKI T  LVSLSEQEL+DCD   N GC GGL
Sbjct: 138 VTEVKNQQDC----GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGL 193

Query: 85  MDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 143
           M+ A++F+  N GI TE+ YPY     Q C    +    VTIDG++ VPEN+E++LL+AV
Sbjct: 194 MEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAV 253

Query: 144 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSENGVDYWIIKNSWGRS 202
             QPVSV I      FQLYS G+F G C T L+H V+IVGY +++NG  YWI++NSWG  
Sbjct: 254 AHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPE 313

Query: 203 WGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 238
           WG  GY+ ++R    + G CGI M ASYPTK    P
Sbjct: 314 WGEGGYVRIERGISENEGRCGIAMEASYPTKLSSTP 349


>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 215

 Score =  224 bits (570), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 107/206 (51%), Positives = 146/206 (70%), Gaps = 7/206 (3%)

Query: 28  FRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDY 87
            +N+  C    G+CWAFSA  A+E INKI TG L+SLSEQEL+DCD + + GC GG M+ 
Sbjct: 16  IKNQKQC----GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA-SHGCNGGWMNN 70

Query: 88  AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
           A+Q++I N GIDT+++YPY    G C   +L   +V+I+G++ V  NNE  L  AV +QP
Sbjct: 71  AFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGFQRVTRNNESALQSAVASQP 128

Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
           VSV +  +   FQ YSSGIFTGPC T+ +H V+IVGY +++G +YWI++NSWG++WG  G
Sbjct: 129 VSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVRNSWGQNWGNQG 188

Query: 208 YMHMQRNTGNSLGICGINMLASYPTK 233
           Y+ M+RN  +S G+CGI  L SYPTK
Sbjct: 189 YIWMERNVASSAGLCGIAQLPSYPTK 214


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  221 bits (564), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 105/212 (49%), Positives = 137/212 (64%), Gaps = 5/212 (2%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS   A+EGIN+I T  L SLSEQEL+DCD + N GC GGLMD A
Sbjct: 142 KNQGQC----GSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLA 197

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
           ++F+ +  G+ +E  YPY+     C+  K N  +V+IDG++DVP+N+E  L++AV  QPV
Sbjct: 198 FEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPV 257

Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNG 207
           SV I      FQ YS G+FTG C T L+H V +VGY +  +G  YWI+KNSWG  WG  G
Sbjct: 258 SVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKG 317

Query: 208 YMHMQRNTGNSLGICGINMLASYPTKTGQNPP 239
           Y+ MQR   +  G+CGI M ASYP K     P
Sbjct: 318 YIRMQRGIRHKEGLCGIAMEASYPLKNSNTNP 349


>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  211 bits (538), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 103/209 (49%), Positives = 143/209 (68%), Gaps = 6/209 (2%)

Query: 25  LIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGL 84
           ++  +N+  C    G+CWAF A  A+EGIN+IVTG L+SLSEQ+L+DC  + N GC GG 
Sbjct: 15  VVPVKNQGGC----GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCS-TRNHGCEGGW 69

Query: 85  MDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV 144
              A+Q++I N GI++E+ YPY G  G C+  K N H+V+ID Y++VP N+EK L +AV 
Sbjct: 70  PYRAFQYIINNGGINSEEHYPYTGTNGTCDT-KENAHVVSIDSYRNVPSNDEKSLQKAVA 128

Query: 145 AQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWG 204
            QPVSV +  + R FQLY +GIFTG C+ S +H   + G ++EN  DYW +KNSWG++WG
Sbjct: 129 NQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVKNSWGKNWG 188

Query: 205 MNGYMHMQRNTGNSLGICGINMLASYPTK 233
            +GY+ ++RN   S G CGI +  SYP K
Sbjct: 189 ESGYIRVERNIAESSGKCGIAISPSYPIK 217


>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 208

 Score =  202 bits (513), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 106/206 (51%), Positives = 133/206 (64%), Gaps = 14/206 (6%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+ SC    G+CWAFS    +E IN+I TG+L+SLSEQEL+DCD+  N GC GG   +A
Sbjct: 17  KNQGSC----GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK-NHGCLGGAFVFA 71

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
           YQ++I N GIDT+ +YPY+   G C   +    +V+IDGY  VP  NE  L QAV  QP 
Sbjct: 72  YQYIINNGGIDTQANYPYKAVQGPC---QAASKVVSIDGYNGVPFCNEXALKQAVAVQPS 128

Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
           +V I  S   FQ YSSGIF+GPC T L+H V IVGY +    +YWI++NSWGR WG  GY
Sbjct: 129 TVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVRNSWGRYWGEKGY 184

Query: 209 MHMQRNTGNSLGICGINMLASYPTKT 234
           + M R  G   G+CGI  L  YPTK 
Sbjct: 185 IRMLRVGG--CGLCGIARLPYYPTKA 208


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  193 bits (490), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/205 (48%), Positives = 131/205 (63%), Gaps = 6/205 (2%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+ +C    G+CWAFS    +EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG    +
Sbjct: 151 KNQGAC----GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTS 205

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
            Q+V  N+G+ T K YPY+ +  +C         V I GYK VP N E   L A+  QP+
Sbjct: 206 LQYVA-NNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPL 264

Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
           SV +    + FQLY SG+F GPC T LDHAV  VGY + +G +Y IIKNSWG +WG  GY
Sbjct: 265 SVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGY 324

Query: 209 MHMQRNTGNSLGICGINMLASYPTK 233
           M ++R +GNS G CG+   + YP K
Sbjct: 325 MRLKRQSGNSQGTCGVYKSSYYPFK 349


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  190 bits (482), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 138/213 (64%), Gaps = 10/213 (4%)

Query: 27  QFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 86
           + +N++ C    G+CW+F+A   +EGI KI TG LVSLSEQE++DC  SY  GC GG ++
Sbjct: 137 EVKNQNPC----GSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY--GCKGGWVN 190

Query: 87  YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 146
            AY F+I N+G+ TE++YPY    G CN      +   I GY  V  N+E+ ++ AV  Q
Sbjct: 191 KAYDFIISNNGVTTEENYPYLAYQGTCNANSF-PNSAYITGYSYVRRNDERSMMYAVSNQ 249

Query: 147 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSEN-GVDYWIIKNSWGRSWGM 205
           P++  I  SE  FQ Y+ G+F+GPC TSL+HA+ I+GY  ++ G  YWI++NSWG SWG 
Sbjct: 250 PIAALIDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGE 308

Query: 206 NGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 237
            GY+ M R   +S G+CGI M   +PT ++G N
Sbjct: 309 GGYVRMARGVSSSSGVCGIAMAPLFPTLQSGAN 341


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score =  188 bits (478), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 92/196 (46%), Positives = 131/196 (66%), Gaps = 6/196 (3%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
           G+C++FS TG++EG+  I TG LVSLSEQ ++DC  S+ N GC GGLM  A++++IKN+G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202

Query: 98  IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSER 157
           +++E+ YPY  +     K +       I  YK++   +E  L  A++  PVSV I  S  
Sbjct: 203 LNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHN 262

Query: 158 AFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 215
           +FQLY++G++  P   S  LDH VL VG  ++NG DY+I+KNSWG SWG+NGY+HM RN 
Sbjct: 263 SFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNK 322

Query: 216 GNSLGICGINMLASYP 231
            N+   CGI+ +ASYP
Sbjct: 323 DNN---CGISTMASYP 335


>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
          Length = 379

 Score =  187 bits (474), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 102/220 (46%), Positives = 139/220 (63%), Gaps = 18/220 (8%)

Query: 24  LLIQFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 83
           ++ Q + +  C    G  WAFSATGAIE  + I TG LVSLSEQEL+DC    + G   G
Sbjct: 146 VITQVKYQGGC----GRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNG 200

Query: 84  LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNE 136
               ++++V+++ GI T+ DYPYR + G+C   K+ +  VTIDGY+ +           E
Sbjct: 201 WQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETE 259

Query: 137 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYW 193
           +  L A++ QP+SV I   +  F LY+ GI+ G   TS   ++H VL+VGY S +GVDYW
Sbjct: 260 QAFLSAILEQPISVSIDAKD--FHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYW 317

Query: 194 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 233
           I KNSWG  WG +GY+ +QRNTGN LG+CG+N  ASYPTK
Sbjct: 318 IAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  186 bits (473), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 99/201 (49%), Positives = 132/201 (65%), Gaps = 13/201 (6%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
           G+CWAFS+TGA+EG +    G LVSLSEQ L+DC   Y N+GC GGLMD A++++  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 98  IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQ-PVSVGICG 154
           IDTEK YPY G    C+    N+  +  T  G+ D+PE +E+++ +AV    PVSV I  
Sbjct: 204 IDTEKSYPYEGIDDSCH---FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDA 260

Query: 155 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 211
           S  +FQLYS G++  P     +LDH VL+VGY + E+G+DYW++KNSWG +WG  GY+ M
Sbjct: 261 SHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKM 320

Query: 212 QRNTGNSLGICGINMLASYPT 232
            RN  N    CGI   +SYPT
Sbjct: 321 ARNQNNQ---CGIATASSYPT 338


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  186 bits (471), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 102/206 (49%), Positives = 127/206 (61%), Gaps = 6/206 (2%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           R++ SC    G+CWAFSA   +EGINKI TG LV LSEQEL+DC+R  + GC GG   YA
Sbjct: 149 RHQGSC----GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYA 203

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
            ++V KN GI     YPY+ + G C  +++   IV   G   V  NNE  LL A+  QPV
Sbjct: 204 LEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPV 262

Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
           SV +    R FQLY  GIF GPC T +DHAV  VGY    G  Y +IKNSWG +WG  GY
Sbjct: 263 SVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGY 322

Query: 209 MHMQRNTGNSLGICGINMLASYPTKT 234
           + ++R  GNS G+CG+   + YPTK 
Sbjct: 323 IRIKRAPGNSPGVCGLYKSSYYPTKN 348


>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 214

 Score =  184 bits (468), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 130/206 (63%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCD-RSYNSGCGGGLMDY 87
           +N++ C    G+CWAFS    IEGINKI+TG L+SLSEQEL+DC+ RS+  GC GG    
Sbjct: 17  KNQNPC----GSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRSH--GCDGGYQTP 70

Query: 88  AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQP 147
           + Q+V+ N G+ TE++YPY  + G+C  +      V I GYK VP N+E  L+QA+  QP
Sbjct: 71  SLQYVVDN-GVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEISLIQAIANQP 129

Query: 148 VSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNG 207
           VSV      R FQ Y  GI+ GPC T+ DHAV  VGY    G  Y ++KNSWG +WG  G
Sbjct: 130 VSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGY----GKTYLLLKNSWGPNWGEKG 185

Query: 208 YMHMQRNTGNSLGICGINMLASYPTK 233
           Y+ ++R +G S G CG+   + +P K
Sbjct: 186 YIRIKRASGRSKGTCGVYTSSFFPIK 211


>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 215

 Score =  184 bits (467), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 96/205 (46%), Positives = 125/205 (60%), Gaps = 9/205 (4%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N++ C    G+CWAFS    +EGINKI TG L+SLSEQEL+DCDR  + GC GG    +
Sbjct: 17  KNQNPC----GSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR-SHGCKGGYQTGS 71

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
            Q+V  N G+ TEK+YPY  + G+C  ++     V I GYK VP N+E  L+Q +  QPV
Sbjct: 72  IQYVADNGGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQGIGNQPV 131

Query: 149 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 208
           SV      RAFQLY  GIF GPC    DHAV  +GY     +D    KNSWG +WG  GY
Sbjct: 132 SVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTAIGYGKAQLLD----KNSWGPNWGEKGY 187

Query: 209 MHMQRNTGNSLGICGINMLASYPTK 233
           + ++R +G S G CG+   + +P K
Sbjct: 188 IKIKRASGKSEGTCGVYKSSYFPIK 212


>sp|P83443|MDO1_PSEMR Macrodontain-1 OS=Pseudananas macrodontes PE=1 SV=1
          Length = 213

 Score =  179 bits (455), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 129/206 (62%), Gaps = 10/206 (4%)

Query: 27  QFRNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 86
           + +N+  C    G CWAF+A   +EGI KI  G+LV LSEQE++DC  SY  GC GG ++
Sbjct: 16  EVKNQGPC----GGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY--GCKGGWVN 69

Query: 87  YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ 146
            AY F+I N+G+ T+++YPYR   G CN      +   I GY  V  N+E  ++ AV  Q
Sbjct: 70  RAYDFIISNNGVTTDENYPYRAYQGTCNANYF-PNSAYITGYSYVRRNDESHMMYAVSNQ 128

Query: 147 PVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMN 206
           P++  I  S   FQ Y  G+++GPC  SL+HA+ I+GY  ++   YWI++NSWG SWG  
Sbjct: 129 PIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS---YWIVRNSWGSSWGQG 185

Query: 207 GYMHMQRNTGNSLGICGINMLASYPT 232
           GY+ ++R+  +S G+CGI M   +PT
Sbjct: 186 GYVRIRRDVSHSGGVCGIAMSPLFPT 211


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  179 bits (454), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 95/200 (47%), Positives = 130/200 (65%), Gaps = 13/200 (6%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
           G+CWAFS+TGA+EG +   +G LVSLSEQ L+DC   Y N+GC GGLMD A++++  N G
Sbjct: 176 GSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 235

Query: 98  IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPVSVGICG 154
           IDTEK YPY      C+    N+  V  T  G+ D+P+ +EK++ +AV    PVSV I  
Sbjct: 236 IDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDA 292

Query: 155 SERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 211
           S  +FQ YS G++  P   + +LDH VL+VG+ + E+G DYW++KNSWG +WG  G++ M
Sbjct: 293 SHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKM 352

Query: 212 QRNTGNSLGICGINMLASYP 231
            RN  N    CGI   +SYP
Sbjct: 353 LRNKENQ---CGIASASSYP 369


>sp|P09648|CATL1_CHICK Cathepsin L1 (Fragments) OS=Gallus gallus GN=CTSL1 PE=1 SV=1
          Length = 218

 Score =  179 bits (453), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 94/197 (47%), Positives = 122/197 (61%), Gaps = 7/197 (3%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 97
           G+CWAFS TGA+EG +    G LVSLSEQ L+DC R   N GC GGLMD A+Q+V  N G
Sbjct: 23  GSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGG 82

Query: 98  IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSE 156
           ID+E+ YPY  +  +  + K   +     G+ D+P+ +E+ L++AV +  PVSV I    
Sbjct: 83  IDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGH 142

Query: 157 RAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
            +FQ Y SGI+  P   S  LDH VL+VGY  E G  YWI+KNSWG  WG  GY++M ++
Sbjct: 143 SSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYWIVKNSWGEKWGDKGYIYMAKD 202

Query: 215 TGNSLGICGINMLASYP 231
             N    CGI   ASYP
Sbjct: 203 RKNH---CGIATAASYP 216


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  176 bits (446), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/224 (45%), Positives = 130/224 (58%), Gaps = 30/224 (13%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G CW+FS TG+ EG +    G LVSLSEQ LIDC    NSGC GGLM YA
Sbjct: 128 KNQGQC----GGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYA 182

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 148
           ++++I N+GIDTE  YPY+ + G+C  +  N    T+  YK V   +E  L  AV   PV
Sbjct: 183 FEYIINNNGIDTESSYPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPV 241

Query: 149 SVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGV---------------- 190
           SV I  S ++FQLY+SGI+  P   S +LDH VL VGY S +G                 
Sbjct: 242 SVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSAS 301

Query: 191 ---DYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 231
              +YWI+KNSWG SWG+ GY+ M RN  N+   CGI   AS+P
Sbjct: 302 SSNEYWIVKNSWGTSWGIEGYILMSRNRDNN---CGIASSASFP 342


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  176 bits (445), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 96/197 (48%), Positives = 127/197 (64%), Gaps = 8/197 (4%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN-SGCGGGLMDYAYQFVIKNHG 97
           G+CWAFS TG++EG + + TGSL+SL+EQ+L+DC R Y   GC GG M+ A+ ++  N+G
Sbjct: 129 GSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNG 188

Query: 98  IDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-QPVSVGICGSE 156
           IDTE  YPY  + G C +   N    T  G+ ++   +E  L QAV    P+SV I  + 
Sbjct: 189 IDTEAAYPYEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAH 247

Query: 157 RAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
            +FQ YSSG++  P CS S LDHAVL VGY SE G D+W++KNSW  SWG  GY+ M RN
Sbjct: 248 SSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRN 307

Query: 215 TGNSLGICGINMLASYP 231
             N+   CGI  +ASYP
Sbjct: 308 RNNN---CGIATVASYP 321


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  174 bits (441), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNH 96
           GACWAFSA GA+E   K+ TG LVSLS Q L+DC  ++  N GC GG M  A+Q++I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196

Query: 97  GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGS 155
           GID+E  YPY+   G+C      R   T   Y ++P  +E  L +AV  + PVSV I  S
Sbjct: 197 GIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDAS 255

Query: 156 ERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
             +F LY SG++  P C+ +++H VL+VGY + NG DYW++KNSWG ++G  GY+ M RN
Sbjct: 256 HYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARN 315

Query: 215 TGNSLGICGINMLASYP 231
           +GN    CGI    SYP
Sbjct: 316 SGNH---CGIASYPSYP 329


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score =  174 bits (440), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 92/197 (46%), Positives = 126/197 (63%), Gaps = 8/197 (4%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY--NSGCGGGLMDYAYQFVIKNH 96
           G+CWAFSA GA+E   K+ TG LVSLS Q L+DC  +   N GC GG M  A+Q++I N+
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196

Query: 97  GIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGS 155
           GID+E  YPY+   G+C     NR   T   Y ++P  +E+ L +AV  + PVSVGI  S
Sbjct: 197 GIDSEASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGSEEALKEAVANKGPVSVGIDAS 255

Query: 156 ERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRN 214
             +F LY +G++  P C+ +++H VL+VGY + +G DYW++KNSWG  +G  GY+ M RN
Sbjct: 256 HSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARN 315

Query: 215 TGNSLGICGINMLASYP 231
           +GN    CGI    SYP
Sbjct: 316 SGNH---CGIANYPSYP 329


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
          Length = 329

 Score =  172 bits (437), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS+ GA+EG  K  TG L++LS Q L+DC  S N GCGGG M  A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
           +Q+V KN GID+E  YPY GQ   C      +      GY+++PE NEK L +AV    P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244

Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
           VSV I  S  +FQ YS G++      S +L+HAVL VGY  + G  +WIIKNSWG +WG 
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304

Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
            GY+ M RN  N+   CGI  LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
          Length = 329

 Score =  172 bits (437), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS+ GA+EG  K  TG L++LS Q L+DC  S N GCGGG M  A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
           +Q+V KN GID+E  YPY GQ   C      +      GY+++PE NEK L +AV    P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244

Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
           VSV I  S  +FQ YS G++      S +L+HAVL VGY  + G  +WIIKNSWG +WG 
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304

Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
            GY+ M RN  N+   CGI  LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327


>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
          Length = 329

 Score =  172 bits (437), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS+ GA+EG  K  TG L++LS Q L+DC  S N GCGGG M  A
Sbjct: 131 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 185

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
           +Q+V KN GID+E  YPY GQ   C      +      GY+++PE NEK L +AV    P
Sbjct: 186 FQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 244

Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
           VSV I  S  +FQ YS G++      S +L+HAVL VGY  + G  +WIIKNSWG +WG 
Sbjct: 245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304

Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
            GY+ M RN  N+   CGI  LAS+P
Sbjct: 305 KGYILMARNKNNA---CGIANLASFP 327


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  172 bits (436), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 97/202 (48%), Positives = 131/202 (64%), Gaps = 9/202 (4%)

Query: 39  GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHG 97
           G+CWAF+ATGA+EGIN+I TG LVSLSEQELIDCDR + N GC GG   +A++F+ +N G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209

Query: 98  IDTEKDYPYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICG 154
           I +++ Y Y G+   A +  + K  R +VTI+G++ VP N+E  L +AV  QP+SV I  
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVAYQPISVMISA 268

Query: 155 SERAFQLYSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQ 212
           +      Y SG++ G CS    DH VLIVGY  S +  DYW+I+NSWG  WG  GY+ +Q
Sbjct: 269 AN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQ 326

Query: 213 RNTGNSLGICGINMLASYPTKT 234
           RN     G C + +   YP K+
Sbjct: 327 RNFHEPTGKCAVAVAPVYPIKS 348


>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
          Length = 330

 Score =  172 bits (436), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS+ GA+EG  K  TG L++LS Q L+DC  S N GCGGG M  A
Sbjct: 132 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 186

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
           +Q+V KN GID+E  YPY GQ   C      +      GY+++PE NEK L +AV    P
Sbjct: 187 FQYVQKNRGIDSEDAYPYVGQDENCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 245

Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
           VSV I  S  +FQ YS G++      S +L+HAVL VGY  + G  +WIIKNSWG +WG 
Sbjct: 246 VSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGN 305

Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
            GY+ M RN  N+   CGI  LAS+P
Sbjct: 306 KGYILMARNKNNA---CGIANLASFP 328


>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
          Length = 334

 Score =  172 bits (436), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 100/212 (47%), Positives = 125/212 (58%), Gaps = 16/212 (7%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 87
           +N+  C    G+CWAFSATGA+EG     TG LVSLSEQ L+DC R   N GC GGLMD 
Sbjct: 130 KNQGQC----GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDN 185

Query: 88  AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 146
           A+Q+V  N G+DTE+ YPY G+       K         G+ D+P+  EK L++AV    
Sbjct: 186 AFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVG 244

Query: 147 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWG 200
           P+SV I     +FQ Y SGI+  P   S  LDH VL+VGY  E    N   +WI+KNSWG
Sbjct: 245 PISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWG 304

Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYPT 232
             WG NGY+ M ++  N    CGI+  ASYPT
Sbjct: 305 PEWGWNGYVKMAKDQNNH---CGISTAASYPT 333


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
          Length = 330

 Score =  171 bits (434), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 126/206 (61%), Gaps = 12/206 (5%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 88
           +N+  C    G+CWAFS+ GA+EG  K  TG L++LS Q L+DC  S N GCGGG M  A
Sbjct: 132 KNQGQC----GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNA 186

Query: 89  YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVV-AQP 147
           +Q+V KN GID+E  YPY GQ   C      +      GY+++PE NEK L +AV    P
Sbjct: 187 FQYVQKNRGIDSEDAYPYVGQDESCMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGP 245

Query: 148 VSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGM 205
           +SV I  S  +FQ YS G++      S +L+HAVL VGY  + G  +WIIKNSWG +WG 
Sbjct: 246 ISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 305

Query: 206 NGYMHMQRNTGNSLGICGINMLASYP 231
            GY+ M RN  N+   CGI  LAS+P
Sbjct: 306 KGYILMARNKNNA---CGIANLASFP 328


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  171 bits (432), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 92/211 (43%), Positives = 130/211 (61%), Gaps = 17/211 (8%)

Query: 29  RNKSSCLYLLGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDY 87
           +N+  C    G+CWAFSA+G +EG   + TG L+SLSEQ L+DC     N GC GGLMD+
Sbjct: 130 KNQGQC----GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDF 185

Query: 88  AYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVA-Q 146
           A+Q++ +N G+D+E+ YPY  + G C K +    +    G+ D+P+  EK L++AV    
Sbjct: 186 AFQYIKENGGLDSEESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQ-QEKALMKAVATVG 243

Query: 147 PVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWG 200
           P+SV +  S  + Q YSSGI+  P   S  LDH VL+VGY  E    N   YW++KNSWG
Sbjct: 244 PISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWG 303

Query: 201 RSWGMNGYMHMQRNTGNSLGICGINMLASYP 231
           + WGM+GY+ + ++  N    CG+   ASYP
Sbjct: 304 KEWGMDGYIKIAKDRNNH---CGLATAASYP 331


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.321    0.137    0.453 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 139,470,684
Number of Sequences: 539616
Number of extensions: 6100936
Number of successful extensions: 22821
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 223
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 21944
Number of HSP's gapped (non-prelim): 406
length of query: 341
length of database: 191,569,459
effective HSP length: 118
effective length of query: 223
effective length of database: 127,894,771
effective search space: 28520533933
effective search space used: 28520533933
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)