BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 048002
         (351 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  423 bits (1087), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 220/371 (59%), Positives = 258/371 (69%), Gaps = 41/371 (11%)

Query: 5   VGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLK 64
           V LS  LV GVA SFD+ + DLASEE LWDLYERWRSHHTVSR L EK  RFNVFK NL 
Sbjct: 9   VVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANLM 68

Query: 65  RIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHG-PRRQTGFMHGKTQDL 122
            +H  N+MDKPYKL+LN+FADMTNHEF S+ + SKV+H RM  G P     FM+ K   +
Sbjct: 69  HVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSV 128

Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK-DN 181
           PPSVDWRK+GAVT VKDQG+CGSCWAFSTVV+VEGIN+IKT +L +LSEQELVDCDK +N
Sbjct: 129 PPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEEN 188

Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
            GC+GGLME A  FI +  G+TTE +YPY A++G+C+   S V               N 
Sbjct: 189 QGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCD--ASKV---------------ND 231

Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE---------------- 285
             V +DG+E VP +DE+AL+KAVANQPV+VAIDAGG DFQFYSE                
Sbjct: 232 LAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVA 291

Query: 286 --GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HP 340
             GYG T DGT YWIV+NSWG +W E GYIRM R I  +EGLCGI +  SYP+K    +P
Sbjct: 292 IVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNP 351

Query: 341 ENSRHPRKDEL 351
             S    KDEL
Sbjct: 352 TGSFSSPKDEL 362


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  417 bits (1071), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 212/360 (58%), Positives = 253/360 (70%), Gaps = 41/360 (11%)

Query: 16  AESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKP 75
           A SFD+ E DL SEE LWDLYERWRSHHTVSR L EK  RFNVFK N+  +H  N+MDKP
Sbjct: 20  ANSFDFHEKDLESEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKMDKP 79

Query: 76  YKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQTG-FMHGKTQDLPPSVDWRKQGA 133
           YKL+LN+FADMTNHEF S+ + SKV+HH+M  G +  +G FM+ K   +P SVDWRK+GA
Sbjct: 80  YKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGA 139

Query: 134 VTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK-DNHGCDGGLMEQA 192
           VT VKDQG+CGSCWAFST+V+VEGIN+IKT +L SLSEQELVDCDK +N GC+GGLME A
Sbjct: 140 VTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESA 199

Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
             FI +  G+TTE +YPYTA++G+C+                     N   V +DG+E V
Sbjct: 200 FEFIKQKGGITTESNYPYTAQEGTCD-----------------ESKVNDLAVSIDGHENV 242

Query: 253 PESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGT 294
           P +DENAL+KAVANQPV+VAIDAGG DFQFYSE                  GYG T DGT
Sbjct: 243 PVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGT 302

Query: 295 KYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HPENSRHPRKDEL 351
            YWIV+NSWG +W E+GYIRM R I  +EGLCGI + ASYP+K    +P  S    KDEL
Sbjct: 303 NYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKNSSDNPTGSLSSPKDEL 362


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  416 bits (1069), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 218/361 (60%), Positives = 255/361 (70%), Gaps = 41/361 (11%)

Query: 15  VAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDK 74
           + ESFD+ E +L SEE LW LYERWRSHHTVSR L EKQ RFNVFK N   +H  N+MDK
Sbjct: 17  ITESFDFHEKELESEESLWGLYERWRSHHTVSRSLHEKQKRFNVFKHNAMHVHNANKMDK 76

Query: 75  PYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLH-GPRRQTGFMHGKTQDLPPSVDWRKQG 132
           PYKL+LN+FADMTNHEF ++ S SKV HHRM   GPR    FM+ K   +P SVDWRK+G
Sbjct: 77  PYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKG 136

Query: 133 AVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQ 191
           AVT VKDQG+CGSCWAFST+V+VEGIN+IKT +L SLSEQELVDCD D N GC+GGLM+ 
Sbjct: 137 AVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDY 196

Query: 192 ALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEM 251
           A  FI +  G+TTE +YPY A DG+C++                   +NAP V +DG+E 
Sbjct: 197 AFEFIKQRGGITTEANYPYEAYDGTCDVSK-----------------ENAPAVSIDGHEN 239

Query: 252 VPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDG 293
           VPE+DENAL+KAVANQPV+VAIDAGG DFQFYSE                  GYG T DG
Sbjct: 240 VPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDG 299

Query: 294 TKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HPENSRHPRKDE 350
           TKYW VKNSWG +W EKGYIRM RGI  +EGLCGI +EASYP+K    +P   +   KDE
Sbjct: 300 TKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNNPSGIKSSPKDE 359

Query: 351 L 351
           L
Sbjct: 360 L 360


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  396 bits (1017), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 203/375 (54%), Positives = 250/375 (66%), Gaps = 42/375 (11%)

Query: 2   FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
           F ++ L +++V    +  D+   D+ SE  LW+LYERWRSHHTV+R L+EK  RFNVFK 
Sbjct: 4   FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVFKH 63

Query: 62  NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQT-GFMHGKT 119
           N+K IH+ N+ DK YKL+LN+F DMT+ EF  + + S + HHRM  G ++ T  FM+   
Sbjct: 64  NVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANV 123

Query: 120 QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK 179
             LP SVDWRK GAVT VK+QG+CGSCWAFSTVV+VEGIN+I+T +L SLSEQELVDCD 
Sbjct: 124 NTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDT 183

Query: 180 D-NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGD 238
           + N GC+GGLM+ A  FI +  GLT+E  YPY A D +C+                    
Sbjct: 184 NQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCD-----------------TNK 226

Query: 239 KNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------- 285
           +NAP V +DG+E VP++ E+ LMKAVANQPV+VAIDAGG DFQFYSE             
Sbjct: 227 ENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNH 286

Query: 286 -----GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHP 340
                GYG T DGTKYWIVKNSWG +W EKGYIRM RGI  +EGLCGI +EASYP+K   
Sbjct: 287 GVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLKNSN 346

Query: 341 EN----SRHPRKDEL 351
            N    S    KDEL
Sbjct: 347 TNPSRLSLDSLKDEL 361


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  392 bits (1008), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/373 (54%), Positives = 253/373 (67%), Gaps = 41/373 (10%)

Query: 2   FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
           F  + L  +    +A+S  + E DLASE+ LW+LYE+WR+HHTV+RDL EK  RFNVFK+
Sbjct: 6   FIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKE 65

Query: 62  NLKRIHKVNQM-DKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQTG-FMHGK 118
           N+K IH+ NQ  D PYKL LN+F DMTN EF S  + SK+ HHR   G ++ TG FM+  
Sbjct: 66  NVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYEN 125

Query: 119 TQDLPP-SVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDC 177
              LP  S+DWR +GAVTGVKDQG+CGSCWAFST+ SVEGIN+IKTGEL SLSEQELVDC
Sbjct: 126 VGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDC 185

Query: 178 DKD-NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
           D   N GC+GGLM+ A  FI K+ G+TTE SYPY  +DG+C   ++++            
Sbjct: 186 DTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTC--ASNLL------------ 230

Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE----------- 285
              N+P V +DG++ VP ++ENALM+AVANQP++V+I+A G  FQFYSE           
Sbjct: 231 ---NSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTEL 287

Query: 286 -------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL 338
                  GYGAT+DGTKYWIVKNSWG +W E GYIRM RGI  + G CGI +EASYP+K 
Sbjct: 288 DHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKT 347

Query: 339 HPENSRHPRKDEL 351
                    +DEL
Sbjct: 348 SANPKNSSTRDEL 360


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  375 bits (962), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 194/377 (51%), Positives = 246/377 (65%), Gaps = 43/377 (11%)

Query: 2   FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
           FF+V +S + +   ++ FD+ E +L +EE +W LYERWR HH+VSR   E   RFNVF+ 
Sbjct: 4   FFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVSRASHEAIKRFNVFRH 63

Query: 62  NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQT-GFMHGKT 119
           N+  +H+ N+ +KPYKL++NRFAD+T+HEF SS + S V HHRML GP+R + GFM+   
Sbjct: 64  NVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENV 123

Query: 120 QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD- 178
             +P SVDWR++GAVT VK+Q  CGSCWAFSTV +VEGINKI+T +L SLSEQELVDCD 
Sbjct: 124 TRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDT 183

Query: 179 KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGD 238
           ++N GC GGLME A  FI  + G+ TE++YPY + D               V  C  N  
Sbjct: 184 EENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSD---------------VQFCRAN-S 227

Query: 239 KNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------- 285
                V +DG+E VPE+DE  L+KAVA+QPV+VAIDAG  DFQ YSE             
Sbjct: 228 IGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNH 287

Query: 286 -----GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHP 340
                GYG T++GTKYWIV+NSWG +W E GY+R+ RGI   EG CGI +EASYP KL  
Sbjct: 288 GVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKLSS 347

Query: 341 ENSRHPR------KDEL 351
             S H        KDEL
Sbjct: 348 TPSTHESVVRDDVKDEL 364


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  370 bits (951), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/377 (51%), Positives = 246/377 (65%), Gaps = 46/377 (12%)

Query: 2   FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
             L+ L  +++   A  FDY + ++ SEE L  LY+RWRSHH+V R L E++ RFNVF+ 
Sbjct: 4   LLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLNEREKRFNVFRH 63

Query: 62  NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRR---QTGFMHG 117
           N+  +H  N+ ++ YKL+LN+FAD+T +EF ++ + S + HHRML GP+R   Q  + H 
Sbjct: 64  NVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHE 123

Query: 118 KTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDC 177
               LP SVDWRK+GAVT +K+QG+CGSCWAFSTV +VEGINKIKT +L SLSEQELVDC
Sbjct: 124 NLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183

Query: 178 D-KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
           D K N GC+GGLME A  FI K+ G+TTE SYPY   DG C+                  
Sbjct: 184 DTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKD-------------- 229

Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE----------- 285
              N   V +DG+E VPE+DENAL+KAVANQPV+VAIDAG  DFQFYSE           
Sbjct: 230 ---NGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTEL 286

Query: 286 -------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL 338
                  GYG ++ G KYWIV+NSWG +W E GYI++ R ID  EG CGI +EASYP+KL
Sbjct: 287 NHGVAAVGYG-SERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345

Query: 339 HPENSRHPR----KDEL 351
              N   P+    KDEL
Sbjct: 346 SSSNPT-PKDGDVKDEL 361


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  325 bits (834), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 176/352 (50%), Positives = 215/352 (61%), Gaps = 43/352 (12%)

Query: 22  QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRL 80
           ++ DL SEE LWDLYERW+S H V R   EK  RF  FK N   IH  N+  D PY+L L
Sbjct: 32  EDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHL 91

Query: 81  NRFADMTNHEFMSSRSSKVSHHR--MLHGPRRQTGFMHG--KTQDLPPSVDWRKQGAVTG 136
           NRF DM   EF   R++ V   R      P    GFM+      DLPPSVDWR++GAVTG
Sbjct: 92  NRFGDMDQAEF---RATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTG 148

Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD-KDNHGCDGGLMEQALNF 195
           VKDQG+CGSCWAFSTVVSVEGIN I+TG L SLSEQEL+DCD  DN GC GGLM+ A  +
Sbjct: 149 VKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEY 208

Query: 196 IAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVI-LDGYEMVPE 254
           I  + GL TE +YPY A  G+C +  +                +N+P V+ +DG++ VP 
Sbjct: 209 IKNNGGLITEAAYPYRAARGTCNVARAA---------------QNSPVVVHIDGHQDVPA 253

Query: 255 SDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKY 296
           + E  L +AVANQPV+VA++A GK F FYSE                  GYG  +DG  Y
Sbjct: 254 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAY 313

Query: 297 WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSRHPRK 348
           W VKNSWG  W E+GYIR+ +   A  GLCGI +EASYPVK + +    PR+
Sbjct: 314 WTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRR 365


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  322 bits (826), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 177/352 (50%), Positives = 215/352 (61%), Gaps = 45/352 (12%)

Query: 22  QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRL 80
           ++ DL SEE LWDLYERW+S H V R   EK  RF  FK N   IH  N+  D PY+L L
Sbjct: 32  EDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHL 91

Query: 81  NRFADMTNHEFMSSRSSKVSHHR--MLHGPRRQTGFMHG--KTQDLPPSVDWRKQGAVTG 136
           NRF DM   EF   R++ V   R      P    GFM+      DLPPSVDWR++GAVTG
Sbjct: 92  NRFGDMDQAEF---RATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTG 148

Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD-KDNHGCDGGLMEQALNF 195
           VKDQG+CGSCWAFSTVVSVEGIN I+TG L SLSEQEL+DCD  DN GC GGLM+ A  +
Sbjct: 149 VKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEY 208

Query: 196 IAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVI-LDGYEMVPE 254
           I  + GL TE +YPY A  G+C +  +                +N+P V+ +DG++ VP 
Sbjct: 209 IKNNGGLITEAAYPYRAARGTCNVARAA---------------QNSPVVVHIDGHQDVPA 253

Query: 255 SDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKY 296
           + E  L +AVANQPV+VA++A GK F FYSE                  GYG  +DG  Y
Sbjct: 254 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAY 313

Query: 297 WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSRHPRK 348
           W VKNSWG  W E+GYIR+ +   A  GLCGI +EASYPVK +  N   PR+
Sbjct: 314 WTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTY--NKPMPRR 363


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  275 bits (703), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 157/341 (46%), Positives = 197/341 (57%), Gaps = 51/341 (14%)

Query: 35  LYERWRSHHTVSRD-----LKEKQIRFNVFKQNLKRI--HKVNQMDKPYKLRLNRFADMT 87
           +Y RW   H  S       + ++  RFN+FK NL+ I  H  N  +  YKL L  FA++T
Sbjct: 3   IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLT 62

Query: 88  NHEFMS----SRSSKVSHHRMLHGPRRQTGFMHGKTQD-LPPSVDWRKQGAVTGVKDQGR 142
           N E+ S    +R+  V   R+         +      D +P +VDWR++GAV  +KDQG 
Sbjct: 63  NDEYRSLYLGARTEPV--RRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120

Query: 143 CGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSEG 201
           CGSCWAFST  +VEGINKI TGEL SLSEQELVDCDK  N GC+GGLM+ A  FI K+ G
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 202 LTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALM 261
           L TEK YPY   +G C       S++           KN+  V +DGYE VP  DE AL 
Sbjct: 181 LNTEKDYPYHGTNGKCN------SLL-----------KNSRVVTIDGYEDVPSKDETALK 223

Query: 262 KAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSW 303
           +AV+ QPV+VAIDAGG+ FQ Y                    GYG +++G  YWIV+NSW
Sbjct: 224 RAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSW 282

Query: 304 GTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSR 344
           GT W E GYIRM R + ++ G CGI +EASYPVK  P   R
Sbjct: 283 GTRWGEDGYIRMERNVASKSGKCGIAIEASYPVKYSPNPVR 323


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  269 bits (687), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 150/333 (45%), Positives = 195/333 (58%), Gaps = 44/333 (13%)

Query: 28  SEECLWDLYERWRSHHTVSRD---LKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFA 84
           SE  +  +YE W   H  ++    L EK  RF +FK NL+ + + N+ +  Y+L L RFA
Sbjct: 42  SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFA 101

Query: 85  DMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQD-LPPSVDWRKQGAVTGVKDQGRC 143
           D+TN E+   RS  +       G RR +     +  D LP S+DWRK+GAV  VKDQG C
Sbjct: 102 DLTNDEY---RSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158

Query: 144 GSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSEGL 202
           GSCWAFST+ +VEGIN+I TG+L +LSEQELVDCD   N GC+GGLM+ A  FI K+ G+
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 203 TTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMK 262
            T+K YPY   DG+C+                    KNA  V +D YE VP   E +L K
Sbjct: 219 DTDKDYPYKGVDGTCDQIR-----------------KNAKVVTIDSYEDVPTYSEESLKK 261

Query: 263 AVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWG 304
           AVA+QP+++AI+AGG+ FQ Y                    GYG T++G  YWIV+NSWG
Sbjct: 262 AVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWG 320

Query: 305 TDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
             W E GY+RM R I +  G CGI +E SYP+K
Sbjct: 321 KSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  267 bits (683), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 202/337 (59%), Gaps = 38/337 (11%)

Query: 21  YQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
           Y   DL S + L +L+E W S+   + + ++EK +RF VFK NLK I + N+  K Y L 
Sbjct: 36  YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95

Query: 80  LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
           LN FAD+++ EF        +        R    F +   + +P SVDWRK+GAV  VK+
Sbjct: 96  LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKN 155

Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAK 198
           QG CGSCWAFSTV +VEGINKI TG L +LSEQEL+DCD   N+GC+GGLM+ A  +I K
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215

Query: 199 SEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDEN 258
           + GL  E+ YPY+ ++G+CE+                     +  V ++G++ VP +DE 
Sbjct: 216 NGGLRKEEDYPYSMEEGTCEMQKD-----------------ESETVTINGHQDVPTNDEK 258

Query: 259 ALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVK 300
           +L+KA+A+QP++VAIDA G++FQFYS                   GYG+++ G+ Y IVK
Sbjct: 259 SLLKALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVK 317

Query: 301 NSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           NSWG  W EKGYIR+ R     EGLCGI   AS+P K
Sbjct: 318 NSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPTK 354


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  266 bits (681), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/356 (42%), Positives = 204/356 (57%), Gaps = 42/356 (11%)

Query: 5   VGLSLVLVFGVAESFD---YQESDLASEECLWDLYERWRSHHTVS-RDLKEKQIRFNVFK 60
           +  S +L    A  F    Y    L + + L +L+E W S H+ + + ++EK  RF VF+
Sbjct: 17  ISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFR 76

Query: 61  QNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ 120
           +NL  I + N     Y L LN FAD+T+ EF   R   ++  +     +    F +    
Sbjct: 77  ENLMHIDQRNNEINSYWLGLNEFADLTHEEF-KGRYLGLAKPQFSRKRQPSANFRYRDIT 135

Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
           DLP SVDWRK+GAV  VKDQG+CGSCWAFSTV +VEGIN+I TG L SLSEQEL+DCD  
Sbjct: 136 DLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTT 195

Query: 181 -NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDK 239
            N GC+GGLM+ A  +I  + GL  E  YPY  ++G C+                    +
Sbjct: 196 FNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQ-----------------EQKE 238

Query: 240 NAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFY---------------- 283
           +   V + GYE VPE+D+ +L+KA+A+QPV+VAI+A G+DFQFY                
Sbjct: 239 DVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLDHG 298

Query: 284 --SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
             + GYG+++ G+ Y IVKNSWG  W EKG+IRM R     EGLCGI   ASYP K
Sbjct: 299 VAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTK 353


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  262 bits (669), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 148/338 (43%), Positives = 196/338 (57%), Gaps = 52/338 (15%)

Query: 28  SEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQMDKP----YKLRLNR 82
           SEE    LY  W++ H  S + + E++ R+  F+ NL+ I + N         ++L LNR
Sbjct: 32  SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNR 91

Query: 83  FADMTNHEFMSSRSSKVSHHRMLHGPRRQTG----FMHGKTQDLPPSVDWRKQGAVTGVK 138
           FAD+TN E+      + ++  + + PRR+      ++    + LP SVDWR +GAV  +K
Sbjct: 92  FADLTNEEY------RDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIK 145

Query: 139 DQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIA 197
           DQG CGSCWAFS + +VEGIN+I TG+L SLSEQELVDCD   N GC+GGLM+ A +FI 
Sbjct: 146 DQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFII 205

Query: 198 KSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDE 257
            + G+ TE  YPY  KD  C++                   KNA  V +D YE V  + E
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDV-----------------NRKNAKVVTIDSYEDVTPNSE 248

Query: 258 NALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIV 299
            +L KAVANQPV+VAI+AGG+ FQ YS                   GYG T++G  YWIV
Sbjct: 249 TSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIV 307

Query: 300 KNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           +NSWG  W E GY+RM R I A  G CGI +E SYP+K
Sbjct: 308 RNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPLK 345


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  260 bits (665), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/351 (43%), Positives = 209/351 (59%), Gaps = 56/351 (15%)

Query: 28  SEECLWDLYERWRSHH-----TVSRDLKEKQIRFNVFKQNLKRI--HKVNQMDKPYKLRL 80
           ++E +  +Y +W + H       +  + ++  RFN+FK NL+ I  H  +  +  YKL L
Sbjct: 41  TDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGL 100

Query: 81  NRFADMTNHEF----MSSRSS---KVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGA 133
            +F D+TN E+    + +R+    +++  + ++  ++ +  ++GK  ++P +VDWR++GA
Sbjct: 101 TKFTDLTNDEYRKLYLGARTEPARRIAKAKNVN--QKYSAAVNGK--EVPETVDWRQKGA 156

Query: 134 VTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQA 192
           V  +KDQG CGSCWAFST  +VEGINKI TGEL SLSEQELVDCDK  N GC+GGLM+ A
Sbjct: 157 VNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYA 216

Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
             FI K+ GL TEK YPY    G C       S +           KN+  V +DGYE V
Sbjct: 217 FQFIMKNGGLNTEKDYPYRGFGGKCN------SFL-----------KNSRVVSIDGYEDV 259

Query: 253 PESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGT 294
           P  DE AL KA++ QPV+VAI+AGG+ FQ Y                    GYG +++G 
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGV 318

Query: 295 KYWIVKNSWGTDWEEKGYIRMLRGIDA-EEGLCGITLEASYPVKLHPENSR 344
            YWIV+NSWG  W E+GYIRM R + A + G CGI +EASYPVK  P   R
Sbjct: 319 DYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVR 369


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  258 bits (658), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 145/313 (46%), Positives = 183/313 (58%), Gaps = 45/313 (14%)

Query: 49  LKEKQIRFNVFKQNLKRIHKVNQMDKP---YKLRLNRFADMTNHEFMSSRSSKVSHHRML 105
           + E + RF VF  NLK +   N        ++L +NRFAD+TN EF   R++ +      
Sbjct: 82  IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF---RATYLGTTPAG 138

Query: 106 HGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG-VKDQGRCGSCWAFSTVVSVEGINKIKTG 164
            G R    + H   + LP SVDWR +GAV   VK+QG+CGSCWAFS V +VEGINKI TG
Sbjct: 139 RGRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 198

Query: 165 ELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTS 222
           EL SLSEQELV+C ++  N GC+GG+M+ A  FIA++ GL TE+ YPYTA DG C L   
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKR 258

Query: 223 MVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQF 282
              +                 V +DG+E VPE+DE +L KAVA+QPV+VAIDAGG++FQ 
Sbjct: 259 SRKV-----------------VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQL 301

Query: 283 YSE------------------GYGA-TQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEE 323
           Y                    GYG     G  YW V+NSWG DW E GYIRM R + A  
Sbjct: 302 YDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTART 361

Query: 324 GLCGITLEASYPV 336
           G CGI + ASYP+
Sbjct: 362 GKCGIAMMASYPI 374


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  253 bits (646), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 46/311 (14%)

Query: 51  EKQIRFNVFKQNLKRIHKVNQMDKP---YKLRLNRFADMTNHEFMSS-RSSKVSHHRMLH 106
           E + RF VF  NLK +   N        ++L +NRFAD+TN EF ++   +KV+      
Sbjct: 70  EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRAA 129

Query: 107 GPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGEL 166
           G R    + H   ++LP SVDWR++GAV  VK+QG+CGSCWAFS V +VE IN++ TGE+
Sbjct: 130 GER----YRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEM 185

Query: 167 WSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMV 224
            +LSEQELV+C  +  N GC+GGLM+ A +FI K+ G+ TE  YPY A DG C++     
Sbjct: 186 ITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDI----- 240

Query: 225 SIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFY- 283
                         +NA  V +DG+E VP++DE +L KAVA+QPV+VAI+AGG++FQ Y 
Sbjct: 241 ------------NRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYH 288

Query: 284 -----------------SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLC 326
                            + GYG T +G  YWIV+NSWG  W E GY+RM R I+   G C
Sbjct: 289 SGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 347

Query: 327 GITLEASYPVK 337
           GI + ASYP K
Sbjct: 348 GIAMMASYPTK 358


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  252 bits (644), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 190/325 (58%), Gaps = 40/325 (12%)

Query: 35  LYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRLNRFADMTNHEFM 92
           +YE+W   +  + + L EK+ RF +FK NLK + + N + D+ +++ L RFAD+TN EF 
Sbjct: 43  MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102

Query: 93  SSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTV 152
           +    K    R     + +  +++ +   LP  VDWR  GAV  VKDQG CGSCWAFS V
Sbjct: 103 AIYLRK-KMERTKDSVKTER-YLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAV 160

Query: 153 VSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPY 210
            +VEGIN+I TGEL SLSEQELVDCD+   N GCDGG+M  A  FI K+ G+ T++ YPY
Sbjct: 161 GAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY 220

Query: 211 TAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVA 270
            A D               + +C+ + + N   V +DGYE VP  DE +L KAVA+QPV+
Sbjct: 221 NAND---------------LGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265

Query: 271 VAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGY 312
           VAI+A  + FQ Y                    GYG+T  G  YWI++NSWG +W + GY
Sbjct: 266 VAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGY 324

Query: 313 IRMLRGIDAEEGLCGITLEASYPVK 337
           +++ R ID   G CGI +  SYP K
Sbjct: 325 VKLQRNIDDPFGKCGIAMMPSYPTK 349


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  252 bits (643), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 143/329 (43%), Positives = 188/329 (57%), Gaps = 49/329 (14%)

Query: 35  LYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
           ++E W   H  V   + EK+ R  +F+ NL+ I   N  +  Y+L LNRFAD++ HE+  
Sbjct: 55  MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEY-- 112

Query: 94  SRSSKVSHHRMLHGPRRQTGFMHG----KTQD---LPPSVDWRKQGAVTGVKDQGRCGSC 146
               ++ H      PR    FM      KT D   LP SVDWR +GAVT VKDQG C SC
Sbjct: 113 ---GEICHGADPRPPRNHV-FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168

Query: 147 WAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEK 206
           WAFSTV +VEG+NKI TGEL +LSEQ+L++C+K+N+GC GG +E A  FI  + GL T+ 
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDN 228

Query: 207 SYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN 266
            YPY A +G CE                    ++   V++DGYE +P +DE ALMKAVA+
Sbjct: 229 DYPYKALNGVCEGRLK----------------EDNKNVMIDGYENLPANDEAALMKAVAH 272

Query: 267 QPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWE 308
           QPV   +D+  ++FQ Y                    GYG T++G  YWIVKNS G  W 
Sbjct: 273 QPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVVGYG-TENGRDYWIVKNSRGDTWG 331

Query: 309 EKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           E GY++M R I    GLCGI + ASYP+K
Sbjct: 332 EAGYMKMARNIANPRGLCGIAMRASYPLK 360


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  250 bits (638), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 189/329 (57%), Gaps = 49/329 (14%)

Query: 35  LYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
           ++E W   H  V   + EK+ R  +F+ NL+ I+  N  +  Y+L L  FAD++ HE+  
Sbjct: 48  IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEY-- 105

Query: 94  SRSSKVSHHRMLHGPRRQTGFM-----HGKTQD--LPPSVDWRKQGAVTGVKDQGRCGSC 146
               +V H      PR    FM     +  + D  LP SVDWR +GAVT VKDQG C SC
Sbjct: 106 ---KEVCHGADPRPPRNHV-FMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSC 161

Query: 147 WAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEK 206
           WAFSTV +VEG+NKI TGEL +LSEQ+L++C+K+N+GC GG +E A  FI K+ GL T+ 
Sbjct: 162 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDN 221

Query: 207 SYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN 266
            YPY A +G                +C     +N   V++DGYE +P +DE+ALMKAVA+
Sbjct: 222 DYPYKAVNG----------------VCDGRLKENNKNVMIDGYENLPANDESALMKAVAH 265

Query: 267 QPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWE 308
           QPV   ID+  ++FQ Y                    GYG T++G  YW+VKNS G  W 
Sbjct: 266 QPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG-TENGRDYWLVKNSRGITWG 324

Query: 309 EKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           E GY++M R I    GLCGI + ASYP+K
Sbjct: 325 EAGYMKMARNIANPRGLCGIAMRASYPLK 353


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  246 bits (627), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 136/336 (40%), Positives = 182/336 (54%), Gaps = 38/336 (11%)

Query: 21  YQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
           Y + DL S E L  L++ W   H+ +   + EK  RF +F+ NL  I + N+ +  Y L 
Sbjct: 33  YSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLG 92

Query: 80  LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
           LN FAD++N EF       V+             F +    + P S+DWR +GAVT VK+
Sbjct: 93  LNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKN 152

Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKS 199
           QG CGSCWAFST+ +VEGINKI TG L  LSEQELVDCDK ++GC GG    +L ++A +
Sbjct: 153 QGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVA-N 211

Query: 200 EGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENA 259
            G+ T K YPY AK   C                    DK  P+V + GY+ VP + E +
Sbjct: 212 NGVHTSKVYPYQAKQYKCRAT-----------------DKPGPKVKITGYKRVPSNCETS 254

Query: 260 LMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKN 301
            + A+ANQP++V ++AGGK FQ Y                    GYG T DG  Y I+KN
Sbjct: 255 FLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG-TSDGKNYIIIKN 313

Query: 302 SWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           SWG +W EKGY+R+ R     +G CG+   + YP K
Sbjct: 314 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  236 bits (601), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 148/374 (39%), Positives = 202/374 (54%), Gaps = 64/374 (17%)

Query: 4   LVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQN 62
           L+  S +L+  +A  F+ +     + + +  +YE W   +  S + L E + RF +FK+ 
Sbjct: 12  LLFFSTLLILSLA--FNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69

Query: 63  LKRIHKVN-QMDKPYKLRLNRFADMTNHEFMS--------SRSSKVSHHRMLHGPRRQTG 113
           L+ I + N   ++ YK+ LN+FAD+T+ EF S        S  +KVS+    + PR    
Sbjct: 70  LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVG-- 124

Query: 114 FMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQE 173
                 Q LP  VDWR  GAV  +K QG CG CWAFS + +VEGINKI TG L SLSEQE
Sbjct: 125 ------QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQE 178

Query: 174 LVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVH 231
           L+DC +  +  GC+GG +     FI  + G+ TE++YPYTA+DG C L            
Sbjct: 179 LIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDL---------- 228

Query: 232 ICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------ 285
                  +N   V +D YE VP ++E AL  AV  QPV+VA+DA G  F+ YS       
Sbjct: 229 -------QNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGP 281

Query: 286 ------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS 333
                       GYG T+ G  YWIVKNSW T W E+GY+R+LR +    G CGI    S
Sbjct: 282 CGTAIDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPS 339

Query: 334 YPVKLHPENSRHPR 347
           YPVK + +N  HP+
Sbjct: 340 YPVKYNNQN--HPK 351


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  234 bits (597), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 147/374 (39%), Positives = 202/374 (54%), Gaps = 64/374 (17%)

Query: 4   LVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQN 62
           L+  S +L+  +A  F+ +     + + +  +YE W   +  S + L E + RF +FK+ 
Sbjct: 12  LLFFSTLLILSLA--FNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69

Query: 63  LKRIHKVN-QMDKPYKLRLNRFADMTNHEFMS--------SRSSKVSHHRMLHGPRRQTG 113
           L+ I + N   ++ YK+ LN+FAD+T+ EF S        S  +KVS+    + PR    
Sbjct: 70  LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNR---YEPRVG-- 124

Query: 114 FMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQE 173
                 Q LP  VDWR  GAV  +K QG CG CWAFS + +VEGINKI TG L SLSEQE
Sbjct: 125 ------QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQE 178

Query: 174 LVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVH 231
           L+DC +  +  GC+GG +     FI  + G+ TE++YPYTA+DG C +            
Sbjct: 179 LIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDL---------- 228

Query: 232 ICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------ 285
                  +N   V +D YE VP ++E AL  AV  QPV+VA+DA G  F+ YS       
Sbjct: 229 -------QNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGP 281

Query: 286 ------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS 333
                       GYG T+ G  YWIVKNSW T W E+GY+R+LR +    G CGI    S
Sbjct: 282 CGTAVDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPS 339

Query: 334 YPVKLHPENSRHPR 347
           YPVK + +N  HP+
Sbjct: 340 YPVKYNNQN--HPK 351


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score =  233 bits (595), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/350 (38%), Positives = 186/350 (53%), Gaps = 37/350 (10%)

Query: 2   FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFK 60
           F  +GLS    FG      Y ++DL S E L  L+E W   H+ + +++ EK  RF +FK
Sbjct: 18  FVYMGLS----FGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFK 73

Query: 61  QNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ 120
            NLK I + N+ +  Y L LN FADM+N EF    +  ++ +        +     G   
Sbjct: 74  DNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDV- 132

Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
           ++P  VDWR++GAVT VK+QG CGSCWAFS VV++EGI KI+TG L   SEQEL+DCD+ 
Sbjct: 133 NIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR 192

Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
           ++GC+GG    AL  +A+  G+    +YPY      C                  + +K 
Sbjct: 193 SYGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCR-----------------SREKG 234

Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY------------- 287
                 DG   V   +E AL+ ++ANQPV+V ++A GKDFQ Y  G              
Sbjct: 235 PYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAV 294

Query: 288 GATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
            A   G  Y ++KNSWGT W E GYIR+ RG     G+CG+   + YPVK
Sbjct: 295 AAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  228 bits (581), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 132/343 (38%), Positives = 179/343 (52%), Gaps = 38/343 (11%)

Query: 13  FGVAESFDYQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQ 71
           FG      Y + DL S E L  L+  W  +H+    ++ EK  RF +FK NL  I + N+
Sbjct: 25  FGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNK 84

Query: 72  MDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQ 131
            +  Y L LN FAD++N EF       +    +      +  F++  T +LP +VDWRK+
Sbjct: 85  KNNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEE--FINEDTVNLPENVDWRKK 142

Query: 132 GAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQ 191
           GAVT V+ QG CGSCWAFS V +VEGINKI+TG+L  LSEQELVDC++ +HGC GG    
Sbjct: 143 GAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPY 202

Query: 192 ALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEM 251
           AL ++AK+ G+     YPY AK G+C                        P V   G   
Sbjct: 203 ALEYVAKN-GIHLRSKYPYKAKQGTCRAKQV-----------------GGPIVKTSGVGR 244

Query: 252 VPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTKY--------------- 296
           V  ++E  L+ A+A QPV+V +++ G+ FQ Y  G      GTK                
Sbjct: 245 VQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGK 304

Query: 297 --WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
              ++KNSWGT W EKGYIR+ R      G+CG+   + YP K
Sbjct: 305 GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  227 bits (579), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 119/235 (50%), Positives = 144/235 (61%), Gaps = 37/235 (15%)

Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
           DLP S+DWR+ GAV  VK+QG CGSCWAFSTV +VEGIN+I TG+L SLSEQ+LVDC   
Sbjct: 2   DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61

Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
           NHGC GG M  A  FI  + G+ +E++YPY  +DG C                  N   N
Sbjct: 62  NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC------------------NSTVN 103

Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE--------------- 285
           AP V +D YE VP  +E +L KAVANQPV+V +DA G+DFQ Y                 
Sbjct: 104 APVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHAL 163

Query: 286 ---GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
              GYG T++   +WIVKNSWG +W E GYIR  R I+  +G CGIT  ASYPVK
Sbjct: 164 TVVGYG-TENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVK 217


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  226 bits (575), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 123/320 (38%), Positives = 175/320 (54%), Gaps = 42/320 (13%)

Query: 36  YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDK-PYKLRLNRFADMTNHEFMS 93
           +E W + +  V +D  EK  RF +FK N+K I   N  ++  Y L +N+F DMT  EF++
Sbjct: 37  FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96

Query: 94  SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
             +       +   P     F       +P S+DWR  GAV  VK+Q  CGSCW+F+ + 
Sbjct: 97  QYTGVSLPLNIEREP--VVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIA 154

Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
           +VEGI KIKTG L SLSEQE++DC   ++GC GG + +A +FI  + G+TTE++YPY A 
Sbjct: 155 TVEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYLAY 213

Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
            G+C                  N +       + GY  V  +DE ++M AV+NQP+A  I
Sbjct: 214 QGTC------------------NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALI 255

Query: 274 DAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRM 315
           DA  ++FQ+Y+                   GYG    GTKYWIV+NSWG+ W E GY+RM
Sbjct: 256 DA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRM 314

Query: 316 LRGIDAEEGLCGITLEASYP 335
            RG+ +  G+CGI +   +P
Sbjct: 315 ARGVSSSSGVCGIAMAPLFP 334


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score =  224 bits (570), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 130/335 (38%), Positives = 179/335 (53%), Gaps = 38/335 (11%)

Query: 21  YQESDLASEECLWDLYERWRSHHTVS-RDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
           Y + DL S E L  L+  W   H  + +++ EK  RF +FK NLK I + N+M   Y L 
Sbjct: 33  YSQDDLTSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLG 92

Query: 80  LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
           LN F+D++N EF       +      + P  +  F++    DLP SVDWR +GAVT VK 
Sbjct: 93  LNEFSDLSNDEFKEKYVGSLPED-YTNQPYDEE-FVNEDIVDLPESVDWRAKGAVTPVKH 150

Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKS 199
           QG C SCWAFSTV +VEGINKIKTG L  LSEQELVDCDK ++GC+ G    +L ++A++
Sbjct: 151 QGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQSYGCNRGYQSTSLQYVAQN 210

Query: 200 EGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENA 259
            G+     YPY AK  +C                        P+V  +G   V  ++E +
Sbjct: 211 -GIHLRAKYPYIAKQQTCRA-----------------NQVGGPKVKTNGVGRVQSNNEGS 252

Query: 260 LMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTKY-----------------WIVKNS 302
           L+ A+A+QPV+V +++ G+DFQ Y  G      GTK                   ++KNS
Sbjct: 253 LLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILIKNS 312

Query: 303 WGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
           WG  W E GYIR+ R      G+CG+   + YP+K
Sbjct: 313 WGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347


>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
           lycopersicum PE=2 SV=1
          Length = 346

 Score =  224 bits (570), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 116/235 (49%), Positives = 150/235 (63%), Gaps = 37/235 (15%)

Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD- 180
           LP S+DWR++G + GVKDQG CGSCWAFS V ++E IN I TG L SLSEQELVDCD+  
Sbjct: 18  LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77

Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
           N GCDGGLM+ A  F+ K+ G+ TE+ YPY  ++G C+         YR         KN
Sbjct: 78  NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQ--------YR---------KN 120

Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE--------------- 285
           A  V +D YE VP ++E AL KAVA+QPV++A++AGG+DFQ Y                 
Sbjct: 121 AKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGV 180

Query: 286 ---GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
              GYG T++G  YWIV+NSWG +  E GY+R+ R + +  GLCG+ +E SYPVK
Sbjct: 181 VIAGYG-TENGMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVK 234


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score =  219 bits (557), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 121/320 (37%), Positives = 173/320 (54%), Gaps = 42/320 (13%)

Query: 36  YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMD-KPYKLRLNRFADMTNHEFMS 93
           +E W + +  V +D  EK +RF +FK N+  I   N  +   Y L +N+F DMTN+EF++
Sbjct: 37  FEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVA 96

Query: 94  SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
             +       +   P     F       +P S+DWR  GAVT VK+QGRCGSCWAF+++ 
Sbjct: 97  QYTGLSLPLNIKREP--VVSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIA 154

Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
           +VE I KIK G L SLSEQ+++DC   ++GC GG + +A +FI  ++G+ +   YPY A 
Sbjct: 155 TVESIYKIKRGNLVSLSEQQVLDC-AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAA 213

Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
            G+C+                 NG  N+  +    Y  V  ++E  +M AV+NQP+A A+
Sbjct: 214 KGTCKT----------------NGVPNSAYITR--YTYVQRNNERNMMYAVSNQPIAAAL 255

Query: 274 DAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRM 315
           DA G +FQ Y                    GYG    G K+WIV+NSWG  W E GYIR+
Sbjct: 256 DASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRL 314

Query: 316 LRGIDAEEGLCGITLEASYP 335
            R + +  GLCGI ++  YP
Sbjct: 315 ARDVSSSFGLCGIAMDPLYP 334


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  217 bits (552), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 137/345 (39%), Positives = 180/345 (52%), Gaps = 61/345 (17%)

Query: 25  DLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRL 80
           DL  EE  W  Y+    H     +  E++ R  +F +N  +I K NQ+       YKL L
Sbjct: 22  DLIKEE--WHTYKL--QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGL 77

Query: 81  NRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ------DLPPSVDWRKQGAV 134
           N++ADM +HEF  + +    +H +    R +TG + G T        +P SVDWR+ GAV
Sbjct: 78  NKYADMLHHEFKETMNG--YNHTLRQLMRERTGLV-GATYIPPAHVTVPKSVDWREHGAV 134

Query: 135 TGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQA 192
           TGVKDQG CGSCWAFS+  ++EG +  K G L SLSEQ LVDC     N+GC+GGLM+ A
Sbjct: 135 TGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194

Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
             +I  + G+ TEKSYPY   D SC    + +                       G+  +
Sbjct: 195 FRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIG------------------ATDTGFVDI 236

Query: 253 PESDENALMKAVANQ-PVAVAIDAGGKDFQFYSE--------------------GYGATQ 291
           PE DE  + KAVA   PV+VAIDA  + FQ YSE                    GYG  +
Sbjct: 237 PEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDE 296

Query: 292 DGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
            G  YW+VKNSWGT W E+GYI+M R  + +   CGI   +SYP 
Sbjct: 297 SGMDYWLVKNSWGTTWGEQGYIKMARNQNNQ---CGIATASSYPT 338


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  215 bits (547), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 177/337 (52%), Gaps = 56/337 (16%)

Query: 35  LYERWRS----HHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADM 86
           + E W +    H    +D  E++ R  +F +N  +I K NQ        +KL +N++AD+
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 87  TNHEFMSSRSS-KVSHHRMLHGPR---RQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGR 142
            +HEF    +    + H+ L       +   F+      LP SVDWR +GAVT VKDQG 
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 143 CGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSE 200
           CGSCWAFS+  ++EG +  K+G L SLSEQ LVDC     N+GC+GGLM+ A  +I  + 
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 201 GLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENAL 260
           G+ TEKSYPY A D SC      V    R                  G+  +P+ DE  +
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDR------------------GFTDIPQGDEKKM 276

Query: 261 MKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTKYWIV 299
            +AVA   PV+VAIDA  + FQFYSE                    G+G  + G  YW+V
Sbjct: 277 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLV 336

Query: 300 KNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
           KNSWGT W +KG+I+MLR    +E  CGI   +SYP+
Sbjct: 337 KNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 370


>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  211 bits (536), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 35/233 (15%)

Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
           LP S+DWR++GAV  VK+QG CGSCWAF  + +VEGIN+I TG+L SLSEQ+LVDC   N
Sbjct: 3   LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRN 62

Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
           HGC+GG   +A  +I  + G+ +E+ YPYT  +G+C+                    +NA
Sbjct: 63  HGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDT------------------KENA 104

Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGA------------ 289
             V +D Y  VP +DE +L KAVANQPV+V +DA G+DFQ Y  G               
Sbjct: 105 HVVSIDSYRNVPSNDEKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRT 164

Query: 290 -----TQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
                T++   YW VKNSWG +W E GYIR+ R I    G CGI +  SYP+K
Sbjct: 165 VGGRETENDKDYWTVKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIK 217


>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 215

 Score =  209 bits (533), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 142/234 (60%), Gaps = 38/234 (16%)

Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
           LP  VDWR +GAV  +K+Q +CGSCWAFS V +VE INKI+TG+L SLSEQELVDCD  +
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60

Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
           HGC+GG M  A  +I  + G+ T+++YPY+A  GSC+         YR+ + S       
Sbjct: 61  HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKP--------YRLRVVS------- 105

Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE---------------- 285
               ++G++ V  ++E+AL  AVA+QPV+V ++A G  FQ YS                 
Sbjct: 106 ----INGFQRVTRNNESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVV 161

Query: 286 --GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
             GYG TQ G  YWIV+NSWG +W  +GYI M R + +  GLCGI    SYP K
Sbjct: 162 IVGYG-TQSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  206 bits (524), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 173/325 (53%), Gaps = 39/325 (12%)

Query: 35  LYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQ-MDKPYKLRLNRFADMTNHEFM 92
           +YE+W   +  + + L EK+ RF +FK NLKRI + N   ++ Y+  LN+F+D+T  EF 
Sbjct: 40  MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99

Query: 93  SSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG-VKDQGRCGSCWAFST 151
           +S        + L     +  +  G    LP  VDWR++GAV   VK QG CGSCWAF+ 
Sbjct: 100 ASYLGGKMEKKSLSDVAERYQYKEGDV--LPDEVDWRERGAVVPRVKRQGECGSCWAFAA 157

Query: 152 VVSVEGINKIKTGELWSLSEQELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYP 209
             +VEGIN+I TGEL SLSEQEL+DCD+  DN GC GG    A  FI ++ G+ +++ Y 
Sbjct: 158 TGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYG 217

Query: 210 YTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPV 269
           YT +D +      M               K    V ++G+E+VP +DE +L KAVA QP+
Sbjct: 218 YTGEDTAACKAIEM---------------KTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262

Query: 270 AVAIDAGGK-----------------DFQFYSEGYGATQDGTKYWIVKNSWGTDWEEKGY 312
           +V I A                    D      GYG + D   YW+++NSWG +W E GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322

Query: 313 IRMLRGIDAEEGLCGITLEASYPVK 337
           +R+ R      G C + +   YP+K
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIK 347


>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 208

 Score =  205 bits (521), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 115/229 (50%), Positives = 133/229 (58%), Gaps = 35/229 (15%)

Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
           LP  +DWRK+GAVT VK+QG CGSCWAFSTV +VE IN+I+TG L SLSEQELVDCDK N
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
           HGC GG    A  +I  + G+ T+ +YPY A  G C+  + +VSI               
Sbjct: 61  HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASKVVSI--------------- 105

Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTK------ 295
                DGY  VP  +E AL +AVA QP  VAIDA    FQ YS G  +   GTK      
Sbjct: 106 -----DGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVT 160

Query: 296 -------YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
                  YWIV+NSWG  W EKGYIRMLR      GLCGI     YP K
Sbjct: 161 IVGYQANYWIVRNSWGRYWGEKGYIRMLR--VGGCGLCGIARLPYYPTK 207


>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 214

 Score =  201 bits (512), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 103/228 (45%), Positives = 139/228 (60%), Gaps = 31/228 (13%)

Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNH 182
           P S+DWR++GAVT VK+Q  CGSCWAFSTV ++EGINKI TG+L SLSEQEL+DC+  +H
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRSH 61

Query: 183 GCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAP 242
           GCDGG    +L ++  + G+ TE+ YPY  K G C                    DK  P
Sbjct: 62  GCDGGYQTPSLQYVVDN-GVHTEREYPYEKKQGRCRAK-----------------DKKGP 103

Query: 243 EVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY-----GATQD----- 292
           +V + GY+ VP +DE +L++A+ANQPV+V  D+ G+ FQFY  G      G   D     
Sbjct: 104 KVYITGYKYVPANDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTA 163

Query: 293 ---GTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
              G  Y ++KNSWG +W EKGYIR+ R     +G CG+   + +P+K
Sbjct: 164 VGYGKTYLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  196 bits (498), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/341 (35%), Positives = 161/341 (47%), Gaps = 63/341 (18%)

Query: 34  DLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
           + +  W   H  S   +E   R+N+FK N+  + + N       L LN FAD+TN E+ +
Sbjct: 28  NAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRN 87

Query: 94  SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
           +          L G + +  F    T     S DWR +GAVT VK+QG+CG CW+FST  
Sbjct: 88  TYLGTKFDASSLIGTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTG 143

Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
           S EG +    GEL SLSEQ L+DC  +N GCDGGLM  A  +I  + G+ TE SYPY A+
Sbjct: 144 STEGAHFQSKGELVSLSEQNLIDCSTENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAE 203

Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
           +G CE  +                 +N+    L  Y+ V    E++L  AV   PV+VAI
Sbjct: 204 NGKCEYKS-----------------ENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAI 245

Query: 274 DAGGKDFQFYSEG-------------YGATQDG-------------------------TK 295
           DA  + FQ Y+ G             +G    G                          +
Sbjct: 246 DASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNE 305

Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
           YWIVKNSWGT W  +GYI M R  D     CGI   AS+PV
Sbjct: 306 YWIVKNSWGTSWGIEGYILMSRNRDNN---CGIASSASFPV 343


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
           SV=2
          Length = 322

 Score =  194 bits (494), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 162/317 (51%), Gaps = 56/317 (17%)

Query: 48  DLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADMTNHEFMSSRSSKVSHHR 103
           DL+E++ R NVF  NL+ I + N+     +  Y L +N+F+DMTN +F +          
Sbjct: 33  DLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKG------ 86

Query: 104 MLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKT 163
              GPR    F           VDWR +GAVT VKDQG+CGSCWAFST   +EG + +KT
Sbjct: 87  YKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKT 146

Query: 164 GELWSLSEQELVDCDKD---NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELP 220
           G L SLSEQ+LVDC      N GC+GG +E+A+ ++  + G+ TE SYPY A+D +C   
Sbjct: 147 GRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNTCRF- 205

Query: 221 TSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKD 279
                            + N       GY  + +  E+AL  A  +  P++VAIDA  + 
Sbjct: 206 -----------------NSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRS 248

Query: 280 FQFY--------------------SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGI 319
           FQ Y                    + GYG ++ G  +W+VKNSW T W E GYI+M R  
Sbjct: 249 FQSYYTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNR 307

Query: 320 DAEEGLCGITLEASYPV 336
           +     CGI  +A YP 
Sbjct: 308 NNN---CGIATDACYPT 321


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score =  193 bits (490), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/355 (34%), Positives = 177/355 (49%), Gaps = 46/355 (12%)

Query: 7   LSLVLVFG-VAESFDY-QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLK 64
           LS+ L+F  +  S  +    ++ S +   D +  W   +  +   KE   R+  FK+N+ 
Sbjct: 3   LSITLIFTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYTHKEFMPRYEEFKKNMD 62

Query: 65  RIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ-DLP 123
            +H  N       L LN+ AD++N E+  +     +H ++    +R  G    + Q   P
Sbjct: 63  YVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQP 122

Query: 124 PSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--N 181
            +VDWR++ AVT VKDQG+CGSC++FST  SVEG+  IKTG+L SLSEQ ++DC     N
Sbjct: 123 LNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGN 182

Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK-DGSCELPTSMVSIIYRVHICSWNGDKN 240
            GC+GGLM  A  +I K+ GL +E+ YPY  K +  C+     V+               
Sbjct: 183 EGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVA--------------- 227

Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEG-------------Y 287
                +  Y+ +   DEN L  A+   PV+VAIDA    FQ Y+ G             +
Sbjct: 228 ---AKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDH 284

Query: 288 G------ATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
           G       T +G  Y+IVKNSWG  W   GYI M R  D     CGI+  ASYP+
Sbjct: 285 GVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN---CGISTMASYPI 336


>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
          Length = 334

 Score =  192 bits (488), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 133/373 (35%), Positives = 184/373 (49%), Gaps = 83/373 (22%)

Query: 5   VGLSLVLV---FGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
           + LSLVL     G+A +    + +L ++      + +W++ H       E+  R  V+++
Sbjct: 1   MNLSLVLAAFCLGIASAVPKFDQNLDTK------WYQWKATHRRLYGANEEGWRRAVWEK 54

Query: 62  NLKRIHKVN----QMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHG 117
           N+K I   N    Q    + + +N F DMTN EF           R + G  R   F  G
Sbjct: 55  NMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEF-----------RQMMGCFRNQKFRKG 103

Query: 118 KT------QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSE 171
           K        DLP SVDWRK+G VT VK+Q +CGSCWAFS   ++EG    KTG+L SLSE
Sbjct: 104 KVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSE 163

Query: 172 QELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYR 229
           Q LVDC +   N GC+GG M +A  ++ ++ GL +E+SYPY A D  C+         YR
Sbjct: 164 QNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICK---------YR 214

Query: 230 VHICSWNGDKNAPEVIL---DGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE 285
                       PE  +    G+ +V    E ALMKAVA   P++VA+DAG   FQFY  
Sbjct: 215 ------------PENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKS 262

Query: 286 GY-----------------------GATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAE 322
           G                        GA  + +KYW+VKNSWG +W   GY+++ +  +  
Sbjct: 263 GIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNH 322

Query: 323 EGLCGITLEASYP 335
              CGI   ASYP
Sbjct: 323 ---CGIATAASYP 332


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  191 bits (486), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 120/323 (37%), Positives = 165/323 (51%), Gaps = 67/323 (20%)

Query: 48  DLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADMTNHEF-------MSSRS 96
           D +E   R  +F+QN K I + N+     +  + L +N+F DMT  EF       +  RS
Sbjct: 33  DAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRS 92

Query: 97  SKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVE 156
           + VS    +  P+++TG            VDWR +GAVT VKDQG+CGSCWAFST  S+E
Sbjct: 93  APVS----VFYPKKETG-------PQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLE 141

Query: 157 GINKIKTGELWSLSEQELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKD 214
           G + +KTG L SL+EQ+LVDC +     GC+GG M  A ++I  + G+ TE +YPY A+D
Sbjct: 142 GQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARD 201

Query: 215 GSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAI 273
           GSC                    D N+      G+  +    E  L +AV +  P++V I
Sbjct: 202 GSCRF------------------DSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTI 243

Query: 274 DAGGKDFQFYSE--------------------GYGATQDGTKYWIVKNSWGTDWEEKGYI 313
           DA    FQFYS                     GYG ++ G  +W+VKNSW T W + GYI
Sbjct: 244 DAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYI 302

Query: 314 RMLRGIDAEEGLCGITLEASYPV 336
           +M R  +     CGI   ASYP+
Sbjct: 303 KMSRNRNNN---CGIATVASYPL 322


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  191 bits (485), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 123/331 (37%), Positives = 171/331 (51%), Gaps = 57/331 (17%)

Query: 36  YERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVN----QMDKPYKLRLNRFADMTNHEF 91
           + +W+S H       E++ R  ++++N++ I   N         + + +N F DMTN EF
Sbjct: 29  WHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF 88

Query: 92  MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFST 151
               +    H +   G   Q   M      +P SVDWR++G VT VK+QG+CGSCWAFS 
Sbjct: 89  RQVVNG-YRHQKHKKGRLFQEPLM----LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 152 VVSVEGINKIKTGELWSLSEQELVDCD--KDNHGCDGGLMEQALNFIAKSEGLTTEKSYP 209
              +EG   +KTG+L SLSEQ LVDC   + N GC+GGLM+ A  +I ++ GL +E+SYP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 210 YTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QP 268
           Y AKDGSC+         YR      N           G+  +P+  E ALMKAVA   P
Sbjct: 204 YEAKDGSCK---------YRAEFAVAND---------TGFVDIPQ-QEKALMKAVATVGP 244

Query: 269 VAVAIDAGGKDFQFYSEGY-----------------------GATQDGTKYWIVKNSWGT 305
           ++VA+DA     QFYS G                        G   +  KYW+VKNSWG+
Sbjct: 245 ISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGS 304

Query: 306 DWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
           +W  +GYI++ +  D     CG+   ASYPV
Sbjct: 305 EWGMEGYIKIAKDRDNH---CGLATAASYPV 332


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
          Length = 330

 Score =  191 bits (485), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 187/360 (51%), Gaps = 64/360 (17%)

Query: 6   GLSLVLVFGVAESFDYQESDLASEECLWDLYER-WRSHHTVSRDLKEKQIRFNVFKQNLK 64
           GL ++L+  +A    Y E  L ++   WDL+++ +R  +    D   +++   ++++NLK
Sbjct: 3   GLEVLLLLPMASFALYPEEILDTQ---WDLWKKTYRKQYNSKVDELSRRL---IWEKNLK 56

Query: 65  RIHKVNQMDKP-----YKLRLNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMH 116
            I  ++ ++       Y+L +N   DMT+ E    M+      SH R        T ++ 
Sbjct: 57  HI-SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSR-----SNDTLYIP 110

Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
                 P SVD+RK+G VT VK+QG+CGSCWAFS+V ++EG  K KTG+L +LS Q LVD
Sbjct: 111 DWESRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 170

Query: 177 CDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
           C  +N GC GG M  A  ++ K+ G+ +E +YPY  +D SC     M +   +   C   
Sbjct: 171 CVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESC-----MYNPTGKAAKCR-- 223

Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE---------- 285
                      GY  +PE +E AL +AVA   P++VAIDA    FQFYS+          
Sbjct: 224 -----------GYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS 272

Query: 286 ----------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
                     GYG  Q G K+WI+KNSWG +W  KGYI M R    +   CGI   AS+P
Sbjct: 273 DNLNHAVLAVGYG-IQKGNKHWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328


>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
          Length = 329

 Score =  190 bits (483), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 177/338 (52%), Gaps = 54/338 (15%)

Query: 26  LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
           L+ EE L   +E W+  H    + K  +I R  ++++NLK+I  V+ ++       Y+L 
Sbjct: 16  LSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKI-SVHNLEASLGAHTYELA 74

Query: 80  LNRFADMTNHEFMSSRSS-KVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVK 138
           +N   DMT+ E +   +  +V   R        T    G+   +P S+D+RK+G VT VK
Sbjct: 75  MNHLGDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGR---VPDSIDYRKKGYVTPVK 131

Query: 139 DQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAK 198
           +QG+CGSCWAFS+  ++EG  K KTG+L +LS Q LVDC  +N+GC GG M  A  ++ +
Sbjct: 132 NQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQ 191

Query: 199 SEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDEN 258
           + G+ +E +YPY  +D SC     M +   +   C              GY  +P  +E 
Sbjct: 192 NGGIDSEDAYPYVGQDESC-----MYNATAKAAKCR-------------GYREIPVGNEK 233

Query: 259 ALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTKYW 297
           AL +AVA   PV+V+IDA    FQFYS                     GYG TQ G KYW
Sbjct: 234 ALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYG-TQKGNKYW 292

Query: 298 IVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
           I+KNSWG  W  KGY+ + R    +   CGIT  AS+P
Sbjct: 293 IIKNSWGESWGNKGYVLLARN---KNNACGITNLASFP 327


>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
          Length = 333

 Score =  190 bits (482), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 133/369 (36%), Positives = 183/369 (49%), Gaps = 73/369 (19%)

Query: 1   TFFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFK 60
           TF L  L L    G+A +       L     L   + +W++ H     + E+  R  V++
Sbjct: 4   TFILAALCL----GIASA------TLTFNHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWE 53

Query: 61  QNLKRI----HKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMH 116
           +N+K I     + +Q    + + +N F DMT+ EF      +V +      PR+   F  
Sbjct: 54  KNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEF-----RQVMNGFQNRKPRKGKVFQE 108

Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
               + P SVDWR++G VT VK+QG+CGSCWAFS   ++EG    KTG+L SLSEQ LVD
Sbjct: 109 PLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD 168

Query: 177 CD--KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICS 234
           C   + N GC+GGLM+ A  ++A + GL +E+SYPY A + SC                 
Sbjct: 169 CSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESC----------------- 211

Query: 235 WNGDKNAPEVIL---DGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE----- 285
               K  PE  +    G+  +P+  E ALMKAVA   P++VAIDAG + F FY E     
Sbjct: 212 ----KYNPEYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFE 266

Query: 286 ---------------GYG---ATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCG 327
                          GYG      D +KYW+VKNSWG +W   GYI+M +        CG
Sbjct: 267 PDCSSEDMDHGVLVVGYGFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKD---RRNHCG 323

Query: 328 ITLEASYPV 336
           I   ASYP 
Sbjct: 324 IASAASYPT 332


>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
          Length = 330

 Score =  190 bits (482), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 187/360 (51%), Gaps = 64/360 (17%)

Query: 6   GLSLVLVFGVAESFDYQESDLASEECLWDLYER-WRSHHTVSRDLKEKQIRFNVFKQNLK 64
           GL +VL+  V  S  Y E  L ++   W+L+++ +R  +    D   +++   ++++NLK
Sbjct: 3   GLKVVLLLPVMSSALYPEEILDTQ---WELWKKTYRKQYNSKVDEISRRL---IWEKNLK 56

Query: 65  RIHKVNQMDKP-----YKLRLNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMH 116
            I  ++ ++       Y+L +N   DMT+ E    M+      SH R        T ++ 
Sbjct: 57  HI-SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSR-----SNDTLYIP 110

Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
                 P S+D+RK+G VT VK+QG+CGSCWAFS+V ++EG  K KTG+L +LS Q LVD
Sbjct: 111 DWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 170

Query: 177 CDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
           C  +N GC GG M  A  ++ K+ G+ +E +YPY  +D +C     M +   +   C   
Sbjct: 171 CVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENC-----MYNPTGKAAKCR-- 223

Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE---------- 285
                      GY  +PE +E AL +AVA   PV+VAIDA    FQFYS+          
Sbjct: 224 -----------GYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNS 272

Query: 286 ----------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
                     GYG  Q G K+WI+KNSWG +W  KGYI M R    +   CGI   AS+P
Sbjct: 273 DNLNHAVLAVGYG-IQKGKKHWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
          Length = 329

 Score =  189 bits (481), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 175/340 (51%), Gaps = 58/340 (17%)

Query: 26  LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
           L  EE L   +E W+  H    + K  +I R  ++++NLK I  ++ ++       Y+L 
Sbjct: 16  LYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI-SIHNLEASLGVHTYELA 74

Query: 80  LNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG 136
           +N   DMTN E    M+      SH R        T ++       P SVD+RK+G VT 
Sbjct: 75  MNHLGDMTNEEVVQKMTGLKVPASHSR-----SNDTLYIPDWEGRAPDSVDYRKKGYVTP 129

Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFI 196
           VK+QG+CGSCWAFS+V ++EG  K KTG+L +LS Q LVDC  +N GC GG M  A  ++
Sbjct: 130 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYV 189

Query: 197 AKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESD 256
            K+ G+ +E +YPY  ++ SC     M +   +   C              GY  +PE +
Sbjct: 190 QKNRGIDSEDAYPYVGQEESC-----MYNPTGKAAKCR-------------GYREIPEGN 231

Query: 257 ENALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTK 295
           E AL +AVA   PV+VAIDA    FQFYS+                    GYG  Q G K
Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNK 290

Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
           +WI+KNSWG +W  KGYI M R    +   CGI   AS+P
Sbjct: 291 HWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 327


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
          Length = 329

 Score =  189 bits (481), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 175/340 (51%), Gaps = 58/340 (17%)

Query: 26  LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
           L  EE L   +E W+  H    + K  +I R  ++++NLK I  ++ ++       Y+L 
Sbjct: 16  LYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI-SIHNLEASLGVHTYELA 74

Query: 80  LNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG 136
           +N   DMTN E    M+      SH R        T ++       P SVD+RK+G VT 
Sbjct: 75  MNHLGDMTNEEVVQKMTGLKVPASHSR-----SNDTLYIPDWEGRAPDSVDYRKKGYVTP 129

Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFI 196
           VK+QG+CGSCWAFS+V ++EG  K KTG+L +LS Q LVDC  +N GC GG M  A  ++
Sbjct: 130 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYV 189

Query: 197 AKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESD 256
            K+ G+ +E +YPY  ++ SC     M +   +   C              GY  +PE +
Sbjct: 190 QKNRGIDSEDAYPYVGQEESC-----MYNPTGKAAKCR-------------GYREIPEGN 231

Query: 257 ENALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTK 295
           E AL +AVA   PV+VAIDA    FQFYS+                    GYG  Q G K
Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNK 290

Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
           +WI+KNSWG +W  KGYI M R    +   CGI   AS+P
Sbjct: 291 HWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 327


>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 215

 Score =  189 bits (479), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 98/228 (42%), Positives = 134/228 (58%), Gaps = 30/228 (13%)

Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNH 182
           P S+DWR +GAVT VK+Q  CGSCWAFSTV +VEGINKI+TG+L SLSEQEL+DCD+ +H
Sbjct: 2   PESIDWRDKGAVTPVKNQNPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRRSH 61

Query: 183 GCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAP 242
           GC GG    ++ ++A + G+ TEK YPY  K G C                    +K   
Sbjct: 62  GCKGGYQTGSIQYVADNGGVHTEKEYPYEKKQGKCRAK-----------------EKKGT 104

Query: 243 EVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY-----GATQD----- 292
           +V + GY+ VP +DE +L++ + NQPV+V  ++ G+ FQ Y  G      G   D     
Sbjct: 105 KVQITGYKRVPANDEISLIQGIGNQPVSVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTA 164

Query: 293 ---GTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
              G    + KNSWG +W EKGYI++ R     EG CG+   + +P+K
Sbjct: 165 IGYGKAQLLDKNSWGPNWGEKGYIKIKRASGKSEGTCGVYKSSYFPIK 212


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.134    0.416 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 137,431,730
Number of Sequences: 539616
Number of extensions: 5804850
Number of successful extensions: 13455
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 216
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 12555
Number of HSP's gapped (non-prelim): 292
length of query: 351
length of database: 191,569,459
effective HSP length: 118
effective length of query: 233
effective length of database: 127,894,771
effective search space: 29799481643
effective search space used: 29799481643
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)