BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037516
         (330 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  302 bits (774), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 149/333 (44%), Positives = 218/333 (65%), Gaps = 12/333 (3%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKF 60
           +LI+ + + +  +++  + D + A +E W+ +  ++Y +  E   RF+IFK+  RFI++ 
Sbjct: 18  LLILSLAFNAKNLTQRTN-DEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEH 76

Query: 61  NREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPR 120
           N + N++YK+ LN+FADLTDEEF +++ G+   + N +  S  Y       P   + LP 
Sbjct: 77  NADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS-NKTKVSNRYE------PRVGQVLPS 129

Query: 121 SIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC---SGSR 177
            +DWR+ GAV  +K+QG CG CW FSA+A VEGI KI TG LISLSEQ+++DC     +R
Sbjct: 130 YVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTR 189

Query: 178 GCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELA 236
           GC GG++ D F +II + G+  E  YPY  ++G CN      K   I +Y++VP  +E A
Sbjct: 190 GCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEWA 249

Query: 237 LRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNS 296
           L+ AV+ QPVSVA+DA+   F++YS G+F GPCG  ++HAVTIVGYG+     YW++KNS
Sbjct: 250 LQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIVKNS 309

Query: 297 WGQNWGEGGFIRMRRDVGGAGLCGIARKASYPI 329
           W   WGE G++R+ R+VGGAG CGIA   SYP+
Sbjct: 310 WDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  298 bits (762), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 218/333 (65%), Gaps = 12/333 (3%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKF 60
           +LI+ + + +  +++  + D + A +E W+ +  ++Y +  E   RF+IFK+  RFI++ 
Sbjct: 18  LLILSLAFNAKNLTQRTN-DEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEH 76

Query: 61  NREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPR 120
           N + N++YK+ LN+FADLTDEEF +++       R  S  +++  +N +  P   + LP 
Sbjct: 77  NADTNRSYKVGLNQFADLTDEEFRSTYL------RFTSGSNKTKVSNRYE-PRVGQVLPS 129

Query: 121 SIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC---SGSR 177
            +DWR+ GAV  +K+QG CG CW FSA+A VEGI KI TG LISLSEQ+++DC     +R
Sbjct: 130 YVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTR 189

Query: 178 GCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELA 236
           GC GG++ D F +II + G+  E  YPY  ++G CN      K   I +Y++VP  +E A
Sbjct: 190 GCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEWA 249

Query: 237 LRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNS 296
           L+ AV+ QPVSVA+DA+   F+ YS G+F GPCG  ++HAVTIVGYG+     YW++KNS
Sbjct: 250 LQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIVKNS 309

Query: 297 WGQNWGEGGFIRMRRDVGGAGLCGIARKASYPI 329
           W   WGE G++R+ R+VGGAG CGIA   SYP+
Sbjct: 310 WDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  296 bits (757), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/333 (46%), Positives = 206/333 (61%), Gaps = 14/333 (4%)

Query: 1   MLIIMVTWASL-VMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEK 59
            L +   WAS    SR    D +  + E WMA+  R YK+  EK  RF+IFK N + IE 
Sbjct: 11  FLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIET 70

Query: 60  FNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSR-RGL 118
           FN     +Y L +N+F D+T  EF+A +TG  +P  NI  +          + D     +
Sbjct: 71  FNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPL-NIEREPV------VSFDDVNISAV 123

Query: 119 PRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGSRG 178
           P+SIDWR  GAV  VKNQ  CG CW F+A+A VEGI KI+TG L+SLSEQ+VLDC+ S G
Sbjct: 124 PQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYG 183

Query: 179 CYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDV-PTSELAL 237
           C GGW++ A+ +II + G+T E  YPY   +G CN       +A I  Y  V    E ++
Sbjct: 184 CKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCN-ANSFPNSAYITGYSYVRRNDERSM 242

Query: 238 RYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGP-YWLIKNS 296
            YAVS QP++  IDAS   F+YY+GGVF+GPCG +LNHA+TI+GYG  + G  YW+++NS
Sbjct: 243 MYAVSNQPIAALIDASE-NFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301

Query: 297 WGQNWGEGGFIRMRRDV-GGAGLCGIARKASYP 328
           WG +WGEGG++RM R V   +G+CGIA    +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  293 bits (750), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 203/313 (64%), Gaps = 9/313 (2%)

Query: 20  DSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLT 79
           D +    E W++   + Y+   EK +RF++FK N + I++ N++G ++Y L LNEFADL+
Sbjct: 45  DKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLS 103

Query: 80  DEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSC 139
            EEF   + G K       ++ +SYA   F Y D    +P+S+DWR +GAV  VKNQGSC
Sbjct: 104 HEEFKKMYLGLKTDIVR-RDEERSYAE--FAYRDVE-AVPKSVDWRKKGAVAEVKNQGSC 159

Query: 140 GCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS--RGCYGGWMDDAFSYIIRSQGL 197
           G CW FS VAAVEGI KI TG L +LSEQ+++DC  +   GC GG MD AF YI+++ GL
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGL 219

Query: 198 TDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPTS-ELALRYAVSRQPVSVAIDASSPG 256
             E  YPY   EG C  Q+   +   I  +QDVPT+ E +L  A++ QP+SVAIDAS   
Sbjct: 220 RKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGRE 279

Query: 257 FRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGG- 315
           F++YSGGVF G CG +L+H V  VGYGSS    Y ++KNSWG  WGE G+IR++R+ G  
Sbjct: 280 FQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 339

Query: 316 AGLCGIARKASYP 328
            GLCGI + AS+P
Sbjct: 340 EGLCGINKMASFP 352


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  292 bits (748), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 156/313 (49%), Positives = 199/313 (63%), Gaps = 10/313 (3%)

Query: 20  DSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLT 79
           D +    E WM++ ++ YK+  EK  RF++F++N   I++ N E N +Y L LNEFADLT
Sbjct: 45  DKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLT 103

Query: 80  DEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSC 139
            EEF   + G   P    S + Q  AN  F Y D    LP+S+DWR +GAV PVK+QG C
Sbjct: 104 HEEFKGRYLGLAKP--QFSRKRQPSAN--FRYRDIT-DLPKSVDWRKKGAVAPVKDQGQC 158

Query: 140 GCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS--RGCYGGWMDDAFSYIIRSQGL 197
           G CW FS VAAVEGI +I TG L SLSEQ+++DC  +   GC GG MD AF YII + GL
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 198 TDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELALRYAVSRQPVSVAIDASSPG 256
             E  YPY   EG C  Q+  ++   I  Y+DVP   + +L  A++ QPVSVAI+AS   
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 257 FRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA 316
           F++Y GGVF G CG +L+H V  VGYGSS    Y ++KNSWG  WGE GFIRM+R+ G  
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338

Query: 317 -GLCGIARKASYP 328
            GLCGI + ASYP
Sbjct: 339 EGLCGINKMASYP 351


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score =  291 bits (746), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 152/333 (45%), Positives = 205/333 (61%), Gaps = 14/333 (4%)

Query: 1   MLIIMVTWASL-VMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEK 59
            L + V WAS    S     D +  + E WMA+  R YK+  EK +RF+IFK N   IE 
Sbjct: 11  FLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIET 70

Query: 60  FNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPD-SRRGL 118
           FN     +Y L +N+F D+T+ EF+A +TG  +P  NI  +          + D     +
Sbjct: 71  FNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPL-NIKREPV------VSFDDVDISSV 123

Query: 119 PRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGSRG 178
           P+SIDWR  GAVT VKNQG CG CW F+++A VE I KI+ G L+SLSEQQVLDC+ S G
Sbjct: 124 PQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSYG 183

Query: 179 CYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELAL 237
           C GGW++ A+S+II ++G+    +YPY+  +G C    G   +A I  Y  V   +E  +
Sbjct: 184 CKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCK-TNGVPNSAYITRYTYVQRNNERNM 242

Query: 238 RYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEG-PYWLIKNS 296
            YAVS QP++ A+DAS   F++Y  GVF GPCG  LNHA+ I+GYG  + G  +W+++NS
Sbjct: 243 MYAVSNQPIAAALDASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301

Query: 297 WGQNWGEGGFIRMRRDVGGA-GLCGIARKASYP 328
           WG  WGEGG+IR+ RDV  + GLCGIA    YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  287 bits (734), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 153/339 (45%), Positives = 211/339 (62%), Gaps = 17/339 (5%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKHELW-MAQSARTYKNQA----EKAMRFKIFKKNFR 55
           ML+++ T   L      H   + +++ LW + +  R++   A    EKA RF +FK N +
Sbjct: 11  MLMVLETTKGL----DFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVFKHNVK 66

Query: 56  FIEKFNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSR 115
            I + N++ +++YKL LN+F D+T EEF  ++ G  +    +  Q +  A   F Y +  
Sbjct: 67  HIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMF-QGEKKATKSFMYANVN 124

Query: 116 RGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-- 173
             LP S+DWR  GAVTPVKNQG CG CW FS V AVEGI +IRT +L SLSEQ+++DC  
Sbjct: 125 T-LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDT 183

Query: 174 SGSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-T 232
           + ++GC GG MD AF +I    GLT E VYPY+  +  C+  +       I  ++DVP  
Sbjct: 184 NQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKN 243

Query: 233 SELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEG-PYW 291
           SE  L  AV+ QPVSVAIDA    F++YS GVF G CG  LNH V +VGYG++ +G  YW
Sbjct: 244 SEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYW 303

Query: 292 LIKNSWGQNWGEGGFIRMRRDV-GGAGLCGIARKASYPI 329
           ++KNSWG+ WGE G+IRM+R +    GLCGIA +ASYP+
Sbjct: 304 IVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  283 bits (723), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 152/344 (44%), Positives = 217/344 (63%), Gaps = 22/344 (6%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKHELW-MAQSARTYKNQA----EKAMRFKIFKKNFR 55
           + ++ +++ S+  S    E  ++++  LW + +  RT+   A    EK  RF +FK+N +
Sbjct: 9   LALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKENVK 68

Query: 56  FIEKFNREGNQTYKLSLNEFADLTDEEFIASHTGYKM----PTRNISNQSQSYANNWFGY 111
           FI +FN++ +  YKL+LN+F D+T++EF + + G K+      R I   + S+     G 
Sbjct: 69  FIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVG- 127

Query: 112 PDSRRGLPR-SIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQV 170
                 LP  SIDWRA+GAVT VK+QG CG CW FS +A+VEGI +I+TG L+SLSEQ++
Sbjct: 128 -----SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQEL 182

Query: 171 LDCSGS--RGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQ 228
           +DC  S   GC GG MD AF + I+  G+T E  YPY  ++G C           I  +Q
Sbjct: 183 VDCDTSYNEGCNGGLMDYAFEF-IQKNGITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQ 241

Query: 229 DVP-TSELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNE 287
           DVP  +E AL  AV+ QP+SV+I+AS  GF++YS GVF G CG  L+H V IVGYG++ +
Sbjct: 242 DVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRD 301

Query: 288 G-PYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLCGIARKASYPI 329
           G  YW++KNSWG+ WGE G+IRM+R +    G CGIA +ASYPI
Sbjct: 302 GTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  279 bits (713), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 143/319 (44%), Positives = 204/319 (63%), Gaps = 17/319 (5%)

Query: 19  EDSISAKHELWMAQ--SARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFA 76
           E  + + +E W+ +   A++  +  EK  RF+IFK N RF+++ N E N +Y+L L  FA
Sbjct: 43  EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101

Query: 77  DLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRG--LPRSIDWRARGAVTPVK 134
           DLT++E+ + + G KM  +     S  Y        ++R G  LP SIDWR +GAV  VK
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTSLRY--------EARVGDELPESIDWRKKGAVAEVK 153

Query: 135 NQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS--RGCYGGWMDDAFSYII 192
           +QG CG CW FS + AVEGI +I TG LI+LSEQ+++DC  S   GC GG MD AF +II
Sbjct: 154 DQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFII 213

Query: 193 RSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQPVSVAID 251
           ++ G+  ++ YPY+  +G C+  R   K   I SY+DVPT SE +L+ AV+ QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 252 ASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRR 311
           A    F+ Y  G+F G CG  L+H V  VGYG+ N   YW+++NSWG++WGE G++RM R
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMAR 333

Query: 312 DVG-GAGLCGIARKASYPI 329
           ++   +G CGIA + SYPI
Sbjct: 334 NIASSSGKCGIAIEPSYPI 352


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  276 bits (707), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 148/315 (46%), Positives = 199/315 (63%), Gaps = 19/315 (6%)

Query: 29  WMAQSARTYKNQA----EKAMRFKIFKKNFRFIEKFNREG-NQTYKLSLNEFADLTDEEF 83
           W A+  +T  N      ++  RF IFK N RFI+  N +  N TYKL L +F DLT++E+
Sbjct: 52  WSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEY 111

Query: 84  IASHTGYKM-PTRNIS---NQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSC 139
              + G +  P R I+   N +Q Y+    G     + +P ++DWR +GAV P+K+QG+C
Sbjct: 112 RKLYLGARTEPARRIAKAKNVNQKYSAAVNG-----KEVPETVDWRQKGAVNPIKDQGTC 166

Query: 140 GCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS--RGCYGGWMDDAFSYIIRSQGL 197
           G CW FS  AAVEGI KI TG LISLSEQ+++DC  S  +GC GG MD AF +I+++ GL
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 226

Query: 198 TDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQPVSVAIDASSPG 256
             E+ YPY+   G CN      +   I  Y+DVPT  E AL+ A+S QPVSVAI+A    
Sbjct: 227 NTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRI 286

Query: 257 FRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGG- 315
           F++Y  G+F G CG NL+HAV  VGYGS N   YW+++NSWG  WGE G+IRM R++   
Sbjct: 287 FQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346

Query: 316 -AGLCGIARKASYPI 329
            +G CGIA +ASYP+
Sbjct: 347 KSGKCGIAVEASYPV 361


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  273 bits (698), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 139/310 (44%), Positives = 194/310 (62%), Gaps = 17/310 (5%)

Query: 29  WMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFN---REGNQTYKLSLNEFADLTDEEFIA 85
           W A+  ++Y    E+  R+  F+ N R+I++ N     G  +++L LN FADLT+EE+  
Sbjct: 43  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 86  SHTGYKMPTRNISNQSQSY--ANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCW 143
           ++ G +   R     S  Y  A+N          LP S+DWR +GAV  +K+QG CG CW
Sbjct: 103 TYLGLRNKPRRERKVSDRYLAADN--------EALPESVDWRTKGAVAEIKDQGGCGSCW 154

Query: 144 IFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS--RGCYGGWMDDAFSYIIRSQGLTDER 201
            FSA+AAVEGI +I TG LISLSEQ+++DC  S   GC GG MD AF +II + G+  E 
Sbjct: 155 AFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTED 214

Query: 202 VYPYQRREGYCNWQRGAMKAARIRSYQDV-PTSELALRYAVSRQPVSVAIDASSPGFRYY 260
            YPY+ ++  C+  R   K   I SY+DV P SE +L+ AV+ QPVSVAI+A    F+ Y
Sbjct: 215 DYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLY 274

Query: 261 SGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDV-GGAGLC 319
           S G+F G CG  L+H V  VGYG+ N   YW+++NSWG++WGE G++RM R++   +G C
Sbjct: 275 SSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKC 334

Query: 320 GIARKASYPI 329
           GIA + SYP+
Sbjct: 335 GIAVEPSYPL 344


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  273 bits (698), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 140/290 (48%), Positives = 185/290 (63%), Gaps = 8/290 (2%)

Query: 46  RFKIFKKNFRFIEKFNREG-NQTYKLSLNEFADLTDEEFIASHTGYKM-PTRNISNQSQS 103
           RF IFK N RFI+  N    N TYKL L  FA+LT++E+ + + G +  P R I+     
Sbjct: 28  RFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITKAKN- 86

Query: 104 YANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLI 163
             N  +    +   +P ++DWR +GAV  +K+QG+CG CW FS  AAVEGI KI TG L+
Sbjct: 87  -VNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGELV 145

Query: 164 SLSEQQVLDCSGS--RGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKA 221
           SLSEQ+++DC  S  +GC GG MD AF +I+++ GL  E+ YPY    G CN      + 
Sbjct: 146 SLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRV 205

Query: 222 ARIRSYQDVPT-SELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIV 280
             I  Y+DVP+  E AL+ AVS QPVSVAIDA    F++Y  G+F G CG N++HAV  V
Sbjct: 206 VTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVAV 265

Query: 281 GYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGG-AGLCGIARKASYPI 329
           GYGS N   YW+++NSWG  WGE G+IRM R+V   +G CGIA +ASYP+
Sbjct: 266 GYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  273 bits (697), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 197/315 (62%), Gaps = 6/315 (1%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADL 78
           E+ +S  ++ W +  +   ++  E+  RF +F+ N   +   N++ N++YKL LN+FADL
Sbjct: 31  EEGLSTLYDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHNTNKK-NRSYKLKLNKFADL 88

Query: 79  TDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGS 138
           T  EF  ++TG  +    +    +  +  +    ++   LP S+DWR +GAVT +KNQG 
Sbjct: 89  TINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGK 148

Query: 139 CGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGSR--GCYGGWMDDAFSYIIRSQG 196
           CG CW FS VAAVEGI KI+T +L+SLSEQ+++DC   +  GC GG M+ AF +I ++ G
Sbjct: 149 CGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGG 208

Query: 197 LTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELALRYAVSRQPVSVAIDASSP 255
           +T E  YPY+  +G C+  +       I  ++DVP   E AL  AV+ QPVSVAIDA S 
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSS 268

Query: 256 GFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGG 315
            F++YS GVF G CG  LNH V  VGYGS     YW+++NSWG  WGEGG+I++ R++  
Sbjct: 269 DFQFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDE 328

Query: 316 -AGLCGIARKASYPI 329
             G CGIA +ASYPI
Sbjct: 329 PEGRCGIAMEASYPI 343


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  272 bits (696), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 149/322 (46%), Positives = 195/322 (60%), Gaps = 18/322 (5%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADL 78
           E+++   +E W + + R  ++ AEK  RF  FK N  FI   N+ G+  Y+L LN F D+
Sbjct: 39  EEALWDLYERWQS-AHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 79  TDEEFIASHTG---YKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
              EF A+  G      P++  S     YA       D    LP S+DWR +GAVT VK+
Sbjct: 98  DQAEFRATFVGDLRRDTPSKPPSVPGFMYAA--LNVSD----LPPSVDWRQKGAVTGVKD 151

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC--SGSRGCYGGWMDDAFSYIIR 193
           QG CG CW FS V +VEGI  IRTG L+SLSEQ+++DC  + + GC GG MD+AF YI  
Sbjct: 152 QGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKN 211

Query: 194 SQGLTDERVYPYQRREGYCNWQRGAMKA---ARIRSYQDVP-TSELALRYAVSRQPVSVA 249
           + GL  E  YPY+   G CN  R A  +     I  +QDVP  SE  L  AV+ QPVSVA
Sbjct: 212 NGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271

Query: 250 IDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEG-PYWLIKNSWGQNWGEGGFIR 308
           ++AS   F +YS GVF G CG  L+H V +VGYG + +G  YW +KNSWG +WGE G+IR
Sbjct: 272 VEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331

Query: 309 MRRDVGGA-GLCGIARKASYPI 329
           + +D G + GLCGIA +ASYP+
Sbjct: 332 VEKDSGASGGLCGIAMEASYPV 353


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  272 bits (696), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/326 (43%), Positives = 207/326 (63%), Gaps = 13/326 (3%)

Query: 14  SRTLHEDSISAKHELW-MAQSARTY----KNQAEKAMRFKIFKKNFRFIEKFNREGNQTY 68
           S   HE  + ++  LW + +  R++    ++  EK  RF +FK N   +   N+  ++ Y
Sbjct: 22  SFDFHEKDLESEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPY 80

Query: 69  KLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARG 128
           KL LN+FAD+T+ EF +++ G K+    +   SQ + +  F Y +    +P S+DWR +G
Sbjct: 81  KLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQ-HGSGTFMY-EKVGSVPASVDWRKKG 138

Query: 129 AVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSG--SRGCYGGWMDD 186
           AVT VK+QG CG CW FS + AVEGI +I+T +L+SLSEQ+++DC    ++GC GG M+ 
Sbjct: 139 AVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMES 198

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPTS-ELALRYAVSRQP 245
           AF +I +  G+T E  YPY  +EG C+  +    A  I  +++VP + E AL  AV+ QP
Sbjct: 199 AFEFIKQKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQP 258

Query: 246 VSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGP-YWLIKNSWGQNWGEG 304
           VSVAIDA    F++YS GVF G C  +LNH V IVGYG++ +G  YW+++NSWG  WGE 
Sbjct: 259 VSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQ 318

Query: 305 GFIRMRRDVG-GAGLCGIARKASYPI 329
           G+IRM+R++    GLCGIA  ASYPI
Sbjct: 319 GYIRMQRNISKKEGLCGIAMMASYPI 344


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  271 bits (694), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 149/322 (46%), Positives = 194/322 (60%), Gaps = 18/322 (5%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADL 78
           E+++   +E W + + R  ++ AEK  RF  FK N  FI   N+ G+  Y+L LN F D+
Sbjct: 39  EEALWDLYERWQS-AHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 79  TDEEFIASHTG---YKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
              EF A+  G      P +  S     YA       D    LP S+DWR +GAVT VK+
Sbjct: 98  DQAEFRATFVGDLRRDTPAKPPSVPGFMYAA--LNVSD----LPPSVDWRQKGAVTGVKD 151

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC--SGSRGCYGGWMDDAFSYIIR 193
           QG CG CW FS V +VEGI  IRTG L+SLSEQ+++DC  + + GC GG MD+AF YI  
Sbjct: 152 QGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKN 211

Query: 194 SQGLTDERVYPYQRREGYCNWQRGAMKA---ARIRSYQDVP-TSELALRYAVSRQPVSVA 249
           + GL  E  YPY+   G CN  R A  +     I  +QDVP  SE  L  AV+ QPVSVA
Sbjct: 212 NGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271

Query: 250 IDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEG-PYWLIKNSWGQNWGEGGFIR 308
           ++AS   F +YS GVF G CG  L+H V +VGYG + +G  YW +KNSWG +WGE G+IR
Sbjct: 272 VEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331

Query: 309 MRRDVGGA-GLCGIARKASYPI 329
           + +D G + GLCGIA +ASYP+
Sbjct: 332 VEKDSGASGGLCGIAMEASYPV 353


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  269 bits (688), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 138/318 (43%), Positives = 200/318 (62%), Gaps = 13/318 (4%)

Query: 18  HEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFAD 77
           +E  +   +E W+ ++ + Y    EK  RFKIFK N +F+++ N   ++T+++ L  FAD
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 78  LTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQG 137
           LT+EEF A +   KM     S +++ Y      Y +    LP  +DWRA GAV  VK+QG
Sbjct: 96  LTNEEFRAIYLRKKMERTKDSVKTERYL-----YKEGDV-LPDEVDWRANGAVVSVKDQG 149

Query: 138 SCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSG---SRGCYGGWMDDAFSYIIRS 194
           +CG CW FSAV AVEGI +I TG LISLSEQ+++DC     + GC GG M+ AF +I+++
Sbjct: 150 NCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKN 209

Query: 195 QGLTDERVYPYQRRE-GYCNWQRGA-MKAARIRSYQDVP-TSELALRYAVSRQPVSVAID 251
            G+  ++ YPY   + G CN  +    +   I  Y+DVP   E +L+ AV+ QPVSVAI+
Sbjct: 210 GGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIE 269

Query: 252 ASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRR 311
           ASS  F+ Y  GV  G CG +L+H V +VGYGS++   YW+I+NSWG NWG+ G+++++R
Sbjct: 270 ASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQR 329

Query: 312 DVGGA-GLCGIARKASYP 328
           ++    G CGIA   SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  268 bits (686), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 215/342 (62%), Gaps = 17/342 (4%)

Query: 2   LIIMVTWASLVM----SRTLHEDSISAKHELW-MAQSARTY----KNQAEKAMRFKIFKK 52
           L+ +V   SLV+    S   H+  ++++  LW + +  R++    ++  EK  RF +FK 
Sbjct: 6   LLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKA 65

Query: 53  NFRFIEKFNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYP 112
           N   +   N+  ++ YKL LN+FAD+T+ EF +++ G K+    +  +   + N  F Y 
Sbjct: 66  NLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMF-RGTPHENGAFMY- 122

Query: 113 DSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLD 172
           +    +P S+DWR +GAVT VK+QG CG CW FS V AVEGI +I+T +L++LSEQ+++D
Sbjct: 123 EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVD 182

Query: 173 CSG--SRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDV 230
           C    ++GC GG M+ AF +I +  G+T E  YPY+ +EG C+  +    A  I  +++V
Sbjct: 183 CDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENV 242

Query: 231 PTS-ELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGP 289
           P + E AL  AV+ QPVSVAIDA    F++YS GVF G C  +LNH V IVGYG++ +G 
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGT 302

Query: 290 -YWLIKNSWGQNWGEGGFIRMRRDVG-GAGLCGIARKASYPI 329
            YW+++NSWG  WGE G+IRM+R++    GLCGIA   SYPI
Sbjct: 303 NYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  266 bits (679), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 138/319 (43%), Positives = 196/319 (61%), Gaps = 16/319 (5%)

Query: 19  EDSISAKHELWMAQSARTYKNQ--AEKAMRFKIFKKNFRFIEKFNREGNQT--YKLSLNE 74
           E    A ++LW+A++     N    E   RF +F  N +F++  N   ++   ++L +N 
Sbjct: 45  EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNR 104

Query: 75  FADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVK 134
           FADLT+EEF A+  G K+  R+ +   + Y +      D    LP S+DWR +GAV PVK
Sbjct: 105 FADLTNEEFRATFLGAKVAERSRA-AGERYRH------DGVEELPESVDWREKGAVAPVK 157

Query: 135 NQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCSGS---RGCYGGWMDDAFSYI 191
           NQG CG CW FSAV+ VE I ++ TG +I+LSEQ++++CS +    GC GG MDDAF +I
Sbjct: 158 NQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI 217

Query: 192 IRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELALRYAVSRQPVSVAI 250
           I++ G+  E  YPY+  +G C+  R   K   I  ++DVP   E +L+ AV+ QPVSVAI
Sbjct: 218 IKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAI 277

Query: 251 DASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMR 310
           +A    F+ Y  GVF+G CG +L+H V  VGYG+ N   YW+++NSWG  WGE G++RM 
Sbjct: 278 EAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRME 337

Query: 311 RDVG-GAGLCGIARKASYP 328
           R++    G CGIA  ASYP
Sbjct: 338 RNINVTTGKCGIAMMASYP 356


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  264 bits (675), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 147/327 (44%), Positives = 196/327 (59%), Gaps = 23/327 (7%)

Query: 19  EDSISAKHELWMAQSARTYKNQ------AEKAMRFKIFKKNFRFIEKFNREGNQT--YKL 70
           E    A ++LW+A+  R            E   RF++F  N +F++  N   ++   ++L
Sbjct: 55  EAEARAAYDLWLARHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRL 114

Query: 71  SLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAV 130
            +N FADLT+ EF A++ G   P        ++Y +      D    LP S+DWR +GAV
Sbjct: 115 GMNRFADLTNGEFRATYLG-TTPAGRGRRVGEAYRH------DGVEALPDSVDWRDKGAV 167

Query: 131 T-PVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDD 186
             PVKNQG CG CW FSAVAAVEGI KI TG L+SLSEQ++++C+    + GC GG MDD
Sbjct: 168 VAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDD 227

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSELALRYAVSRQP 245
           AF++I R+ GL  E  YPY   +G CN  + + K   I  ++DVP   EL+L+ AV+ QP
Sbjct: 228 AFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQP 287

Query: 246 VSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGS--SNEGPYWLIKNSWGQNWGE 303
           VSVAIDA    F+ Y  GVF G CG NL+H V  VGYG+  +    YW ++NSWG +WGE
Sbjct: 288 VSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGE 347

Query: 304 GGFIRMRRDVGG-AGLCGIARKASYPI 329
            G+IRM R+V    G CGIA  ASYPI
Sbjct: 348 NGYIRMERNVTARTGKCGIAMMASYPI 374


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  262 bits (670), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 137/293 (46%), Positives = 187/293 (63%), Gaps = 8/293 (2%)

Query: 42  EKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQS 101
           EK  RF +FK N   +   N+  ++ YKL LN+FAD+T+ EF  +++G K+    +  + 
Sbjct: 53  EKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMF-RG 110

Query: 102 QSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGR 161
               N  F Y +    +P S+DWR +GAVT VK+QG CG CW FS + AVEGI +I+T +
Sbjct: 111 GPRGNGTFMY-EKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNK 169

Query: 162 LISLSEQQVLDCSG--SRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAM 219
           L+SLSEQ+++DC    ++GC GG MD AF +I +  G+T E  YPY+  +G C+  +   
Sbjct: 170 LVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENA 229

Query: 220 KAARIRSYQDVP-TSELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVT 278
            A  I  +++VP   E AL  AV+ QPVSVAIDA    F++YS GVF G CG  L+H V 
Sbjct: 230 PAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVA 289

Query: 279 IVGYGSSNEG-PYWLIKNSWGQNWGEGGFIRMRRDVGG-AGLCGIARKASYPI 329
           IVGYG++ +G  YW +KNSWG  WGE G+IRM R +    GLCGIA +ASYPI
Sbjct: 290 IVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 342


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  253 bits (645), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 135/308 (43%), Positives = 186/308 (60%), Gaps = 10/308 (3%)

Query: 27  ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEF-IA 85
           E WM +  + Y + AEK  R  IF+ N RFI   N E N +Y+L LN FADL+  E+   
Sbjct: 57  ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEI 115

Query: 86  SHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIF 145
            H     P RN    + S   N +   D    LP+S+DWR  GAVT VK+QG C  CW F
Sbjct: 116 CHGADPRPPRNHVFMTSS---NRYKTSDGDV-LPKSVDWRNEGAVTEVKDQGLCRSCWAF 171

Query: 146 SAVAAVEGITKIRTGRLISLSEQQVLDCSG-SRGCYGGWMDDAFSYIIRSQGLTDERVYP 204
           S V AVEG+ KI TG L++LSEQ +++C+  + GC GG ++ A+ +I+ + GL  +  YP
Sbjct: 172 STVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYP 231

Query: 205 YQRREGYCNWQ-RGAMKAARIRSYQDVPTS-ELALRYAVSRQPVSVAIDASSPGFRYYSG 262
           Y+   G C  + +   K   I  Y+++P + E AL  AV+ QPV+  +D+SS  F+ Y  
Sbjct: 232 YKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYES 291

Query: 263 GVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLCGI 321
           GVF G CG NLNH V +VGYG+ N   YW++KNS G  WGE G+++M R++    GLCGI
Sbjct: 292 GVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGI 351

Query: 322 ARKASYPI 329
           A +ASYP+
Sbjct: 352 AMRASYPL 359


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  252 bits (643), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 199/342 (58%), Gaps = 17/342 (4%)

Query: 1   MLIIMVTWASLVMSRT---LHEDSISAKHELW-MAQSARTYKNQA----EKAMRFKIFKK 52
             I+++++ SL+ +       E  +  +  +W + +  R + + +    E   RF +F+ 
Sbjct: 4   FFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVSRASHEAIKRFNVFRH 63

Query: 53  NFRFIEKFNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYP 112
           N   + + N++ N+ YKL +N FAD+T  EF +S+ G  +    +  +     +  F Y 
Sbjct: 64  NVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRML-RGPKRGSGGFMYE 121

Query: 113 DSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLD 172
           +  R +P S+DWR +GAVT VKNQ  CG CW FS VAAVEGI KIRT +L+SLSEQ+++D
Sbjct: 122 NVTR-VPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVD 180

Query: 173 CSG--SRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRRE-GYCNWQRGAMKAARIRSYQD 229
           C    ++GC GG M+ AF +I  + G+  E  YPY   +  +C       +   I  ++ 
Sbjct: 181 CDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEH 240

Query: 230 VP-TSELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEG 288
           VP   E  L  AV+ QPVSVAIDA S  F+ YS GVF G CG  LNH V IVGYG +  G
Sbjct: 241 VPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG 300

Query: 289 -PYWLIKNSWGQNWGEGGFIRMRRDVG-GAGLCGIARKASYP 328
             YW+++NSWG  WGEGG++R+ R +    G CGIA +ASYP
Sbjct: 301 TKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP 342


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  251 bits (641), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 135/310 (43%), Positives = 186/310 (60%), Gaps = 14/310 (4%)

Query: 27  ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEFI-A 85
           E WM +  + Y + AEK  R  IF+ N RFI   N E N +Y+L L  FADL+  E+   
Sbjct: 50  ESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEV 108

Query: 86  SHTGYKMPTRN--ISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCW 143
            H     P RN      S  Y  +      +   LP+S+DWR  GAVT VK+QG C  CW
Sbjct: 109 CHGADPRPPRNHVFMTSSDRYKTS------ADDVLPKSVDWRNEGAVTEVKDQGHCRSCW 162

Query: 144 IFSAVAAVEGITKIRTGRLISLSEQQVLDCSG-SRGCYGGWMDDAFSYIIRSQGLTDERV 202
            FS V AVEG+ KI TG L++LSEQ +++C+  + GC GG ++ A+ +I+++ GL  +  
Sbjct: 163 AFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDND 222

Query: 203 YPYQRREGYCNWQ-RGAMKAARIRSYQDVPTS-ELALRYAVSRQPVSVAIDASSPGFRYY 260
           YPY+   G C+ + +   K   I  Y+++P + E AL  AV+ QPV+  ID+SS  F+ Y
Sbjct: 223 YPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLY 282

Query: 261 SGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLC 319
             GVF G CG NLNH V +VGYG+ N   YWL+KNS G  WGE G+++M R++    GLC
Sbjct: 283 ESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLC 342

Query: 320 GIARKASYPI 329
           GIA +ASYP+
Sbjct: 343 GIAMRASYPL 352


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
          Length = 329

 Score =  251 bits (640), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 142/326 (43%), Positives = 199/326 (61%), Gaps = 18/326 (5%)

Query: 12  VMSRTLHEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           VMS  L+ + I   H ELW     + Y ++ ++  R  I++KN ++I   N E   G  T
Sbjct: 11  VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T+EE +   TG K+P       S S +N+    PD     P S+D+R +
Sbjct: 71  YELAMNHLGDMTNEEVVQKMTGLKVPA------SHSRSNDTLYIPDWEGRAPDSVDYRKK 124

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTN 184

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ +++G+  E  YPY  +E  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 185 AFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 243

Query: 245 PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           PVSVAIDAS   F++YS GV+     N  NLNHAV  VGYG      +W+IKNSWG+NWG
Sbjct: 244 PVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG 303

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 304 NKGYILMARNKNNA--CGIANLASFP 327


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
          Length = 329

 Score =  251 bits (640), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 142/326 (43%), Positives = 199/326 (61%), Gaps = 18/326 (5%)

Query: 12  VMSRTLHEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           VMS  L+ + I   H ELW     + Y ++ ++  R  I++KN ++I   N E   G  T
Sbjct: 11  VMSFALYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T+EE +   TG K+P       S S +N+    PD     P S+D+R +
Sbjct: 71  YELAMNHLGDMTNEEVVQKMTGLKVPA------SHSRSNDTLYIPDWEGRAPDSVDYRKK 124

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTN 184

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ +++G+  E  YPY  +E  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 185 AFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 243

Query: 245 PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           PVSVAIDAS   F++YS GV+     N  NLNHAV  VGYG      +W+IKNSWG+NWG
Sbjct: 244 PVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG 303

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 304 NKGYILMARNKNNA--CGIANLASFP 327


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  251 bits (640), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 129/304 (42%), Positives = 187/304 (61%), Gaps = 9/304 (2%)

Query: 29  WMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEFIASHT 88
           WM +  + Y++  EK  RF+IF+ N  +I++ N++ N +Y L LN FADL+++EF   + 
Sbjct: 51  WMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYV 109

Query: 89  GYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAV 148
           G+           + + N  F Y       P+SIDWRA+GAVTPVKNQG+CG CW FS +
Sbjct: 110 GF---VAEDFTGLEHFDNEDFTYKHVTN-YPQSIDWRAKGAVTPVKNQGACGSCWAFSTI 165

Query: 149 AAVEGITKIRTGRLISLSEQQVLDCSG-SRGCYGGWMDDAFSYIIRSQGLTDERVYPYQR 207
           A VEGI KI TG L+ LSEQ+++DC   S GC GG+   +  Y+  + G+   +VYPYQ 
Sbjct: 166 ATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYV-ANNGVHTSKVYPYQA 224

Query: 208 REGYCNWQRGAMKAARIRSYQDVPTS-ELALRYAVSRQPVSVAIDASSPGFRYYSGGVFA 266
           ++  C          +I  Y+ VP++ E +   A++ QP+SV ++A    F+ Y  GVF 
Sbjct: 225 KQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFD 284

Query: 267 GPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLCGIARKA 325
           GPCG  L+HAVT VGYG+S+   Y +IKNSWG NWGE G++R++R  G + G CG+ + +
Sbjct: 285 GPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSS 344

Query: 326 SYPI 329
            YP 
Sbjct: 345 YYPF 348


>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
          Length = 329

 Score =  248 bits (634), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 141/326 (43%), Positives = 198/326 (60%), Gaps = 18/326 (5%)

Query: 12  VMSRTLHEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           V+S  L+ + I   H ELW     + Y N+ ++  R  I++KN ++I   N E   G  T
Sbjct: 11  VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T EE +   TG K+P       S S +N+    P+     P S+D+R +
Sbjct: 71  YELAMNHLGDMTSEEVVQKMTGLKVPL------SHSRSNDTLYIPEWEGRAPDSVDYRKK 124

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTN 184

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ +++G+  E  YPY  +E  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 185 AFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 243

Query: 245 PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           PVSVAIDAS   F++YS GV+     N  NLNHAV  VGYG      +W+IKNSWG+NWG
Sbjct: 244 PVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG 303

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 304 NKGYILMARNKNNA--CGIANLASFP 327


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score =  247 bits (631), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 133/306 (43%), Positives = 184/306 (60%), Gaps = 14/306 (4%)

Query: 27  ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEFIAS 86
           E WM +  + YKN  EK  RF+IFK N ++I++ N++ N +Y L LN FAD++++EF   
Sbjct: 49  ESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFADMSNDEFKEK 107

Query: 87  HTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFS 146
           +TG        +  S     N     D    +P  +DWR +GAVTPVKNQGSCG CW FS
Sbjct: 108 YTGSIAGNYTTTELSYEEVLN-----DGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFS 162

Query: 147 AVAAVEGITKIRTGRLISLSEQQVLDCS-GSRGCYGGWMDDAFSYIIRSQGLTDERVYPY 205
           AV  +EGI KIRTG L   SEQ++LDC   S GC GG+   A   ++   G+     YPY
Sbjct: 163 AVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQ-LVAQYGIHYRNTYPY 221

Query: 206 QRREGYCNWQRGAMKAARIRSYQDV-PTSELALRYAVSRQPVSVAIDASSPGFRYYSGGV 264
           +  + YC  +     AA+    + V P +E AL Y+++ QPVSV ++A+   F+ Y GG+
Sbjct: 222 EGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGI 281

Query: 265 FAGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLCGIAR 323
           F GPCGN ++HAV  VGYG +    Y LIKNSWG  WGE G+IR++R  G + G+CG+  
Sbjct: 282 FVGPCGNKVDHAVAAVGYGPN----YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYT 337

Query: 324 KASYPI 329
            + YP+
Sbjct: 338 SSFYPV 343


>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
          Length = 329

 Score =  246 bits (627), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 139/326 (42%), Positives = 201/326 (61%), Gaps = 18/326 (5%)

Query: 12  VMSRTLH-EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           V+S  LH E+ +  + ELW    ++ Y ++ ++  R  I++KN + I   N E   G  T
Sbjct: 11  VVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHT 70

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T EE +   TG K+P       S+S++N+    PD     P SID+R +
Sbjct: 71  YELAMNHLGDMTSEEVVQKMTGLKVPP------SRSHSNDTLYIPDWEGRTPDSIDYRKK 124

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTN 184

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ R++G+  E  YPY  ++  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 185 AFQYVQRNRGIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 243

Query: 245 PVSVAIDASSPGFRYYSGGVF--AGPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           PVSVAIDAS   F++YS GV+       +N+NHAV  VGYG      +W+IKNSWG++WG
Sbjct: 244 PVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWG 303

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 304 NKGYILMARNKNNA--CGIANLASFP 327


>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
          Length = 330

 Score =  245 bits (626), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 141/326 (43%), Positives = 198/326 (60%), Gaps = 18/326 (5%)

Query: 12  VMSRTLH-EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           VMS  L+ E+ +  + ELW     + Y ++ ++  R  I++KN + I   N E   G  T
Sbjct: 12  VMSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHT 71

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T EE +   TG K+P       S S +N+    PD     P SID+R +
Sbjct: 72  YELAMNHLGDMTSEEVVQKMTGLKVPP------SHSRSNDTLYIPDWEGRTPDSIDYRKK 125

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 126 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTN 185

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ +++G+  E  YPY  ++  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 186 AFQYVQKNRGIDSEDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 244

Query: 245 PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           PVSVAIDAS   F++YS GV+     N  NLNHAV  VGYG      +W+IKNSWG+NWG
Sbjct: 245 PVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWG 304

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 305 NKGYILMARNKNNA--CGIANLASFP 328


>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
          Length = 331

 Score =  244 bits (624), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 198/342 (57%), Gaps = 30/342 (8%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEK 59
           ++ +++  +S V    LH+D     H  LW     + YK + E+A+R  I++KN +F+  
Sbjct: 4   LVCVLLVCSSAVAQ--LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVML 61

Query: 60  FNRE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPT---RNISNQSQSYANNWFGYPD 113
            N E   G  +Y L +N   D+T EE ++  +  ++P+   RNI+ +S           +
Sbjct: 62  HNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKS-----------N 110

Query: 114 SRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC 173
             R LP S+DWR +G VT VK QGSCG CW FSAV A+E   K++TG+L+SLS Q ++DC
Sbjct: 111 PNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 170

Query: 174 S----GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQD 229
           S    G++GC GG+M  AF YII ++G+  +  YPY+  +  C +     +AA    Y +
Sbjct: 171 STEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYD-SKYRAATCSKYTE 229

Query: 230 VPTS-ELALRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGP-CGNNLNHAVTIVGYGSSN 286
           +P   E  L+ AV+ + PVSV +DA  P F  Y  GV+  P C  N+NH V +VGYG  N
Sbjct: 230 LPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLN 289

Query: 287 EGPYWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
              YWL+KNSWG N+GE G+IRM R+ G    CGIA   SYP
Sbjct: 290 GKEYWLVKNSWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  243 bits (621), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 138/340 (40%), Positives = 198/340 (58%), Gaps = 27/340 (7%)

Query: 2   LIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFN 61
           L+ ++   S  +++   + ++     LW    ++ YK + E+  R  I++KN +F+   N
Sbjct: 4   LVGLLPLCSYAVAQVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHN 63

Query: 62  RE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPT---RNISNQSQSYANNWFGYPDSR 115
            E   G  +Y L +N   D+T EE I+     ++P+   RN++ +S           +S 
Sbjct: 64  LEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRS-----------NSN 112

Query: 116 RGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS- 174
           + LP S+DWR +G VT VK QGSCG CW FSAV A+E   K++TG+L+SLS Q ++DCS 
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 175 ---GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP 231
              G++GC GG+M  AF YII + G+  E  YPY+   G C +     +AA    Y ++P
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYD-SKKRAATCSKYTELP 231

Query: 232 -TSELALRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGP-CGNNLNHAVTIVGYGSSNEG 288
             SE AL+ AV+ + PVSVAIDAS   F  Y  GV+  P C  N+NH V +VGYG+ N  
Sbjct: 232 FGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGK 291

Query: 289 PYWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
            YWL+KNSWG N+G+ G+IRM R+ G    CGIA   SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNH--CGIASYPSYP 329


>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
          Length = 330

 Score =  243 bits (619), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 136/341 (39%), Positives = 196/341 (57%), Gaps = 29/341 (8%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEK 59
           ++ ++   +S V    LH+D     H  LW     + YK + E+A+R  I++KN +F+  
Sbjct: 4   LVCVLFVCSSAVTQ--LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVML 61

Query: 60  FNRE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPT---RNISNQSQSYANNWFGYPD 113
            N E   G  +Y L +N   D+T EE ++  +  ++P    RNI+ +S           +
Sbjct: 62  HNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKS-----------N 110

Query: 114 SRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC 173
             + LP S+DWR +G VT VK QGSCG CW FSAV A+E   K++TG+L+SLS Q ++DC
Sbjct: 111 PNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 170

Query: 174 S---GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDV 230
           S   G++GC GG+M +AF YII ++G+  E  YPY+  +  C +     +AA    Y ++
Sbjct: 171 SEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYD-SKYRAATCSKYTEL 229

Query: 231 PTS-ELALRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGP-CGNNLNHAVTIVGYGSSNE 287
           P   E  L+ AV+ + PV V +DAS P F  Y  GV+  P C   +NH V ++GYG  N 
Sbjct: 230 PYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDLNG 289

Query: 288 GPYWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
             YWL+KNSWG N+GE G+IRM R+ G    CGIA   SYP
Sbjct: 290 KEYWLVKNSWGSNFGEQGYIRMARNKGNH--CGIASYPSYP 328


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
          Length = 330

 Score =  242 bits (617), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 201/335 (60%), Gaps = 20/335 (5%)

Query: 2   LIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFN 61
           +++++  AS  +     E+ +  + +LW     + Y ++ ++  R  I++KN + I   N
Sbjct: 6   VLLLLPMASFAL---YPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHN 62

Query: 62  RE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGL 118
            E   G  TY+L++N   D+T EE +   TG K+P       S S +N+    PD     
Sbjct: 63  LEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPP------SHSRSNDTLYIPDWESRA 116

Query: 119 PRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSR 177
           P S+D+R +G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + 
Sbjct: 117 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 176

Query: 178 GCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELA 236
           GC GG+M +AF Y+ +++G+  E  YPY  ++  C +     KAA+ R Y+++P  +E A
Sbjct: 177 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKA 235

Query: 237 LRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLI 293
           L+ AV+R  P+SVAIDAS   F++YS GV+     N  NLNHAV  VGYG      +W+I
Sbjct: 236 LKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWII 295

Query: 294 KNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
           KNSWG+NWG  G+I M R+   A  CGIA  AS+P
Sbjct: 296 KNSWGENWGNKGYILMARNKNNA--CGIANLASFP 328


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score =  241 bits (616), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 138/340 (40%), Positives = 198/340 (58%), Gaps = 31/340 (9%)

Query: 6   VTWASLVMSRTL---HEDSISAKH-ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFN 61
           + WA L+ S  +   H D     H +LW     + YK + E+  R  I++KN + +   N
Sbjct: 4   LVWALLLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHN 63

Query: 62  RE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPT---RNISNQSQSYANNWFGYPDSR 115
            E   G  +Y+L +N   D+T EE I+  +  ++P+   RN++ +S           D  
Sbjct: 64  LEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKS-----------DPN 112

Query: 116 RGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS- 174
           + LP S+DWR +G VT VK QG+CG CW FSAV A+E   K++TG+L+SLS Q ++DCS 
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172

Query: 175 ---GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP 231
              G++GC GG+M +AF YII + G+  E  YPY+  +G C +     +AA    Y ++P
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDV-KNRAATCSRYIELP 231

Query: 232 -TSELALRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGP-CGNNLNHAVTIVGYGSSNEG 288
             SE AL+ AV+ + PVSV IDAS   F  Y  GV+  P C  N+NH V +VGYG+ +  
Sbjct: 232 FGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGK 291

Query: 289 PYWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
            YWL+KNSWG ++G+ G+IRM R+ G    CGIA   SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNH--CGIANYPSYP 329


>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
          Length = 329

 Score =  239 bits (611), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/326 (42%), Positives = 198/326 (60%), Gaps = 18/326 (5%)

Query: 12  VMSRTLH-EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQT 67
           V+S  L+ E+ +  + ELW     + Y ++ ++  R  I++KN + I   N E   G  T
Sbjct: 11  VVSFALYPEEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHT 70

Query: 68  YKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRAR 127
           Y+L++N   D+T EE +   TG K+P       S+S +N+    PD     P S+D+R +
Sbjct: 71  YELAMNHLGDMTSEEVVQKMTGLKVPA------SRSRSNDTLYIPDWEGRAPDSVDYRKK 124

Query: 128 GAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDD 186
           G VTPVKNQG CG CW FS+V A+EG  K +TG+L++LS Q ++DC S + GC GG+M +
Sbjct: 125 GYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTN 184

Query: 187 AFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ- 244
           AF Y+ +++G+  E  YPY  ++  C +     KAA+ R Y+++P  +E AL+ AV+R  
Sbjct: 185 AFQYVQKNRGIDSEDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVG 243

Query: 245 PVSVAIDASSPGFRYYSGGVFAGPCGN--NLNHAVTIVGYGSSNEGPYWLIKNSWGQNWG 302
           P+SVAIDAS   F++Y  GV+     N  NLNHAV  VGYG      +W+IKNSWG+NWG
Sbjct: 244 PISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG 303

Query: 303 EGGFIRMRRDVGGAGLCGIARKASYP 328
             G+I M R+   A  CGIA  AS+P
Sbjct: 304 NKGYILMARNKNNA--CGIANLASFP 327


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  239 bits (609), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 131/303 (43%), Positives = 188/303 (62%), Gaps = 11/303 (3%)

Query: 29  WMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFADLTDEEFIASHT 88
           WM    + Y+N  EK  RF+IFK N  +I++ N++ N +Y L LNEFADL+++EF   + 
Sbjct: 51  WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSNDEFNEKYV 109

Query: 89  GYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQGSCGCCWIFSAV 148
           G  +     +   QSY   +    +    LP ++DWR +GAVTPV++QGSCG CW FSAV
Sbjct: 110 GSLID----ATIEQSYDEEFIN--EDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV 163

Query: 149 AAVEGITKIRTGRLISLSEQQVLDCSG-SRGCYGGWMDDAFSYIIRSQGLTDERVYPYQR 207
           A VEGI KIRTG+L+ LSEQ+++DC   S GC GG+   A  Y+ ++ G+     YPY+ 
Sbjct: 164 ATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKA 222

Query: 208 REGYCNWQRGAMKAARIRSYQDV-PTSELALRYAVSRQPVSVAIDASSPGFRYYSGGVFA 266
           ++G C  ++      +      V P +E  L  A+++QPVSV +++    F+ Y GG+F 
Sbjct: 223 KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFE 282

Query: 267 GPCGNNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGA-GLCGIARKA 325
           GPCG  ++HAVT VGYG S    Y LIKNSWG  WGE G+IR++R  G + G+CG+ + +
Sbjct: 283 GPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSS 342

Query: 326 SYP 328
            YP
Sbjct: 343 YYP 345


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score =  239 bits (609), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 199/339 (58%), Gaps = 24/339 (7%)

Query: 2   LIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFN 61
           L  M    S+ M +   + ++    +LW     + YK++ E+ +R  I++KN +FI   N
Sbjct: 12  LFWMPLVCSVAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHN 71

Query: 62  RE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQS-QSYANNWFGYPDSRRG 117
            E   G  TY++ +N+  D+T+EE +      ++P ++    + +SY+N         R 
Sbjct: 72  LEYSMGMHTYQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRSYSN---------RT 122

Query: 118 LPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS--- 174
           LP ++DWR +G VT VK QGSCG CW FSAV A+EG  K++TG+LISLS Q ++DCS   
Sbjct: 123 LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEE 182

Query: 175 --GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP- 231
             G++GC GG+M +AF YII + G+  +  YPY+  +  C++     +AA    Y  +P 
Sbjct: 183 KYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYN-SKNRAATCSRYIQLPF 241

Query: 232 TSELALRYAV-SRQPVSVAIDASSPGFRYYSGGVFAGP-CGNNLNHAVTIVGYGSSNEGP 289
             E AL+ AV ++ PVSV IDAS   F +Y  GV+  P C  N+NH V +VGYG+ +   
Sbjct: 242 GDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKD 301

Query: 290 YWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
           YWL+KNSWG N+G+ G+IRM R+      CGIA   SYP
Sbjct: 302 YWLVKNSWGLNFGDQGYIRMARN--NKNHCGIASYCSYP 338


>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
          Length = 329

 Score =  238 bits (606), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 130/318 (40%), Positives = 197/318 (61%), Gaps = 17/318 (5%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQTYKLSLNEF 75
           E+++  + ELW     + Y ++ ++  R  I++KN + I   N E   G  TY+L++N  
Sbjct: 19  EETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query: 76  ADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
            D+T EE +   TG ++P       S+S++N+    P+    +P SID+R +G VTPVKN
Sbjct: 79  GDMTSEEVVQKMTGLRVPP------SRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKN 132

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGSRGCYGGWMDDAFSYIIRS 194
           QG CG CW FS+  A+EG  K +TG+L++LS Q ++DC S + GC GG+M  AF Y+ ++
Sbjct: 133 QGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQN 192

Query: 195 QGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ-PVSVAIDA 252
            G+  E  YPY  ++  C +   A KAA+ R Y+++P  +E AL+ AV+R  PVSV+IDA
Sbjct: 193 GGIDSEDAYPYVGQDESCMYNATA-KAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDA 251

Query: 253 SSPGFRYYSGGVFAGP-CG-NNLNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMR 310
           S   F++YS GV+    C  +N+NHAV +VGYG+     YW+IKNSWG++WG  G++ + 
Sbjct: 252 SLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLA 311

Query: 311 RDVGGAGLCGIARKASYP 328
           R+   A  CGI   AS+P
Sbjct: 312 RNKNNA--CGITNLASFP 327


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score =  238 bits (606), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 114/215 (53%), Positives = 151/215 (70%), Gaps = 4/215 (1%)

Query: 118 LPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS-GS 176
           LP SIDWR  GAV PVKNQG CG CW FS VAAVEGI +I TG LISLSEQQ++DC+  +
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTAN 62

Query: 177 RGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SEL 235
            GC GGWM+ AF +I+ + G+  E  YPY+ ++G CN    A     I SY++VP+ +E 
Sbjct: 63  HGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNA-PVVSIDSYENVPSHNEQ 121

Query: 236 ALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKN 295
           +L+ AV+ QPVSV +DA+   F+ Y  G+F G C  + NHA+T+VGYG+ N+  +W++KN
Sbjct: 122 SLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKN 181

Query: 296 SWGQNWGEGGFIRMRRDVGGA-GLCGIARKASYPI 329
           SWG+NWGE G+IR  R++    G CGI R ASYP+
Sbjct: 182 SWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  234 bits (598), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 136/323 (42%), Positives = 194/323 (60%), Gaps = 20/323 (6%)

Query: 18  HEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNREGNQTYKLSLNEFAD 77
           +E  +   +E W+ ++ + Y    EK  RFKIFK N + IE+ N + N++Y+  LN+F+D
Sbjct: 33  NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92

Query: 78  LTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTP-VKNQ 136
           LT +EF AS+ G KM  +++S+ ++ Y      Y +    LP  +DWR RGAV P VK Q
Sbjct: 93  LTADEFQASYLGGKMEKKSLSDVAERYQ-----YKEGDV-LPDEVDWRERGAVVPRVKRQ 146

Query: 137 GSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC---SGSRGCYGGWMDDAFSYIIR 193
           G CG CW F+A  AVEGI +I TG L+SLSEQ+++DC   + + GC GG    AF +I  
Sbjct: 147 GECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKE 206

Query: 194 SQGLTDERVYPYQRREGYCNWQRGAMKAAR---IRSYQDVPTS-ELALRYAVSRQPVSVA 249
           + G+  + VY Y   E     +   MK  R   I  ++ VP + E++L+ AV+ QP+SV 
Sbjct: 207 NGGIVSDEVYGYT-GEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVM 265

Query: 250 IDASSPGFRYYSGGVFAGPCGNNL-NHAVTIVGYG-SSNEGPYWLIKNSWGQNWGEGGFI 307
           I A++     Y  GV+ G C N   +H V IVGYG SS+EG YWLI+NSWG  WGEGG++
Sbjct: 266 ISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYL 323

Query: 308 RMRRDVGG-AGLCGIARKASYPI 329
           R++R+     G C +A    YPI
Sbjct: 324 RLQRNFHEPTGKCAVAVAPVYPI 346


>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
          Length = 329

 Score =  234 bits (598), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 132/336 (39%), Positives = 203/336 (60%), Gaps = 23/336 (6%)

Query: 1   MLIIMVTWASLVMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKF 60
           +L+ MV++A         E+ +  + ELW     + Y ++ ++  R  I++KN + I   
Sbjct: 7   LLLPMVSFA------LSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAH 60

Query: 61  NRE---GNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRG 117
           N E   G  TY+L++N   D+T EE +   TG ++P       S+SY+N+    P+    
Sbjct: 61  NLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIPP------SRSYSNDTLYTPEWEGR 114

Query: 118 LPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGS 176
           +P SID+R +G VTPVKNQG CG CW FS+  A+EG  K +TG+L++LS Q ++DC + +
Sbjct: 115 VPDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTEN 174

Query: 177 RGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SEL 235
            GC GG+M  AF Y+ ++ G+  E  YPY  ++  C +   A KAA+ R Y+++P  +E 
Sbjct: 175 YGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATA-KAAKCRGYREIPVGNEK 233

Query: 236 ALRYAVSRQ-PVSVAIDASSPGFRYYSGGVFAGP-CG-NNLNHAVTIVGYGSSNEGPYWL 292
           AL+ AV+R  P+SV+IDAS   F++YS GV+    C  +N+NHAV +VGYG+     +W+
Sbjct: 234 ALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWI 293

Query: 293 IKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYP 328
           IKNSWG++WG  G+  + R+   A  CGI   AS+P
Sbjct: 294 IKNSWGESWGNKGYALLARNKNNA--CGITNMASFP 327


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  232 bits (592), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 183/325 (56%), Gaps = 26/325 (8%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQTYKLSLNEF 75
           + + SA+   W +   R Y    E+  R  I++KN R I+  N E   G   + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 76  ADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
            D+T+EEF     GY+          +      F  P   + +P+S+DWR +G VTPVKN
Sbjct: 81  GDMTNEEFRQVVNGYR--------HQKHKKGRLFQEPLMLK-IPKSVDWREKGCVTPVKN 131

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDDAFSYII 192
           QG CG CW FSA   +EG   ++TG+LISLSEQ ++DCS   G++GC GG MD AF YI 
Sbjct: 132 QGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIK 191

Query: 193 RSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPTSELALRYAVSR-QPVSVAID 251
            + GL  E  YPY+ ++G C + R     A    + D+P  E AL  AV+   P+SVA+D
Sbjct: 192 ENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250

Query: 252 ASSPGFRYYSGGVFAGP--CGNNLNHAVTIVGYG----SSNEGPYWLIKNSWGQNWGEGG 305
           AS P  ++YS G++  P     NL+H V +VGYG     SN+  YWL+KNSWG  WG  G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310

Query: 306 FIRMRRDVGGAGLCGIARKASYPIA 330
           +I++ +D      CG+A  ASYP+ 
Sbjct: 311 YIKIAKDRDNH--CGLATAASYPVV 333


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  232 bits (592), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 130/320 (40%), Positives = 183/320 (57%), Gaps = 15/320 (4%)

Query: 20  DSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNR---EGNQTYKLSLNEFA 76
           D I  +   +  Q  + Y N+ E+  R KIF +N   I K N+   +G  +YKL LN++A
Sbjct: 22  DLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYA 81

Query: 77  DLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKNQ 136
           D+   EF  +  GY    R +  +        +  P +   +P+S+DWR  GAVT VK+Q
Sbjct: 82  DMLHHEFKETMNGYNHTLRQLMRERTGLVGATY-IPPAHVTVPKSVDWREHGAVTGVKDQ 140

Query: 137 GSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDDAFSYIIR 193
           G CG CW FS+  A+EG    + G L+SLSEQ ++DCS   G+ GC GG MD+AF YI  
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 200

Query: 194 SQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ-PVSVAID 251
           + G+  E+ YPY+  +  C++ +  + A     + D+P   E  ++ AV+   PVSVAID
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKATIGATDT-GFVDIPEGDEEKMKKAVATMGPVSVAID 259

Query: 252 ASSPGFRYYSGGVFAGP-CG-NNLNHAVTIVGYGSSNEG-PYWLIKNSWGQNWGEGGFIR 308
           AS   F+ YS GV+  P C   NL+H V +VGYG+   G  YWL+KNSWG  WGE G+I+
Sbjct: 260 ASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIK 319

Query: 309 MRRDVGGAGLCGIARKASYP 328
           M R+      CGIA  +SYP
Sbjct: 320 MARNQNNQ--CGIATASSYP 337


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  231 bits (590), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 191/323 (59%), Gaps = 16/323 (4%)

Query: 20  DSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNR---EGNQTYKLSLNEFA 76
           D +  +   +  +  + Y+++ E+  R KIF +N   I K N+   EG  ++KL++N++A
Sbjct: 53  DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112

Query: 77  DLTDEEFIASHTGYKMPT-RNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
           DL   EF     G+     + +    +S+    F  P +   LP+S+DWR +GAVT VK+
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISP-AHVTLPKSVDWRTKGAVTAVKD 171

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDDAFSYII 192
           QG CG CW FS+  A+EG    ++G L+SLSEQ ++DCS   G+ GC GG MD+AF YI 
Sbjct: 172 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 231

Query: 193 RSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSRQ-PVSVAI 250
            + G+  E+ YPY+  +  C++ +G + A   R + D+P   E  +  AV+   PVSVAI
Sbjct: 232 DNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGPVSVAI 290

Query: 251 DASSPGFRYYSGGVFAGP-C-GNNLNHAVTIVGYGSSNEGP-YWLIKNSWGQNWGEGGFI 307
           DAS   F++YS GV+  P C   NL+H V +VG+G+   G  YWL+KNSWG  WG+ GFI
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350

Query: 308 RMRRDVGGAGLCGIARKASYPIA 330
           +M R+      CGIA  +SYP+ 
Sbjct: 351 KMLRN--KENQCGIASASSYPLV 371


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score =  231 bits (590), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 202/348 (58%), Gaps = 39/348 (11%)

Query: 1   MLIIMVTWASL--VMSRTLHEDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIE 58
           ++++ +++ S   V S   ++DS       WM  + + Y ++ E   R++ FKKN  ++ 
Sbjct: 11  LIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYEEFKKNMDYVH 65

Query: 59  KFNREGNQTYKLSLNEFADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGL 118
            +N +G++T  L LN+ ADL++EE+  ++ G +   +              GY     GL
Sbjct: 66  NWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIK------------LNGYHKRNLGL 112

Query: 119 ---------PRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQ 169
                    P ++DWR + AVTPVK+QG CG C+ FS   +VEG+T I+TG+L+SLSEQ 
Sbjct: 113 RLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQN 172

Query: 170 VLDCS---GSRGCYGGWMDDAFSYIIRSQGLTDERVYPYQRR-EGYCNWQRGAMKAARIR 225
           +LDCS   G+ GC GG M +AF YII++ GL  E  YPY+ +    C +Q G++ AA+I 
Sbjct: 173 ILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSV-AAKIT 231

Query: 226 SYQDVPT-SELALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPC--GNNLNHAVTIVGY 282
           SY+++    E  L+ A+   PVSVAIDAS   F+ Y+ GV+  P     +L+H V  VG 
Sbjct: 232 SYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGM 291

Query: 283 GSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVGGAGLCGIARKASYPIA 330
           G+ N   Y+++KNSWG +WG  G+I M R+      CGI+  ASYPIA
Sbjct: 292 GTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN--CGISTMASYPIA 337


>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 215

 Score =  230 bits (586), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 113/214 (52%), Positives = 149/214 (69%), Gaps = 5/214 (2%)

Query: 118 LPRSIDWRARGAVTPVKNQGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDC-SGS 176
           LP  +DWR++GAV  +KNQ  CG CW FSAVAAVE I KIRTG+LISLSEQ+++DC + S
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60

Query: 177 RGCYGGWMDDAFSYIIRSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVP-TSEL 235
            GC GGWM++AF YII + G+  ++ YPY   +G C   R  ++   I  +Q V   +E 
Sbjct: 61  HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNES 118

Query: 236 ALRYAVSRQPVSVAIDASSPGFRYYSGGVFAGPCGNNLNHAVTIVGYGSSNEGPYWLIKN 295
           AL+ AV+ QPVSV ++A+   F++YS G+F GPCG   NH V IVGYG+ +   YW+++N
Sbjct: 119 ALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVRN 178

Query: 296 SWGQNWGEGGFIRMRRDVG-GAGLCGIARKASYP 328
           SWGQNWG  G+I M R+V   AGLCGIA+  SYP
Sbjct: 179 SWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYP 212


>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
           SV=1
          Length = 323

 Score =  229 bits (583), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 132/315 (41%), Positives = 191/315 (60%), Gaps = 25/315 (7%)

Query: 27  ELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQTYKLSLNEFADLTDEEF 83
           E +  +  R Y +  E + R  IF++N ++IE+FN++   G  T+ L++N+F D+T EEF
Sbjct: 21  EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80

Query: 84  IASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRS--IDWRARGAVTPVKNQGSCGC 141
            A   G      NI  +S   +     YP    G P++  +DWR +GAVTPVK+QG CG 
Sbjct: 81  NAVMKG------NIPRRSAPVS---VFYPKKETG-PQATEVDWRTKGAVTPVKDQGQCGS 130

Query: 142 CWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDDAFSYIIRSQGLT 198
           CW FS   ++EG   ++TG LISL+EQQ++DCS   G +GC GGWM+DAF YI  + G+ 
Sbjct: 131 CWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGID 190

Query: 199 DERVYPYQRREGYCNWQRGAMKAARIRSYQDVPT-SELALRYAVSR-QPVSVAIDASSPG 256
            E  YPY+ R+G C +   ++ AA    + ++ + SE  L+ AV    P+SV IDA+   
Sbjct: 191 TEAAYPYEARDGSCRFDSNSV-AATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSS 249

Query: 257 FRYYSGGVFAGP-CGNN-LNHAVTIVGYGSSNEGPYWLIKNSWGQNWGEGGFIRMRRDVG 314
           F++YS GV+  P C  + L+HAV  VGYGS     +WL+KNSW  +WG+ G+I+M R+  
Sbjct: 250 FQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRN 309

Query: 315 GAGLCGIARKASYPI 329
               CGIA  ASYP+
Sbjct: 310 NN--CGIATVASYPL 322


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  229 bits (583), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 128/325 (39%), Positives = 184/325 (56%), Gaps = 26/325 (8%)

Query: 19  EDSISAKHELWMAQSARTYKNQAEKAMRFKIFKKNFRFIEKFNRE---GNQTYKLSLNEF 75
           + + +A+   W +   R Y    E+  R  +++KN R I+  N E   G   + + +N F
Sbjct: 22  DQTFNAQWHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 80

Query: 76  ADLTDEEFIASHTGYKMPTRNISNQSQSYANNWFGYPDSRRGLPRSIDWRARGAVTPVKN 135
            D+T+EEF     GY+          +      F  P   + +P+++DWR +G VTPVKN
Sbjct: 81  GDMTNEEFRQIVNGYR--------HQKHKKGRLFQEPLMLQ-IPKTVDWREKGCVTPVKN 131

Query: 136 QGSCGCCWIFSAVAAVEGITKIRTGRLISLSEQQVLDCS---GSRGCYGGWMDDAFSYII 192
           QG CG CW FSA   +EG   ++TG+LISLSEQ ++DCS   G++GC GG MD AF YI 
Sbjct: 132 QGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIK 191

Query: 193 RSQGLTDERVYPYQRREGYCNWQRGAMKAARIRSYQDVPTSELALRYAVSR-QPVSVAID 251
            + GL  E  YPY+ ++G C + R     A    + D+P  E AL  AV+   P+SVA+D
Sbjct: 192 ENGGLDSEESYPYEAKDGSCKY-RAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250

Query: 252 ASSPGFRYYSGGVFAGP--CGNNLNHAVTIVGYG----SSNEGPYWLIKNSWGQNWGEGG 305
           AS P  ++YS G++  P     +L+H V +VGYG     SN+  YWL+KNSWG+ WG  G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDG 310

Query: 306 FIRMRRDVGGAGLCGIARKASYPIA 330
           +I++ +D      CG+A  ASYPI 
Sbjct: 311 YIKIAKDRNNH--CGLATAASYPIV 333


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.320    0.134    0.426 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 124,001,063
Number of Sequences: 539616
Number of extensions: 5174889
Number of successful extensions: 11463
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 216
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 10442
Number of HSP's gapped (non-prelim): 273
length of query: 330
length of database: 191,569,459
effective HSP length: 118
effective length of query: 212
effective length of database: 127,894,771
effective search space: 27113691452
effective search space used: 27113691452
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)