BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12185
         (317 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P43234|CATO_HUMAN Cathepsin O OS=Homo sapiens GN=CTSO PE=2 SV=1
          Length = 321

 Score =  167 bits (422), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 44  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 207 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 265 GEANHAVLITGFDKTGST 282


>sp|Q8BM88|CATO_MOUSE Cathepsin O OS=Mus musculus GN=Ctso PE=2 SV=1
          Length = 312

 Score =  150 bits (379), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +SL     LN       +A YG+ +FS L  EEFK  +L    +K+     +      
Sbjct: 36  LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   ES  A++ 
Sbjct: 91  -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +  C+   
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 258 ANHAVLITGFDRTGNT 273


>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
           GN=cfaD PE=1 SV=1
          Length = 531

 Score =  143 bits (361), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 96/290 (33%), Positives = 143/290 (49%), Gaps = 31/290 (10%)

Query: 31  EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           EQ   LF  ++ +Y K YS + EHD RF NF+ +  II   N       S + G+  ++D
Sbjct: 219 EQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKE---SSYKLGMNHYAD 275

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS +EF T      V   V        D  H+    RSI          P   DWR    
Sbjct: 276 LSNKEFNTL-----VKPKVARPSVTGADSVHDDESLRSI----------PSTVDWRNQNC 320

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
           +  V++Q  CG+CW F +  + E  + + NG L  LS Q+++DCA   G+ GC GG   +
Sbjct: 321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASS 380

Query: 209 LLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              + M++    L  ES YP L+++  C+ +  +P+GV I  Y  +    SES++   IA
Sbjct: 381 AFQYVMEIGS--LATESNYPYLMQNGLCRDRTVTPSGVSITGYV-NVTSGSESALQNAIA 437

Query: 268 THGPVIAAVNALT--WQYYLGGVIQYN---CDGSLANINHAVQIVGYDNY 312
           T GPV  A++A    ++YY+ GV  YN   C   L +++H V  +GY  Y
Sbjct: 438 TTGPVAIAIDASVDDFRYYMSGV--YNNPACKNGLDDLDHEVLAIGYGTY 485


>sp|Q9R014|CATJ_MOUSE Cathepsin J OS=Mus musculus GN=Ctsj PE=2 SV=2
          Length = 334

 Score =  134 bits (336), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 30/304 (9%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L+ LCF +A   +   P L+ +   +  ++ +Y KSYS  E  +R   +E+++ +I+ 
Sbjct: 5   VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYSPKEEALRRAVWEENMRMIKL 61

Query: 70  LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK N     +    + +F D + EEF     R S++ ++ +       H  NHV     
Sbjct: 62  HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 111

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                   G+P  KDWRE G +  VRNQ  CG+CWAF+     E     K G L+ LSVQ
Sbjct: 112 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 164

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  G      +++  NK  LE E+ YP   KD  C+ ++ + +   I
Sbjct: 165 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 222

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y    L P+E  +   +A+ GPV AA++A   ++++Y GG I Y  + S   +NHAV 
Sbjct: 223 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 279

Query: 306 IVGY 309
           +VGY
Sbjct: 280 VVGY 283


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  133 bits (334), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 139/287 (48%), Gaps = 31/287 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           N ++ LELF S+   + K+Y   E  + RF+ F ++L  I++ N    S      G+ EF
Sbjct: 43  NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK R+L       +             + + R IT        +P   DWR+ 
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V++Q  CG+CWAFSTV   E ++ +  G LS LS QE+IDC    N GC+GG   
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203

Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     ++   L  E +YP L+++  C+ +      V I  Y  + +  ++   L 
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               H PV  A+ A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKGGVFNGKCG---TDLDHGVAAVGY 304


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  132 bits (332), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)

Query: 34  LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF ++   ++K+Y   E   +RF+ F+ +L  I+E NK  +S      G+ EF+DLS 
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   +   ++    +  +  +     R +         +P   DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG      ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E +YP  +++  C+ +      V I  +  D     E S+L  +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305


>sp|O97397|CATLL_PHACE Cathepsin L-like proteinase OS=Phaedon cochleariae PE=2 SV=1
          Length = 324

 Score =  131 bits (330), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 95/310 (30%), Positives = 158/310 (50%), Gaps = 33/310 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
           +IAL  L + +     N     EL++ F++ + ++Y S  E  +RF  F+ +L  I E N
Sbjct: 4   IIALAALIVVI-----NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58

Query: 72  KNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
              ++ ES  Y  I +FSD+++EEF+   +++  ++  L             ++   +T 
Sbjct: 59  VKYENGESTYYLAINKFSDITDEEFRDMLMKNEASRPNL-----------EGLEVADLTV 107

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G       P   DWR  G++  VRNQ  CG+CWA ST    ES  A+K+G+   LS Q++
Sbjct: 108 GAA-----PESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQL 162

Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+ + GN GC+GG      +++  N   LE +++YP   K+  CK    S + V++  
Sbjct: 163 VDCSTSYGNHGCNGGFAVNGFEYVKDNG--LESDADYPYSGKEDKCKANDKSRSVVELTG 220

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
           Y    +  SE+S+   + T GP+ A V     + Y GG+    +C G   N++H V +VG
Sbjct: 221 YK--KVTASETSLKEAVGTIGPISAVVFGKPMKSYGGGIFDDSSCLGD--NLHHGVNVVG 276

Query: 309 Y--DNYSRTW 316
           Y  +N  + W
Sbjct: 277 YGIENGQKYW 286


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
          Length = 363

 Score =  131 bits (330), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 147/283 (51%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ KSY +K EHD RF  F+ +L I  +L++NR    +A +GIT+FSDL+  EF
Sbjct: 48  FTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDP--TAEHGITKFSDLTASEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L   + K + +  H                  I   T +P   DWRE G +  V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q +CG+CWAFST    E  H L  G L  LS Q+++DC        AG+ + GC+GG   
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              +++  +  V++ E +Y    +D +CK    S     + +++  TL   E  I  ++ 
Sbjct: 210 NAFEYLLESGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVTL--DEDQIAANLV 265

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+  A+NA   Q Y+ GV   Y C  + + ++H V +VG+
Sbjct: 266 KNGPLAVAINAAWMQTYMSGVSCPYVC--AKSRLDHGVLLVGF 306


>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
          Length = 343

 Score =  127 bits (320), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 152/313 (48%), Gaps = 35/313 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           ++ L  L    + V      LE++ + F  FQ ++ K YS  E+  RF+ F+ +L  IEE
Sbjct: 3   VILLFVLAVFTVFVSSRGIPLEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61

Query: 70  LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           LN    N ++    ++G+ +F+DLS +EFK  +L    NK  + +         +++   
Sbjct: 62  LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I +       IP   DWR  G +  V+NQ  CG+CW+FST    E  H +    L  LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166

Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
            Q ++DC             + GC+GG      +++  N  + + ES YP   +      
Sbjct: 167 EQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
             ++  G KI ++   T+IP   +++   I + GP+  A +A+ WQ+Y+GGV    C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282

Query: 297 LANINHAVQIVGY 309
             +++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293


>sp|Q63088|CATJ_RAT Cathepsin J OS=Rattus norvegicus GN=Ctsj PE=2 SV=2
          Length = 334

 Score =  127 bits (318), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 156/305 (51%), Gaps = 32/305 (10%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L+ LCF +A       PNL+ + +    ++ +Y KSYS  E +++   +E++L +I+ 
Sbjct: 5   VFLVILCFGVASGAPARDPNLDAEWQ---DWKTKYAKSYSPVEEELKRAVWEENLKMIQL 61

Query: 70  LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK N          +  F+D + EEF     R S++  ++ +           V   S 
Sbjct: 62  HNKENGLGKNGFTMEMNAFADTTGEEF-----RKSLSDILIPAA----------VTNPSA 106

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              ++I  G+P  KDWR+ G +  VRNQ  CG+CWAF+ V   E     K G L+ LSVQ
Sbjct: 107 QKQVSI--GLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  G      +++  NK  LE E+ YP   KD  C+  + + +   I
Sbjct: 165 NLLDCSKSEGNNGCRWGTAHQAFNYVLKNK-GLEAEATYPYEGKDGPCRYHSENAS-ANI 222

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
             +    L P+E  +   +A+ GPV AA++A   ++++Y GGV  + NC   +  +NHAV
Sbjct: 223 TGFV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYV--VNHAV 278

Query: 305 QIVGY 309
            +VGY
Sbjct: 279 LVVGY 283


>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
           GN=CG12163 PE=2 SV=2
          Length = 614

 Score =  126 bits (316), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 142/284 (50%), Gaps = 38/284 (13%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  FQ R+ + Y S +E  +R + F ++L  IEELN N     SA+YGITEF+D++  E
Sbjct: 307 LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMG--SAKYGITEFADMTSSE 364

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGIIG 151
           +K R      ++                  K +  +   +P     +P + DWR+   + 
Sbjct: 365 YKERTGLWQRDE-----------------AKATGGSAAVVPAYHGELPKEFDWRQKDAVT 407

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
           +V+NQ +CG+CWAFS     E ++A+K G L   S QE++DC    +  C+GG    L+D
Sbjct: 408 QVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDC-DTTDSACNGG----LMD 462

Query: 212 WMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
             +  K +     LE E+EYP   K   C    T  + V++  +  D    +E+++   +
Sbjct: 463 --NAYKAIKDIGGLEYEAEYPYKAKKNQCHFNRTLSH-VQVAGFV-DLPKGNETAMQEWL 518

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
             +GP+   +NA   Q+Y GGV   +    S  N++H V +VGY
Sbjct: 519 LANGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGY 562


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
          Length = 371

 Score =  124 bits (312), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF QR+ KSY  + EH  R   F+   D +    +++    SA +G+T+FSDL+  EF
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFK---DNLRRARRHQLLDPSAEHGVTKFSDLTPAEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           +  +L    ++  L+       H               +PT G+P   DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L +LS Q+ +DC          + + GC+GG  
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                ++      LE E +YP    D  CK    S     +++++  ++   E+ I  ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310


>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
          Length = 443

 Score =  124 bits (310), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
          Length = 444

 Score =  122 bits (307), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 135/281 (48%), Gaps = 26/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDAACKRKATSPN----GVKIKSYTCDTLIPSESSILTDIATH 269
            N    L  E  YP +  +      + S      G +I  +    +  SE ++   +A +
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHV--LIGSSEKAMAAWLAKN 259

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 GPIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 297


>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
           GN=At2g21430 PE=2 SV=2
          Length = 361

 Score =  122 bits (307), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 158/325 (48%), Gaps = 42/325 (12%)

Query: 1   MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
           +F V +++F+   +++C      +   V  ++P +    + F+ F++++ K Y S  EH 
Sbjct: 8   LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF  F+ +L  +  +   +  P SAR+G+T+FSDL+  EF+ +HL       V      
Sbjct: 67  YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117

Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
             D +   +          +PT  +P + DWR+ G +  V+NQ +CG+CW+FST    E 
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167

Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
            H L  G L  LS Q+++DC         G+ + GC+GG   +  ++  +    L  E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYT-LKTGGLMREKD 226

Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
           YP    D    +   S     + +++  ++  +E  I  ++  +GP+  A+NA   Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284

Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
           GGV   Y C   L   NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306


>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
           PE=2 SV=1
          Length = 358

 Score =  119 bits (298), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F  RY K Y S  E  +RF  F+++LD+I   NK   S    +  + +F+DL+ +EF
Sbjct: 59  FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                 IT  T +P  KDWRE GI+  V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
            Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC GG      +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV+++  + +  + +E  +   +    PV 
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  +++Y  GV   N C  +  ++NHAV  VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  119 bits (298), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L + V  +K      Q    F+ +   ++KSY+  E   R+  F+ ++D +++ 
Sbjct: 4   LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ EE++  +L    +   L+   +                
Sbjct: 64  NS--KGSETVL-GLNNFADITNEEYRNTYLGTKFDASSLIGTQEEK-------------- 106

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                T     KDWR  G +  V+NQ  CG CW+FST  + E  H    G L  LS Q +
Sbjct: 107 --VFTTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           IDC+   N GC GG      +++ +N   ++ ES YP   ++  C+ K+ + +G  + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSEN-SGATLSSY 221

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              T   SESS+ + +  + PV  A++A   ++Q Y  G I Y  + S  N++H V  VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278

Query: 309 Y 309
           Y
Sbjct: 279 Y 279


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  119 bits (298), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +LR +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    ++ Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             ++HAV IVGY
Sbjct: 286 --VDHAVTIVGY 295


>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
          Length = 376

 Score =  119 bits (298), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 21/281 (7%)

Query: 32  QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           Q    F+ +  ++ + YS SE   R+  F+ ++D ++  N N +       G+  F+D++
Sbjct: 31  QYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVD--NWNSKGDSQTVLGLNNFADIT 88

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
            EE++  +L   VN H            +N    R +     + T  P   DWR    + 
Sbjct: 89  NEEYRKTYLGTRVNAH-----------SYNGYDGREVLNVEDLQTN-PKSIDWRTKNAVT 136

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALL 210
            +++Q  CG+CW+FST  + E  HALK   L  LS Q ++DC+G   N GC GG      
Sbjct: 137 PIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAF 196

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D++  NK + + ES YP   +  +      S  G  IK Y  +    SE S L + A HG
Sbjct: 197 DYIIKNKGI-DTESSYPYTAETGSTCLFNKSDIGATIKGY-VNITAGSEIS-LENGAQHG 253

Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A   ++Q Y  G I Y    S   ++H V +VGY
Sbjct: 254 PVSVAIDASHNSFQLYTSG-IYYEPKCSPTELDHGVLVVGY 293


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  119 bits (298), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 91/285 (31%), Positives = 133/285 (46%), Gaps = 28/285 (9%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ ++LF S+  ++ K Y   +  I RF+ F  +L  I+E NK   S      G+  F
Sbjct: 40  SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK +++         + H  + D  + HV            T  P   DWR  
Sbjct: 97  ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFST+ T E ++ +  G L  LS QE++DC  + + GC GG   
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V       YP   K   C  +AT   G K+K  T    +PS  E+S L  
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258

Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A     V+       +Q Y  GV    C   L   +HAV  VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300


>sp|Q9JIA9|CATR_MOUSE Cathepsin R OS=Mus musculus GN=Ctsr PE=2 SV=1
          Length = 334

 Score =  119 bits (297), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 147/306 (48%), Gaps = 28/306 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           + A++ + FL + V    P L+  L+  +  ++ +Y KSYS  E  ++   +E+ L +I+
Sbjct: 1   MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60

Query: 69  ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N+ N          + EF D ++EEF+   +  SV  H               + KR 
Sbjct: 61  LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
              G  +P  +    DWR+ G +  VR Q  C ACWAF+     E+    + G L+ LSV
Sbjct: 111 --AGSILPKFV----DWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSV 164

Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC GGD      ++ ++   LE E+ YP   KD  C+    +P   K
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCR---YNPKNSK 220

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
            +     +L  SE  ++  +AT GP+ A ++A   +++ Y GG+  + NC  S   + H 
Sbjct: 221 AEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  118 bits (295), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 145/315 (46%), Gaps = 29/315 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKK-------SYSKSEHDIR 56
            K     +AL+AL FL+I   +  P  E+ L    S    Y+K       +    E + R
Sbjct: 2   AKPKFIALALVALSFLSIAQSI--PFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRR 59

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           F  F++++  I E N+ + +P   +  + +F D++ +EF++++    +       HH+  
Sbjct: 60  FNVFKENVKFIHEFNQKKDAP--YKLALNKFGDMTNQEFRSKYAGSKIQ------HHRSQ 111

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
                +          ++P       DWR  G +  V++Q  CG+CWAFST+ + E ++ 
Sbjct: 112 RGIQKNTGSFMYENVGSLPA---ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQ 168

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           +K G L  LS QE++DC  + N GC+GG      +++  N +    E  YP   +D  C 
Sbjct: 169 IKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKNGITT--EDSYPYAEQDGTCA 226

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
               +   V I  +  D    +E++++  +A   P+  ++ A    +Q+Y  GV    C 
Sbjct: 227 SNLLNSPVVSIDGHQ-DVPANNENALMQAVANQ-PISVSIEASGYGFQFYSEGVFTGRCG 284

Query: 295 GSLANINHAVQIVGY 309
             L   +H V IVGY
Sbjct: 285 TEL---DHGVAIVGY 296


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
           SV=1
          Length = 368

 Score =  118 bits (295), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 143/297 (48%), Gaps = 34/297 (11%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  ++P +    + FS F++++ K Y S  EHD RF  F+ +L       ++++   SA 
Sbjct: 37  VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
           +G+T+FSDL+  EF+ +HL        + S  K        + K +    I     +P  
Sbjct: 94  HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
            DWR+ G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC        
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           A + + GC+GG   +  ++  +    L  E +YP   KD    +   S     + +++  
Sbjct: 199 ADSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257

Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++   E  I  ++  +GP+  A+NA   Q Y+GGV   Y C      +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  117 bits (294), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    +++Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  117 bits (293), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            EH+ RF  F  +L  ++  N         R G+  F+DL+ EEF+   L   V +    
Sbjct: 69  GEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 128

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           +  ++    H+ V++            +P   DWRE G +  V+NQ  CG+CWAFS V T
Sbjct: 129 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 173

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            ES++ L  G +  LS QE+++C+ NG N GC+GG      D++ +    ++ E +YP  
Sbjct: 174 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTEDDYPYK 232

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
             D  C     +   V I  +  D     E S+   +A H PV  A+ A    +Q Y  G
Sbjct: 233 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 290

Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           V    C  SL   +H V  VGY  DN    W
Sbjct: 291 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 318


>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
          Length = 450

 Score =  116 bits (291), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V+ Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 KQLDHGVLLVGYNDNS 298


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  116 bits (290), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 31/308 (10%)

Query: 7   VLFIVALIALCFL-AIPVKVSK--PNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEK 62
           V  +   + LC + A P   S+  PN +  ++ F  +   Y + Y   +  +R F+ F+ 
Sbjct: 5   VQLVFLFLFLCAMWASPSAASRDEPN-DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKN 63

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           ++  IE  N   ++  S   GI +F+D+++ EF  ++   S+  ++        D     
Sbjct: 64  NVKHIETFNSRNEN--SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD---- 117

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
                    + I + +P   DWR+ G + +V+NQ  CG+CW+F+ + T E ++ +K G L
Sbjct: 118 ---------VNI-SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYL 167

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
             LS QEV+DCA   + GC GG      D++  N  V   E  YP L     C   +  P
Sbjct: 168 VSLSEQEVLDCA--VSYGCKGGWVNKAYDFIISNNGVTT-EENYPYLAYQGTCNANSF-P 223

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANIN 301
           N   I  Y+       E S++  ++   P+ A ++A   +QYY GGV    C  SL   N
Sbjct: 224 NSAYITGYSY-VRRNDERSMMYAVSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---N 278

Query: 302 HAVQIVGY 309
           HA+ I+GY
Sbjct: 279 HAITIIGY 286


>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
          Length = 360

 Score =  115 bits (289), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 27/278 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL ++   N+   S    R GI  F+D+S EEF
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+                       +P  KDWRE GI+  V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q+++DC     N GC+GG      +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  CK K  +  GVK+   + +  + +E  +   +    PV 
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +T ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314


>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
           SV=1
          Length = 321

 Score =  115 bits (289), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
           VA + LC LA+    + P+ +        F+ +Y + Y  ++ ++ R + F+++  +IE+
Sbjct: 3   VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 53

Query: 70  LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK  ++ E + +  + +F D++ EEF           + +M  +K      +  + +++
Sbjct: 54  FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 98

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
            T    P    V  DWR   ++  V++Q+ CG+CWAFS     E  H LKN  L  LS Q
Sbjct: 99  FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 156

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           +++DC+ + GN GC GG   +  D++  N  + + ES YP   +D +C+  A S   +  
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 215

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
            S     +  +E ++   ++  GP+  A++A   ++Q+Y  GV  + NC  +   ++H V
Sbjct: 216 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 270

Query: 305 QIVGY 309
             VGY
Sbjct: 271 LAVGY 275


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  115 bits (288), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 139/311 (44%), Gaps = 31/311 (9%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
           FIV  +ALC L +       +   K         EL+  ++  +  + S  E   RF  F
Sbjct: 4   FIV--LALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVF 61

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           + ++  I E NK     +S +  + +F D++ EEF+  +   ++       HH+      
Sbjct: 62  KHNVKHIHETNK---KDKSYKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGEK 112

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              K        T+PT +    DWR+ G +  V+NQ  CG+CWAFSTV   E ++ ++  
Sbjct: 113 KATKSFMYANVNTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L+ LS QE++DC  N N GC+GG      +++   K  L  E  YP    D  C     
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
           +   V I  +  D    SE  ++  +A   PV  A++A    +Q+Y  GV    C   L 
Sbjct: 228 NAPVVSIDGHE-DVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284

Query: 299 NINHAVQIVGY 309
             NH V +VGY
Sbjct: 285 --NHGVAVVGY 293


>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
          Length = 358

 Score =  114 bits (286), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++ + + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  114 bits (285), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
           V   + + E+   L++ ++  + KSY+   E + R+  F  +L  I+E N    +   S 
Sbjct: 26  VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
           R G+  F+DL+ EE++  +L             ++       V  R +         +P 
Sbjct: 86  RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR  G + ++++Q  CG+CWAFS +   E ++ +  G L  LS QE++DC  + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      D++ +N   ++ E +YP   KD  C     +   V I SY  D    SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           S+   +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296


>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
           PE=2 SV=2
          Length = 362

 Score =  114 bits (284), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ K Y   +E   RF+ F +SL+++   N+ R  P   R GI  F+D+S EEF
Sbjct: 62  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+  D                    +P  KDWRE GI+  V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q+++DCA    N GCSGG      +++
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P  V +K   + +  + +E  +   +    PV
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 277

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   +  ++ Y  GV   + C  S  ++NHAV  VGY
Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316


>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  114 bits (284), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
          Length = 442

 Score =  114 bits (284), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L +    +K      Q    F+++ Q ++++YS  E + R++ F+ ++D + + 
Sbjct: 4   LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ +E++T +L    +   L+   +         +K   T 
Sbjct: 64  NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
             T+        DWR  G +  ++NQ  CG CW+FST  + E  H + +GT   L  LS 
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
           Q +IDC+ + GN GC GG      +++ +N   ++ ES YP   +D   CK K TS  G 
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLAFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
           +I SY  +    SE+S L   + + PV  A++A   ++Q Y  G I Y    S   ++H 
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACSPTQLDHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
          Length = 434

 Score =  113 bits (283), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 143/305 (46%), Gaps = 33/305 (10%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L ALC L + V  +K  L   Q    F+++   +++ YS  E + RF  F+ ++D I E 
Sbjct: 4   LSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSEEFNGRFNIFKANMDYINEW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ EE++  +L    +   L       +     V+  S+  
Sbjct: 64  NT--KGSETV-LGLNVFADITNEEYRATYLGTPFDASSL--EMTPSEKVFGGVQANSV-- 116

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV--Q 188
                       DWR  G +  ++NQ  CG CW+FS     E    + NG   L SV  Q
Sbjct: 117 ------------DWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQ 164

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           ++IDC+G+ GN GC GG      +++ +N   ++ ES YP       CK   ++  G ++
Sbjct: 165 QLIDCSGSYGNNGCEGGLMTLAFEYI-INNGGIDTESSYPFTANTEKCKYNPSNI-GAEL 222

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDG-SLANINHAV 304
            SY  +    SES +   + T GP   A++A   ++Q+Y  G+  YN    S   ++H V
Sbjct: 223 SSYV-NVTSGSESDLAAKV-TQGPTSVAIDASQPSFQFYSSGI--YNEPACSSTQLDHGV 278

Query: 305 QIVGY 309
             VG+
Sbjct: 279 LAVGF 283


>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
           virus GN=VCATH PE=3 SV=1
          Length = 324

 Score =  113 bits (283), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 32/301 (10%)

Query: 14  IALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
           I LC L   V   +  +L +    F  F  ++ K+YS +SE   RFK F+ +L+  E +N
Sbjct: 4   IMLCLLVCGVVHAATYDLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLE--EIIN 61

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS-HHKHHDHHHNHVKKRSITT 130
           KN Q+  +A+Y I +FSDLS+EE        +++K+  +S  H+  +     +  R    
Sbjct: 62  KN-QNDSTAQYEINKFSDLSKEE--------AISKYTGLSLPHQTQNFCEVVILDRPPDR 112

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G       P++ DWR+   +  V+NQ  CGACWAF+T+ + ES  A+K   L  LS Q+ 
Sbjct: 113 G-------PLEFDWRQFNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQF 165

Query: 191 IDCAGNGNMGCSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           IDC    N GC GG    A    M++  V +  ES+YP    +  C+    +PN   +  
Sbjct: 166 IDC-DRVNAGCDGGLLHTAFESAMEMGGVQM--ESDYPYETANGQCR---INPNRFVVGV 219

Query: 250 YTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
            +C   I      L D+    GP+  A++A     Y  G+++   +  L   NHAV +VG
Sbjct: 220 RSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMRQCANHGL---NHAVLLVG 276

Query: 309 Y 309
           Y
Sbjct: 277 Y 277


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  113 bits (282), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/280 (28%), Positives = 134/280 (47%), Gaps = 28/280 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ ++LF+S+   + K Y   +  + RF+ F+ +L+ I+E NK   S      G+ EF+D
Sbjct: 42  ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YWLGLNEFAD 98

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS +EF  +++   ++  +  S+ +   +                   +P   DWR+ G 
Sbjct: 99  LSNDEFNEKYVGSLIDATIEQSYDEEFINEDT--------------VNLPENVDWRKKGA 144

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  VR+Q +CG+CWAFS V T E ++ ++ G L  LS QE++DC    + GC GG     
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
           L+++  N + L   S+YP   K   C+ K     G  +K+     + P+    L +    
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
            PV   V +    +Q Y GG+ +  C      ++HAV  V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296


>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
          Length = 354

 Score =  113 bits (282), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
            + +  L  +C+ +  +  + P ++  +    + SF++R+ K++   +E   RF  F+++
Sbjct: 10  AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+DL+ +EF   +L    N      H K+H      
Sbjct: 70  MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKNH------ 116

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K  +    + P+G+ +  DWR+ G +  V+NQ  CG+CWAFS +   E   A    +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
             LS Q ++ C  N + GC+GG     ++W M  +   +  E+ YP          C  +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
                G KI  +   +L   E  I   +   GPV  AV+A TWQ Y GGV+      SL 
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287

Query: 299 NINHAVQIVGYDNYSR 314
             NH V IVG++  ++
Sbjct: 288 --NHGVLIVGFNKNAK 301


>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 331

 Score =  112 bits (281), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 137/278 (49%), Gaps = 28/278 (10%)

Query: 35  ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + F +F   Y K Y+  SE + RF  F+++L   EE+N   +  +SA Y I +F+DLS+ 
Sbjct: 29  DYFETFLANYNKMYNDTSEKERRFSIFQQTL---EEINYKNRLNDSAVYQINKFADLSKN 85

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI-PVKKDWREAGIIGK 152
           E  +++    +N  V  +         N  K    T  I  P G  P+  DWR+   +  
Sbjct: 86  EIISKYT--GLNMPVQTT---------NFCK----TIVIDQPPGKGPLNFDWRQQNKVTS 130

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           ++NQ+ CGACWAF+T+ + ES +A+KN     LS Q++IDC    +MGC GG      + 
Sbjct: 131 IKNQKACGACWAFATLASIESQYAIKNNVHIDLSEQQMIDC-DYVDMGCDGGLLHTAFEQ 189

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH-GP 271
           M +    L  E EYP    +  C+ +      VK+K   C   +      L D+    GP
Sbjct: 190 M-IQMGELVQEHEYPYAGVNKPCELRGDETGVVKVKG--CYRYVVFREEKLKDLLRAVGP 246

Query: 272 VIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +  A++A     Y  G+I Y C+     +NHAV +VGY
Sbjct: 247 IPMAIDASGIVNYHHGIIHY-CENY--GLNHAVLLVGY 281


>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
          Length = 354

 Score =  112 bits (281), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 32/316 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
            + +  L  +C+ +  +  + P ++  +    + SF++R+ K++   +E   RF  F+++
Sbjct: 10  AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+DL+ +EF   +L    N      H K H      
Sbjct: 70  MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKDH------ 116

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K  +    + P+G+ +  DWR+ G +  V+NQ  CG+CWAFS +   E   A    +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
             LS Q ++ C  N + GC+GG     ++W M  +   +  E+ YP          C  +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
                G KI  +   +L   E  I   +   GPV  AV+A TWQ Y GGV+      SL 
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287

Query: 299 NINHAVQIVGYDNYSR 314
             NH V IVG++  ++
Sbjct: 288 --NHGVLIVGFNKNAK 301


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  112 bits (280), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E   +L+  ++  +  S S  E   RF  F+ ++  +   NK     +  +  + +F+D+
Sbjct: 34  ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +  EF++ +    VN H +    +H      + K  S+          P   DWR+ G +
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V++Q  CG+CWAFST+   E ++ +K   L  LS QE++DC    N GC+GG   +  
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++   K  +  ES YP   ++  C     +   V I  +  +  +  E+++L  +A   
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A    +Q+Y  GV   +C+    ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295


>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
          Length = 334

 Score =  112 bits (280), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N       + ++G
Sbjct: 16  LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72

Query: 84  IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
            T     F D++ EEF     R  VN +           H  H K R     + +   IP
Sbjct: 73  FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
              DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
            GC+GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP 
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E +++  +AT GP+  A++A   + Q+Y  G I Y  + S  +++H V +VGY
Sbjct: 231 QEKALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  112 bits (279), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)

Query: 34  LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           + ++ ++  ++ K+ S++   E D RF+ F+ +L  ++E N+   S    R G+T F+DL
Sbjct: 47  MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + +E+++++L   + K                 ++ S+     +   +P   DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
            +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205

Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     +    ++ + +YP    D  C +   +   V I SY  D    SE S+   +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            H P+  A+ A    +Q Y  G+   +C   L   +H V  VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCGTQL---DHGVVAVGY 304


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score =  112 bits (279), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 128/277 (46%), Gaps = 26/277 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL   EE+    +     R GI  FSD+S EEF
Sbjct: 61  FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLPYRLGINRFSDMSWEEF 117

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  +      ++         NH+ + +          +P  KDWRE GI+  V+N
Sbjct: 118 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKN 161

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++ 
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N  + + E  YP    +  C  KA +   V++   + +  + +E  +   +    PV  
Sbjct: 222 YNGGI-DTEESYPYKGVNGVCHYKAENA-AVQVLD-SVNITLNAEDELKNAVGLVRPVSV 278

Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   +    QY  G     +C  +  ++NHAV  VGY
Sbjct: 279 AFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 315


>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
           polyhedrosis virus GN=VCATH PE=3 SV=1
          Length = 356

 Score =  112 bits (279), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 137/284 (48%), Gaps = 25/284 (8%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           NL++  + F SF + Y K+Y+   E + R+  F+ +L  I   N N     +A Y I +F
Sbjct: 48  NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           SDLS+ E   +    S+ + V            N  K   +      P   P+  DWRE 
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-----------SNFCKTIILNQP---PDKGPLHFDWREQ 153

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF- 206
             +  ++NQ  CGACWAF+T+ + ES  A+++  L  LS Q++IDC  + +MGC+GG   
Sbjct: 154 NKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC-DSVDMGCNGGLLH 212

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            A  + M +  V  + E +YP + ++  C      P  V +    C   +      L D+
Sbjct: 213 TAFEEIMRMGGV--QTELDYPFVGRNRRCGLDRHRPYVVSLVG--CYRYVMVNEEKLKDL 268

Query: 267 ATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               GP+  A++A     Y  GVI  +C+ +   +NHAV +VGY
Sbjct: 269 LRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GLNHAVLLVGY 309


>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
          Length = 467

 Score =  110 bits (276), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 82/294 (27%), Positives = 134/294 (45%), Gaps = 24/294 (8%)

Query: 21  IPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
           +P   +  + E+ L   F+ F+Q++ + Y S +E   R   F ++L  +  L+    +  
Sbjct: 21  VPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHA--AANP 77

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
            A +G+T FSDL+ EEF++R+             H    H     ++  +   + +  G 
Sbjct: 78  HATFGVTPFSDLTREEFRSRY-------------HNGAAHFAAAQERARVPVKVEV-VGA 123

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWR  G +  V++Q  CG+CWAFS +   E    L    L+ LS Q ++ C    +
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSC-DKTD 182

Query: 199 MGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
            GCSGG      +W+   N   +  E  YP    +       TS + V         L  
Sbjct: 183 SGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQ 242

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
            E+ I   +A +GPV  AV+A +W  Y GGV+  +C      ++H V +VGY++
Sbjct: 243 DEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDHGVLLVGYND 293


>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
          Length = 334

 Score =  110 bits (275), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 144/304 (47%), Gaps = 33/304 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + +  + P L+Q L+  +  ++  + + Y  +E   R   +EK++ +IE  N
Sbjct: 7   LTALC---LGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           +   Q        +  F D++ EEF+            +M+  ++  H    V   S+  
Sbjct: 64  QEYSQGKHGFSMAMNAFGDMTNEEFRQ-----------VMNGFQNQKHKKGKVFHESLV- 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
            + +P  +    DWRE G +  V+NQ  CG+CWAFS     E     K G L  LS Q +
Sbjct: 112 -LEVPKSV----DWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIK 248
           +DC+   GN GC+GG       ++  N   L+ E  YP L ++  +C  K          
Sbjct: 167 VDCSRPQGNQGCNGGLMDNAFQYVKDNG-GLDTEESYPYLGRETNSCTYKPE----CSAA 221

Query: 249 SYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
           + T    IP  E +++  +AT GP+  A++A   ++Q+Y  G I Y+ D S  +++H V 
Sbjct: 222 NDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSG-IYYDPDCSSKDLDHGVL 280

Query: 306 IVGY 309
           +VGY
Sbjct: 281 VVGY 284


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  110 bits (274), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/311 (27%), Positives = 142/311 (45%), Gaps = 34/311 (10%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDII 67
           ++V L+ LC  A+      P L+    L+   ++ Y K Y +   ++ R   +EK+L  +
Sbjct: 3   WLVGLLPLCSYAVAQVHKDPTLDHHWNLW---KKTYSKQYKEENEEVARRLIWEKNLKFV 59

Query: 68  EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
              N ++     S   G+    D++ EE  +           LM   +       +V  R
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTGEEVIS-----------LMGSLRVPSQWQRNVTYR 108

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           S +        +P   DWRE G + +V+ Q +CGACWAFS V   E+   LK G L  LS
Sbjct: 109 SNSN-----QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLS 163

Query: 187 VQEVIDCAGN--GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
            Q ++DC+    GN GC+GG       ++ ++   ++ E+ YP    +  C+  +     
Sbjct: 164 AQNLVDCSTEKYGNKGCNGGFMTTAFQYI-IDNNGIDSEASYPYKAMNGKCRYDS----- 217

Query: 245 VKIKSYTCD--TLIP--SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
            K ++ TC   T +P  SE ++   +A  GPV  A++A  + ++L     Y       N+
Sbjct: 218 -KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNV 276

Query: 301 NHAVQIVGYDN 311
           NH V +VGY N
Sbjct: 277 NHGVLVVGYGN 287


>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
          Length = 334

 Score =  110 bits (274), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 147/319 (46%), Gaps = 39/319 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           + L A C L I   V  P  +Q L+  +  ++  +++ Y  +E   R   +EK++ +IE 
Sbjct: 5   LVLAAFC-LGIASAV--PKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61

Query: 70  LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N    Q        +  F D++ EEF+            +M   ++       V +  +
Sbjct: 62  HNGEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------MMGCFRNQKFRKGKVFREPL 110

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              + +P  +    DWR+ G +  V+NQ+ CG+CWAFS     E     K G L  LS Q
Sbjct: 111 F--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC+GG       ++  N   L+ E  YP +  D  CK +  +     +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENG-GLDSEESYPYVAVDEICKYRPEN----SV 219

Query: 248 KSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
            + T  T++    E +++  +AT GP+  A++A   ++Q+Y  G I +  D S  N++H 
Sbjct: 220 ANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSG-IYFEPDCSSKNLDHG 278

Query: 304 VQIVGY------DNYSRTW 316
           V +VGY       N S+ W
Sbjct: 279 VLVVGYGFEGANSNNSKYW 297


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.133    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 117,397,920
Number of Sequences: 539616
Number of extensions: 4778598
Number of successful extensions: 20595
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 228
Number of HSP's successfully gapped in prelim test: 88
Number of HSP's that attempted gapping in prelim test: 19037
Number of HSP's gapped (non-prelim): 971
length of query: 317
length of database: 191,569,459
effective HSP length: 117
effective length of query: 200
effective length of database: 128,434,387
effective search space: 25686877400
effective search space used: 25686877400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)