BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy2558
         (348 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
           GN=CG12163 PE=2 SV=2
          Length = 614

 Score =  287 bits (735), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 200/321 (62%), Gaps = 9/321 (2%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H    V H  LF  F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ E
Sbjct: 299 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 356

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCG 148
           F+D++++E++ +   ++   + A     A++P     LP+ FDWR+ DAVT VK+Q  CG
Sbjct: 357 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 416

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           S WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGL
Sbjct: 417 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 474

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
           E E  YPY+     C  N+  + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 475 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 534

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  GVSHP +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY
Sbjct: 535 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 592

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGD +CG+++   SA++
Sbjct: 593 YRVYRGDNTCGVSEMATSAVL 613


>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2 SV=1
          Length = 319

 Score =  266 bits (680), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 138/289 (47%), Positives = 183/289 (63%), Gaps = 11/289 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
           R +IF  N+ K QL Q    GS +YG+  +SDL+T EF   +L    + PS    +  ++
Sbjct: 39  RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 98

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
              +  +P+ FDWRE  AVT VK+Q MCGS WAFSTTGN+E  +  KT KL+SLSEQ+L+
Sbjct: 99  GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 158

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L      V IN  V+
Sbjct: 159 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 216

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +++DET++A +L  N  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV
Sbjct: 217 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 274

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                + K  P+WI+KNSWG  WGE GYFR+YRGDGSCGIN    SA++
Sbjct: 275 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
          Length = 363

 Score =  253 bits (646), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 202/336 (60%), Gaps = 25/336 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV +E+ H L+   H   F  F  + +K+YAT  E+  R  +F  NL K +L Q+ +  
Sbjct: 32  QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRD-P 87

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           +  +G+ +FSDL+ +EF+ ++LG K +   P++A ++   ++P   LP  FDWRE  AVT
Sbjct: 88  TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 145

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VKDQ  CGS WAFSTTG +EG +   T KLVSLSEQ+L+DCD           D GC G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++NAF+ ++    GG+ +EK Y Y G D +C+ +K      ++ +  V+ DE  +A  
Sbjct: 206 GLMNNAFEYLLE--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAAN 263

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q Y++GVS P  + C      L H VL+VG+G         K  
Sbjct: 264 LVKNGPLAVAINAAWMQTYMSGVSCP--YVC--AKSRLDHGVLLVGFGKGAYAPIRLKEK 319

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 320 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355


>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
          Length = 462

 Score =  251 bits (640), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461


>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
           GN=At2g21430 PE=2 SV=2
          Length = 361

 Score =  242 bits (618), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
           V+L+ + VSVS   V GDE +     V  T          F  F ++  K Y ++ E+Y 
Sbjct: 11  VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
           R  +F  NL +    Q  +  S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++ 
Sbjct: 68  RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             ++P   LP  FDWR+  AVT VK+Q  CGS W+FSTTG +EG +   T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183

Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
           L+DCD E         D GC GG +++AF+  +    GGL  EK YPY G D  +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
                 ++ +  VS +E  +A  L++NGP+AVAINA  +Q Y+ GVS P  + C   +  
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296

Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           L+H VL+VGYG    ++   K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355


>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
          Length = 484

 Score =  242 bits (618), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
           SV=1
          Length = 368

 Score =  240 bits (613), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VVG  +   L    H   F+ F  +  K YA+  E+  R  +F  NLR+ +  Q  +  
Sbjct: 35  QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90

Query: 83  SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
           S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++   ++P   LP  FDWR++ AV
Sbjct: 91  SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
           T VK+Q  CGS W+FS TG +EG     T KLVSLSEQ+L+DCD E         D GC 
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
           GG +++AF+  +    GGL +E+ YPY G D K C+L+K      ++ +  +S DE  +A
Sbjct: 208 GGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
             LV+NGP+AVAINA  +Q Y+ GVS P  + C      L+H VL+VGYG       +F 
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358


>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
          Length = 343

 Score =  230 bits (586), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
            L   TV VSS  +  +E+   L           F ++ NK Y+   EY  R  IF  NL
Sbjct: 8   VLAVFTVFVSSRGIPLEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56

Query: 71  RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
            KI+   L+         +G+N+F+DLS+ EF+  YL  K      D  V   + +    
Sbjct: 57  GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           ++P AFDWR   AVT VK+Q  CGS W+FSTTGN+EG +     KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176

Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
                     D+GC GG   NA++ I+    GG++ E +YPY  +    C  N      K
Sbjct: 177 CMEYEGEQACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234

Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           I+ +  + ++ET MA Y+V  GP+A+A +A   QFY+ GV     F       +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           IVGY    T F  K +PYWI+KNSWG  WGE+GY  L RG  +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
          Length = 371

 Score =  223 bits (567), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  K+Y    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106

Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG +       R +        ++P   LP  FDWR++ AV  VK+Q  CGS W+FS 
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           +G +EG +   T KL  LSEQ+ +DCD E         D GC GG ++ AF  +     G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G D  C+ +K      +  +  VS DE  ++  L+++GP+A+ INA  +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
           Q Y+ GVS P  + C     +L H VL+VGYG         K  PYWIIKNSWGE WGE 
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339

Query: 326 GYFRLYRGD---GSCGINDYV 343
           GY+++ RG      CG++  V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360


>sp|Q8V5U0|CATV_NPVHZ Viral cathepsin OS=Heliothis zea nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 367

 Score =  221 bits (563), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 120/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
           F +FL+Q+NK+Y    EY  R ++F  NL KI               D+   S  +G+N+
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 91  FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           FSD +  E      GF L  S      +  +    P+I LP  +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGV 176

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAF   GNIE  YA +  KL+ LSEQ+L+DCD+ D GC GG +  AF  ++  L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
           G+E E  YPY+G ++ C L+ +   VK+N       RDE  + + +   GP+A+A++A  
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +  Y  G+ +    +      +L+H+VL++G+G++        VPYWIIKNSWGE WGE 
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342

Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
           G+ R+ R   +CG +N++  S+++
Sbjct: 343 GFLRVRRNVNACGLLNEFGASSVI 366


>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
           PE=3 SV=1
          Length = 337

 Score =  212 bits (539), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 188/348 (54%), Gaps = 29/348 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           ++L +  +  +V   +   HL    H A   F  F+  +NK Y        R  IF  NL
Sbjct: 1   MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNL 60

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMI-------- 121
             I   ++  + S +Y +N+FSDLS  E   KY G    KPS   RS             
Sbjct: 61  EDINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNFCNVIHLDAP 119

Query: 122 PNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           P++   LP+ FDWR  + +T VKDQ  CGS WA +  G +E +YA K   L++LSEQ+LI
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  +  C+GG +  AF+ +M+   GGL EE  YPY+G    C+++ K   + ++    
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKR 237

Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
            + ++E ++ K L+  GP+A+AI+A ++  Y  G+ H    FC+  N  L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
                 T   V YW +KNSWG  WGE GYFR+ R   +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333


>sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 324

 Score =  210 bits (534), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 182/315 (57%), Gaps = 20/315 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  + F  FL + NK Y++  E   R  IF  NL +I + ++    S  Y +N+FSDLS
Sbjct: 22  LKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI-INKNQNDTSAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKL---KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L   K ++ +  V    P+   P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 81  KDETISKYTGLSLPLQKQNFCEVVVLDRPPDKG-PLEFDWRRLNKVTSVKNQGMCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ+LIDCD  D GC+GG +  A++ +M+   GG++ E 
Sbjct: 140 FATLGSLESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVMNM--GGIQAEN 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  ++  CR+N     V++   Y  V+  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEANNGPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C+  N  L+H+VL+VGYGV+        +P+WI+KN+WG  WGE+GYFR+ 
Sbjct: 258 GIIR----YCE--NHGLNHAVLLVGYGVE------NGIPFWILKNTWGADWGEQGYFRVQ 305

Query: 332 RGDGSCGINDYVRSA 346
           +   +CGI + + S+
Sbjct: 306 QNINACGIKNELPSS 320


>sp|Q91BH1|CATV_NPVST Viral cathepsin OS=Spodoptera litura multicapsid
           nucleopolyhedrovirus GN=VCATH PE=3 SV=1
          Length = 337

 Score =  209 bits (531), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 120/316 (37%), Positives = 173/316 (54%), Gaps = 27/316 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F++QHNK Y T  +  +    F  NL  +  + +  +   VYG+N+FSD+    F  ++ 
Sbjct: 36  FIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSN-QAVYGINKFSDIDKITFVNEHA 94

Query: 105 GF----------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           G              P      V    P+   P +FDWR+ + VT VK+Q +CGS WAF+
Sbjct: 95  GLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFA 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  YA     L+ LSEQ+L+DCD+ D GC+GG +  AF  I+    GG+E E  Y
Sbjct: 155 AIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRI--GGVEHEIDY 212

Query: 215 PYRGDDKACRLNKKATQVKIN-GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           PY+G + ACRL      V+++  Y    RDE  + + L +NGP+AVAI+   +  Y +G+
Sbjct: 213 PYQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGI 272

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
           +       D G   L+H+VL+VGYG++         PYWI KNSWG  WGE GYFR  R 
Sbjct: 273 ATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRARRN 320

Query: 334 DGSCG-INDYVRSALV 348
             +CG +N++  SA++
Sbjct: 321 INACGMLNEFAASAVL 336


>sp|P56203|CATW_MOUSE Cathepsin W OS=Mus musculus GN=Ctsw PE=2 SV=2
          Length = 371

 Score =  207 bits (528), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 122/333 (36%), Positives = 177/333 (53%), Gaps = 38/333 (11%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  + N++Y    EY  RL IF+ NL + Q LQ  + G+  +G   FSDL+  EF 
Sbjct: 39  VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
                      Y     P   PN+T           +PR  DWR+  + ++ VK+Q  C 
Sbjct: 99  Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
             WA +   NI+ ++  K ++ V +S QEL+DC++  +GC GG + +A+ T+++    GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207

Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             EK YP++GD K  R L KK  +V  I  +  +S +E  +A YL  +GP+ V IN   L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
           Q Y  GV       CD     + HSVL+VG+G ++      T  +H      + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILK 325

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGEKGYFRLYRG+ +CG+  Y  +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358


>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
          Length = 467

 Score =  207 bits (526), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear polyhedrosis virus GN=VCATH
           PE=1 SV=1
          Length = 323

 Score =  207 bits (526), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
          Length = 450

 Score =  207 bits (526), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VK Q  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           ++ L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDAESFMDYNGGI------LTSCTSKQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  206 bits (525), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 179/308 (58%), Gaps = 27/308 (8%)

Query: 48  QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYL 104
           QH K YA  VE   R+ IF+ N  KI +  Q    G   Y  GLN+++D+   EF+    
Sbjct: 34  QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMN 93

Query: 105 GFK--LKPSYADRS--VPAM-IP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G+   L+    +R+  V A  IP  ++T+P++ DWRE+ AVTGVKDQ  CGS WAFS+TG
Sbjct: 94  GYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTG 153

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            +EG +  K   LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+YP
Sbjct: 154 ALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYP 211

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
           Y G D +C  NK        G+V +   DE  M K +   GP++VAI+A   + Q Y  G
Sbjct: 212 YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEG 271

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V +  +  CD   +NL H VL+VGYG D +      + YW++KNSWG  WGE+GY ++ R
Sbjct: 272 VYNEPE--CD--EQNLDHGVLVVGYGTDES-----GMDYWLVKNSWGTTWGEQGYIKMAR 322

Query: 333 G-DGSCGI 339
             +  CGI
Sbjct: 323 NQNNQCGI 330


>sp|Q91GE3|CATV_NPVEP Viral cathepsin OS=Epiphyas postvittana nucleopolyhedrovirus
           GN=VCATH PE=3 SV=1
          Length = 323

 Score =  206 bits (524), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 175/316 (55%), Gaps = 25/316 (7%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ Q+NK Y +  E   R  IF  NL  I  +    + + VY +N+FSDLS
Sbjct: 22  LKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P +       ++   P    P  FDWR ++ +T VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A    +L++LSEQ++IDCD  D GCEGG +  AF+ I+S   GG++ E 
Sbjct: 139 FATLASLESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIISM--GGVQIEN 196

Query: 213 TYPYRGDDKACRLNKKATQVKI---NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
            YPY   +  CR++     V +   N Y+++   E  +   L   GP+ VAI+A  +  Y
Sbjct: 197 DYPYESSNNYCRMDPTKFVVGVKQCNRYITIY--EEKLKDVLRLAGPIPVAIDASDILNY 254

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             G+      +C   N  L+H+VL+VGYGV+        VPYWI+KNSWG  WGE+G+F+
Sbjct: 255 EQGIIK----YC--ANNGLNHAVLLVGYGVENN------VPYWILKNSWGTDWGEQGFFK 302

Query: 330 LYRGDGSCGINDYVRS 345
           + +   +CGI + + S
Sbjct: 303 IQQNVNACGIKNELAS 318


>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
          Length = 444

 Score =  206 bits (523), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 23/320 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    G    C  + +   V  +I+G+V +   E  MA +L +NGP+A+A++A +   
Sbjct: 214 YPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMS 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY 
Sbjct: 274 YKSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYV 321

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 322 RVVMGVNACLLSEYPVSAHV 341


>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
          Length = 443

 Score =  205 bits (522), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana nuclear polyhedrosis
           virus GN=Vcath PE=3 SV=1
          Length = 324

 Score =  205 bits (521), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 175/315 (55%), Gaps = 20/315 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  FL + NK+Y++  E   R  IF  NL +I + ++    +  Y +N+F+DLS
Sbjct: 22  LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI-INKNHNDSTAQYEINKFADLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 81  KDETISKYTGLSL-PLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  + ++LSEQ+LIDCD  D GC+GG +  AF+ +M+   GG++ E 
Sbjct: 140 FATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNM--GGIQAES 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  ++  CR N     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C   N  L+H+VL+VGY V+        VP+WI+KN+WG  WGE+GYFR+ 
Sbjct: 258 GIMK----YC--ANHGLNHAVLLVGYAVE------NGVPFWILKNTWGADWGEQGYFRVQ 305

Query: 332 RGDGSCGINDYVRSA 346
           +   +CGI + + S+
Sbjct: 306 QNINACGIQNELPSS 320


>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
           virus GN=Vcath PE=3 SV=1
          Length = 324

 Score =  205 bits (521), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 22/316 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
           +K  + F  FL   NK Y++  E   R  IF  NL +I  + L DT   S  Y +N+FSD
Sbjct: 22  LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           LS  E  +KY G  L     +     ++  P    P  FDWR  + VT VK+Q  CG+ W
Sbjct: 79  LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AF+T G++E  +A K  +L++LSEQ+LIDCD  D GC+GG +  A++ +M+   GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196

Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
             YPY  ++  CRLN     VK+   Y  V   E  +   L   GP+ VAI+A  +  Y 
Sbjct: 197 NDYPYEANNGDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYK 256

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV      +C   N  L+H+VL+VGY V+        VP+WI+KN+WG  WGE+GYFR+
Sbjct: 257 RGVIR----YC--ANHGLNHAVLLVGYAVEN------GVPFWILKNTWGTDWGEQGYFRV 304

Query: 331 YRGDGSCGINDYVRSA 346
            +   +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320


>sp|O91466|CATV_GVCPM Viral cathepsin OS=Cydia pomonella granulosis virus (isolate
           Mexico/1963) GN=VCATH PE=3 SV=1
          Length = 333

 Score =  204 bits (518), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 189/348 (54%), Gaps = 29/348 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           LL+  +  S   V      + L++     LF  F  ++NKTY +  E   +L  F  NL+
Sbjct: 4   LLNFVILASVLTVTAHALTYDLNNSDE--LFKNFAIKYNKTYVSDEERAIKLENFKNNLK 61

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI-----P 122
            I   ++      V+ +NE+SDL+      +  GF+L     PS    +  +++     P
Sbjct: 62  MINE-KNMASKYAVFDINEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIKDEP 120

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP   DWR+   VT VK+Q  CGS WAFST  NIE +Y  K  K ++LSEQ L++CD
Sbjct: 121 QALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCD 180

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VS 241
             ++GC GG +  A ++I+ +  GG+   +  PY G D  C+  K   ++ I+G    V 
Sbjct: 181 NINNGCAGGLMHWALESILQE--GGVVSAENEPYYGFDGVCK--KSPFELSISGSRRYVL 236

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           ++E  + + LV NGP++VAI+   L  Y  G++      C+  NE L+H+VL+VGYGV  
Sbjct: 237 QNENKLRELLVVNGPISVAIDVSDLINYKAGIAD----ICE-NNEGLNHAVLLVGYGVKN 291

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
                  VPYWI+KNSWG  WGE+GYFR+ R   SCG +N+Y  SA++
Sbjct: 292 D------VPYWILKNSWGAEWGEEGYFRVQRDKNSCGMMNEYASSAIL 333


>sp|Q8QLK1|CATV_NPVMC Viral cathepsin OS=Mamestra configurata nucleopolyhedrovirus
           GN=VCATH PE=3 SV=1
          Length = 337

 Score =  203 bits (516), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 183/343 (53%), Gaps = 21/343 (6%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           LL   V  S   VV      +L+++    L F  F+ Q+NK Y++  E   R +IF  N+
Sbjct: 9   LLVSAVLTSHDQVVAVTIKPNLYNINSAPLYFEKFISQYNKQYSSEDEKKYRYNIFRHNI 68

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLP 127
             I   +++ + S VY +N F+D++  E   ++ G     +  ++ +  V         P
Sbjct: 69  ESINA-KNSRNDSAVYKINRFADMTKNEVVNRHTGLASGDIGANFCETIVVDGPGQRQRP 127

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR Y+ VT VKDQ MCG+ WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D G
Sbjct: 128 ANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMG 187

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETD 246
           C+GG I  A++ IM    GG+E+E  YPY+     C +      V + N Y  V   E  
Sbjct: 188 CDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEER 245

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +   L   GP+A+A++A  L  Y  GV      FC+  N  L+H+VL+VGYG++      
Sbjct: 246 LEDLLRHVGPIAIAVDAVDLTDYYGGVIS----FCE--NNGLNHAVLLVGYGIENN---- 295

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
             VPYW IKNSWG  +GE GY R+ RG  SCG IN+   SA +
Sbjct: 296 --VPYWTIKNSWGSDYGENGYVRIRRGVNSCGMINELASSAQI 336


>sp|Q9J8B9|CATV_NPVSE Viral cathepsin OS=Spodoptera exigua nuclear polyhedrosis virus
           (strain US) GN=VCATH PE=3 SV=1
          Length = 337

 Score =  203 bits (516), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y +  E   R +IF  N+  I   +++ + S VY +N F
Sbjct: 30  LYNINSAPLYFEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 88

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D+   E   ++ G    +L  ++ +  V         P +FDWR  + +T VKDQ MCG
Sbjct: 89  ADMPKNEIVIRHTGLASGELGLNFCETIVVDGPAQRQRPVSFDWRSMNKITSVKDQGMCG 148

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + W F++ G +E  YA K  +L+ LSEQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 149 ACWRFASLGALESQYAIKYDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKM--GGV 206

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E+E  Y Y+ + + C L  +K AT V+ N Y  V  +E  +   L   GP+A+A++A  L
Sbjct: 207 EQEFDYSYKAERQPCALKPHKFATGVR-NCYRYVILNEERLEDLLRYVGPIAIAVDAVDL 265

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC+  N  L+H+VL+VGYGV+        VPYWIIKNSWG  +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 313

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336


>sp|P56202|CATW_HUMAN Cathepsin W OS=Homo sapiens GN=CTSW PE=1 SV=2
          Length = 376

 Score =  203 bits (516), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   LQ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica nuclear polyhedrosis
           virus GN=VCATH PE=1 SV=1
          Length = 323

 Score =  202 bits (515), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y + VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
           polyhedrosis virus GN=VCATH PE=3 SV=1
          Length = 356

 Score =  202 bits (514), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 114/314 (36%), Positives = 173/314 (55%), Gaps = 21/314 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
           F  F+E +NK Y +  E   R  IF  NL +I       T+  +  Y +N+FSDLS +E 
Sbjct: 56  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115

Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
            AK+ G  +    ++  +++    P    P  FDWRE + VT +K+Q  CG+ WAF+T  
Sbjct: 116 IAKFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLA 175

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++E  +A +  +L+ LSEQ+LIDCD  D GC GG +  AF+ IM    GG++ E  YP+ 
Sbjct: 176 SVESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGLLHTAFEEIMRM--GGVQTELDYPFV 233

Query: 218 GDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           G ++ C L++    V   +  Y  V  +E  +   L   GP+ +AI+A  +  Y  GV  
Sbjct: 234 GRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 293

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
             +      N  L+H+VL+VGYGV+        VPYW+ KN+WG+ WGE GYFR+ +   
Sbjct: 294 SCE------NNGLNHAVLLVGYGVE------NGVPYWVFKNTWGDDWGENGYFRVRQNVN 341

Query: 336 SCG-INDYVRSALV 348
           +CG +ND   +A++
Sbjct: 342 ACGMVNDLASTAVL 355


>sp|Q8B9D5|CATV_NPVR1 Viral cathepsin OS=Rachiplusia ou multiple nucleopolyhedrovirus
           (strain R1) GN=VCATH PE=3 SV=1
          Length = 323

 Score =  202 bits (514), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y + VE   R  IF  NL +I +    ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIII--KNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE+G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEEGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 324

 Score =  201 bits (511), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 120/318 (37%), Positives = 178/318 (55%), Gaps = 21/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  + F  FL + NK Y++  E   R  IF  NL +I ++++    +  Y +N+FSDLS
Sbjct: 22  LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         ++   P    P  FDWR  + VT VK+Q +CG+ WA
Sbjct: 81  KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ+LIDCD  D GC GG +  A++ +M    GG++ E 
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D  CR++     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C   N   +H+VL+VGYGV+        VPYWI+KN+WGE WGE+GYFR+ 
Sbjct: 258 GIMR----YC--SNYGFNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305

Query: 332 RGDGSCGI-NDYVRSALV 348
           +   +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323


>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
           virus GN=VCATH PE=3 SV=1
          Length = 324

 Score =  200 bits (509), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 21/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  FL + NK Y++  E   R  IF  NL +I + ++    +  Y +N+FSDLS
Sbjct: 22  LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEI-INKNQNDSTAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         +I   P    P  FDWR+++ VT VK+Q +CG+ WA
Sbjct: 81  KEEAISKYTGLSL-PHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ+ IDCD+ + GC+GG +  AF++ M    GG++ E 
Sbjct: 140 FATLGSLESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAMEM--GGVQMES 197

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY   +  CR+N     V +      +   E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYETANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+           N  L+H+VL+VGY V+        +PYWI+KN+WG  WGE GYFR+ 
Sbjct: 258 GIMRQC------ANHGLNHAVLLVGYAVENN------IPYWILKNTWGTDWGEDGYFRVQ 305

Query: 332 RGDGSCGI-NDYVRSALV 348
           +   +CGI N+ V SA +
Sbjct: 306 QNINACGIRNELVSSAEI 323


>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
           SV=1
          Length = 346

 Score =  197 bits (502), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 121/337 (35%), Positives = 175/337 (51%), Gaps = 30/337 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VALL+L V   S++       + + + +   LFN F+ ++NK Y    E  +R  IF  N
Sbjct: 19  VALLTLNVCAVSYIA------YDMSNAQE--LFNEFVVKYNKVYKDDQEKEARFEIFKQN 70

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
           L  I      E  S ++ +N  +D+S+ E   K  G KL     ++      P +     
Sbjct: 71  LADINARNALED-SAMFEINSRADISSNELLQKLTGLKLSLMRGEKKNSFCTPTVISGDS 129

Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              +P +FDWR+ ++VT VK Q  CGS WAFS   NIE +Y  K    + LSEQ+L+DCD
Sbjct: 130 SGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCD 189

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
           + ++GC GG +S AF+ I+    GG+  E  YPY G D  C+   +  Q+    Y    R
Sbjct: 190 KVNNGCNGGLMSWAFEGIIR--AGGISYEAPYPYTGVDGVCKNTTRYVQLS-GCYAYDLR 246

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
            E  + + L E GP++VAI+   L  Y +GV+          +  L+H VL+VGYG +  
Sbjct: 247 SEKKLRQVLHEKGPVSVAIDVVDLTNYKSGVAKHCSV-----DHGLNHGVLLVGYGQEND 301

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
                 V YW +KNSWG  WGE+G+FR+ R   SCGI
Sbjct: 302 ------VKYWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  197 bits (501), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 129/309 (41%), Positives = 171/309 (55%), Gaps = 36/309 (11%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E   R ++F  N++ I      +  S    LN+F D+++ EF+  Y G  +
Sbjct: 44  HHTVARSLEEKAKRFNVFKHNVKHIHETNKKDK-SYKLKLNKFGDMTSEEFRRTYAGSNI 102

Query: 109 K-------PSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           K          A +S   M  N+ TLP + DWR+  AVT VK+Q  CGS WAFST   +E
Sbjct: 103 KHHRMFQGEKKATKSF--MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVE 160

Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+   +TKKL SLSEQEL+DCD  ++ GC GG +  AF+ I  K  GGL  E  YPY+  
Sbjct: 161 GINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKAS 218

Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
           D+ C  NK+ A  V I+G+  V ++  D     V N P++VAI+A     QFY  GV   
Sbjct: 219 DETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGV--- 275

Query: 277 IQFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
             F    G E L+H V +VGYG  +D TK       YWI+KNSWGE WGEKGY R+ RG 
Sbjct: 276 --FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGI 325

Query: 334 ---DGSCGI 339
              +G CGI
Sbjct: 326 RHKEGLCGI 334


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  196 bits (498), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 129/331 (38%), Positives = 177/331 (53%), Gaps = 28/331 (8%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F +VG    H  +  K   LF  ++ +H+K Y ++ E   R  +F  NL  I   ++ E 
Sbjct: 31  FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEI 89

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM---IPNIT-LPRAFDWREYDA 137
            S   GLNEF+DL+  EF+ +YLG   KP ++ +  P+      +IT LP++ DWR+  A
Sbjct: 90  NSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQPSANFRYRDITDLPKSVDWRKKGA 148

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
           V  VKDQ  CGS WAFST   +EG+    T  L SLSEQELIDCD   + GC GG +  A
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENG 255
           F  I+S   GGL +E  YPY  ++  C+  K+   +V I+GY  V  ++ +     + + 
Sbjct: 209 FQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQ 266

Query: 256 PMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           P++VAI A     QFY  GV      F      +L H V  VGYG      + K   Y I
Sbjct: 267 PVSVAIEASGRDFQFYKGGV------FNGKCGTDLDHGVAAVGYG------SSKGSDYVI 314

Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           +KNSWG  WGEKG+ R+ R     +G CGIN
Sbjct: 315 VKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345


>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
           GN=VCATH PE=3 SV=1
          Length = 331

 Score =  194 bits (494), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 116/311 (37%), Positives = 169/311 (54%), Gaps = 19/311 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  FL  +NK Y    E   R  IF   L +I   ++  + S VY +N+F+DLS  E  +
Sbjct: 31  FETFLANYNKMYNDTSEKERRFSIFQQTLEEINY-KNRLNDSAVYQINKFADLSKNEIIS 89

Query: 102 KYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           KY G  +     +     +I  P    P  FDWR+ + VT +K+Q  CG+ WAF+T  +I
Sbjct: 90  KYTGLNMPVQTTNFCKTIVIDQPPGKGPLNFDWRQQNKVTSIKNQKACGACWAFATLASI 149

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           E  YA K    + LSEQ++IDCD  D GC+GG +  AF+ ++    G L +E  YPY G 
Sbjct: 150 ESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQMIQM--GELVQEHEYPYAGV 207

Query: 220 DKACRLNKKAT-QVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           +K C L    T  VK+ G Y  V   E  +   L   GP+ +AI+A  +  Y  G+ H  
Sbjct: 208 NKPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMAIDASGIVNYHHGIIH-- 265

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +C+  N  L+H+VL+VGYGV+        VP+W  KN+WG+ WGE+GYFR+ +   +C
Sbjct: 266 --YCE--NYGLNHAVLLVGYGVENN------VPFWTFKNTWGKDWGEEGYFRVRQNVDAC 315

Query: 338 GINDYVRSALV 348
           G+ + + S+ V
Sbjct: 316 GMTNELASSAV 326


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  194 bits (493), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 62  FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 121

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 122 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 181

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 182 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 239

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 240 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 299

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 300 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 350

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 351 KMLRNKENQCGI 362


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  194 bits (492), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 181/351 (51%), Gaps = 37/351 (10%)

Query: 7   FAGVALLSLT-VSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRL 63
           F  +AL++L+ +S++  +   ++ L         +L+N +     H+     L E   R 
Sbjct: 6   FIALALVALSFLSIAQSIPFTEKDL-----ASEDSLWNLYEKWRTHHTVARDLDEKNRRF 60

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---- 119
           ++F  N++ I      +       LN+F D++  EF++KY G K++   + R +      
Sbjct: 61  NVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGS 120

Query: 120 -MIPNI-TLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
            M  N+ +LP A  DWR   AVTGVKDQ  CGS WAFST  ++EG+   KT +LVSLSEQ
Sbjct: 121 FMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQ 180

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-KKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I      G+  E +YPY   D  C  N   +  V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFEFIQKN---GITTEDSYPYAEQDGTCASNLLNSPVVSI 237

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           +G+  V  +  +     V N P++V+I A  Y  QFY  GV     F    G E L H V
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGV-----FTGRCGTE-LDHGV 291

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
            IVGYG      T     YWI+KNSWGE WGE GY R+ RG     G CGI
Sbjct: 292 AIVGYGA-----TRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGI 337


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  192 bits (487), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K+Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 39  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ  CGS WA
Sbjct: 98  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 322 RMERNIKASSGKCGI 336


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  190 bits (483), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 109/306 (35%), Positives = 159/306 (51%), Gaps = 23/306 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N++ I+        S   G+N+F+D++ +EF A
Sbjct: 37  FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96

Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +Y G  L P   +R       ++ +   P++ DWR+Y AV  VK+Q  CGS W+F+    
Sbjct: 97  QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG+Y  KT  LVSLSEQE++DC     GC+GG ++ A+D I+S    G+  E+ YPY  
Sbjct: 156 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYLA 212

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
               C  N       I GY  V R++     Y V N P+A  I+A    Q+Y  GV    
Sbjct: 213 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 268

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F      +L+H++ I+GYG D +        YWI++NSWG  WGE GY R+ RG    
Sbjct: 269 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 321

Query: 334 DGSCGI 339
            G CGI
Sbjct: 322 SGVCGI 327


>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
          Length = 354

 Score =  190 bits (483), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F ++H K +    E   R + F  N++    L +T++    Y ++ +F+DL+  
Sbjct: 39  SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF   YL        LK    D  V    P+  +  + DWR+  AVT VK+Q +CGS WA
Sbjct: 98  EFAKLYLNPDYYARHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIEG +AA    LVSLSEQ L+ CD  D+GC GG +  A + IM    G +  E 
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215

Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           +YPY    G    C  ++     KI G++S+  DE  +A+++ + GP+AVA++A   Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV       C     +L+H VLIVG+  +         PYWI+KNSWG  WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322

Query: 330 LYRGDGSCGINDYVRSALV 348
           L  G   C + +Y  SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341


>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
          Length = 354

 Score =  190 bits (483), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F ++H K +    E   R + F  N++    L +T++    Y ++ +F+DL+  
Sbjct: 39  SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF   YL        LK    D  V    P+  +  + DWR+  AVT VK+Q +CGS WA
Sbjct: 98  EFAKLYLNPDYYARHLKNHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIEG +AA    LVSLSEQ L+ CD  D+GC GG +  A + IM    G +  E 
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215

Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           +YPY    G    C  ++     KI G++S+  DE  +A+++ + GP+AVA++A   Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV       C     +L+H VLIVG+  +         PYWI+KNSWG  WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322

Query: 330 LYRGDGSCGINDYVRSALV 348
           L  G   C + +Y  SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  189 bits (480), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 130/326 (39%), Positives = 173/326 (53%), Gaps = 43/326 (13%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNEFSD 93
           H K   LF  ++    K Y T+ E + R  +F  NL+ I   +  + G   + GLNEF+D
Sbjct: 44  HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHID--ETNKKGKSYWLGLNEFAD 101

Query: 94  LSTAEFQAKYLGFKL-------KPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           LS  EF+  YLG K        + SYA+   R V A      +P++ DWR+  AV  VK+
Sbjct: 102 LSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA------VPKSVDWRKKGAVAEVKN 155

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
           Q  CGS WAFST   +EG+    T  L +LSEQELIDCD   ++GC GG +  AF+ I+ 
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215

Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVA 260
              GGL +E+ YPY  ++  C + K  ++ V ING+  V + DE  + K L    P++VA
Sbjct: 216 N--GGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVA 272

Query: 261 INAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A     QFY  GV      F      +L H V  VGYG      + K   Y I+KNSW
Sbjct: 273 IDASGREFQFYSGGV------FDGRCGVDLDHGVAAVGYG------SSKGSDYIIVKNSW 320

Query: 319 GEGWGEKGYFRLYRG----DGSCGIN 340
           G  WGEKGY RL R     +G CGIN
Sbjct: 321 GPKWGEKGYIRLKRNTGKPEGLCGIN 346


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  189 bits (479), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E + R ++F  NL  +      +    +  LN+F+D++  EF++ Y G K+
Sbjct: 46  HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 104

Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
                 R  P    A +    +++P + DWR+  AVT VKDQ  CGS WAFST   +EG+
Sbjct: 105 NHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 164

Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I  K  GG+  E  YPY+  + 
Sbjct: 165 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 222

Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
            C  +K     V I+G+ +V  ++ D     V N P++VAI+A     QFY  GV     
Sbjct: 223 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 277

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
            F    + +L+H V IVGYG      T     YWI++NSWG  WGE GY R+ R     +
Sbjct: 278 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 331

Query: 335 GSCGI 339
           G CGI
Sbjct: 332 GLCGI 336


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  188 bits (477), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
           +A+++++ +V   ++  DEK    H V  T         +++  +L +H K  +  +LVE
Sbjct: 13  LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  IF  NLR +    + ++ S   GL  F+DL+  E+++KYLG K++     R+  
Sbjct: 69  KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127

Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                +   LP + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I+    GG++ +K YPY+G D  C ++ K A  V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V     +  K  V + P+++AI A   A Q Y +G+      F       L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           + VGYG +  K       YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344


>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
          Length = 333

 Score =  188 bits (477), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 178/339 (52%), Gaps = 33/339 (9%)

Query: 8   AGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
           AG  LLS T + +   V   EK H          F  +++QH KTY++ VEY  RL +F+
Sbjct: 10  AGAWLLS-TGATAELTVNAIEKFH----------FKSWMKQHQKTYSS-VEYNHRLQMFA 57

Query: 68  GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNIT 125
            N RKIQ      H   +  LN+FSD+S AE + K+L  + +   A +S  +    P   
Sbjct: 58  NNWRKIQAHNQRNHTFKM-ALNQFSDMSFAEIKHKFLWSEPQNCSATKSNYLRGTGP--- 113

Query: 126 LPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
            P + DWR+  + V+ VK+Q  CGS W FSTTG +E   A  + K++SL+EQ+L+DC Q 
Sbjct: 114 YPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQA 173

Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS- 241
             + GC+GG  S AF+ I+     G+ EE +YPY G D +CR N +     +   V+++ 
Sbjct: 174 FNNHGCKGGLPSQAFEYIL--YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITL 231

Query: 242 RDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
            DE  M + +    P++ A         Y +GV       C    + ++H+VL VGYG  
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS--CHKTPDKVNHAVLAVGYG-- 287

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
                   + YWI+KNSWG  WGE GYF + RG   CG+
Sbjct: 288 ----EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322


>sp|P43234|CATO_HUMAN Cathepsin O OS=Homo sapiens GN=CTSO PE=2 SV=1
          Length = 321

 Score =  187 bits (475), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 52  RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQE 228

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  187 bits (474), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/275 (43%), Positives = 152/275 (55%), Gaps = 43/275 (15%)

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREYD 136
           LN F D+  AEF+A ++G  L+     R  PA  P++            LP + DWR+  
Sbjct: 91  LNRFGDMDQAEFRATFVG-DLR-----RDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKG 144

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED-DGCEGGSISN 195
           AVTGVKDQ  CGS WAFST  ++EG+ A +T  LVSLSEQELIDCD  D DGC+GG + N
Sbjct: 145 AVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDN 204

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSV-SRDETDMAKY 250
           AF+ I  K  GGL  E  YPYR     C + + A      V I+G+  V +  E D+A+ 
Sbjct: 205 AFEYI--KNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLAR- 261

Query: 251 LVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
            V N P++VA+ A   A  FY  GV     F  D G E L H V +VGYGV         
Sbjct: 262 AVANQPVSVAVEASGKAFMFYSEGV-----FTGDCGTE-LDHGVAVVGYGV-----AEDG 310

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
             YW +KNSWG  WGE+GY R+ +  G+    CGI
Sbjct: 311 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 345


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  186 bits (473), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/296 (39%), Positives = 161/296 (54%), Gaps = 33/296 (11%)

Query: 62  RLHIFSGNLRKIQLL-QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-------A 113
           R +IF  NLR I L  +D ++ +   GL +F+DL+  E++  YLG + +P+         
Sbjct: 73  RFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNV 132

Query: 114 DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
           ++   A +    +P   DWR+  AV  +KDQ  CGS WAFSTT  +EG+    T +L+SL
Sbjct: 133 NQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISL 192

Query: 174 SEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQ 231
           SEQEL+DCD+  + GC GG +  AF  IM    GGL  EK YPYRG    C    K +  
Sbjct: 193 SEQELVDCDKSYNQGCNGGLMDYAFQFIMKN--GGLNTEKDYPYRGFGGKCNSFLKNSRV 250

Query: 232 VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENL 288
           V I+GY  V ++DET + K  +   P++VAI A     Q Y +G+      F      NL
Sbjct: 251 VSIDGYEDVPTKDETALKK-AISYQPVSVAIEAGGRIFQHYQSGI------FTGSCGTNL 303

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            H+V+ VGYG      +   V YWI++NSWG  WGE+GY R+ R       G CGI
Sbjct: 304 DHAVVAVGYG------SENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGI 353


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
          Length = 330

 Score =  186 bits (472), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/336 (34%), Positives = 180/336 (53%), Gaps = 33/336 (9%)

Query: 15  LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           L + ++SF +  +E L           ++ + + + K Y + V+  SR  I+  NL+ I 
Sbjct: 8   LLLPMASFALYPEEIL--------DTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHIS 59

Query: 75  LLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPR 128
           +  + E   GV+     +N   D+++ E   K  G K+ PS++  +    IP+     P 
Sbjct: 60  I-HNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPD 118

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           + D+R+   VT VK+Q  CGS WAFS+ G +EG    KT KL++LS Q L+DC  E+DGC
Sbjct: 119 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGC 178

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDM 247
            GG ++NAF  +      G++ E  YPY G D++C  N      K  GY  +   +E  +
Sbjct: 179 GGGYMTNAFQYVQKNR--GIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 236

Query: 248 AKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGN-ENLSHSVLIVGYGVDRTKF 304
            + +   GP++VAI+A   + QFY  GV     ++ +  N +NL+H+VL VGYG+     
Sbjct: 237 KRAVARVGPISVAIDASLTSFQFYSKGV-----YYDENCNSDNLNHAVLAVGYGI----- 286

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
             K   +WIIKNSWGE WG KGY  + R  + +CGI
Sbjct: 287 -QKGNKHWIIKNSWGENWGNKGYILMARNKNNACGI 321


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.136    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,978,118
Number of Sequences: 539616
Number of extensions: 5768565
Number of successful extensions: 13044
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 217
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 11996
Number of HSP's gapped (non-prelim): 264
length of query: 348
length of database: 191,569,459
effective HSP length: 118
effective length of query: 230
effective length of database: 127,894,771
effective search space: 29415797330
effective search space used: 29415797330
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)