BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy4960
         (341 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
          Length = 884

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 111/325 (34%), Positives = 168/325 (51%), Gaps = 31/325 (9%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           + IK    F+ +I K+ +TY   +E   RF+ FKQ+ K  +E          YG +  +D
Sbjct: 571 EEIKDETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFAD 630

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            +P+E   R  GLR   K +  +      +           LP   DWR   V  + PV+
Sbjct: 631 LTPKEFKARYLGLRPELKHENEIPLPEAEIPDV-------SLPLKFDWRDHSV--VTPVK 681

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
            QG+CGSCWAF+ T  +E Q A+    L  LS+ +LV+CD  +  CNGG+++ A++ +++
Sbjct: 682 DQGQCGSCWAFSVTGNVEGQYAIKHNQLLSLSEQELVDCDSLDEGCNGGDMENAYKAIER 741

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIGVYL 261
             GLE ++DYPY  K+    +C + + KAKV  V    +TS    M   L+++GPI V +
Sbjct: 742 LGGLELESDYPYDAKDE---KCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPISVGI 798

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGP 315
           N   ++ Y G      ++ CNP  LDH V IVGYG     L       WI++NSWG    
Sbjct: 799 NANAMQFYFGGVSHPLNFLCNPKNLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGPRWG 858

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+++ RG   CG+ + A  A V
Sbjct: 859 ERGYYRVYRGDGTCGVNTMATSAVV 883


>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
           mellifera]
          Length = 881

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 113/328 (34%), Positives = 173/328 (52%), Gaps = 32/328 (9%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSG 82
           LA D IK    F+ +I+K+N+T++  NE + RF+ FKQ+ K  +E          YG + 
Sbjct: 566 LAQD-IKDEMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTM 624

Query: 83  SSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            +D +P+E   R  G R   K++  +   +  V           LP   DWR     V+ 
Sbjct: 625 FADLTPKEFKTRYLGFRPELKQENEIPLAKIEVSDIF-------LPLKFDWRD--YNVVT 675

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
           PV+ QG CGSCWAF+ T  +E Q A+  K L  LS+ +L++CD  +  CNGG ++ A++ 
Sbjct: 676 PVKDQGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKA 735

Query: 202 VKQY-GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIG 258
           +++  GLE ++DYPY  +     +C + K+ AKV  V    +TS    M   L+++GPI 
Sbjct: 736 IEKLGGLELESDYPYDGRNE---KCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPIS 792

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGD 312
           + +N   ++ Y G       + CNP  LDH V IVGYG     L       WI++NSWG 
Sbjct: 793 IGINANAMQFYIGGVSHPFHFLCNPKDLDHGVLIVGYGISKYPLFHKKLPYWIIKNSWGS 852

Query: 313 IGPDHGYFQIERGANACGIESYAYLASV 340
              ++GY+++ RG   CG+ + A  A V
Sbjct: 853 RWGENGYYRVYRGDGTCGVNAMASSAIV 880


>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
          Length = 881

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/324 (33%), Positives = 169/324 (52%), Gaps = 31/324 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           +IK    F+ +I+K+N+T++  NE + RF+ FKQ+ K   E          YG +  +D 
Sbjct: 569 NIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFADL 628

Query: 87  SPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
           +P+E   R  G R   K++  +   +  V           LP   DWR      + PV+ 
Sbjct: 629 TPKEFKTRYLGFRPELKQENEIPLAKIEVSDIF-------LPPKFDWRD--YNAVTPVKD 679

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q A+  K L  LS+ +L++CD  +  CNGG ++ A++ +++ 
Sbjct: 680 QGLCGSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAIEKL 739

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIGVYLN 262
            GLE ++DYPY  +     +C + K+ AKV  V    +TS    M   L+++GPI + +N
Sbjct: 740 GGLELESDYPYDGRNE---KCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGIN 796

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPD 316
              ++ Y G       + CNP  LDH V IVGYG     L       WI++NSWG    +
Sbjct: 797 ANAMQFYIGGVSHPFHFLCNPKDLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGSRWGE 856

Query: 317 HGYFQIERGANACGIESYAYLASV 340
           +GY+++ RG   CG+ + A  A V
Sbjct: 857 NGYYRVYRGDGTCGVNAMASSAIV 880


>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
          Length = 774

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 105/325 (32%), Positives = 166/325 (51%), Gaps = 32/325 (9%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ------DGKETDE---YYGTSGSSD 85
           + +K    F  ++  +NRTY+   E   RF+ F++      + +ET++    YG +  +D
Sbjct: 462 EDMKAERLFNNFMTTYNRTYSSL-ERNLRFKIFRENLNFIEELRETEQGTGIYGVNMFAD 520

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S +E   R  GLR   + +  +   +  +           LP S DWRQ  V  + PV+
Sbjct: 521 MSQKEFRTRYLGLRPDLQSENEIPLPKAEIPDI-------DLPSSFDWRQKGV--VTPVK 571

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
           +QG+CGSCWAF+ T  +E Q A+    L  LS+ +LV+CDH +  CNGG  D A+  ++Q
Sbjct: 572 NQGQCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDHLDEGCNGGLPDNAYRAIEQ 631

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIGVYL 261
             GLE ++DYPY  +     +C +++   KV       +TS    +   L+Q+GPI + +
Sbjct: 632 LGGLELESDYPYEAENE---KCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPIAIGI 688

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK------NGILTWIVRNSWGDIGP 315
           N   ++ Y G         CNP+ L+H V IVGYG          +  WI++NSWG    
Sbjct: 689 NANAMQFYMGGVSHPLKILCNPNNLNHGVLIVGYGTSRYPLFHKNLPYWIIKNSWGKSWG 748

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+++ RG   CG+ + A  A V
Sbjct: 749 EQGYYRVYRGDGTCGLNTMASSAVV 773


>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
          Length = 887

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 102/325 (31%), Positives = 158/325 (48%), Gaps = 31/325 (9%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD---------GKETDEYYGTSGSSD 85
           + ++    F  ++V +NRTY+   E   R   F+++          +    +Y  +  +D
Sbjct: 574 EDVRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVNMFAD 633

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            SP+E   R  GLR   + +  +      +           LP   DWR+  V  + PV+
Sbjct: 634 MSPEEFRSRYLGLRPDLRSENDIPLREAEIPDV-------ELPPKFDWREKSV--VTPVK 684

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
            QG CGSCWAF+ T  +E Q A+    L  LS+ +LV+CD  +  CNGG  D A+  +++
Sbjct: 685 DQGMCGSCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCDDLDEGCNGGLPDNAYRAIEK 744

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIGVYL 261
             GLE ++DYPY  +     +C ++K  AKV       +TS    M   L+Q+GPI + +
Sbjct: 745 LGGLELESDYPYEAENE---KCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIGI 801

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGP 315
           N   ++ Y G       + CNP  LDH V IVGYG  +  L       W ++NSWG    
Sbjct: 802 NANAMQFYVGGVSHPFKFLCNPKNLDHGVLIVGYGTSDYPLFHKKLPYWTIKNSWGKRWG 861

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+++ RG   CG+ + A  A V
Sbjct: 862 EQGYYRVYRGDGTCGLNTLATSAVV 886


>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 158/317 (49%), Gaps = 31/317 (9%)

Query: 43   FKTYIVKWNRTYTDDNEIKTRFEYFKQD---------GKETDEYYGTSGSSDRSPQEILQ 93
            F+ ++  +NRTY  + E   R   F+++          ++    YG +  +D S +E   
Sbjct: 727  FENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVSTEEFHA 786

Query: 94   -RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
               GLR   + +  +   +  +           LP S DWRQ     + PV++QG CGSC
Sbjct: 787  FYLGLRPDLRTENNIPLRQAEIPDI-------ELPNSFDWRQKGA--VTPVKNQGMCGSC 837

Query: 153  WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
            WAF+ T  +E Q A+    L  LS+ +LV+CD  +  CNGG  D A+  +++  GLE ++
Sbjct: 838  WAFSVTGNVEGQYAIKHNKLLSLSEQELVDCDDLDEGCNGGLPDNAYRAIEKLGGLELES 897

Query: 212  DYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH-LLQSGPIGVYLNHRLIESY 269
            DYPY  +     RC ++K  AKV V     +TS    +   L+ +GPI + +N   ++ Y
Sbjct: 898  DYPYEAENE---RCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGINANAMQFY 954

Query: 270  DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPDHGYFQIE 323
             G       + CNP  LDH V IVGYG  N  L       WIV+NSWGD   + GY+++ 
Sbjct: 955  MGGVSHPFKFLCNPKNLDHGVLIVGYGTSNYPLFHKKLPYWIVKNSWGDRWGEQGYYRVY 1014

Query: 324  RGANACGIESYAYLASV 340
            RG   CG+ + A  A V
Sbjct: 1015 RGDGTCGLNTMASSAVV 1031


>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 161/318 (50%), Gaps = 24/318 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK ++VK+N+ Y+  +E   R   F ++ K  ++          YG +  SD
Sbjct: 169 ESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSD 228

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E       R T       +    R  K  +   KGP P S DWR      ++ V++
Sbjct: 229 LTEEE------FRSTYLNPLLSQWTLHRPMKPASP-AKGPAPASWDWRDHGA--VSSVKN 279

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L   TL  LS+ +LV+CD  +  CNGG    A+E +++ 
Sbjct: 280 QGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAIEKL 339

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE++ DY Y  K+     C +  +K   ++  +   S  +  +   L ++GP+ V LN
Sbjct: 340 GGLETETDYSYIGKKQ---SCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALN 396

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+ GI  W ++NSWG+   + GY+ +
Sbjct: 397 AFAMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGERKGIPFWAIKNSWGEDYGEQGYYYL 456

Query: 323 ERGANACGIESYAYLASV 340
            RG+NACGI      A V
Sbjct: 457 HRGSNACGINKMCSSAVV 474


>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 161/318 (50%), Gaps = 24/318 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK ++VK+N+ Y+  +E   R   F ++ K  ++          YG +  SD
Sbjct: 169 ESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSD 228

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E       R T       +    R  K  +   KGP P S DWR      ++ V++
Sbjct: 229 LTEEE------FRSTYLNPLLSQWTLHRPMKPASP-AKGPAPASWDWRDHGA--VSSVKN 279

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L   TL  LS+ +LV+CD  +  CNGG    A+E +++ 
Sbjct: 280 QGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAIEKL 339

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE++ DY Y  K+     C +  +K   ++  +   S  +  +   L ++GP+ V LN
Sbjct: 340 GGLETETDYSYIGKKQ---SCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALN 396

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+ GI  W ++NSWG+   + GY+ +
Sbjct: 397 AFAMQFYRKGVSHPLKIFCNPWMIDHAVLMVGYGERKGIPFWAIKNSWGEDYGEQGYYNL 456

Query: 323 ERGANACGIESYAYLASV 340
            RG+NACGI      A V
Sbjct: 457 YRGSNACGINKMCSSAVV 474


>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
          Length = 324

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 166/312 (53%), Gaps = 25/312 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSS 84
           AYD +K    F+ ++ K+N+ Y+ ++E   RF+ F+ + +E        T   Y  +  S
Sbjct: 18  AYDLLKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEIINKNQNDTSAQYEINKFS 77

Query: 85  DRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D S  E + + TGL L  +++   E     V        KGPL    DWR  ++  +  V
Sbjct: 78  DLSKDETISKYTGLSLPLQKQNFCE-----VVVLDRPPDKGPL--EFDWR--RLNKVTSV 128

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CG+CWAFAT   LESQ A+    L  LS+ QL++CD  ++ C+GG +  A+E V 
Sbjct: 129 KNQGMCGACWAFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVM 188

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGPIGVYL 261
              G++++ DYPY    N   R    K   +V     +VT   + +  LL+  GPI V +
Sbjct: 189 NMGGIQAENDYPYE-ANNGPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAI 247

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +   I  Y    IR     C  H L+HAV +VGYG +NGI  WI++N+WG    + GYF+
Sbjct: 248 DASDIVGYKRGIIRY----CENHGLNHAVLLVGYGVENGIPFWILKNTWGADWGEQGYFR 303

Query: 322 IERGANACGIES 333
           +++  NACGI++
Sbjct: 304 VQQNINACGIKN 315


>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
          Length = 715

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 103/311 (33%), Positives = 154/311 (49%), Gaps = 25/311 (8%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYF---------KQDGKETDEYYGTSGSSDRSPQEIL 92
            F+ +   + R Y    E KTRF+ F          QD ++    YG +  +D S  E  
Sbjct: 417 VFQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK 476

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           Q  G        +  + +  +  K     +   LP S DWR+     +  V++QG CGSC
Sbjct: 477 QYVG--------KVWDQNANKGMKKAKIPEMNSLPNSFDWREHGA--VTEVKNQGSCGSC 526

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQA 211
           WAF+TT  +E Q A+ KK L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE++ 
Sbjct: 527 WAFSTTGNIEGQWAISKKKLVSLSEQELVDCDKVDEGCNGGLPSQAYKEIIRLGGLETET 586

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           DY YR       +C+ +K K +V +  +   S  +  M   L+++GPI + +N   ++ Y
Sbjct: 587 DYKYRGHNE---KCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFY 643

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
            G         CNP +LDH V IVGYG K     WI++NSWG    + GY+ + RGA  C
Sbjct: 644 MGGISHPWKIFCNPKELDHGVLIVGYGVKGSKPYWIIKNSWGPDWGEKGYYLVYRGAGVC 703

Query: 330 GIESYAYLASV 340
           G+ +    A V
Sbjct: 704 GLNTMCTSAVV 714


>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 167/312 (53%), Gaps = 25/312 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSS 84
           AYD +K  + F+ ++ K+N++Y+ ++E   RF+ F+ + +E        +   Y  +  +
Sbjct: 18  AYDVLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFA 77

Query: 85  DRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D S  E + + TGL L  + +   E     V        KGPL    DWR  ++  +  V
Sbjct: 78  DLSKDETISKYTGLSLPLQTQNFCE-----VVVLDRPPDKGPL--EFDWR--RLNKVTSV 128

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CG+CWAFAT   LESQ A+       LS+ QL++CD  +  C+GG +  AFE V 
Sbjct: 129 KNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVM 188

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYL 261
              G+++++DYPY    N   R    K   KV     ++T   + +  LL+S GPI V +
Sbjct: 189 NMGGIQAESDYPYE-ANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAI 247

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +   I +Y    ++     C  H L+HAV +VGY  +NG+  WI++N+WG    + GYF+
Sbjct: 248 DASDIVNYKRGIMKY----CANHGLNHAVLLVGYAVENGVPFWILKNTWGADWGEQGYFR 303

Query: 322 IERGANACGIES 333
           +++  NACGI++
Sbjct: 304 VQQNINACGIQN 315


>gi|387015020|gb|AFJ49629.1| Cathepsin H [Crotalus adamanteus]
          Length = 337

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/302 (33%), Positives = 150/302 (49%), Gaps = 16/302 (5%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK +  +  R Y  + E + R + F  D K+  + +    SS R    + Q + +  T  
Sbjct: 36  FKAWASQHRRAYRSEEEFRHRLQIF-LDNKQKIDKHNAGNSSFR--MGLNQFSDMTFTEF 92

Query: 103 EKERLEADRERVKKFLNE--RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
            K+ L  + +     +    R  GP PK++DWR+ K K ++PV++QG CGSCW F+TT  
Sbjct: 93  RKKYLWQEPQNCSATMGNFPRSAGPCPKAIDWRK-KGKFVSPVKNQGSCGSCWTFSTTGC 151

Query: 161 LESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRN 217
           LES +A+    L  L++ QL++C  +  N  C+GG    AFEY+    GL  +  YPYR 
Sbjct: 152 LESAIAIKTGKLLNLAEQQLIDCAQNFNNFGCSGGLPSQAFEYILYNKGLMDEEAYPYRA 211

Query: 218 KENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG---PIGVYLNHRL-IESYDGNP 273
           +      C ++ +KA  F++D    S  D    +   G   P+ +    R     Y    
Sbjct: 212 QNGT---CKFQPQKAVAFIKDVVNISLYDEQGLVQAVGTYNPVSIAFEVREDFVHYQEGV 268

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
               D    P K++HAV  VGYGE+ G+  WIV+NSWG      GYF IERG N CG+  
Sbjct: 269 YTSTDCDKTPDKVNHAVLAVGYGEEGGVPFWIVKNSWGTSWGLDGYFNIERGKNMCGLAD 328

Query: 334 YA 335
            A
Sbjct: 329 CA 330


>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
 gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
          Length = 276

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 83/220 (37%), Positives = 127/220 (57%), Gaps = 8/220 (3%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +P+S+DWR   V  + PV++QG CGSCWAF+TT  +E Q A+    L  LS+ +LV+C
Sbjct: 61  GDIPESVDWRDKGV--VTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELVDC 118

Query: 184 DHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
           D  +  C GG    A++ +++ G LES++DYPY+  ++   +C + K + KV +  + V 
Sbjct: 119 DTIDKGCEGGLPSNAYKQIEKLGGLESESDYPYKGADS---KCKFNKAEVKVTINSSVVI 175

Query: 243 SGVDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
           S  +  +   L ++GPI + +N   ++ Y G         CNP  L+H V IVGYG KNG
Sbjct: 176 SKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCNPSSLNHGVLIVGYGVKNG 235

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
              WI++NSWG    + GY+ I RG   CG+ +    A +
Sbjct: 236 TPYWIIKNSWGPSWGEKGYYLIYRGGGCCGLNTMCTSAVI 275


>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
 gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 108/321 (33%), Positives = 159/321 (49%), Gaps = 32/321 (9%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CDH +  CNGG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGG 180

Query: 194 NIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                + E  K  GLE  +DYPY   + I   C   + K   +V D+ V    + +    
Sbjct: 181 YPPKTYGEIEKMGGLELASDYPYTGVDGI---CYMNQSKFVAYVNDSTVLPLSEKIQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L + GP+   LN  L++ Y G  I    + CNPH L+HAV  VGYG + GI  WIV+NSW
Sbjct: 238 LKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSW 297

Query: 311 GDIGPDHGYFQIERGANACGI 331
           G    + GYF+I RGA  CGI
Sbjct: 298 GVGFGEKGYFRIFRGAGTCGI 318


>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
 gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
          Length = 325

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 157/321 (48%), Gaps = 29/321 (9%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------E 76
           AI       YD +K  D F++++  +N+ Y D  E   R++ FK + +E +         
Sbjct: 9   AIITSSVCGYDLLKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVEDHA 68

Query: 77  YYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQS 135
            +  +  SD S  EI+ + TGL L    +E         +  + +      P + DWRQ 
Sbjct: 69  VFSINKFSDMSKSEIISKYTGLSLPSLMQENF------CRAIILDGPPNKAPINFDWRQ- 121

Query: 136 KVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI 195
               + PV  QG CGSCWAF+T A +ESQ ++       LS  QLV+CD  N+ C GG +
Sbjct: 122 -YNAVTPVRVQGNCGSCWAFSTLAGIESQYSIKYNKQISLSVQQLVDCDTSNMGCAGGLL 180

Query: 196 DVAFEYVKQY--GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHL 251
             A E +     G+  + DYPY+  +    +C        V V     ++    + +  +
Sbjct: 181 HTALEQIINAGGGVLQEEDYPYKGVDK---QCNLPHNNFAVQVLGCYRYIVMNEEKLKDV 237

Query: 252 LQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L++ GPI V ++   I  Y    IR     C  + L+HAV +VGYG ++G+  W ++N+W
Sbjct: 238 LRAVGPIPVAIDAASIVDYSRGIIR----TCTYYGLNHAVLLVGYGVQDGVPYWTLKNTW 293

Query: 311 GDIGPDHGYFQIERGANACGI 331
           GD   +HGYF++ +  N+CGI
Sbjct: 294 GDDWGEHGYFRVRQNVNSCGI 314


>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
          Length = 1036

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 101/323 (31%), Positives = 157/323 (48%), Gaps = 31/323 (9%)

Query: 37   IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
            +K+   F  ++ K+ + Y +  E + RF+ FK +    +E          YG +  +D +
Sbjct: 725  LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLT 784

Query: 88   PQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
              E   R  GL+ T K +  +      +           LP   DWR   V  + PV+ Q
Sbjct: 785  KAEFKARHLGLKPTLKSENDIPMPMATIPDI-------ELPSDYDWRHHNV--VTPVKDQ 835

Query: 147  GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY- 205
            G CGSCWAF+ T  +E Q A+    L  LS+ +LV+CD  +  CNGG  D A+  +++  
Sbjct: 836  GSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKLDSGCNGGLPDTAYRAIEELG 895

Query: 206  GLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH-LLQSGPIGVYLNH 263
            GLE ++DYPY  ++    +C + K K KV  V    +TS    M   L+++GP+ + +N 
Sbjct: 896  GLELESDYPYDAEDE---KCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINA 952

Query: 264  RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE------KNGILTWIVRNSWGDIGPDH 317
              ++ Y G       + C+P  LDH V IVGYG       K  +  WI++NSWG    + 
Sbjct: 953  NAMQFYMGGVSHPFKFLCSPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWGEQ 1012

Query: 318  GYFQIERGANACGIESYAYLASV 340
            GY+++ RG   CG+      A V
Sbjct: 1013 GYYRVYRGDGTCGVNKMVTSAVV 1035


>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/312 (33%), Positives = 164/312 (52%), Gaps = 25/312 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSS 84
           AYD +K    F+ ++  +N+ Y+  +E   RF+ F+ + +E        T   Y  +  S
Sbjct: 18  AYDLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFS 77

Query: 85  DRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D S  E + + TGL L  + +   E     V        KGPL    DWR  ++  +  V
Sbjct: 78  DLSKDETISKYTGLSLPLQNQNFCE-----VVVLNRPPDKGPL--EFDWR--RLNKVTSV 128

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CG+CWAFAT   LESQ A+    L  LS+ QL++CD  ++ C+GG +  A+E V 
Sbjct: 129 KNQGTCGACWAFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVM 188

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYL 261
              G++++ DYPY    N   R    K   KV     ++T   + +  LL+S GPI V +
Sbjct: 189 NMGGIQAENDYPYE-ANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAI 247

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +   I +Y    ++     C  H L+HAV +VGY  +NG+  WI++N+WG    + GYF+
Sbjct: 248 DASDIVNYKRGIMKY----CANHGLNHAVLLVGYAVQNGVPFWILKNTWGADWGEQGYFR 303

Query: 322 IERGANACGIES 333
           +++  NACGI++
Sbjct: 304 VQQNINACGIQN 315


>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
 gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
          Length = 434

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 153/310 (49%), Gaps = 34/310 (10%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD---------EYYGTSGSSDRSPQE 90
           + +FK +++K+N+ Y    E K RF  F+ + K+ +           YG +  SD S  E
Sbjct: 131 LQSFKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFSDLSVTE 190

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
                GL+    E +   A+   VK          LP + DWR      + PV++QG CG
Sbjct: 191 FKNYLGLK-KKPESKLPTAEIPDVK----------LPDNFDWRH--YNAVTPVKNQGSCG 237

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
           SCWAF+ T  +E   A+ K  L  LS+ +L++CD  +  CNGG +   +E + +  GLE+
Sbjct: 238 SCWAFSVTGNIEGLWAIKKHELLSLSEQELIDCDKIDNGCNGGYMPETYEAIMKLGGLET 297

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIE 267
           + DYPY  +     +C   K + KV +        S +D    L ++GP+   LN   ++
Sbjct: 298 ETDYPYEAENE---KCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNANAMQ 354

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILT-----WIVRNSWGDIGPDHGYFQ 321
            Y G         CNP + DH + IVGYG  K+ IL      WI++NSWG    + GY++
Sbjct: 355 FYLGGISHPPKILCNPEEQDHGILIVGYGIHKSSILKRTIPYWIIKNSWGKHWGEKGYYR 414

Query: 322 IERGANACGI 331
           + RG+  CGI
Sbjct: 415 LYRGSGVCGI 424


>gi|9635308|ref|NP_059206.1| ORF58 [Xestia c-nigrum granulovirus]
 gi|13124001|sp|Q9PYY5.1|CATV_GVXN RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|6175702|gb|AAF05172.1|AF162221_58 ORF58 [Xestia c-nigrum granulovirus]
          Length = 346

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 156/311 (50%), Gaps = 21/311 (6%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGS 83
           +AYD     + F  ++VK+N+ Y DD E + RFE FKQ+          E    +  +  
Sbjct: 32  IAYDMSNAQELFNEFVVKYNKVYKDDQEKEARFEIFKQNLADINARNALEDSAMFEINSR 91

Query: 84  SDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D S  E+LQ+ TGL+L+    E+   +       ++    G +P S DWR      +  
Sbjct: 92  ADISSNELLQKLTGLKLSLMRGEK--KNSFCTPTVISGDSSGKVPDSFDWRDRNS--VTS 147

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-Y 201
           V+ Q  CGSCWAF+  A +ES   +       LS+ QLV+CD  N  CNGG +  AFE  
Sbjct: 148 VKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDKVNNGCNGGLMSWAFEGI 207

Query: 202 VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL 261
           ++  G+  +A YPY   + +    T   + +  +  D      +  ++H  + GP+ V +
Sbjct: 208 IRAGGISYEAPYPYTGVDGVCKNTTRYVQLSGCYAYDLRSEKKLRQVLH--EKGPVSVAI 265

Query: 262 NHRLIESYDGNPIRRNDWACN-PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +   + +Y     +     C+  H L+H V +VGYG++N +  W ++NSWG    + G+F
Sbjct: 266 DVVDLTNYKSGVAKH----CSVDHGLNHGVLLVGYGQENDVKYWTLKNSWGSDWGEQGFF 321

Query: 321 QIERGANACGI 331
           +I+R  N+CGI
Sbjct: 322 RIKRDVNSCGI 332


>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
 gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
 gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
          Length = 365

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 162/316 (51%), Gaps = 43/316 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------------------DGKETDEYYGTSG 82
           FK ++ ++N++Y D  E + R+  FK                     D   T   +G + 
Sbjct: 55  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114

Query: 83  SSDRSPQEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            SD++P E+L   TG  L   +   L  +R  VK   N R    LP   DWR +    + 
Sbjct: 115 FSDKTPDEVLHSNTGFFLNLSQHYTLCENR-IVKGAPNIR----LPDYYDWRDTNK--VT 167

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-E 200
           P++ QG CGSCWAF     +ESQ A+    L  LS+ QL++CD  +L CNGG + +AF E
Sbjct: 168 PIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQE 227

Query: 201 YVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-----FVQDTWVTSGVDHMMHLLQSG 255
            +   G+E++ADYPY+  E +   CT +  K  V     F  D    + +  +++   +G
Sbjct: 228 LLLMGGVETEADYPYQGSEQM---CTLDNRKIAVKLNSCFKYDIRDENKLKELVY--TTG 282

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           P+ + ++   I +Y    + +    C+ + L+HAV ++G+G +N +  WI++NSWG+   
Sbjct: 283 PVAIAVDAMDIINYRRGILNQ----CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG 338

Query: 316 DHGYFQIERGANACGI 331
           ++GY ++ R  NACG+
Sbjct: 339 ENGYLRVRRNVNACGL 354


>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
           Australia]
          Length = 367

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 162/316 (51%), Gaps = 43/316 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------------------DGKETDEYYGTSG 82
           FK ++ ++N++Y D  E + R+  FK                     D   T   +G + 
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 83  SSDRSPQEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            SD++P E+L   TG  L   +   L  +R  VK   N R    LP   DWR +    + 
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENR-IVKGAPNIR----LPDYYDWRDTNK--VT 169

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-E 200
           P++ QG CGSCWAF     +ESQ A+    L  LS+ QL++CD  +L CNGG + +AF E
Sbjct: 170 PIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQE 229

Query: 201 YVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-----FVQDTWVTSGVDHMMHLLQSG 255
            +   G+E++ADYPY+  E +   CT +  K  V     F  D    + +  +++   +G
Sbjct: 230 LLLMGGVETEADYPYQGSEQM---CTLDNRKIAVKLNSCFKYDIRDENKLKELVY--TTG 284

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           P+ + ++   I +Y    + +    C+ + L+HAV ++G+G +N +  WI++NSWG+   
Sbjct: 285 PVAIAVDAMDIINYRRGILNQ----CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG 340

Query: 316 DHGYFQIERGANACGI 331
           ++GY ++ R  NACG+
Sbjct: 341 ENGYLRVRRNVNACGL 356


>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 344

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 173/323 (53%), Gaps = 27/323 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQ 89
           +++  F+ +  K+N+ Y +++E  + F  +K   +   ++        +G +  SD SP+
Sbjct: 28  QRLAEFEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPE 87

Query: 90  EI---LQRTGLRLTGKEKER-LEADRERVKKFLNERKK---GPLPKSLDWRQSKVKVLNP 142
           E    +      L  K K + ++   E +K +L + +      LP+S DWR   +  + P
Sbjct: 88  EFENKMLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGI--ITP 145

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
            + Q  CGSCW FATT ++ESQ AL    L   S+  L++CD+ N  C GG +  A++++
Sbjct: 146 AKFQNTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDCDNINQGCRGGLMTDAYQFL 205

Query: 203 KQYGLESQADY--PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
           +Q G    AD    Y+NK++I   C ++K K K  V D +     +  +   L+++GP+ 
Sbjct: 206 QQSGGIQTADTYGDYKNKKDI---CNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVA 262

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           V +N R ++ Y+G  +   +  C+  K++HAV IVGYG + GI  W+++N WG      G
Sbjct: 263 VGINARTLQFYEGGIVDPKN--CDD-KINHAVLIVGYGVEEGIPYWLIKNQWGAEWGIKG 319

Query: 319 YFQIERGANACGIESYAYLASVK 341
           +F++ RG   CGI +YA +A V+
Sbjct: 320 FFKLIRGKKQCGIHTYASIAYVE 342


>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
          Length = 328

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 109/321 (33%), Positives = 159/321 (49%), Gaps = 32/321 (9%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR 133
              YG +  SD + +E   R   +R  G       +  E V    NE+         DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDPSPEEDVT-MDNEK--------FDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CDH +  CNGG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGG 180

Query: 194 NIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                + E  K  GLE  +DYPY   + I   C   + K   +V ++ V    + +    
Sbjct: 181 YPPKTYGEIEKMGGLELASDYPYTGVDGI---CYMNQSKFVAYVNESTVLPLSEKIQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L + GP+   LN  L++ Y G  I    + CNPH L+HAV  VGYG + GI  WIV+NSW
Sbjct: 238 LKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSW 297

Query: 311 GDIGPDHGYFQIERGANACGI 331
           G    + GYF+I RGA  CGI
Sbjct: 298 GVGFGEKGYFRIFRGAGTCGI 318


>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
          Length = 475

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 158/318 (49%), Gaps = 24/318 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK ++ K+N+ Y+   E+  R   F ++ K  ++          YG +  SD
Sbjct: 169 ESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSD 228

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E  + T L     +    +  +           KGP P S DWR      ++PV++
Sbjct: 229 LTEEE-FRSTYLNPLLSQWTLHQPMKPATPA------KGPSPDSWDWRDHGA--VSPVKN 279

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+    +E Q  L   TL  LS+ +LV+CD  +  C GG    A+E +++ 
Sbjct: 280 QGMCGSCWAFSVIGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAIEKL 339

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE+++DY Y   +    RC +   K   ++  +      +  +   L ++GP+ V LN
Sbjct: 340 GGLETESDYSYTGHKQ---RCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSVALN 396

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+ GI  W ++NSWG+   + GY+ +
Sbjct: 397 AFAMQFYRKGISHPLKIFCNPWMIDHAVLLVGYGERKGIPFWAIKNSWGEDYGEQGYYYL 456

Query: 323 ERGANACGIESYAYLASV 340
            RG+NACGI      A V
Sbjct: 457 YRGSNACGINKMCSSAVV 474


>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
 gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
          Length = 334

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 165/315 (52%), Gaps = 24/315 (7%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYG 79
           +++   YD +K  D F+ ++  +N+ YTD  E   R+  FK + +E +          Y 
Sbjct: 22  IFQSDTYDPLKAADYFELFVANYNKNYTDPLEKTKRYHIFKDNLEEINNKNKSNDTAVYR 81

Query: 80  TSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
            +  SD S  E++ + TGL + G+      A+  ++        KGPL  + DWRQ    
Sbjct: 82  INKFSDLSTNELISKYTGLNVPGET-----ANFCKIVVLDQPPGKGPL--NFDWRQQNK- 133

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            + P+++QG CG+CWAFAT A +ESQ A+       LS+ Q+++CD+ ++ C GG +  A
Sbjct: 134 -VTPIKNQGACGACWAFATLASIESQYAIRNNVHLDLSEQQMIDCDYVDMGCYGGLLHTA 192

Query: 199 FEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-GP 256
           FE + Q  G+E +  YPY    N     + E+   KV     ++    + +  LL++ GP
Sbjct: 193 FEQMIQMGGVEEERQYPYEGVNNNCRLKSDERFVVKVKGCYRYLVMREEKLKDLLRAVGP 252

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           + + ++   I +Y    I      C  + L+HAV +VGYG +NG+  W  +N+WGD   +
Sbjct: 253 LPMAIDASSIFNYYRGVINY----CGNNGLNHAVLLVGYGVENGVPFWTFKNTWGDDWGE 308

Query: 317 HGYFQIERGANACGI 331
            GYF++ +  +ACG+
Sbjct: 309 DGYFRVRQNVDACGM 323


>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
          Length = 474

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 158/311 (50%), Gaps = 24/311 (7%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGS 83
           + DS++ +  FK ++V++NRTY+   E   R   F ++ K  ++          YG +  
Sbjct: 166 SVDSVELLGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKF 225

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           SD + +E   RT        ++ L+   +          +GP P S DWR+     ++PV
Sbjct: 226 SDLTEEEF--RTLYLNPLLSQQNLQQSMKPAA-----MPRGPAPPSWDWREHGA--VSPV 276

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CGSCWAF+ T  +E Q       L  LS+ +LV+CD  +  C GG    A+E ++
Sbjct: 277 KNQGMCGSCWAFSVTGNIEGQWFAKTGKLVSLSEQELVDCDTVDQACGGGLPSNAYEAIE 336

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
           + G LE++ DY Y  K+     C +  +K   ++  +   S  ++ +   L ++GP+ V 
Sbjct: 337 KLGGLETETDYSYTGKKQ---SCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVSVA 393

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           LN   ++ Y           CNP  +DHAV +VGYGE+ G   W ++NSWG+   + GY+
Sbjct: 394 LNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGYGERQGKPFWAIKNSWGEDYGEQGYY 453

Query: 321 QIERGANACGI 331
            + RG+  CGI
Sbjct: 454 YLYRGSRLCGI 464


>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
          Length = 451

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 164/325 (50%), Gaps = 39/325 (12%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD---GKETDEY------YGTSGSSD 85
           DS++ +  FK ++  +N++Y +  E + R   F ++    ++  E       YG +  SD
Sbjct: 146 DSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSD 205

Query: 86  RSPQEI----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            + +E     L      L G+      A R            GP P S DWR      + 
Sbjct: 206 LTEEEFRTSYLNPLLSSLPGRALRPGPATR------------GPAPASWDWRDHGA--VT 251

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
            V++QG CGSCWAF+ T  +E Q  L +  L  LS+ +LV+CD  +  C GG    A+  
Sbjct: 252 GVKNQGACGSCWAFSVTGNVEGQWFLRRGALLALSEQELVDCDTLDQACGGGLPSNAYTA 311

Query: 202 VKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIG 258
           +++ G LE++ DY Y  ++    RC++  +KA+V++  +   S  +  +   L ++GP+ 
Sbjct: 312 IEKLGGLETEKDYSYEGRKE---RCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVS 368

Query: 259 VYLNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           + LN   ++ Y     +P R     C+P  +DHAV +VGYG ++GI  W ++NSWG    
Sbjct: 369 IALNAFAMQFYRRGVSHPFRP---LCSPWFIDHAVLLVGYGHRSGIPFWAIKNSWGPDWG 425

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+ + RGA ACG+ + A  A V
Sbjct: 426 EEGYYYLYRGARACGVNAMASSAIV 450


>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
 gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
          Length = 335

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/316 (31%), Positives = 168/316 (53%), Gaps = 33/316 (10%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETD---EYYGTS 81
           AY+  +  D F++++  +N+ YT D E   R+  FK        ++G  TD     YG +
Sbjct: 25  AYNLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGIN 84

Query: 82  GSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR-QSKVKVL 140
             SD S  E++ +     TG    +  ++  +         KGPL    DWR Q+KV   
Sbjct: 85  KFSDLSKSELIAK----FTGLSIPQRASNFCKTIVLNQPPDKGPL--HFDWREQNKVT-- 136

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF- 199
             +++QG CG+CWAFAT A +ESQ A+    L  LS+ QL++CD  ++ CNGG +  AF 
Sbjct: 137 -SIKNQGACGACWAFATLASVESQFAMRHNRLVDLSEQQLIDCDSVDMGCNGGLLHTAFE 195

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD---TWVTSGVDHMMHLLQS-G 255
           E ++  G++++ DYP+  ++    RC  ++ +  V        +V    + +  LL++ G
Sbjct: 196 EIIRMGGVQAELDYPFVGRDR---RCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVG 252

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I     +C  + L+HAV +VGYG +NG+  W  +N+WGD   
Sbjct: 253 PIPMAIDAADIVNYYRGVIS----SCENNGLNHAVLLVGYGVENGVPYWAFKNTWGDDWG 308

Query: 316 DHGYFQIERGANACGI 331
           ++GYF++ +  NACG+
Sbjct: 309 ENGYFRVRQNINACGM 324


>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
 gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
          Length = 475

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 160/318 (50%), Gaps = 24/318 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           D ++ +  FK ++V++NRTY+   +   R   F ++ K  ++          YG +  SD
Sbjct: 169 DFVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSD 228

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E   RT        +++L+   +           GP P S DWR+     ++PV++
Sbjct: 229 LTEEEF--RTLYLNPLLSQQKLQRSMKPAA-----MPHGPAPPSWDWREHGA--VSPVKN 279

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  +    L  LS+ +LV+CD  +  C GG    A+E +++ 
Sbjct: 280 QGMCGSCWAFSVTGNIEGQWFVKTGKLVSLSEQELVDCDTADQACGGGLPSNAYEAIEKL 339

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G +E++ DY Y  K+     C +  +K   ++  +   S  ++ +   L ++GP+ V LN
Sbjct: 340 GGVETETDYSYTGKKQ---SCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSVALN 396

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+ G   W ++NSWG+   + GY+ +
Sbjct: 397 AFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGYGERQGKPFWAIKNSWGEDYGEQGYYYL 456

Query: 323 ERGANACGIESYAYLASV 340
            RG+  CGI +    A V
Sbjct: 457 YRGSRLCGINTMCSSAIV 474


>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
          Length = 323

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 164/318 (51%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++T   + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
 gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
          Length = 324

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/312 (33%), Positives = 162/312 (51%), Gaps = 25/312 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSS 84
           AYD +K    F+ ++  +N+ Y+  +E   RF+ F+ + +E        T   Y  +  S
Sbjct: 18  AYDLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFS 77

Query: 85  DRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D S  E + + TGL L  + +   E     V        KGPL    DWR  ++  +  V
Sbjct: 78  DLSKDETISKYTGLSLPLQNQNFCE-----VVVLNRPPDKGPL--EFDWR--RLNKVTSV 128

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CG+CWAFAT   LESQ A+    L  LS+ QL++CD  ++ C+GG +  A+E V 
Sbjct: 129 KNQGTCGACWAFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVM 188

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGPIGVYL 261
              G++++ DYPY    N   R    K   KV     +V    + +  LL+  GP+ V +
Sbjct: 189 NMGGIQAENDYPYE-ANNGDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAI 247

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +   I +Y    IR     C  H L+HAV +VGY  +NG+  WI++N+WG    + GYF+
Sbjct: 248 DASDIVNYKRGVIRY----CANHGLNHAVLLVGYAVENGVPFWILKNTWGTDWGEQGYFR 303

Query: 322 IERGANACGIES 333
           +++  NACGI++
Sbjct: 304 VQQNINACGIQN 315


>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
 gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
          Length = 615

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 154/315 (48%), Gaps = 30/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + V++ R Y    E + R   F+Q+ K  +E          YG +  +D +  E  +
Sbjct: 309 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADLTSSEYKE 368

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           RTGL       +R EA        +     G LPK  DWRQ     + PV++QG CGSCW
Sbjct: 369 RTGLW------QRDEAKATGGSAAVVPAYHGELPKEFDWRQKNA--VTPVKNQGSCGSCW 420

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +A+
Sbjct: 421 AFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAE 480

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY+ K+N   +C + +  + V V     +  G +  M   LL  GPI + +N   ++ Y
Sbjct: 481 YPYKAKKN---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGPISIGINANAMQFY 537

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V +VGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 538 RGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 597

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 598 RGDNTCGVSEMATSA 612


>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
          Length = 619

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 100/322 (31%), Positives = 171/322 (53%), Gaps = 27/322 (8%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD----GKETDEY-----YGTS 81
           DL   +   +D FK + +++N++Y D  E + RFE F  +     + T+++     +G +
Sbjct: 254 DLPPATQDLMDQFKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQFGVT 313

Query: 82  GSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
             SD + +E  Q      +  ++  L+  +       + R + PL +S DWR  K  VL 
Sbjct: 314 QFSDLTEEEFHQHYQPAQSSYKEPSLKTRK-------HPRLQRPLIRSCDWR--KAGVLT 364

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFE 200
           PV  Q +C SCWA A    +E+  A+  +  + LS  ++++CD     C GG + D    
Sbjct: 365 PVRKQKKCRSCWAIAAVGNVEALWAIHYEQHFELSVQEVLDCDRCGKACKGGFVWDAFLT 424

Query: 201 YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
            ++Q GL  + DYPY+++  ++ +   +K+    ++QD  +    ++ M  HL   GPI 
Sbjct: 425 ILRQRGLARERDYPYQDQ--LSRKGCQKKQNRTGWIQDFLMLPKEENAMAEHLALKGPIT 482

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE--KNGILTWIVRNSWGDIGPD 316
           V +N  L+++Y    IR  D  C+P+++DH+V +VG+G+  K+G   WI++NSWG    +
Sbjct: 483 VTINQALLKTYRKGVIRPKD-DCDPNQVDHSVLLVGFGQNTKDGAY-WILKNSWGSDWGE 540

Query: 317 HGYFQIERGANACGIESYAYLA 338
            GYF++ RG NACGI  Y   A
Sbjct: 541 EGYFRLRRGTNACGITKYPVTA 562


>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
          Length = 330

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/314 (32%), Positives = 160/314 (50%), Gaps = 31/314 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRS 87
           + +++ +FKT++ + N+ Y+ + E   R   F Q+ ++ +E+         G +  SD +
Sbjct: 23  TFQEIVSFKTWMTQHNKHYSSE-EYSYRLRTFIQNKRKVEEHNSGRHSYRMGLNQFSDMT 81

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
             E  +   LR    E +   A R       +    GP P  +DWR +K   + PV++QG
Sbjct: 82  FSEFKKLYLLR----EPQNCSATRGN-----HVLSMGPYPDFVDWR-TKGNYVTPVKNQG 131

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LES +A+    L  L++ QLV+C   + N  CNGG    AFEY+K  
Sbjct: 132 GCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYN 191

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT-----WVTSGVDHMMHLLQSGPIGV 259
            GLE++ DYPY  ++     C Y+  KA  FV++      +  +G+   +  L    I  
Sbjct: 192 GGLEAEKDYPYTAQDQ---HCQYQPNKAVAFVKEVVNITQYDENGIVDAVARLNPVSIAF 248

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
            +     + Y+G     ++    P K++HAV  VGYG +NG   WIV+NSWG     +GY
Sbjct: 249 EVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGYGVQNGTKYWIVKNSWGPEWGLNGY 307

Query: 320 FQIERGANACGIES 333
           F I RG N CG+ +
Sbjct: 308 FYIIRGKNMCGLAA 321


>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
 gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
          Length = 327

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/308 (33%), Positives = 165/308 (53%), Gaps = 25/308 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--------SDRSPQEILQR 94
           F+ ++ K+N++Y+ + E + +F+ FK + +  +E    S S        SD +  E+L++
Sbjct: 25  FEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSNSAVYDINFYSDMNKNELLRK 84

Query: 95  -TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
            TG ++  K+    L  + +  KK +N      LP S DWR   V  +  V++Q  CGSC
Sbjct: 85  QTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHV--ITSVKNQRDCGSC 142

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQA 211
           WAF+T A +ES  A+    L  LS+ QLV CD  N  CNGG +  A  E ++Q G+ ++ 
Sbjct: 143 WAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQNNGCNGGLMHWAMEEIIRQGGVSNET 202

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGPIGVYLNHRLIESYD 270
           D+PY   +     C  ++    +   + ++ S  D +  LL  +GPI + ++   +  Y 
Sbjct: 203 DFPYTASDGF---CKRKQGFVNINGCNQFILSNEDRLRELLIFNGPISIAIDVIDVIDYS 259

Query: 271 G--NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
              +   RND     + L+HAV +VGYG KN I  WI++NSWG    ++GYF+++R  N+
Sbjct: 260 QGISSTCRND-----NGLNHAVLLVGYGVKNNIPYWILKNSWGSQWGENGYFRVQRNINS 314

Query: 329 CG-IESYA 335
           CG I  YA
Sbjct: 315 CGMINDYA 322


>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
           rotundata]
          Length = 884

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 160/319 (50%), Gaps = 35/319 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F+ ++  +N+TY    E   R++ F+++ K  ++          YG +  +D +P+E   
Sbjct: 579 FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTPEEFKT 638

Query: 94  R-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +  GL+    ++  +      +           LP   DWR+     + PV+ QG+CGSC
Sbjct: 639 KYLGLKTNLNQENDIPLQEAVIPDI-------DLPPKFDWRE--YNAVTPVKDQGQCGSC 689

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAF+    +E Q A+  K L  LS+ +LV+CD+ +  C GG +  A++ V++  GLE + 
Sbjct: 690 WAFSAIGNIEGQYAIKHKKLLSLSEQELVDCDNLDDGCGGGYMINAYKTVEKLGGLELET 749

Query: 212 DYPY--RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIE 267
           DYPY  RN+     +C + K KAKV V      +  +  M   L+++GPI V +N   ++
Sbjct: 750 DYPYDARNE-----KCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGPISVGINANAMQ 804

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGE------KNGILTWIVRNSWGDIGPDHGYFQ 321
            Y G       + C+P  LDH V IVGY        K  +  WI++NSWG    + GY++
Sbjct: 805 FYFGGVSHPFKFLCDPANLDHGVLIVGYATSTYPLFKKKLPYWIIKNSWGPKWGEQGYYR 864

Query: 322 IERGANACGIESYAYLASV 340
           + RG   CG+ + A  A V
Sbjct: 865 VYRGDGTCGVNAMASSAIV 883


>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
 gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 168/317 (52%), Gaps = 25/317 (7%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYG 79
           V    AYD +K    F+ ++ K+N+ Y+ ++E   RF+ F+ + +E        T   Y 
Sbjct: 13  VAHSAAYDLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYE 72

Query: 80  TSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
            +  SD S  E + + TGL L  + +   E     V        KGPL    DWR  ++ 
Sbjct: 73  INKFSDLSKDETISKYTGLALPLQTQNFCE-----VVVLNRPPDKGPL--EFDWR--RLN 123

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            +  V++QG CG+CWAFAT A LESQ A+    L  LS+ QL++CD+ +  CNGG +  A
Sbjct: 124 KVTSVKNQGICGACWAFATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTA 183

Query: 199 FEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGP 256
           +E V Q  G++++ DYPY   +    R    K   KV     ++    + +  LL+  GP
Sbjct: 184 YEAVMQMGGVQAENDYPYEGSDG-NCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGP 242

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           I V ++   I +Y    +R     C+ + L+HAV +VGYG +N +  WI++N+WG+   +
Sbjct: 243 IPVAIDASDIVNYRRGIMRY----CSNYGLNHAVLLVGYGVENNVPYWILKNTWGEDWGE 298

Query: 317 HGYFQIERGANACGIES 333
            GYF++++  NACGI +
Sbjct: 299 QGYFRVQQNINACGIRN 315


>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
          Length = 567

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 160/325 (49%), Gaps = 39/325 (12%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSD 85
           DS++ +  FK ++  +N++Y +  E + R   F          Q+  +    YG +  SD
Sbjct: 262 DSVELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSD 321

Query: 86  RSPQEI----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            + +E     L      L G+                  R +GP P S DWR      L 
Sbjct: 322 LTEEEFRMFYLNPLLSSLPGRALRP------------APRARGPAPASWDWRDHGA--LT 367

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
             ++QG CGSCWAF+ T  +E Q  L +  L  LS+ +LV+CD  +  C GG    A+  
Sbjct: 368 AAKNQGMCGSCWAFSVTGNVEGQWFLRRGALLTLSEQELVDCDTLDQACGGGLPSNAYTA 427

Query: 202 VKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIG 258
           ++  G LE++ DY Y  ++    RC++  +KA+ ++  +   S  +  +   L ++GP+ 
Sbjct: 428 IETLGGLETEKDYSYEGRKE---RCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVS 484

Query: 259 VYLNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           + LN   ++ Y     +P R     C+P  +DHAV +VGYG+++GI  W ++NSWG    
Sbjct: 485 IALNAFAMQFYRRGVSHPFRP---LCSPWFIDHAVLLVGYGDRSGIPFWAIKNSWGPDWG 541

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+ + RGA ACG+ + A  A V
Sbjct: 542 EEGYYYLYRGARACGMNTMASSAIV 566


>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
 gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
          Length = 356

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 173/324 (53%), Gaps = 51/324 (15%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETD---EYYGTSG 82
           Y+  +  D F++++  +N+ YT D E   R+  FK        ++G  TD     Y  + 
Sbjct: 47  YNLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINK 106

Query: 83  SSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKF-----LNERK-KGPLPKSLDWR-Q 134
            SD S  E++ + TGL +            ERV  F     LN+   KGPL    DWR Q
Sbjct: 107 FSDLSKSELIAKFTGLSIP-----------ERVSNFCKTIILNQPPDKGPL--HFDWREQ 153

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
           +KV     +++QG CG+CWAFAT A +ESQ A+    L  LS+ QL++CD  ++ CNGG 
Sbjct: 154 NKVT---SIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGL 210

Query: 195 IDVAFEYVKQY-GLESQADYPY--RNKENITFRCTYEKEKAKVFVQD---TWVTSGVDHM 248
           +  AFE + +  G++++ DYP+  RN+     RC  ++ +  V        +V    + +
Sbjct: 211 LHTAFEEIMRMGGVQTELDYPFVGRNR-----RCGLDRHRPYVVSLVGCYRYVMVNEEKL 265

Query: 249 MHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVR 307
             LL++ GPI + ++   I +Y    I     +C  + L+HAV +VGYG +NG+  W+ +
Sbjct: 266 KDLLRAVGPIPMAIDAADIVNYYRGVIS----SCENNGLNHAVLLVGYGVENGVPYWVFK 321

Query: 308 NSWGDIGPDHGYFQIERGANACGI 331
           N+WGD   ++GYF++ +  NACG+
Sbjct: 322 NTWGDDWGENGYFRVRQNVNACGM 345


>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
 gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
          Length = 323

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 165/313 (52%), Gaps = 28/313 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGTSGSSD 85
           AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  +  SD
Sbjct: 18  AYDLLKAPNYFEEFVHRFNKNYSSETEKLRRFKIFQHNLNEIINKNQNDSAKYEINKFSD 77

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S  E + + TGL L  + +          K  + ++  G  P   DWR+   KV N V+
Sbjct: 78  LSKDETIAKYTGLSLPTQTQNF-------CKVIILDQPPGKGPLDFDWRRLN-KVTN-VK 128

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVK 203
           +QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AFE  +K
Sbjct: 129 NQGTCGACWAFATLASLESQYAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIK 188

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVY 260
             G++ ++DYPY   E     C     K  V V+D   +VT   + +  LL+ +GPI + 
Sbjct: 189 MGGVQLESDYPY---EANNNNCRMNGNKFAVRVKDCYRYVTVYEEKLKDLLRVAGPIPMA 245

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y    IR     C    L+HAV +VGYG +N I  WI +N+WG    + GYF
Sbjct: 246 IDAADIVNYKQGVIRY----CFNSGLNHAVLLVGYGVENNIPFWIFKNTWGTDWGEDGYF 301

Query: 321 QIERGANACGIES 333
           ++++  NACG+ +
Sbjct: 302 RVQQNINACGMRN 314


>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
 gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
 gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
 gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
          Length = 367

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 162/316 (51%), Gaps = 43/316 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------------------DGKETDEYYGTSG 82
           FK ++ ++N++Y D  E + R+  FK                     D   T   +G + 
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 83  SSDRSPQEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
            SD++P E+L   TG  L   +   L  +R  VK   + R    LP   DWR +    + 
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENR-IVKGAPDIR----LPDYYDWRDTNK--VT 169

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-E 200
           P++ QG CGSCWAF     +ESQ A+    L  LS+ QL++CD  +L CNGG + +AF E
Sbjct: 170 PIKDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQE 229

Query: 201 YVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-----FVQDTWVTSGVDHMMHLLQSG 255
            +   G+E++ADYPY+  E +   CT +  K  V     F  D    + +  +++   +G
Sbjct: 230 LLLMGGVETEADYPYQGSEQM---CTLDNRKIAVKLNSCFKYDIRDENKLKELVY--TTG 284

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           P+ + ++   I +Y    + +    C+ + L+HAV ++G+G +N +  WI++NSWG+   
Sbjct: 285 PVAIAVDAMDIINYRRGILNQ----CHIYDLNHAVLLIGWGIENNVPYWIIKNSWGEDWG 340

Query: 316 DHGYFQIERGANACGI 331
           ++G+ ++ R  NACG+
Sbjct: 341 ENGFLRVRRNVNACGL 356


>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
          Length = 476

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 99/309 (32%), Positives = 159/309 (51%), Gaps = 24/309 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK ++ K+N+ Y+   E   R + FK++ K  ++          YG +  SD
Sbjct: 170 ESVELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSD 229

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E       RLT       +    R  K  +   + P P S DWR      ++PV++
Sbjct: 230 LTEEE------FRLTYLNPLLSQWTLRRPMKPASP-ARSPAPASWDWRDHGA--VSPVKN 280

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L    L  LS+ +LV+CD  +  C GG    A+E ++  
Sbjct: 281 QGLCGSCWAFSVTGNIEGQWFLKHGKLLSLSEQELVDCDGLDHACRGGLPSNAYEAIEGL 340

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE++ DY Y   +    +C++  EK   ++  +      ++ M   L ++GP+ V LN
Sbjct: 341 GGLEAENDYTYSGHKQ---KCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVSVALN 397

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+NGI  W ++NSWG+   + GY+ +
Sbjct: 398 AFAMQFYKKGVSHPWMILCNPWMIDHAVLLVGYGERNGIPFWAIKNSWGEDYGEEGYYYL 457

Query: 323 ERGANACGI 331
            +G+NACGI
Sbjct: 458 YKGSNACGI 466


>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 109/321 (33%), Positives = 157/321 (48%), Gaps = 32/321 (9%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR 133
              YG +  SD + +E   R   +R  G       +  E V    NE+         DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDPSPEEDVT-MDNEK--------FDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CDH    CNGG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGG 180

Query: 194 NIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                + E  K  GLE  +DYPY   + I   C   + K   +V D+ V    + +    
Sbjct: 181 YPPKTYGEIEKMGGLELASDYPYTGVDGI---CYMNQSKFVAYVNDSTVLPLSEKIQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L + GP+   LN  L++ Y G  I    + CNPH L+HAV  VGYG + GI  WIV+NS 
Sbjct: 238 LKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGIPYWIVKNSL 297

Query: 311 GDIGPDHGYFQIERGANACGI 331
           G    + GYF+I RGA  CGI
Sbjct: 298 GVGFGEKGYFRIFRGAGTCGI 318


>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
          Length = 363

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 158/318 (49%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           FK +  K+ RTY  + E + R   FK + +    +        +G +  SD +P E  ++
Sbjct: 50  FKLFKNKFGRTYDTEEEHEYRLTVFKSNLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKK 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               L  K K +L AD  +            LP+  DWR      + PV++QG CGSCW+
Sbjct: 110 ---YLGLKSKLKLPADANKAPILPTSN----LPQDFDWRDKGA--VTPVKNQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ K 
Sbjct: 161 FSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPAEYNSCDSGCNGGLMNNAFEYILKA 220

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL+ +ADYPY  ++     C ++K K    V +  V S  +  +  +L+ +GP+ + +N
Sbjct: 221 GGLQKEADYPYTGRDGT---CKFDKSKIAASVANFSVVSTDEDQIAANLVTNGPLAIGIN 277

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C+  K+DH V +VGYG              WI++NSWG+   
Sbjct: 278 AAWMQTYIGQ--VSCPYICSKTKMDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGEDWG 335

Query: 316 DHGYFQIERGANACGIES 333
           + GY+++  G NACG+++
Sbjct: 336 EDGYYKLCSGYNACGMDT 353


>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 167/317 (52%), Gaps = 25/317 (7%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYG 79
           V    AYD +K    F+ ++ K+N+ Y+ ++E   RF+ F+ + +E        T   Y 
Sbjct: 13  VAHSAAYDLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYE 72

Query: 80  TSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
            +  SD S  E + + TGL L  + +   E     V        KGPL    DWR  ++ 
Sbjct: 73  INKFSDLSKDETISKYTGLALPLQTQNFCE-----VVVLNRPPDKGPL--EFDWR--RLN 123

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            +  V++QG CG+CWAFAT A LESQ A+    L  LS+ QL++CD+ +  CNGG +  A
Sbjct: 124 KVTSVKNQGICGACWAFATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTA 183

Query: 199 FEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGP 256
           +E V Q  G++++ DYPY   +    R    K   KV     ++    + +  LL+  GP
Sbjct: 184 YEAVMQMGGVQAENDYPYEGSDG-NCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGP 242

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           I V ++   I +Y    +R     C+ +  +HAV +VGYG +N +  WI++N+WG+   +
Sbjct: 243 IPVAIDASDIVNYRRGIMRY----CSNYGFNHAVLLVGYGVENNVPYWILKNTWGEDWGE 298

Query: 317 HGYFQIERGANACGIES 333
            GYF++++  NACGI +
Sbjct: 299 QGYFRVQQNINACGIRN 315


>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
 gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
          Length = 323

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 165/313 (52%), Gaps = 28/313 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEYYGTSGSSD 85
           AYD +K  + F+ ++ ++N+ Y  + E   R++ F+ +        +     Y  +  SD
Sbjct: 18  AYDILKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDIITKNRNDTAVYKINKFSD 77

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S  E + + TGL L    +   E         + +R  G  P   DWR  +   +  V+
Sbjct: 78  LSKDETIAKYTGLSLPLHTQNFCEV-------VVLDRPPGKGPLEFDWR--RFNKITSVK 128

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVK 203
           +QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  ++ C GG +  AFE  + 
Sbjct: 129 NQGMCGACWAFATLASLESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIIS 188

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ--DTWVTSGVDHMMHLLQ-SGPIGVY 260
             G++ + DYPY +  N    C  +  K  V V+  + ++T   + +  +L+ +GPI V 
Sbjct: 189 MGGVQIENDYPYESSNN---YCRMDPTKFVVGVKQCNRYITIYEEKLKDVLRLAGPIPVA 245

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y+   I+     C  + L+HAV +VGYG +N +  WI++NSWG    + G+F
Sbjct: 246 IDASDILNYEQGIIKY----CANNGLNHAVLLVGYGVENNVPYWILKNSWGTDWGEQGFF 301

Query: 321 QIERGANACGIES 333
           +I++  NACGI++
Sbjct: 302 KIQQNVNACGIKN 314


>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
 gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
          Length = 605

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 156/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVD-AFKTYIVKWNRTYTDDNEIKTRFEYFKQ---------DGKETDEYYGTSGSSD 85
           S+ +VD  F  + +K+ R Y +  E + R   F+Q         D ++    YG +  +D
Sbjct: 291 SLNKVDHLFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFAD 350

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E  QR GL       +R        K  +    KG LPK  DWR+     +  V++
Sbjct: 351 MTSSEYTQRAGLW------QRSANKPTGGKPAVVPAYKGELPKEFDWREKNA--VTQVKN 402

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K  
Sbjct: 403 QGSCGSCWAFSVTGNIEGLYAIKTGELREFSEQELLDCDSTDSACNGGLMDNAYKAIKDI 462

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYL 261
            GLE +++YPY  K+    +C + K  + V V D   +  G +  M   LL +GPI + L
Sbjct: 463 GGLEYESEYPYLAKKK---QCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIGL 519

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGP 315
           N   ++ Y G         C+   LDH V IVGYG  +       +  WIV+NSWG    
Sbjct: 520 NANAMQFYRGGVSHPWGPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG 579

Query: 316 DHGYFQIERGANACGIESYA 335
           + GY++I RG N CG+   A
Sbjct: 580 EQGYYRIYRGDNTCGVSEMA 599


>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
 gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
          Length = 615

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 158/322 (49%), Gaps = 32/322 (9%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGS 83
           A+D +  +  F  + V++ R Y    E + R   F+Q+ K  +E          YG +  
Sbjct: 301 AFDKVDHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEF 358

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           +D +  E  +RTGL       +R EA        +     G LPK  DWRQ     +  V
Sbjct: 359 ADMTSSEYKERTGLW------QRDEAKATGGSAAVVPAYHGELPKEFDWRQKDA--VTQV 410

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           ++QG CGSCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K
Sbjct: 411 KNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIK 470

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGV 259
              GLE +A+YPY+ K+N   +C + +  + V V     +  G +  M   LL +GPI +
Sbjct: 471 DIGGLEYEAEYPYKAKKN---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISI 527

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDI 313
            +N   ++ Y G         C+   LDH V +VGYG  +       +  WIV+NSWG  
Sbjct: 528 GINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPR 587

Query: 314 GPDHGYFQIERGANACGIESYA 335
             + GY+++ RG N CG+   A
Sbjct: 588 WGEQGYYRVYRGDNTCGVSEMA 609


>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
 gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
          Length = 343

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 167/321 (52%), Gaps = 47/321 (14%)

Query: 37  IKQVDA-FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRS 87
           +K V+  FK ++ K+ + Y    E   R + F+ +         ++    +G +  +D +
Sbjct: 39  VKDVEGHFKHFMQKFGKVYGTTEEYVHRLKVFQANLAHVMSLKKQDPTAIHGITSFADLT 98

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVE 144
           P+E+ +  G R         +A   RV   +N+    P   LP++ DWR+     + PV+
Sbjct: 99  PEELSRFLGFR---------KAYSNRV---VNQAPLLPTDNLPEAFDWREHGA--VTPVK 144

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
            QGRCGSCW F+TT ++E    L    L  LS+ QL++CD+ +  C GG++  A+EYVK 
Sbjct: 145 FQGRCGSCWTFSTTGVVEGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKA 204

Query: 205 YGLESQADYPYRNKENITFR-------CTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSG 255
            GLE++ DYPY   E + +R       C Y+  K    + + + V+   D +  +L+++G
Sbjct: 205 RGLEAEEDYPY---EELGYRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNG 261

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACN---PHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
           P+ + L   ++ +Y+G        AC    P +++H V +VGYG +NG+  W  +N+W D
Sbjct: 262 PLSIALRGNVLFTYEGG------VACPRICPGEINHGVLLVGYGVENGLRYWTFKNTWTD 315

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++GYF++ RG   C + S
Sbjct: 316 EFGENGYFRLCRGVGVCDMNS 336


>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
 gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
 gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
          Length = 324

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 163/313 (52%), Gaps = 29/313 (9%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSSD 85
           YD +K  + F+ ++ K+N+ Y+ ++E   RF+ F+ + +E        +   Y  +  SD
Sbjct: 19  YDLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSD 78

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S +E + + TGL L  + +   E         + +R     P   DWRQ     +  V+
Sbjct: 79  LSKEEAISKYTGLSLPHQTQNFCEV-------VILDRPPDRGPLEFDWRQ--FNKVTSVK 129

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
           +QG CG+CWAFAT   LESQ A+    L  LS+ Q ++CD  N  C+GG +  AFE   +
Sbjct: 130 NQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAME 189

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVY 260
             G++ ++DYPY   E    +C     +  V V+    ++    + +  LL++ GPI V 
Sbjct: 190 MGGVQMESDYPY---ETANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVA 246

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y    +R+    C  H L+HAV +VGY  +N I  WI++N+WG    + GYF
Sbjct: 247 IDASDIVNYRRGIMRQ----CANHGLNHAVLLVGYAVENNIPYWILKNTWGTDWGEDGYF 302

Query: 321 QIERGANACGIES 333
           ++++  NACGI +
Sbjct: 303 RVQQNINACGIRN 315


>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
          Length = 345

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 156/320 (48%), Gaps = 37/320 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK+++ ++N+ Y D NE   R + F ++ +  D++   +  +      + Q +G+     
Sbjct: 27  FKSWMAQYNKAY-DFNEYYRRLQIFTENKRRIDKH---NEGNHSFTMGLNQFSGMTFNEF 82

Query: 103 EKERLEADRER------------VKKF----LNERKK------GPLPKSLDWRQSKVKVL 140
            K  L ++ +             + +F     NE +K      GP P S+DWR+ K   +
Sbjct: 83  RKAFLMSEPQNCSATKGNYLSSNLNQFSGMTFNEFRKAFLMSEGPQPDSIDWRK-KGNYI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVA 198
            PV++QG CGSCW F+TT  LES  A+    L PLS+ QLV+C  D  N  CNGG    A
Sbjct: 142 TPVKTQGSCGSCWTFSTTGCLESVTAIATVKLVPLSEQQLVDCAQDFNNHGCNGGLPSQA 201

Query: 199 FEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPI 257
           FEY+    GL ++ DYPY+  E I   C+Y+   A  FV++    +  D M  +   G +
Sbjct: 202 FEYIMYNKGLMTEQDYPYKFVEGI---CSYKPSLAAAFVKEVRNITAYDEMGMVDAVGTL 258

Query: 258 G-VYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDI 313
             V     + +    Y               K++HAV  VGYG++ G   WIV+NSWG  
Sbjct: 259 NPVSFAFEVTDDFMHYREGVYTSTTCHNTTDKVNHAVLAVGYGQEKGTPYWIVKNSWGSS 318

Query: 314 GPDHGYFQIERGANACGIES 333
               GYF IERG N CG+ +
Sbjct: 319 WGIDGYFLIERGKNMCGLAA 338


>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
 gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
          Length = 343

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 158/311 (50%), Gaps = 40/311 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQR 94
           FK ++ K+ + Y    E   R + F+ +         ++    +G +  +D +P+E+ + 
Sbjct: 46  FKHFMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKKQDPTAIHGITSFADLTPEELSRF 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
            G R         +A   RV           LP++ DWR+     + PV+ QGRCGSCW 
Sbjct: 106 LGFR---------KAYSNRVVNQAPLLPTDNLPEAFDWREHGA--VTPVKFQGRCGSCWT 154

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYP 214
           F+TT ++E    L    L  LS+ QL++CD+ +  C GG++  A+EYVK  GLE+  DYP
Sbjct: 155 FSTTGVVEGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKARGLEADEDYP 214

Query: 215 YRNKENITFR-------CTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRL 265
           Y   E + +R       C Y+  K    + + + V+   D +  +L+++GP+ + L   +
Sbjct: 215 Y---EELGYRHKPVRGPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNV 271

Query: 266 IESYDGNPIRRNDWACN---PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
           + +Y+G        AC    P +++H V +VGYG +NG+  W  +NSW D   ++GYF++
Sbjct: 272 LFTYEGG------VACPRICPGEINHGVLLVGYGVENGLRYWTFKNSWTDEFGENGYFRL 325

Query: 323 ERGANACGIES 333
            RG   C + S
Sbjct: 326 CRGVGVCDMTS 336


>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
 gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
          Length = 353

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 160/328 (48%), Gaps = 30/328 (9%)

Query: 30  RDLAYDSIKQVDA-FKTY---IVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-------- 77
           R  + D+    D  FK Y   I ++N++Y +  E+  R++ F ++      +        
Sbjct: 38  RRFSQDTATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATG 97

Query: 78  -YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSK 136
            YG +  SD + QE+     ++   K  ++L   ++     LN      LP+S DWR   
Sbjct: 98  RYGFTKLSDLTDQEVKSFYAMK---KWPQQLYPTKKANIPQLNS-----LPQSFDWRSKG 149

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNID 196
              +  V+ Q RCG+CWAFATT  +E Q  L K  LY LS+ +LV+CD  +  C GG   
Sbjct: 150 A--VTAVKDQKRCGACWAFATTGNIEGQWYLNKGKLYSLSEQELVDCDKIDEGCKGGLPL 207

Query: 197 VAFEYV--KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLL 252
            A+  +  +  GLE++ DYPY  K     +C   K +  V++  +    T+  D    L+
Sbjct: 208 NAYHSIMNRLGGLETEKDYPYVAKNG---KCKLNKSEEVVYINSSVKVSTNETDLAAWLV 264

Query: 253 QSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
             GP+ + +N   +  Y G      +  CNP  LDH V IVGYGE+     WI++NSWG 
Sbjct: 265 AHGPVAIGINSVNMLHYKGGIAHPTNKDCNPKLLDHGVLIVGYGEEKSTPYWIIKNSWGT 324

Query: 313 IGPDHGYFQIERGANACGIESYAYLASV 340
              + GY+++ RG  ACG+   A  A V
Sbjct: 325 DWGEKGYYRVVRGIGACGLNKSATSAIV 352


>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
 gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
          Length = 2676

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 170/355 (47%), Gaps = 49/355 (13%)

Query: 6    CDHQETNTEQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDD-NEIKTRF 64
            CD+ E  T +V +++  +   Y                  ++  +   Y DD ++++ RF
Sbjct: 2351 CDYHEAATAEVYHHLQAEHLFY-----------------EFLSTYKPEYIDDRHQMRQRF 2393

Query: 65   EYFKQDGKETDEY---------YGTSGSSDRSPQEI-LQRTGLRLTGKEKERLEADRERV 114
            E FK++ ++  E          YG +  +D + +E   +  G++ + ++  +++  +  +
Sbjct: 2394 EIFKENVRKMHELNTHERGTATYGVTRFADLTYEEFSTKHMGMKASLRDPNQVQFRKAVI 2453

Query: 115  KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
                        P S DWR      +  V+ QG CGSCWAF+ T  +E Q  +    L  
Sbjct: 2454 PNVT-------APDSFDWRDHGA--VTGVKDQGSCGSCWAFSVTGNIEGQWKMKTGDLVS 2504

Query: 175  LSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAK 233
            LS+ +LV+CD  +  CNGG  D A+  ++Q  GLES+ DYPY   ++   +C++ K  A+
Sbjct: 2505 LSEQELVDCDKLDQGCNGGLPDNAYRAIEQLGGLESEDDYPYEGSDD---KCSFNKTLAR 2561

Query: 234  VFVQDTW-VTSGVDHMMH-LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVA 291
            V +     +TS    M   L++ GPI + +N   ++ Y G         CNP  LDH V 
Sbjct: 2562 VQISGAVNITSNETDMAKWLVKHGPISIGINANAMQFYMGGISHPWRMLCNPSNLDHGVL 2621

Query: 292  IVGYGEKNGILT------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            IVGYG K+  L       WI++NSWG    + GY+++ RG   CG+   A  A V
Sbjct: 2622 IVGYGAKDYPLFHKHLPYWIIKNSWGTSWGEQGYYRVYRGDGTCGVNQMASSAVV 2676


>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
          Length = 459

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 156/317 (49%), Gaps = 25/317 (7%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K V  FK +I  +NRTY  + E + R   F  +     E          YG +  SD 
Sbjct: 155 SMKMVSLFKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDL 214

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT   L    KE L   + R+ K +++    P P   DWR      +  V++Q
Sbjct: 215 TEEEF--RT-FYLNPLLKEGL-GKKMRLAKPVDD----PAPPEWDWRNKGA--VTKVKNQ 264

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L +  L  LS+ +LV+CD  +  C GG    A+  +K  G
Sbjct: 265 GMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELVDCDTLDKACMGGLPSNAYSAIKTLG 324

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y         C++  EK KV++ D+   S  +  +   L + GPI + +N 
Sbjct: 325 GLETEDDYSYHGHLQT---CSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPISIAINA 381

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+ + 
Sbjct: 382 FGMQFYRRGISRPLRLLCSPWFIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEEGYYYLH 441

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+   A  A V
Sbjct: 442 RGSRACGVNVMASSAVV 458


>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
           kowalevskii]
          Length = 352

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 152/316 (48%), Gaps = 24/316 (7%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSP 88
           K  D F+ ++  +++ Y  + E + R++ F+         Q  ++    YG +   D S 
Sbjct: 49  KTQDLFQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSE 108

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           +E  +     LT             +KK   E  KG  P + DWR +    +  V++QG 
Sbjct: 109 EEFRK---YYLT----PVWRGSDPHMKK--AEIPKGTPPAAFDWRDADKNAVTKVKNQGT 159

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GL 207
           CGSCWAF+TT  +E Q  + K TL  LS+ +LV+CD  +  CNGG    A++ + ++ G+
Sbjct: 160 CGSCWAFSTTGNIEGQWKIKKGTLVSLSEQELVDCDKLDQGCNGGLPSNAYQEIMRFGGI 219

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRL 265
            S+ DYPY  ++     C       KV++  +   S  +  M   L  +GPI + +N   
Sbjct: 220 MSEDDYPYTGRDQ---DCKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANA 276

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           ++ Y G         CNP  LDH V IVGYG K+G   WI++NSWG      GY+ + RG
Sbjct: 277 MQFYFGGVSHPWKIFCNPENLDHGVLIVGYGTKDGTPYWIIKNSWGRSWGVEGYYLVYRG 336

Query: 326 ANACGIESYAYLASVK 341
              CG+      A VK
Sbjct: 337 GGVCGLNEMCTSAIVK 352


>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
          Length = 308

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 155/314 (49%), Gaps = 24/314 (7%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQE 90
           VD F   I ++NRTY++  E+  RF  +K         Q  ++    YG +  SD +  E
Sbjct: 7   VDGF---IGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAE 63

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
              R  +     E  ++       K+F     +  +P+S DWR+     +  V++QG CG
Sbjct: 64  F--RKIMLPYKWETPKVPNKMANFKEF--GIAQNDIPESFDWREKNA--VTEVKNQGSCG 117

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLES 209
           SCWAF+ T  +E   A+    L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE+
Sbjct: 118 SCWAFSVTGNIEGAWAIKTSKLVSLSEQELVDCDIIDQGCNGGLPSNAYREIIRMGGLEA 177

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIE 267
           ++DYPY  +     +C   K+   V++ D+      +  M   L+  GPI + LN   ++
Sbjct: 178 ESDYPYDGRGE---KCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQ 234

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y           C+P  LDH V IVGYG +     WI++NSWG    + GYF++ RG N
Sbjct: 235 FYRHGIAHPWRVFCSPKHLDHGVLIVGYGSETDKPYWIIKNSWGTKWGEEGYFRLFRGKN 294

Query: 328 ACGIESYAYLASVK 341
            CGI+  A  A ++
Sbjct: 295 VCGIQEMATTAIIE 308


>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
 gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 162/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
 gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
 gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
          Length = 475

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 152/315 (48%), Gaps = 30/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + V++ R Y    E + R   F+Q+ K  +E          YG +  +D +  E  +
Sbjct: 169 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 228

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           RTGL       +R EA        +     G LPK  DWRQ     +  V++QG CGSCW
Sbjct: 229 RTGLW------QRDEAKATGGSAAVVPAYHGELPKEFDWRQKDA--VTQVKNQGSCGSCW 280

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +A+
Sbjct: 281 AFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAE 340

Query: 213 YPYRNKEN-ITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY+ K+N   F  T    +   FV    +  G +  M   LL +GPI + +N   ++ Y
Sbjct: 341 YPYKAKKNQCHFNRTLSHVQVAGFVD---LPKGNETAMQEWLLANGPISIGINANAMQFY 397

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V +VGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 398 RGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 457

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 458 RGDNTCGVSEMATSA 472


>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
 gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
          Length = 620

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 154/315 (48%), Gaps = 30/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + V++ R Y    E + R   F+Q+ K  +E          YG +  +D +  E  +
Sbjct: 314 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSTEYKE 373

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           RTGL       +R EA        +     G LPK  DWR      +  V++QG+CGSCW
Sbjct: 374 RTGLW------QRDEAKATGGSPAVVPAYSGELPKEFDWRSKNA--VTGVKNQGQCGSCW 425

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   AL    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +A+
Sbjct: 426 AFSVTGNIEGLYALKYGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAE 485

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY  K+    +C + K  + V V+D   +  G +  M   L+ +GPI + +N   ++ Y
Sbjct: 486 YPYEAKKK---QCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGINANAMQFY 542

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V +VGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 543 RGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNYHKTLPYWIVKNSWGPRWGEQGYYRVY 602

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 603 RGDNTCGVSEMATSA 617


>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 162/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/327 (30%), Positives = 161/327 (49%), Gaps = 31/327 (9%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDN-EIKTRFEYFKQDGKETDEY---------YGTSGS 83
           Y  ++    F  +I  +   Y +D+ E+  RFE FK++ K+  E          Y  +  
Sbjct: 222 YHHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRF 281

Query: 84  SDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D + +E   +  GL    K+  ++   +  + K         LP S DWR   +  +  
Sbjct: 282 TDLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQ------LPASFDWR--PLGAVTE 333

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+ QG CGSCWAF+ T  +E Q  L    L  LS+ +LV+CD  +  C+GG +D A+  +
Sbjct: 334 VKDQGACGSCWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMDDGCDGGYMDNAYRAI 393

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGV 259
           +Q  GLE++ +YPY  +++   +C++ K  +KV +      S  +  M   L+ +GPI +
Sbjct: 394 EQLGGLETEEEYPYEAEDD---KCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISI 450

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDI 313
            +N   ++ Y G         CNP  +DH V IVGYG K   L       W+V+NSWG  
Sbjct: 451 GINANAMQFYVGGVSHPWKALCNPKNIDHGVLIVGYGIKEYPLFNKQLPYWVVKNSWGPG 510

Query: 314 GPDHGYFQIERGANACGIESYAYLASV 340
             + GY+++ RG   CG+ + A  A V
Sbjct: 511 WGEQGYYRVFRGDGTCGVNTMASSAVV 537


>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 369

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 105/322 (32%), Positives = 154/322 (47%), Gaps = 43/322 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ ++Y    E   R   FK + +    +        +G +  SD +P+E  +R
Sbjct: 47  FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQLLDPSAVHGVTKFSDLTPKE-FRR 105

Query: 95  TGLRL----TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           T L +    +GK K +L AD    +          LP   DWR      +  V+ QG CG
Sbjct: 106 TFLGIRKSSSGKRKLKLPADAHAAEIL----PTSDLPSDFDWRD--YGAVTGVKDQGSCG 159

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY 201
           SCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG +  A+EY
Sbjct: 160 SCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNGGLMTTAYEY 219

Query: 202 VKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
           V Q  GLE + DYPY  K+     C ++K K    V +  V S  +  +  +L++ GP+ 
Sbjct: 220 VLQSGGLEKEKDYPYTGKDGT---CKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLS 276

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           V +N   +++Y G       + C+   LDH V +VGYG              WIV+NSWG
Sbjct: 277 VGINAVFMQTYIGG--VSCPYICSKRNLDHGVLLVGYGAAGYAPIRFKDKPYWIVKNSWG 334

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   + GY++I RG N CGI+S
Sbjct: 335 ENWGEEGYYKICRGNNICGIDS 356


>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
 gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
           Precursor
 gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
 gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
          Length = 614

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/312 (32%), Positives = 153/312 (49%), Gaps = 30/312 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + V++ R Y    E + R   F+Q+ K  +E          YG +  +D +  E  +
Sbjct: 308 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 367

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           RTGL       +R EA        +     G LPK  DWRQ     +  V++QG CGSCW
Sbjct: 368 RTGLW------QRDEAKATGGSAAVVPAYHGELPKEFDWRQKDA--VTQVKNQGSCGSCW 419

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +A+
Sbjct: 420 AFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAE 479

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY+ K+N   +C + +  + V V     +  G +  M   LL +GPI + +N   ++ Y
Sbjct: 480 YPYKAKKN---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQFY 536

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V +VGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 537 RGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 596

Query: 324 RGANACGIESYA 335
           RG N CG+   A
Sbjct: 597 RGDNTCGVSEMA 608


>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
          Length = 274

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 125/221 (56%), Gaps = 12/221 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P + DWR      + PV++QG CGSCWAF+ T  +E Q A+ KK L  LS+ +LV+CD  
Sbjct: 58  PDAFDWRDHDA--VTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDKV 115

Query: 187 NLNCNGGNIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSG 244
           +L CNGG    A+ E ++  GLE++ DYPY  K +   +C +EK + +V +     ++S 
Sbjct: 116 DLGCNGGLPLQAYKEIMRIGGLETEKDYPYEGKGD---KCVFEKAEVEVNITGAVNISSN 172

Query: 245 VDHM-MHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
            D M   L ++GPI + LN   ++ Y G       + C+P  LDH V I GYG K G ++
Sbjct: 173 EDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCSPSSLDHGVLITGYGIKQGWMS 232

Query: 304 ----WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
               W ++NSWG+   + GY+ + RGA  CG+      A+V
Sbjct: 233 DSPFWAIKNSWGESWGEKGYYLLYRGAGVCGVNQMPTSATV 273


>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
          Length = 323

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 163/318 (51%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y  + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKDQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  + ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPIQT-------QNFCKVIVLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++T   + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 162/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N +  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFDSGLNHAVLLVGYGVENNVPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|203341|gb|AAA63484.1| cathepsin H [Rattus norvegicus]
          Length = 298

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 87/223 (39%), Positives = 124/223 (55%), Gaps = 13/223 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P S+DWR+ K  V++PV++QG CGSCW F+TT  LES VA+    +  L++ QL
Sbjct: 74  RGTGPYPSSMDWRK-KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQL 132

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  +  N  C GG    AFEY+    G+  +  YPY  K     +C +  EKA  FV+
Sbjct: 133 VDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG---QCKFNPEKAVAFVK 189

Query: 238 DTWVTSGVDHMMHLLQSGPI--GVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAI 292
           +  V   ++    ++++  +   V     + E    Y       N     P K++HAV  
Sbjct: 190 NV-VNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLA 248

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           VGYGE+NG+L WIV+NSWG    ++GYF IERG N CG+ + A
Sbjct: 249 VGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACA 291


>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
 gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
 gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
          Length = 323

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 161/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL   G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 629

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 156/323 (48%), Gaps = 31/323 (9%)

Query: 36  SIKQVD-AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           ++ +VD  F  + +++ R Y +  E + R   F+Q+ K  +E          YG +  +D
Sbjct: 315 ALDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFAD 374

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E  +RTGL    ++K    A        +    +G  PK  DWRQ     + PV++
Sbjct: 375 MTSTEYKERTGLWQRDEQKPTGGAPA------VVPAYEGEFPKEFDWRQKNA--VTPVKN 426

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K  
Sbjct: 427 QGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDI 486

Query: 206 -GLESQADYPYR-NKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYL 261
            GLE +A+YPY   K+   F  T    +   FV    +  G +  M   LL  GPI + L
Sbjct: 487 GGLEYEAEYPYEAKKQQCHFNRTLSHVQVSGFVD---LPKGNETAMQEWLLTHGPISIGL 543

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGP 315
           N   ++ Y G         C+   LDH V IVGYG  +       +  WIV+NSWG    
Sbjct: 544 NANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG 603

Query: 316 DHGYFQIERGANACGIESYAYLA 338
           + GY+++ RG N CG+   A  A
Sbjct: 604 EQGYYRVYRGDNTCGVSEMATSA 626


>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
 gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
          Length = 627

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 156/323 (48%), Gaps = 31/323 (9%)

Query: 36  SIKQVD-AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           ++ +VD  F  + +++ R Y +  E + R   F+Q+ K  +E          YG +  +D
Sbjct: 313 ALDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFAD 372

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E  +RTGL    ++K    A        +    +G  PK  DWRQ     + PV++
Sbjct: 373 MTSTEYKERTGLWQRDEQKPTGGAPA------VVPAYEGEFPKEFDWRQKNA--VTPVKN 424

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K  
Sbjct: 425 QGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDI 484

Query: 206 -GLESQADYPYR-NKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYL 261
            GLE +A+YPY   K+   F  T    +   FV    +  G +  M   LL  GPI + L
Sbjct: 485 GGLEYEAEYPYEAKKQQCHFNRTLSHVQVSGFVD---LPKGNETAMQEWLLTHGPISIGL 541

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGP 315
           N   ++ Y G         C+   LDH V IVGYG  +       +  WIV+NSWG    
Sbjct: 542 NANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG 601

Query: 316 DHGYFQIERGANACGIESYAYLA 338
           + GY+++ RG N CG+   A  A
Sbjct: 602 EQGYYRVYRGDNTCGVSEMATSA 624


>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
 gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
 gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
 gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
 gi|226475|prf||1514114A cathepsin H
          Length = 333

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 87/223 (39%), Positives = 124/223 (55%), Gaps = 13/223 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P S+DWR+ K  V++PV++QG CGSCW F+TT  LES VA+    +  L++ QL
Sbjct: 109 RGTGPYPSSMDWRK-KGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQL 167

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  +  N  C GG    AFEY+    G+  +  YPY  K     +C +  EKA  FV+
Sbjct: 168 VDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG---QCKFNPEKAVAFVK 224

Query: 238 DTWVTSGVDHMMHLLQSGPI--GVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAI 292
           +  V   ++    ++++  +   V     + E    Y       N     P K++HAV  
Sbjct: 225 NV-VNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLA 283

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           VGYGE+NG+L WIV+NSWG    ++GYF IERG N CG+ + A
Sbjct: 284 VGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACA 326


>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 477

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/325 (31%), Positives = 157/325 (48%), Gaps = 31/325 (9%)

Query: 36  SIKQVD-AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           ++ +VD  F  + +++ R Y +  E + R   F+Q+ K  +E          YG +  +D
Sbjct: 163 ALDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFAD 222

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E  +RTGL    ++K    A        +    +G  PK  DWRQ     + PV++
Sbjct: 223 MTSTEYKERTGLWQRDEQKPTGGAPA------VVPAYEGEFPKEFDWRQKNA--VTPVKN 274

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K  
Sbjct: 275 QGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDI 334

Query: 206 -GLESQADYPYR-NKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYL 261
            GLE +A+YPY   K+   F  T    +   FV    +  G +  M   LL  GPI + L
Sbjct: 335 GGLEYEAEYPYEAKKQQCHFNRTLSHVQVSGFVD---LPKGNETAMQEWLLTHGPISIGL 391

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGP 315
           N   ++ Y G         C+   LDH V IVGYG  +       +  WIV+NSWG    
Sbjct: 392 NANAMQFYRGGVSHPWKALCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWG 451

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+++ RG N CG+   A  A +
Sbjct: 452 EQGYYRVYRGDNTCGVSEMATSAVL 476


>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
          Length = 396

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 162/308 (52%), Gaps = 20/308 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           FK +  K+ R +    E K RFE F+++ +E +E         YG +  SD++  E L+ 
Sbjct: 88  FKDFNKKFGREHKSLEEYKMRFEVFQKNLREFEELNQKNPSVQYGINKFSDKTESE-LKN 146

Query: 95  TGLRLTGKEKERLEADRERVKKFLNER---KKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
             +     +     +  + +  + N R   K    P  +DWR    KV++ V+ QG+CGS
Sbjct: 147 LLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNVQRPDYIDWRNDG-KVMS-VKDQGQCGS 204

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           CWAFAT A +ESQ A+ K TL+ LS+ +LV+CD  +  C GG +  A  ++   GLE++ 
Sbjct: 205 CWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGASYGCGGGFLTSALGFILGNGLETED 264

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQS-GPIGVYLN-HRLIES 268
           DYPY   ++   +C    +K +V++ + + +T   D +   + + GP+   ++  +   +
Sbjct: 265 DYPYSATKHD--QCWINGDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPA 322

Query: 269 YDGNPIRRNDWACNPHKLD-HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           Y       ++  C    L  HA+AI+GYG++ G   WIV+NSWG    D GY ++ RG N
Sbjct: 323 YHDGIYSPSEHECKDESLGYHAMAIIGYGQEGGQNYWIVKNSWGGSWGDQGYMRLARGVN 382

Query: 328 ACGIESYA 335
           ACG+  Y 
Sbjct: 383 ACGMNDYV 390


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 154/312 (49%), Gaps = 28/312 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEI-L 92
           ++ +  K+ +TY +D++ + RF  FK         Q  ++    YG +   D + QE  +
Sbjct: 307 YEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTSQEFQI 365

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           Q  G +    +     +   RV   ++E        S DWR      + PV  QG+CGSC
Sbjct: 366 QYLGFKYEDMQDTEEMSPSTRV--VMDE-------DSFDWRDHGA--VGPVLDQGKCGSC 414

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQA 211
           WAF+T   +E Q  L    L  LS+ QL++CD+ +  CNGG     +   +K  GLE  +
Sbjct: 415 WAFSTIGNIEGQWFLKTGELLSLSEQQLIDCDNVDEGCNGGYPPKTYGAVIKMGGLELNS 474

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           DYPY+    +  +C  +++K KV++ D+ V    +H+    L   GP+   LN   ++ Y
Sbjct: 475 DYPYKA---LAEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFY 531

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
               +     +C P  L+HAV  VGYG +NG+  W V+NSWG    + GYF+I RG   C
Sbjct: 532 KTGIMHLPVASCFPRALNHAVLTVGYGTENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTC 591

Query: 330 GIESYAYLASVK 341
           GI      A+++
Sbjct: 592 GINRLVSTAAIR 603



 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 96/185 (51%), Gaps = 8/185 (4%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNL 188
           + DWRQ     + PV +QG CGSCWAF+    +E Q  L    L  LS  Q+++CDH + 
Sbjct: 42  NFDWRQHGA--VGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCDHVDH 99

Query: 189 NCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            CNGG     +  V Q  GL+  ADY Y+       +C  ++ K + +V  + + S  + 
Sbjct: 100 GCNGGYPPQVYRQVNQMGGLQLDADYSYKAAVG---KCHTDRSKFRAYVNSSVILSQNEQ 156

Query: 248 MM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWI 305
                L   GP+   LN R ++ Y    +     ACNP +L+HAV  VGYG + G+  WI
Sbjct: 157 FQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPGQLNHAVLTVGYGTEQGMPYWI 216

Query: 306 VRNSW 310
           V+NSW
Sbjct: 217 VKNSW 221


>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
 gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
          Length = 323

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 162/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  + ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVIILDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
          Length = 368

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 155/317 (48%), Gaps = 40/317 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  ++ +TY  D E   RF  FK + +        +    +G +  SD +P E  Q+
Sbjct: 54  FTLFKKRFGKTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEFSQK 113

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               L    + R  +D  +      E     LP   DWR+     + PV++QG CGSCW+
Sbjct: 114 F---LGVNRRLRFPSDANKAPILPTED----LPSDFDWREHGA--VTPVKNQGSCGSCWS 164

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  C+GG ++ AFEY +K 
Sbjct: 165 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKA 224

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DYPY   +  T  C ++  K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 225 GGLMREEDYPYTGTDKAT--CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAIN 282

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPD 316
              +++Y G       + C+  +LDH V +VGYG     +       WI++NSWG+   +
Sbjct: 283 AVFMQTYVGG--VSCPYICS-KQLDHGVLLVGYGTGFSPIRMKEKPYWIIKNSWGEKWGE 339

Query: 317 HGYFQIERGANACGIES 333
            GY++I RG N CG++S
Sbjct: 340 SGYYKIRRGRNVCGVDS 356


>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
 gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
          Length = 323

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 161/313 (51%), Gaps = 28/313 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGTSGSSD 85
           AYD +K  + F+ ++ ++N+ Y  + E   RF+ F+ +  E           Y  +  SD
Sbjct: 18  AYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIIIKNQNDSAKYEINKFSD 77

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S  E + + TGL L  +        +   K  + ++  G  P   DWR  ++  +  V+
Sbjct: 78  LSKDETIAKYTGLSLPIQT-------QNFCKVIVLDQPPGKGPLEFDWR--RLNKVTSVK 128

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVK 203
           +QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AFE  +K
Sbjct: 129 NQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIK 188

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVY 260
             G++ ++DYPY    N    C     K  V V+D   ++T   + +  LL+  GPI + 
Sbjct: 189 MGGVQLESDYPYEADNN---NCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMA 245

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    + G+F
Sbjct: 246 IDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWGEEGFF 301

Query: 321 QIERGANACGIES 333
           ++++  NACG+ +
Sbjct: 302 RVQQNINACGMRN 314


>gi|324513891|gb|ADY45690.1| Cysteine proteinase [Ascaris suum]
          Length = 398

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 105/314 (33%), Positives = 159/314 (50%), Gaps = 26/314 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY----------YGTSGSSDR 86
           ++ +D+F  ++ K+++ Y D  +   RF  +  +    D            YG +  +D 
Sbjct: 85  LRLLDSFMEFMHKYDKVYVDSAQFVKRFRIYVNNMANIDALNERNYGRSIIYGENQFADW 144

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           S  E  Q    R   K   +     ++  + +  RK+  +P+  DWR     V+ PV++Q
Sbjct: 145 SEDEFRQILLPRGFYKNFHKRAIFIDQPDEIMMPRKE-IIPEHFDWR--PYNVVTPVKAQ 201

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
             CGSCWAFATT  +ES  A+    L  LS+ QL++C+  N  C+GG+ID A  YV + G
Sbjct: 202 LNCGSCWAFATTGTVESAYAIGTGELKSLSEQQLLDCNVENNACDGGDIDKALRYVYEEG 261

Query: 207 LESQADYPY--RNKENITFRCTYEKEKAKVFV-QDTWVTSGVDHMMHLLQSGPIGVYLNH 263
           L ++ DYPY    +E    R    + KA VF+ QD    S +D ++H   +GP+ V +N 
Sbjct: 262 LMTEYDYPYVAHRQETCYLRGETTRIKAAVFLHQDE--ASIIDWLIH---NGPVNVGVNV 316

Query: 264 RL-IESYDGNPIRRNDWACNPHKL-DHAVAIVGYG--EKNGILTWIVRNSWG-DIGPDHG 318
              +++Y G     N W C    +  HA+ IVGYG   K     WIV+NSWG   G ++G
Sbjct: 317 TADMKAYKGGVYTPNKWECENKIIGTHAMNIVGYGTWNKTNEKYWIVKNSWGQSYGVENG 376

Query: 319 YFQIERGANACGIE 332
           Y    RG N+CGIE
Sbjct: 377 YVYFARGINSCGIE 390


>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
          Length = 458

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 109/345 (31%), Positives = 168/345 (48%), Gaps = 34/345 (9%)

Query: 13  TEQVTYNVNTDSAIYVWRDLAYDSIKQVDA-FKTYIVKWNRTYTDDNEIKTRFEYFKQDG 71
           T++ + N  + S ++    L  D + QV + FK +++ +NRTY    E + R   F  + 
Sbjct: 130 TDEKSGNFRSFSPLFNKDTLPEDFVMQVASIFKEFVITYNRTYETKEEAQWRMSVFINNM 189

Query: 72  KETDEY---------YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKF-LNER 121
               +          YG +  SD + +E   RT + L    KE       R K+  L   
Sbjct: 190 MRAQKIQALDRGTARYGVTKFSDLTEEEF--RT-IYLNPLLKEL------RSKRMPLAMS 240

Query: 122 KKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLV 181
             GP P   DWR      +  V+ QG CGSCWAF+ T  +E Q  L +  L  LS+ +LV
Sbjct: 241 VSGPAPPEWDWRNKGA--VTKVKDQGMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELV 298

Query: 182 ECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           +CD  +  C GG    A+  +K  G LE++ DY Y         C +  EKAKV++ D+ 
Sbjct: 299 DCDKLDKACLGGLPSNAYSAIKTLGGLETEDDYGYNGHLQT---CNFSAEKAKVYINDSV 355

Query: 241 VTSGVDHMMH--LLQSGPIGVYLNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGY 295
             S  +  +   L ++GPI + +N   ++ Y     +P+R     C+P  +DHAV +VGY
Sbjct: 356 ELSQNEQKLAAWLAKNGPISIAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGY 412

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           G ++ I  W ++NSWG    + GY+ + RG+ ACG+   A  A V
Sbjct: 413 GNRSDIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNIMASSAVV 457


>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
          Length = 323

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 161/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  +        +   K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQT-------QNFCKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q++ CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIGCDFVDAGCNGGLLHTAF 183

Query: 200 E-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E  +K  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFDSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
 gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
 gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
          Length = 323

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 161/313 (51%), Gaps = 28/313 (8%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGTSGSSD 85
           AYD +K  + F+ ++ ++N+ Y  + E   RF+ F+ +  E           Y  +  SD
Sbjct: 18  AYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEINKFSD 77

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S  E + + TGL L  +        +   K  + ++  G  P   DWR  ++  +  V+
Sbjct: 78  LSKDETIAKYTGLSLPIQT-------QNFCKVIVLDQPPGKGPLEFDWR--RLNKVTSVK 128

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVK 203
           +QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AFE  +K
Sbjct: 129 NQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIK 188

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVY 260
             G++ ++DYPY    N    C     K  V V+D   ++T   + +  LL+  GPI + 
Sbjct: 189 MGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMA 245

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    + G+F
Sbjct: 246 IDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWGEDGFF 301

Query: 321 QIERGANACGIES 333
           ++++  NACG+ +
Sbjct: 302 RVQQNINACGMRN 314


>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
          Length = 367

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 105/319 (32%), Positives = 160/319 (50%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F ++  K+ + Y    E   RF  FK + +        +    +G +  SD +P E  ++
Sbjct: 53  FASFKAKFGKKYATKEEHDRRFGVFKSNLRRARLHAKLDPSAVHGVTKFSDLTPAEFRRQ 112

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL A+ ++      +     LPK  DWR  K  V N V+ QG CGSCW+
Sbjct: 113 ----FLGFKPLRLPANAQKAPILPTKD----LPKDFDWRD-KGAVTN-VKDQGACGSCWS 162

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q 
Sbjct: 163 FSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS 222

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++ + DYPY  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 223 GGVQKEKDYPYTGRDGT---CKFDKTKVAATVSNYSVVSLDEDQIAANLVKNGPLAVGIN 279

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE--------KNGILTWIVRNSWGDIG 314
              +++Y G       + C  H LDH V IVGYGE        KN    WI++NSWG+  
Sbjct: 280 AVFMQTYIGG--VSCPYICGKH-LDHGVLIVGYGEGAYAPIRFKNKPY-WIIKNSWGESW 335

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 336 GENGYYKICRGRNVCGVDS 354


>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
           pulchellus]
          Length = 475

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 155/316 (49%), Gaps = 28/316 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK---------ETDEYYGTSGSSDRSPQEILQ 93
           F  +   +N+TY D  E + RF  FK + K         E   +YG +  SD SP E  +
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSE-FE 224

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           R  L L    K+ L   +  VK         PLP   DWR      +  V++QG CGSCW
Sbjct: 225 RHYLGL----KKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGA--VTEVKNQGMCGSCW 278

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E Q  L +  L  LS+ +LV+CDHG+  C GG +  A + V +  GLE++++
Sbjct: 279 AFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHGDHGCKGGYMGQAMKAVIEMGGLETESE 338

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYD 270
           YPY+  +     C + K ++K  VQ       +  +    L++ GP+ + +N   ++ Y 
Sbjct: 339 YPYKGVDGT---CEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVSIGINANAMQFYF 395

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYG------EKNGILTWIVRNSWGDIGPDHGYFQIER 324
           G       + C+P  LDH V +VG+G       +  +  WIV+NSWG    + GY+++ R
Sbjct: 396 GGISHPWKFLCSPTDLDHGVLLVGFGVDKRSFRRKPVPYWIVKNSWGKYWGEKGYYRVYR 455

Query: 325 GANACGIESYAYLASV 340
           G   CG+   A  A V
Sbjct: 456 GDGTCGVNQMALSAVV 471


>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
          Length = 325

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 151/295 (51%), Gaps = 29/295 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLT 100
           + + Y ++++ K RF  FK +     +Y         YG +  SD +P+E      + L 
Sbjct: 39  YGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEF---AAMYLG 94

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
            +  ER++    RV+  LN+ +  P   S+DWR  K   + PVE QG CGSCWAF+ TA 
Sbjct: 95  SRIDERVD----RVQ--LNDLQTAP--ASVDWR--KKGAVGPVEDQGSCGSCWAFSVTAN 144

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKE 219
           +E Q  L    L  LSK QLV+CD  +  C+GG     ++ +K+ G LE Q+ YPY + +
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPYTSWK 204

Query: 220 NITFRCTYEKEKAKVFVQDTWV--TSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C  ++ K    + D+ V  T        L + GP+   LN   ++ Y    +  +
Sbjct: 205 QA---CRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPS 261

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
              C+P  L+HAV  VGY  ++G+  W VRNSWG    ++GYF+I RG   CGI+
Sbjct: 262 KAMCSPEGLNHAVLTVGYDTEHGVPYWTVRNSWGTRWGENGYFRIYRGDGTCGID 316


>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
          Length = 473

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 99/309 (32%), Positives = 157/309 (50%), Gaps = 24/309 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK ++VK+ + Y+   E + R + F+++ K  ++          YG +  SD
Sbjct: 167 ESVQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSD 226

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            + +E       R T       +    R         K P P S DWR      ++PV++
Sbjct: 227 LTEEE------FRSTYLNPLLSQWTLHR-GMKPAPPAKTPAPDSWDWRDHGA--VSPVKN 277

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L   TL  LS+ +LV+CD  +  C GG    A+E +++ 
Sbjct: 278 QGMCGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAIEKL 337

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LES+ DY Y   +    +C +   K   ++  +      +  +   L ++GPI V LN
Sbjct: 338 GGLESETDYSYTGHKQ---KCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALN 394

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y           CNP  +DHAV +VGYGE+NGI  W ++NSWG+   + GY+ +
Sbjct: 395 AFAMQFYKKGVSHPWKIFCNPWMIDHAVLLVGYGERNGIPFWAIKNSWGEDYGEQGYYYL 454

Query: 323 ERGANACGI 331
           +RG+NACGI
Sbjct: 455 QRGSNACGI 463


>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
 gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
          Length = 324

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 103/314 (32%), Positives = 162/314 (51%), Gaps = 29/314 (9%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSS 84
           AYD +K  + F+ +++++N+ Y  + E   RF+ F+ +  E        +   Y  +  S
Sbjct: 18  AYDLLKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQNDSAAKYEINKFS 77

Query: 85  DRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D S  E + + TGL L  +        +   K  + ++  G  P   DWR+   KV N V
Sbjct: 78  DLSKDETIAKYTGLSLPIQT-------QNFCKVIVLDQPPGKGPFEFDWRRLN-KVTN-V 128

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV- 202
           ++QG CG+CWAFA  A LESQ A+    L  LS+ Q+++CD  +  CNGG +  AFE V 
Sbjct: 129 KNQGVCGACWAFAALASLESQFAMKHNQLIDLSEQQMIDCDSVDAGCNGGLLHTAFEAVI 188

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGV 259
           K  G++ + DYPY    N    C     K  V V+D   ++    + +  LL+S GPI +
Sbjct: 189 KMGGVQLEKDYPYEAANN---NCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPIPM 245

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
            ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    + GY
Sbjct: 246 AIDAADIVNYKQGIIKY----CLNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWGESGY 301

Query: 320 FQIERGANACGIES 333
           F++++  NACG+ +
Sbjct: 302 FRLQQNINACGMRN 315


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 149/304 (49%), Gaps = 29/304 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLT 100
           + ++Y +D++ K RF  FK +      Y         YG +  SD +P+E   +      
Sbjct: 39  YGKSYANDDDEK-RFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAAKF----- 92

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
                R +   ERV+  LN+ K  P  +S+DWR+  +  + PVE QG CGSCWAF+    
Sbjct: 93  --LSSRFDDQVERVQ--LNDLKAAP--ESVDWRE--LGAVAPVEDQGSCGSCWAFSVAGN 144

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQADYPYRNKE 219
           +E Q  L    L  LSK QLV+CD  +  C+GG     + E ++  GLE+Q DYPY  +E
Sbjct: 145 VEGQWFLKTGQLVSLSKQQLVDCDVQDSGCDGGYPPTTYGEIIRMGGLEAQRDYPYVGRE 204

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C  ++ K    +  + V    +     ++ + GP+   +N   ++ Y       +
Sbjct: 205 Q---PCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPS 261

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYL 337
              C P  L+H V  VGYG ++G+  WI++NSWG    + GYF++ RG   CGIE     
Sbjct: 262 KSQCQPDWLNHGVLSVGYGTEDGVPYWIIKNSWGTGWGEKGYFRLYRGDGTCGIEKVVSS 321

Query: 338 ASVK 341
           A ++
Sbjct: 322 AIIR 325


>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
 gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
          Length = 615

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 153/315 (48%), Gaps = 30/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + V++ R Y    E + R   F+Q+ K  ++          YG +  +D +  E  +
Sbjct: 309 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADMTSSEYKE 368

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           RTGL       +R EA        +     G LPK  DWRQ     +  V++QG CGSCW
Sbjct: 369 RTGLW------QRNEAKATGGSVAVVPAYHGELPKEFDWRQKNA--VTQVKNQGSCGSCW 420

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +A+
Sbjct: 421 AFSVTGNIEGLHAVKTGDLKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAE 480

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY+ K+N   +C + +  + V V     +  G +  M   LL +GPI + +N   ++ Y
Sbjct: 481 YPYKAKKN---QCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFY 537

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEK------NGILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V +VGYG          +  WIV+NSWG    + GY+++ 
Sbjct: 538 RGGVSHPWKALCSKKNLDHGVLVVGYGVSEYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 597

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 598 RGDNTCGVSEMATSA 612


>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
          Length = 360

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 152/319 (47%), Gaps = 43/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D+ E   RF  FK + +    +        +G +  +D +P E  +R
Sbjct: 45  FSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARRHQRLDPTAVHGVTRFADLTPSE-FRR 103

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  + + R                   LP   DWR      + PV++QG CGSCW+
Sbjct: 104 TYLGL--RRRPRTAGSTHDAPILPTNE----LPADFDWRDHGA--VTPVKNQGSCGSCWS 155

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY+ K 
Sbjct: 156 FSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGLMTTAFEYILKS 215

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE +ADYPY   +  T  C + K K      +  V S +D      +L++ GP+ V +
Sbjct: 216 GGLEREADYPYTGTDRGT--CKFNKAKISAVASNFSVVS-IDEDQIAANLVKHGPLAVGI 272

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C  H LDH V +VGYG              WI++NSWG+  
Sbjct: 273 NAVFMQTYVGG--VSCPYICGKH-LDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGENW 329

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 330 GENGYYKICRGRNVCGVDS 348


>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
 gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
          Length = 338

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 159/321 (49%), Gaps = 37/321 (11%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSG 82
           +L YD       F  ++ K+ + Y +D E K+RF+ FK        ++ +E    +G + 
Sbjct: 25  NLQYDLSNSEVLFDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEESATFGINF 84

Query: 83  SSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERK-KGP----LPKSLDWRQSK 136
            SD S  E+L++ TG       K  L  D E+  K+   R   GP    LP++ +WR S 
Sbjct: 85  YSDLSSNELLRKQTGF------KTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSD 138

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNID 196
              +  V+ Q  CGSCWAF+  A +ESQ  +  K    LS+ Q+V+CD  N  CNGG + 
Sbjct: 139 A--VTSVKQQRDCGSCWAFSAVANIESQYYIKNKQYVDLSEQQIVDCDPINNGCNGGLMS 196

Query: 197 VAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS----GVDHMMHL 251
            A EYV +  G++ + DY Y   E +       K  +   VQ +   S      + +  L
Sbjct: 197 WAMEYVMRSGGVQLEEDYQYVGNEGVC------KNNSANVVQISGCVSYDLRNEERLREL 250

Query: 252 LQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L S GPI V ++   + +Y     +    A   H L+HAV +VGYG +N    W+ +NSW
Sbjct: 251 LVSNGPISVAIDVMDVTNYQSGIAKHCSVA---HGLNHAVLLVGYGVQNNTPYWVFKNSW 307

Query: 311 GDIGPDHGYFQIERGANACGI 331
           G    ++GYF++ R  N+CG+
Sbjct: 308 GSDWGENGYFRVLRDVNSCGM 328


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 159/319 (49%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F ++  K+ +TY    E   RF  FK + +        +    +G +  SD +P E  ++
Sbjct: 56  FASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLDPSAVHGVTKFSDLTPAEFRRQ 115

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  R  A  ++      +     LPK  DWR  K  V N V+ QG CGSCW+
Sbjct: 116 ----FLGLKPLRFPAHAQKAPILPTKD----LPKDFDWRD-KGAVTN-VKDQGACGSCWS 165

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q 
Sbjct: 166 FSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS 225

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++ + DYPY  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 226 GGVQKEKDYPYTGRDGT---CKFDKTKVAATVSNYSVVSLDEEQIAANLVKNGPLAVAIN 282

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE--------KNGILTWIVRNSWGDIG 314
              +++Y G       + C  H LDH V +VGYGE        KN    WI++NSWG+  
Sbjct: 283 AVFMQTYVGG--VSCPYICGKH-LDHGVLLVGYGEGAYAPIRFKNKPY-WIIKNSWGESW 338

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 339 GENGYYKICRGRNVCGVDS 357


>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
 gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
          Length = 344

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 163/313 (52%), Gaps = 28/313 (8%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGS 83
           L Y+  +    F+T+  K+ + Y DDNE   R++ FK        ++ +     Y  +  
Sbjct: 36  LQYNLERAPQYFETFQTKYKKVYADDNERDYRYKIFKTNLEIINLKNQQNDSAVYNINKF 95

Query: 84  SDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E++ + TGL +      R  A +   +  + +       ++ DWRQ     +  
Sbjct: 96  ADLTKNEVIAKFTGLGI------RSPALKNSCEPVIVDGPSKYTQETFDWRQ--FNKITS 147

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+ QG CGSCWAF+T A LESQ A+       LS+ QLV+CD  ++ C GG +  A+E +
Sbjct: 148 VKDQGFCGSCWAFSTIAGLESQYAIKYNEHVDLSEQQLVDCDTIDMGCAGGLLHTAYEEI 207

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLL-QSGPIG 258
               GLE + DYPYR+ +     C  + +K +V V +   +V    D +  +L + GPI 
Sbjct: 208 MAMGGLEYEEDYPYRSVQG---PCRLQSDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIA 264

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           V ++   +  Y G  I     +C  + L+HAV +VGYG +NG+  W+++NSWG    ++G
Sbjct: 265 VAVDAVDLTDYYGGIIT----SCKNYGLNHAVLLVGYGIENGVPFWVLKNSWGSDYGENG 320

Query: 319 YFQIERGANACGI 331
           + +++R  N+CG+
Sbjct: 321 FVRVKRNVNSCGM 333


>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
          Length = 334

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 124/229 (54%), Gaps = 25/229 (10%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R+ GP P+S+DWR+ K   ++PV++QG CGSCW F+TT  LES VA+    L  L++ QL
Sbjct: 110 RRLGPYPESVDWRK-KGNFVSPVKNQGGCGSCWTFSTTGGLESAVAIATGKLLSLAEQQL 168

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  D  N  CNGG    AFEY+    G+  +  YPY  K+     C ++  KA  FV+
Sbjct: 169 VDCAQDFNNHGCNGGLPSQAFEYIMYNKGIMGEDTYPYEGKDGT---CKFQPNKAIAFVK 225

Query: 238 DTWVTSGVDHMMH---LLQSGPIGVYLN--------HRLIESYDGNPIRRNDWACNPHKL 286
           D    +  D       +    P+             H+ I S   NP      + +P K+
Sbjct: 226 DVANITAYDEEAMTEAVAHHNPVSFAFEVTDDFLSYHKGIYS---NP----KCSKSPDKV 278

Query: 287 DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           +HAV  VGYG++NGI  WIV+NSWG    ++GYF IERG N CG+   A
Sbjct: 279 NHAVLAVGYGKENGIPYWIVKNSWGTSWGNNGYFLIERGKNMCGLADCA 327


>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
 gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
          Length = 599

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 150/315 (47%), Gaps = 30/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + VK+ R Y +  E + R   F+Q  K   E          YG +  +D +  E  Q
Sbjct: 293 FHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADMTSTEYAQ 352

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           R GL       +R E         +     G LPK  DWRQ     +  V++QG+CGSCW
Sbjct: 353 RAGLW------QRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNA--VTHVKNQGQCGSCW 404

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +++
Sbjct: 405 AFSVTGNIEGAYAIKTGDLQEFSEQELLDCDSKDSACNGGLMDNAYKAIKDIGGLEYESE 464

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
           YPY  K+    +C + +  + V V     +  G +  M   LL +GPI + +N   ++ Y
Sbjct: 465 YPYEGKKK---QCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFY 521

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C+   LDH V IVGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 522 RGGVSHPWSPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 581

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 582 RGDNTCGVSEMATSA 596


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 152/305 (49%), Gaps = 32/305 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           F+ + +K  +TY +  E   RF  FK + +  +++             G +  +D + +E
Sbjct: 25  FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
              R  L L+  +K         +           +P S+DWR +K +V   V+ QG CG
Sbjct: 85  F--RAFLTLSSSKKPHFNTTEHVLTGL-------AVPDSIDWR-TKGQVTG-VKDQGNCG 133

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLES 209
           SCWAF+ T   E+        L  LS+ QLV+C    N  CNGG +D  F YVK  GLE+
Sbjct: 134 SCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTDINAGCNGGYLDETFTYVKSKGLEA 193

Query: 210 QADYPYRNKENITFRCTYEKEKA--KVFVQDTWVTSGVDHMMHLLQS-GPIGVYLNHRLI 266
           ++ YPY+  +     C Y   K   KV    +  +   + ++  + + GP+ V ++   +
Sbjct: 194 ESTYPYKGTDG---SCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYL 250

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
            SY+ + I  +DW C+P +L+H V +VGYG  NG   WIV+NSWG    + GYF++ RG 
Sbjct: 251 SSYE-SGIYEDDW-CSPSELNHGVLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGK 308

Query: 327 NACGI 331
           N CG+
Sbjct: 309 NECGV 313


>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
          Length = 347

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 153/320 (47%), Gaps = 37/320 (11%)

Query: 43  FKTYIVKWNRTYTD---DNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEI 91
            K   +K++R Y       E   R++ FK + +++  Y        +G +  SD +P+E 
Sbjct: 29  MKKLFIKFSRKYAKVYGTEEHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEE- 87

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            +R  L  T   +E  +         L+E++    P S DWRQ     +  V++QG CGS
Sbjct: 88  FKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGA--VTRVKNQGACGS 145

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN----------CNGGNIDVAFEY 201
           CW F+TT  +E Q A+ K  L  LS+ QLV+CDH  +           CNGG +  AF+Y
Sbjct: 146 CWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQY 205

Query: 202 V-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIG 258
           V K  GL+++  YPY   E +   C + K      +      S  ++ M   L  +GPI 
Sbjct: 206 VIKNGGLDTEDSYPY---EGVDDTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPIS 262

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDI 313
           + +N   ++ Y       + W CNP  LDH V IVGYG     L      WIV+NSWG  
Sbjct: 263 IAINAEWLQYYTSGI--SDPWFCNPQDLDHGVLIVGYGVGKSWLGSEENYWIVKNSWGSD 320

Query: 314 GPDHGYFQIERGANACGIES 333
             + GYF+I RG   CG+ S
Sbjct: 321 WGEDGYFRIIRGKGKCGLNS 340


>gi|334314327|ref|XP_001368532.2| PREDICTED: cathepsin H-like [Monodelphis domestica]
          Length = 344

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 152/310 (49%), Gaps = 45/310 (14%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLE 108
           ++N+TY + NE   R   F  + +  DE+     S       + Q + +     +K+ L 
Sbjct: 50  QFNKTY-NLNEYHRRLHNFLNNKRRIDEHNAGKHSYTLG---LNQFSDMSFDEFKKQYLM 105

Query: 109 ADRERVK--KFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           ++ +     K  + R+ GP P  +DWR+ K   ++PV++QG CGSCW F+TT  LES VA
Sbjct: 106 SEPQNCSATKGSHVRRVGPYPDFMDWRK-KGNYVSPVKNQGGCGSCWTFSTTGGLESAVA 164

Query: 167 LLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITF 223
           +    L  L++ QLV+C     N  CNGG    AFEY+    G+  +  YPY  K+    
Sbjct: 165 IATGKLLSLAEQQLVDCAQAFNNHGCNGGLPSQAFEYIMYNNGIMGEDTYPYEGKDGT-- 222

Query: 224 RCTYEKEKAKVFVQDT---------WVTSGVDH---------MMHLLQSGPIGVYLNHRL 265
            C ++ +KA  FV+D           +T  V H         +     S   G+Y N R 
Sbjct: 223 -CRFKPDKAIAFVKDVVNITIYDEEAMTEAVAHHNPVSFAFEVTEDFMSYRDGIYSNPRC 281

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            +S              P K++HAV  VGYG+ NGIL WIV+NSWG    ++GYF IERG
Sbjct: 282 DKS--------------PDKVNHAVLAVGYGKNNGILYWIVKNSWGTSWGNNGYFLIERG 327

Query: 326 ANACGIESYA 335
            N CG+   A
Sbjct: 328 KNMCGLADCA 337


>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
          Length = 359

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 161/323 (49%), Gaps = 26/323 (8%)

Query: 24  SAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------ 77
           S I  +   A +     + F T+  K+ + Y +D+E+  R E FK++  + +E+      
Sbjct: 5   SLILFFMLTAKNGAFATETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQ 64

Query: 78  ------YGTSGSSDRSPQEILQRTGLR-LTGKEKERLEADRERVKKFLNERKKGPLPKSL 130
                  G +  SD +  E      +  LT +  +++E       K+ +E      P S+
Sbjct: 65  NLVSYELGLNQFSDLTEAEFQALLTMSPLTDQLTKQME-------KYNSEFDIKTAPVSV 117

Query: 131 DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNC 190
           +W +  V  + PV++QG CGSCW F TT  +ES++AL   +L  LS+ QL++C+  N  C
Sbjct: 118 NWAEKGV--VTPVKNQGNCGSCWTFTTTGTIESRLALKTGSLVSLSEQQLLDCNRVNAGC 175

Query: 191 NGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH 250
           +GG +  A +YV+  GL ++ +YPY+   N T   T++   A         T     +M 
Sbjct: 176 DGGVLSYALQYVESAGLTTEDEYPYK-AWNGTCNSTHKPVAAYTKGYTLIYTRSESDLMK 234

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
            +  GP+ V LN  L++ Y       N  AC+   ++H   +VGY E   +  WI++NSW
Sbjct: 235 AVAEGPVAVALNADLLQYYSKGIF--NPSACS-STVNHGGLVVGYEENATLPYWIIKNSW 291

Query: 311 GDIGPDHGYFQIERGANACGIES 333
           G    ++GYF++ +G N CGI S
Sbjct: 292 GATWGENGYFRMAKGYNLCGITS 314


>gi|341887744|gb|EGT43679.1| hypothetical protein CAEBREN_04647 [Caenorhabditis brenneri]
          Length = 394

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/219 (35%), Positives = 123/219 (56%), Gaps = 6/219 (2%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S DWR SK  ++ PV++QG CGSCWAFA  A +E+Q AL K  L  LS+ +LV+CD 
Sbjct: 177 VPDSFDWRSSKSPMVTPVKNQGDCGSCWAFAVVAAIETQFALKKGALLSLSEQELVDCDV 236

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSG 244
            +  CNGG ++ A  +  + GLE++ADYPY   +    +C+ + +K +V + D + + + 
Sbjct: 237 LSYGCNGGYLNTALLFAIEKGLETEADYPYVAIQQK--QCSIQTQKIRVKIDDGYHLKAN 294

Query: 245 VDHMMH-LLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKL-DHAVAIVGYGEKNGI 301
            D +   + + GP+   +   + I  Y G     +   C    + +H +AIVG+G +   
Sbjct: 295 EDQIADWVAREGPVSFLMPVPKSIMFYRGGIFNPSMAECRAQAVGNHVMAIVGFGREGNQ 354

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
             WIV+NSWG    + GY ++ RG N CG  +Y +   +
Sbjct: 355 KFWIVKNSWGTRWGEQGYLKMARGVNICGFTNYVFAPHI 393


>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
          Length = 374

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 155/318 (48%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F ++  ++ + YT  +E   RF  FK + +        +    +G +   D +P E  +R
Sbjct: 58  FSSFKKRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDPSAVHGVTQFFDLTPAE-FRR 116

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L   G ++ RL AD               LP   DWR      + PV++QG CGSCW+
Sbjct: 117 TYL---GLKRLRLPADTHEAPIL----PTNDLPADFDWRDHGA--VTPVKNQGSCGSCWS 167

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY +K 
Sbjct: 168 FSATGALEGANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKA 227

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GLE + DYPY   ++   +C ++K K  V   +  V S  ++ +  +L+ +GP+ + +N
Sbjct: 228 GGLEREEDYPYTGTDHS--KCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGIN 285

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C+   LDH V +VGYG              WI++NSWG+   
Sbjct: 286 AMFMQTYIGG--VSCPYICSKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWG 343

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 344 EKGYYKICRGRNICGMDS 361


>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
          Length = 325

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/294 (31%), Positives = 141/294 (47%), Gaps = 29/294 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLT 100
           + + Y +D++ K RF  FK +     +          YG +  SD +P+E   +   R  
Sbjct: 39  YGKVYANDDDQK-RFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTPEEFAAKYLSRPM 97

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
             + ER+     +             P+ +DWR+     + PVE+QG CGSCWAF+    
Sbjct: 98  NDQVERVRPTGLKAA-----------PERMDWRE--WGAVGPVENQGSCGSCWAFSVAGN 144

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKE 219
           +E Q  L    L  LSK QLV+CD  +  C GG   +   E ++  GLE Q+DYPY   +
Sbjct: 145 VEGQWFLKTGQLVSLSKQQLVDCDVMDYGCGGGWPTNAYMEIMRMGGLELQSDYPYVGVQ 204

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
               +C   KEK    + D  V    +  H  +L + GP+   LN   ++ Y       +
Sbjct: 205 Q---QCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPS 261

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
              C+P  L+HAV  VGY  +NG+  WI++NSWG    ++GYF++ RG   CGI
Sbjct: 262 YEECSPASLNHAVLTVGYDTENGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGI 315


>gi|268570635|ref|XP_002640795.1| Hypothetical protein CBG15672 [Caenorhabditis briggsae]
          Length = 396

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 165/314 (52%), Gaps = 24/314 (7%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR 94
           DS+++   FK +  K+ R + + +++  RF+ F ++ KE +     +  +     E   R
Sbjct: 90  DSLRK---FKEFNQKFQRIHENSDDLNFRFQLFSKNLKEIEILNSQNSGAKFEINEFTDR 146

Query: 95  TGLRLTGKEKERLEADRERVK-----KFLNER-KKGPLPKS--LDWRQSKVKVLNPVESQ 146
           +      +E  R   D++ VK     KF N     G L +S   DWR    KV++ V++Q
Sbjct: 147 SE-----EELRRYSMDQKFVKNLSNLKFANSTILAGSLNRSGYRDWRNDG-KVMS-VKNQ 199

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G+CGSCWAF+  + +ESQ A+ K TL+ LS+ +LV+CD  +  CNGG +D A  ++   G
Sbjct: 200 GQCGSCWAFSIVSAVESQFAIKKGTLWSLSEQELVDCDRDSYGCNGGFMDKALSWILGNG 259

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQS-GPIGVYLN-H 263
           LE++ DYPY    +   +C     K +V+V + + + +  D +   + S GP+   +   
Sbjct: 260 LETEDDYPYDAVRHD--QCYLNGRKTRVWVDEGYRLANNEDFIADWVDSVGPVSFAMKLP 317

Query: 264 RLIESYDGNPIRRNDWACN-PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
           +   SY       ++  CN P+   HA+ ++GYG + G L WIV+NSWG    D GY ++
Sbjct: 318 KSFYSYSKGIYHPSERECNDPNNGYHAMTLIGYGNEGGQLYWIVKNSWGSGWGDQGYMRL 377

Query: 323 ERGANACGIESYAY 336
            RG N CG   Y +
Sbjct: 378 ARGQNVCGAGEYVF 391


>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
          Length = 323

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 161/318 (50%), Gaps = 28/318 (8%)

Query: 28  VWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET-------DEYYGT 80
           V +  AYD +K  + F+ ++ ++N+ Y+ + E   RF+ F+ +  E           Y  
Sbjct: 13  VVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQNDSAKYEI 72

Query: 81  SGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
           +  SD S  E + + TGL L  + +          K  L ++  G  P   DWR  ++  
Sbjct: 73  NKFSDLSKDETIAKYTGLSLPTQTQNF-------CKVILLDQPPGKGPLEFDWR--RLNK 123

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V++QG CG+CWAFAT   LESQ A+    L  LS+ Q+++CD  +  CNGG +  AF
Sbjct: 124 VTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAF 183

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SG 255
           E   +  G++ ++DYPY    N    C     K  V V+D   ++    + +  LL+  G
Sbjct: 184 EANCRMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVG 240

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W  +N+WG    
Sbjct: 241 PIPMAIDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWTFKNTWGTDWG 296

Query: 316 DHGYFQIERGANACGIES 333
           + G+F++++  NACG+ +
Sbjct: 297 EDGFFRVQQNINACGMRN 314


>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
          Length = 396

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/308 (31%), Positives = 160/308 (51%), Gaps = 20/308 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           FK +  K+ R +    E K RFE F+++ ++ +E         YG +  SD++  E L+ 
Sbjct: 88  FKDFNAKFQREHKTLEEYKMRFEIFQKNLRDIEELNLKNPSVQYGINKFSDKTESE-LKN 146

Query: 95  TGLRLTGKEKERLEADRERVKKFLNER---KKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
             +     +     +  + +  + N R   K    P  +DWR    KV++ V+ QG+CGS
Sbjct: 147 LLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNVQRPDYIDWRNDG-KVMS-VKDQGQCGS 204

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           CWAFAT A +ESQ A+ K TL+ LS+ +LV+CD  +  C GG +  A  ++   GLE++ 
Sbjct: 205 CWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGASYGCGGGFLTSALGFILGNGLETED 264

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQS-GPIGVYLN-HRLIES 268
           DYPY    +   +C    +K +V++ + + +T   D +   + + GP+   ++  +    
Sbjct: 265 DYPYSATRHD--QCWINGDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPY 322

Query: 269 YDGNPIRRNDWACNPHKLD-HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           Y       ++  C    L  HA+AI+GYG++ G   WIV+NSWG    D GY ++ RG N
Sbjct: 323 YHDGIYSPSEHECKDESLGYHAMAIIGYGQEGGQNYWIVKNSWGGSWGDQGYMRLARGVN 382

Query: 328 ACGIESYA 335
           ACG+  Y 
Sbjct: 383 ACGMNDYV 390


>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
          Length = 365

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 156/318 (49%), Gaps = 43/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F T+  K+ +TY    E   RF  FK + +        +    +G +  SD +P E  ++
Sbjct: 51  FSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDPSAVHGVTKFSDLTPAEFHRK 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL A  ++            LPK  DWR  K  V N V+ QG CGSCW+
Sbjct: 111 ----FLGLKPLRLPAHAQKAPILPTNN----LPKDFDWRD-KGAVTN-VKDQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +  
Sbjct: 161 FSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGS 220

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++ + DYPY  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 221 GGVQREKDYPYTGRDGT---CKFDKSKIAASVSNYSVISLDEEQIAANLVKNGPLAVAIN 277

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYGE             WI++NSWG+   
Sbjct: 278 AVYMQTYVGG--VSCPYICGKH-LDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWG 334

Query: 316 DHGYFQIERGANACGIES 333
           ++GY++I RG N CG++S
Sbjct: 335 ENGYYKICRGRNVCGVDS 352


>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
 gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
          Length = 337

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 154/308 (50%), Gaps = 28/308 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F+T+I+ +N+ Y D      RF+ FKQ+ ++ +E         Y  +  SD S  E+L +
Sbjct: 32  FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNKLNDSAIYNINKFSDLSKNELLTK 91

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERK----KGPLPKSLDWRQSKVKVLNPVESQGRC 149
            TGL  T K+   +          ++          LP++ DWR +    +  V+ QG C
Sbjct: 92  YTGL--TSKKPSNMVRSTSNFCNVIHLDAPPDVHDELPQNFDWRVNNK--MTSVKDQGAC 147

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLE 208
           GSCWA A    LE+  A+    L  LS+ QL++CD  N+ C+GG +  AFE +    GL 
Sbjct: 148 GSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCDSANMACDGGLMHTAFEQLMNAGGLM 207

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHM-MHLLQSGPIGVYLNHRL 265
            + DYPY+  + +   C  + +K  + V     ++    +++   L+  GPI + ++   
Sbjct: 208 EEIDYPYQGTKGV---CKIDNKKFALSVSSCKRYIFQNEENLKKELITMGPIAMAIDAAS 264

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           I +Y    I      C    L+HAV +VGYG + G+  W ++NSWG    + GYF+++R 
Sbjct: 265 ISTYSKGIIH----FCENLGLNHAVLLVGYGTEGGVSYWTLKNSWGSDWGEDGYFRVKRN 320

Query: 326 ANACGIES 333
            NACG+ +
Sbjct: 321 INACGLNN 328


>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 150/318 (47%), Gaps = 30/318 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           F+ +I+  N+ YT   E   RF  F          Q+ ++    YG +  +D +  E  +
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 94  R-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +  GL  +   K+ L              +   +P   DWR   V  + PV++QG CGSC
Sbjct: 340 KYLGLDSSMTSKKTLP--------MAVIPQSASIPNEFDWRNHNV--VTPVKNQGACGSC 389

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAF+  A +E Q AL  K L  LS+ +L++CD+ +  C GG +  AFE V+   GLE+++
Sbjct: 390 WAFSAIANIEGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENLGGLETES 449

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIESY 269
           DYPY    +    C  +K   KV +       T   D    L++ GP+ V +N   ++ Y
Sbjct: 450 DYPYEGHADRK-GCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFY 508

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQIE 323
            G         C+P  LDH VAIVGYG       N  L  W ++NSWGD     GY+ + 
Sbjct: 509 MGGVSHPIHALCSPKSLDHGVAIVGYGVHKYPYLNATLPFWTIKNSWGDKWGMQGYYLLY 568

Query: 324 RGANACGIESYAYLASVK 341
           RG  +CG+      A ++
Sbjct: 569 RGDGSCGVNQMVSSAIIE 586


>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
          Length = 325

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 153/305 (50%), Gaps = 31/305 (10%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQR-TGLRL 99
           + + Y ++++ K RF  FK +     +Y         YG +  SD +P+E   +  GLR+
Sbjct: 39  YGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFEAKYLGLRI 97

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
                   +   +RV+  LN+ +  P   S+DWR+     + P+E+QG CGSCWAF+   
Sbjct: 98  --------DEQVDRVQ--LNDLQTAP--ASVDWREKGA--VGPIENQGSCGSCWAFSVVG 143

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNK 218
            +E Q  L    L  LSK QLV+CD  +  C GG     ++ +K+ G LE Q+DYPY   
Sbjct: 144 NIEGQWFLKTGYLVSLSKQQLVDCDTVDNGCYGGYPPYTYKEIKRMGGLELQSDYPYTGW 203

Query: 219 ENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRR 276
            +    C  ++ K    + D+ V    +      L + GP+   LN + ++ Y    +  
Sbjct: 204 GH---GCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHP 260

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
           +   C+P  L+HAV  VGY  K+GI  WI++NSWG    + GYF+I RG   CGI+    
Sbjct: 261 SKAMCSPEGLNHAVLTVGYDTKHGIPYWIIKNSWGTSWGEDGYFRIYRGDGTCGIDRLTT 320

Query: 337 LASVK 341
            A ++
Sbjct: 321 SAIIR 325


>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
          Length = 343

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 160/319 (50%), Gaps = 38/319 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSS---------DRSPQEIL 92
           AF  Y+ K+ ++Y    E + R+E ++++  +  +Y G +G++         D +P+E  
Sbjct: 42  AFANYLAKYGKSYGTKEEFQFRYEQYQKNMAKVAQYNGQNGNTFRLGINKFTDYTPEEYK 101

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
              G +   K    LEA       +L+E      P S+DWR+     + PV+ QG+CGSC
Sbjct: 102 VLLGYKPQSKPM-TLEAS------YLSEENT---PASIDWREKGA--VTPVKDQGQCGSC 149

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH-GNLNCNGGNIDVAFEYVKQYGLESQA 211
           WAF+ T  LE    +    L  +S+ QLV+C H GN  CNGG + +AF+Y  +  +E ++
Sbjct: 150 WAFSATGALEGHYQISNNKLISISEQQLVDCSHDGNNGCNGGEMYLAFDYASKNKMELES 209

Query: 212 DYPYRNKENITFRCTYEKEKAKV---FVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLI 266
           DY Y  K+    +C+YE  K K+     Q     S    +   L +GP+ V +  ++ + 
Sbjct: 210 DYVYHAKDE---KCSYEASKGKMEADHFQRVPKNSPA-QLKAALANGPVSVAIEADNEVF 265

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYG--EKNGILTWIVRNSWGDIGPDHGYFQIER 324
           ++YDG  +   +   N   LDH V  VG+G  E +    +IV+NSWG    DHG+ +I  
Sbjct: 266 QAYDGGILNSKECGTN---LDHGVLAVGFGHDEASKQDYFIVKNSWGQYWGDHGFIKIAA 322

Query: 325 --GANACGIESYAYLASVK 341
             G   CGI+  A    VK
Sbjct: 323 VDGEGICGIQMDAVYPIVK 341


>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
          Length = 1095

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/273 (35%), Positives = 156/273 (57%), Gaps = 20/273 (7%)

Query: 78   YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNE--RKKGPLPKSLDWRQS 135
            +G +  SD SPQ+  Q+  L+L  K+  +++ + +++   + +    +  +P+  DWR  
Sbjct: 834  FGHTKFSDLSPQQFAQKH-LKLNQKKLLQVKKETKKLTTPIQQDITVEENVPEQFDWRDR 892

Query: 136  KVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI 195
             V V  P + Q  CGSCW F+TT ++ESQ A+  + L P S+ QLV+CD  N  C+GG +
Sbjct: 893  NV-VTEP-KYQNTCGSCWTFSTTGVIESQYAIKHQKLVPFSEQQLVDCDDINDGCHGGLM 950

Query: 196  DVAFEYVKQY-GLESQADY-PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM---H 250
              A++Y++Q  GLE   DY  Y+NK+    +C ++  K +  +++ W     D  +    
Sbjct: 951  TDAYKYLQQSGGLEFAEDYGDYKNKKE---KCKFDLNKVQAKIKE-WQQIDEDEEIIKKQ 1006

Query: 251  LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNS 309
            L Q+GPI   +N RL++ Y        +  C+   ++HA+ IVGYG EK+G   WI++N 
Sbjct: 1007 LYQNGPIAAGVNARLLQFYKSGIFDPKE--CD-SDINHAILIVGYGVEKDGQKYWIIKNQ 1063

Query: 310  WG-DIGPDHGYFQIERGANACGIESYAYLASVK 341
            WG D G D GYF++ RG   CGI +YA +A ++
Sbjct: 1064 WGKDWGMD-GYFKLARGKKQCGIHTYASIAFIE 1095


>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
 gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
          Length = 337

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 94/306 (30%), Positives = 153/306 (50%), Gaps = 28/306 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F+T+IV +N+ Y D      RF+ F Q+ +  +E         Y  +  SD S  E+L +
Sbjct: 32  FETFIVNYNKQYADTKTKNYRFKIFVQNLEYINEKNKLNDSAIYNINKFSDLSKNELLTK 91

Query: 95  -TGLRLTGKEKERLEADRERVKKFLN----ERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            TGL  T ++   +          ++       +  LP++ DWR +    +  V+ QG C
Sbjct: 92  YTGL--TSRKPSNMVKSTSNFCNVIHLDAPPDARDELPQNFDWRVNNK--MTSVKDQGAC 147

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLE 208
           GSCWA A    LE+  A+    L  LS+ QL++CD  N+ C+GG +  AFE +    GL 
Sbjct: 148 GSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCDSANMACDGGLMHTAFEQLMNAGGLM 207

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHM-MHLLQSGPIGVYLNHRL 265
            + DYPY+  + I   C  + +K  + V     ++    +++   L+ +GPI + ++   
Sbjct: 208 EEIDYPYQGTKGI---CKIDNKKFALSVSSCKRYIFQNEENLKKELITTGPIAMAIDAAS 264

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           I +Y    I      C    L+HAV +VGYG + G+  W ++NSWG    + GYF+++R 
Sbjct: 265 ISTYSKGIIHF----CENLGLNHAVLLVGYGTEGGVSYWTLKNSWGSDWGEDGYFRVKRN 320

Query: 326 ANACGI 331
            NACG+
Sbjct: 321 INACGL 326


>gi|341876229|gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri]
          Length = 389

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/304 (33%), Positives = 154/304 (50%), Gaps = 17/304 (5%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTG- 96
           K    F  +I+K+NR Y    E+  R+  F ++ KE +         D    E    T  
Sbjct: 83  KYFRMFNDFILKYNRRYEQPGELSRRYLIFVKNVKEFEAEEKKHLGVDLDVNEYTDWTDD 142

Query: 97  -LRLTGKEKERLEADRERVK---KFLNERKKGPLPKSLDWR-QSKVKVLNPVESQGRCGS 151
            L+    EK+ +  D E V+    +L    K   P S+DWR Q K   L P+++QG+CGS
Sbjct: 143 ELKRMVIEKKNVITDLEAVRFEGSYLESGVK--RPASIDWRDQGK---LTPIKNQGQCGS 197

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           CWAFAT A +E+Q A+ K  L  LS+ ++V+CD  N  C+GG    A  +VK+ GLES+ 
Sbjct: 198 CWAFATVAAVEAQHAIKKGQLVSLSEQEMVDCDGRNNGCSGGYRPYAMRFVKENGLESEK 257

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMHLLQSGPIGVYLN-HRLIES 268
           +YPY   ++   +C  ++   +VF+ D     T+  D    +   GP+   +N  + + S
Sbjct: 258 EYPYSALKHD--QCFLKQNDTRVFIDDFRMLSTNEEDIANWVGTKGPVTFGMNVVKAMYS 315

Query: 269 YDGNPIRRNDWACNPHKLD-HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           Y       +   C    +  HA+ IVGYG +     WIV+NSWG      GYF++ RG N
Sbjct: 316 YRSGIFNPSSEDCAEKSMGAHALTIVGYGGEGSSAFWIVKNSWGTSWGSSGYFRLARGVN 375

Query: 328 ACGI 331
           +CG+
Sbjct: 376 SCGL 379


>gi|301775254|ref|XP_002923050.1| PREDICTED: cathepsin H-like [Ailuropoda melanoleuca]
          Length = 307

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 127/223 (56%), Gaps = 13/223 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P  +DWR+ K K ++PV++QG CGSCW F+TT  LES +A+    L  L++ QL
Sbjct: 83  RGTGPYPPFVDWRK-KGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQL 141

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  D  N  C GG    AFEY++   G+  +  YPY+ ++     C ++  KA  FV+
Sbjct: 142 VDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDG---DCKFQPSKAIAFVK 198

Query: 238 DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW---ACN--PHKLDHAVAI 292
           D      ++    ++++  +   ++     + D    R+  +   +C+  P K++HAV  
Sbjct: 199 DV-ANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTPDKVNHAVLA 257

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           VGYGE+NG+  WIV+NSWG     HGYF IERG N CG+ + A
Sbjct: 258 VGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACA 300


>gi|281350252|gb|EFB25836.1| hypothetical protein PANDA_012122 [Ailuropoda melanoleuca]
          Length = 294

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 127/223 (56%), Gaps = 13/223 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P  +DWR+ K K ++PV++QG CGSCW F+TT  LES +A+    L  L++ QL
Sbjct: 70  RGTGPYPPFVDWRK-KGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQL 128

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  D  N  C GG    AFEY++   G+  +  YPY+ ++     C ++  KA  FV+
Sbjct: 129 VDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDG---DCKFQPSKAIAFVK 185

Query: 238 DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW---ACN--PHKLDHAVAI 292
           D      ++    ++++  +   ++     + D    R+  +   +C+  P K++HAV  
Sbjct: 186 DV-ANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTPDKVNHAVLA 244

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           VGYGE+NG+  WIV+NSWG     HGYF IERG N CG+ + A
Sbjct: 245 VGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACA 287


>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
 gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
          Length = 350

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/289 (33%), Positives = 142/289 (49%), Gaps = 34/289 (11%)

Query: 66  YFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKER--LEADRERVKKFLNERKK 123
           Y+   GK   E +G S   D +P+E  +   ++    E+ R  L A +E V   +  ++ 
Sbjct: 68  YYNHVGKR--ETFGVSKFMDLTPEEFKRMFLMKTYTPEEARKILAAPKEAV---VTAQQV 122

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
              P S DWRQ     + PV++QG CGSCW F+TT  +E    +    L  LS+ QLV+C
Sbjct: 123 KDTPTSWDWRQKGA--VTPVKNQGACGSCWTFSTTGNVEGIHQIKTGKLVSLSEQQLVDC 180

Query: 184 DHG----------NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKA 232
           DH           +  CNGG +  AF+YV K  GL ++  YPY   E +   C + K   
Sbjct: 181 DHNCVTYQGQQACDAGCNGGLMWSAFQYVIKTGGLVTEDSYPY---EGVDDTCRFNKSNV 237

Query: 233 KVFVQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHA 289
            V + ++W +   D       L  +GPI + +N   +++Y       N W CNP  LDH 
Sbjct: 238 AVTI-NSWTSIPSDEGKMAAWLAANGPISIAINAEWLQTYTSG--ISNPWFCNPQDLDHG 294

Query: 290 VAIVGYGEKNGILT-----WIVRNSWGDIGPDHGYFQIERGANACGIES 333
           V IVG+G  +  L      WI++NSWG    + GYF+I RG   CG+ S
Sbjct: 295 VLIVGFGTGSNWLGEKEDYWIIKNSWGADWGESGYFRIVRGKGKCGLNS 343


>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
          Length = 410

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 158/316 (50%), Gaps = 25/316 (7%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYF------KQDGKETDE---YYGTSGSSDRS 87
           ++    FK +I  +NRTY  + E + R   F       Q  +  D     YG +  SD +
Sbjct: 107 LRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKFSDLT 166

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
            +E   RT + L    KE L   + R+ KF+ +    P P   DWR  K   +  V++QG
Sbjct: 167 EEEF--RT-MYLNPLLKEEL-GKKMRLVKFVGD----PAPPEWDWR--KKGAVTKVKNQG 216

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-G 206
            CGSCWAF+ T  +E Q  L +  L  LS+ +LV+CD  +  C GG    A+  +K   G
Sbjct: 217 MCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDKVDKACMGGLPSNAYSAIKTLGG 276

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR 264
           LE++ DY Y         C++  +KAKV++ D+   S  +  +   L ++GPI + +N  
Sbjct: 277 LETEDDYSYSGHLQT---CSFSAQKAKVYINDSVELSHNEQELAAWLAKNGPISIAINAF 333

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
            ++ Y     R     C+   +DHAV +VGYG ++ +  W ++NSWG    + GY+ + R
Sbjct: 334 GMQFYRHGISRPLRPLCSRWFIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEEGYYYLHR 393

Query: 325 GANACGIESYAYLASV 340
           G+ ACG+   A  A V
Sbjct: 394 GSGACGVNVMASSAVV 409


>gi|308476152|ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
 gi|308265817|gb|EFP09770.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
          Length = 391

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 156/303 (51%), Gaps = 24/303 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +I+K++R Y    E + R++ F Q+ KE +         D    E         T +
Sbjct: 89  FNDFILKYDRRYPSLEEFQYRYQVFLQNVKEFEAEEAKHFGLDLDVNEFTD-----WTNE 143

Query: 103 EKERLEADRERVKKFLNE--RKKGPL-------PKSLDWR-QSKVKVLNPVESQGRCGSC 152
           E +R+  D + VK   +E  R +G         P S+DWR Q K   L P+++QG+CGSC
Sbjct: 144 ELQRIVYDNKNVKTDGSEEVRFEGSYLESGVKRPASIDWRDQGK---LTPIKNQGQCGSC 200

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQAD 212
           WAFAT A +E+Q A+ K  L  LS+ ++V+CD  N  C+GG    A  +VK+ GLES+ +
Sbjct: 201 WAFATVAAVEAQHAIRKNQLVSLSEQEMVDCDDKNNGCSGGYRPYAMRFVKENGLESEKE 260

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN-HRLIESY 269
           YPY   ++   +C  ++   +VF+ D  + S  +  +   +   GP+   ++  + + SY
Sbjct: 261 YPYSALKHD--QCMLKQNDTRVFIDDFRMLSQNEEEIANWVGTKGPVTFGMSVTKAMYSY 318

Query: 270 DGNPIRRNDWACNPHKL-DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
                  +   C    +  HA+ IVGYG +     WIV+NSWG      GYF++ RG N+
Sbjct: 319 RSGIFNPSADDCAEKSMGSHALTIVGYGGEGEAAFWIVKNSWGTSWGASGYFRLARGVNS 378

Query: 329 CGI 331
           CG+
Sbjct: 379 CGL 381


>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
 gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
          Length = 333

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 147/299 (49%), Gaps = 20/299 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------YGTSGSSDRSPQEILQRTG 96
           FK++I  +NR YT   E + RF+ FK++ +           YG +  +D + +E  +  G
Sbjct: 36  FKSFITDYNRNYTTKEEHEFRFQTFKKNFRRIASTNANGATYGVNKFADWTDEEFKELLG 95

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
            R    +    E     +   L+  K    P SLDWR+ K  ++ PV +QGRCG CWAF+
Sbjct: 96  NRQVPTQ----EIVNSELHHSLSTAK---FPSSLDWREHKRNIVGPVRNQGRCGCCWAFS 148

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV--KQYGLESQADYP 214
           T   + S  AL   +   LS  QL+ CD+ +  C GG+  +A  ++   +  LE+++  P
Sbjct: 149 TVETIASAWALAGNSFTELSVQQLLSCDNMDGGCRGGSFYLACNWLTKNRVPLETESANP 208

Query: 215 YRNKENITFR-CTYEKEKAKVFVQDTWVTSGVDHMMHLL-QSGPIGVYLNHRLIESYDGN 272
           Y  K +   +  T      K F    ++      M+  L Q+GP+ + ++      Y G 
Sbjct: 209 YLGKRDKCVKHATNTGIILKKFTTSNFIYQESSSMIAALNQNGPLSIAVDATSWRDYVGG 268

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
            I+ +   C+   L+HAV +VGY     +  WIVRNSWG+   DHGY  I+ G N CGI
Sbjct: 269 IIQHH---CDGKVLNHAVQVVGYKLDAPVPYWIVRNSWGEDFGDHGYIYIKMGKNVCGI 324


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 150/303 (49%), Gaps = 20/303 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I K+ R Y+   E   RF+ + Q+    ++          YG +  SD SP+E 
Sbjct: 168 NSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEE- 226

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q+T L     ++         +KKF        LP+  DWR   V  + PV++QG CGS
Sbjct: 227 FQKTMLPSLWWDRVVSNGVEYDLKKF--NLTFNNLPEQFDWRTKGV--VTPVKNQGSCGS 282

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQ 210
           CWAF+ T  +E   A+    L  LS+ +L++CD  +  CNGG    AF  +++ G LE +
Sbjct: 283 CWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRMGGLEPE 342

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIES 268
             YPY+ +      C   +    V + D       + +M   ++Q GP+ V ++ +L+  
Sbjct: 343 DQYPYKARNG---TCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAY 399

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y    +  +   C P  +DH V I GYG +NG+  W ++NSWGD   + GYF++  G + 
Sbjct: 400 YKSGILHPSRSRCPPSGIDHGVLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDV 459

Query: 329 CGI 331
           CG+
Sbjct: 460 CGV 462


>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 365

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 155/318 (48%), Gaps = 43/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F T+  K+ +TY    E   RF  FK + +        +    +G +  SD +P E  ++
Sbjct: 51  FSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDPSAVHGVTKFSDLTPAEFHRK 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL A  ++            LPK  DWR  K  V N V+ QG CGSCW+
Sbjct: 111 ----FLGLKPLRLPAHAQKAPILPTNN----LPKDFDWRD-KGAVTN-VKDQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +  
Sbjct: 161 FSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGS 220

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++ + DYPY  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 221 GGVQREKDYPYTGRDGT---CKFDKSKIAASVSNYSVISLDEEQIAANLVKNGPLAVAIN 277

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYGE             WI++NSWG+   
Sbjct: 278 AVYMQTYVGG--VSCPYICGKH-LDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWG 334

Query: 316 DHGYFQIERGANACGIES 333
            +GY++I RG N CG++S
Sbjct: 335 GNGYYKICRGRNVCGVDS 352


>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
 gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
          Length = 344

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 162/316 (51%), Gaps = 34/316 (10%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGS 83
           L Y+  +    F+T+  K+ + Y DDNE   R++ FK        ++ +     Y  +  
Sbjct: 36  LQYNLERAPQYFETFQTKYKKVYADDNERDYRYKIFKTNLEIINLKNQQNDSAVYNINKF 95

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNER-KKGP---LPKSLDWRQSKVKV 139
           +D +  E++ +     TG     L      +K F +     GP     ++ DWRQ     
Sbjct: 96  ADLTKNEVIAK----FTG-----LGVKSPNLKNFCDPLIVDGPSKYTQETFDWRQ--FNK 144

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V+ QG CGSCWAF+T A LESQ A+       LS+ QLV+CD  ++ C GG +  A+
Sbjct: 145 ITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHIDLSEQQLVDCDTIDMGCAGGLLHTAY 204

Query: 200 EYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLL-QSG 255
           E +    G+E + DYPYR+ +     C  E +K +V V +   ++    D +  +L + G
Sbjct: 205 EEIMSMGGVEYEEDYPYRSVQG---PCRIENDKFQVSVDNCYRYILYSEDKLKDVLHEMG 261

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           PI V ++   +  Y G  I     +C  + L+HAV +VGYG +NGI  W+++NSWG    
Sbjct: 262 PIAVAVDAVDLTDYYGGIIT----SCKNYGLNHAVLLVGYGTENGIPFWVLKNSWGTDYG 317

Query: 316 DHGYFQIERGANACGI 331
           ++G+ +++R  N+CG+
Sbjct: 318 ENGFVRVKRNVNSCGM 333


>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 387

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 162/331 (48%), Gaps = 41/331 (12%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSG 82
           D  + ++     F  +  ++ ++Y  + E   RF+ FK + +  + +        +G + 
Sbjct: 47  DFNHHALGAEHHFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQ 106

Query: 83  SSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
            SD +P E  ++  L L G  + RL  D         E     LP   DWRQ     +  
Sbjct: 107 FSDLTPFE-FRKAFLGLRG-HRLRLPVDTNAAPILPTEN----LPIDFDWRQHGG--VTR 158

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGG 193
           V++QG CGSCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG
Sbjct: 159 VKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGG 218

Query: 194 NIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MM 249
            ++ AFEY +K  GL  + DYPY   +  T  C ++K K    + +  V + +D      
Sbjct: 219 LMNSAFEYTLKAGGLMKEQDYPYAGIDRNT--CNFDKSKIAASIANFSVVNSIDEDQIAA 276

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------ 303
           +L+++GP+ + +N   +++Y G       + C+  +LDH V +VGYG             
Sbjct: 277 NLVKNGPLAIAINAVFMQTYIGGV--SCPFICSK-RLDHGVLLVGYGSAGYAPIRMRDKD 333

Query: 304 -WIVRNSWGDIGPDHGYFQIERGANACGIES 333
            WI++NSWG+   ++GY++I RG N CG++S
Sbjct: 334 YWIIKNSWGESWGENGYYKICRGRNICGVDS 364


>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
          Length = 325

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 99/295 (33%), Positives = 151/295 (51%), Gaps = 29/295 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLT 100
           + + Y ++++ K RF  FK +     +Y         YG +  SD + +E      + L 
Sbjct: 39  YGKAYANEDDQK-RFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTNEEF---AAMYLG 94

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
            +  ER++    RV+  LN+ +  P   S+DWR+     + PVE QG CGSCWAF+ TA 
Sbjct: 95  SRIDERVD----RVQ--LNDLQTAP--ASVDWREKGA--VGPVEHQGSCGSCWAFSVTAN 144

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKE 219
           +E Q  L    L  LSK QLV+CD  +  C+GG     ++ +K+ G LE Q+ YPY   E
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPYTGWE 204

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C  ++ K    + D+ V    +      L + GP+   LN   ++ Y    +  +
Sbjct: 205 QA---CRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPS 261

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
           ++AC+P  L+HAV  VGY  + G+  W VRNSWG    ++GYF+I RG   CGI+
Sbjct: 262 EYACSPEGLNHAVLTVGYDTERGVPYWTVRNSWGTRWGENGYFRIYRGDGTCGID 316


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 152/312 (48%), Gaps = 20/312 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I K+ R Y+   E   RF+ + Q+    ++          YG +  SD SP+E 
Sbjct: 133 NSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEE- 191

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q+T L     ++         +KKF        LP+  DWR   V  + PV++QG CGS
Sbjct: 192 FQKTMLPSLWWDRVVSNGVEYDLKKF--NLTFNNLPEQFDWRTKGV--VTPVKNQGSCGS 247

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQ 210
           CWAF+ T  +E   A+    L  LS+ +L++CD  +  CNGG    AF  +++ G LE +
Sbjct: 248 CWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRMGGLEPE 307

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIES 268
             YPY+ +      C   +    V + D       + +M   ++Q GP+ V ++ +L+  
Sbjct: 308 DQYPYKARNG---TCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDAKLLAY 364

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y    +  +   C P  +DH V I GYG +NG+  W ++NSWGD   + GYF++  G + 
Sbjct: 365 YKSGILHPSRSRCPPSGIDHGVLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDV 424

Query: 329 CGIESYAYLASV 340
           CG+      A +
Sbjct: 425 CGVSDLVSSAII 436


>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
 gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
          Length = 463

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 159/330 (48%), Gaps = 41/330 (12%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGK------ETDE---YYGTS 81
           +L  + +K +  FK ++  +N+ Y+D  E   R + F Q+ K      E D+    YG +
Sbjct: 154 ELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGTAEYGVT 213

Query: 82  GSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPK-----SLDWRQSK 136
             SD             LT  E   L  +     K L + KK  +P        DWR   
Sbjct: 214 KYSD-------------LTEDEFRSLYLNPLLSSKPLYQMKKAIVPNMSAPDQWDWRDHG 260

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNID 196
              +  V++QG CGSCWAF+    +E Q  L K +L  LS+ +LV+CD  +  C GG   
Sbjct: 261 A--VTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCDGVDHACAGGLPS 318

Query: 197 VAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQ 253
            A+E +++ G +E++ +Y Y   +N    C++   K   ++  +      ++ +   L Q
Sbjct: 319 NAYEAIEKLGGIETEQEYSYEGHKNT---CSFSTSKVSAYINSSVEIPKDENEIAAWLAQ 375

Query: 254 SGPIGVYLNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           +GPI + LN   ++ Y     +P R     CNP  +DHAV +VGYGE+NG   W ++NSW
Sbjct: 376 NGPISIALNAFAMQFYRKGISHPFRI---LCNPWMIDHAVLLVGYGERNGTPFWAIKNSW 432

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASV 340
           G    + GY+ + RG  ACG+ +    A V
Sbjct: 433 GTDWGEQGYYYLYRGTGACGMNTMCSSAVV 462


>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
          Length = 335

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 165/321 (51%), Gaps = 32/321 (9%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSG 82
           +L+ +S+++   FK+++ K ++TY+ + E   R + F  + ++ + +           + 
Sbjct: 24  ELSVNSLEKFH-FKSWMSKHHKTYSTE-EYHHRLQMFASNWRKINAHNNGNHTFKMALNQ 81

Query: 83  SSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
            SD S  EI  +        E +   A +         R  GP P S+DWR+ K   ++P
Sbjct: 82  FSDMSFAEIKHK----YLWSEPQNCSATKSNYL-----RGTGPYPPSMDWRK-KGNFVSP 131

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFE 200
           V++QG CGSCW F+TT  LES +A+    +  L++ QLV+C  D  N  C GG    AFE
Sbjct: 132 VKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFE 191

Query: 201 YV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGV 259
           Y+    G+  +  YPY+ K+     C +   KA  FV+D    +  D    ++++  +  
Sbjct: 192 YILYNKGIMGEDTYPYQGKDGY---CKFRPGKAIGFVKDVANITIYDEEA-MVEAVALYN 247

Query: 260 YLNHRLIESYDGNPIRRNDW---ACN--PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIG 314
            ++     + D    RR  +   +C+  P K++HAV  VGYGEKNGI  WIV+NSWG   
Sbjct: 248 PVSFAFEVTQDFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQW 307

Query: 315 PDHGYFQIERGANACGIESYA 335
             +GYF IERG N CG+ + A
Sbjct: 308 GMNGYFLIERGKNMCGLAACA 328


>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 150/318 (47%), Gaps = 30/318 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           F+ +I+  N+ YT   E   RF  F          Q+ ++    YG +  +D +  E  +
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 94  R-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +  GL  +   K+ L              +   +P   DWR   V  + PV++QG CGSC
Sbjct: 340 KYLGLDSSMTSKKTLP--------MAVIPQSASIPNEFDWRNHNV--VTPVKNQGACGSC 389

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAF+  A +E Q AL  K L  LS+ +L++CD+ +  C GG +  AFE V+   GLE+++
Sbjct: 390 WAFSAIANIEGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENLGGLETES 449

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIESY 269
           DYPY    +    C  +K   KV +       T   D    L++ GP+ V +N   ++ Y
Sbjct: 450 DYPYEGHADRK-GCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFY 508

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPDHGYFQIE 323
            G         C+P  LDH VAIVGYG      T      W+++NSWG    + GY+ + 
Sbjct: 509 MGGVSHPIHALCSPKSLDHGVAIVGYGVHRTKYTHKNLPYWLIKNSWGPGWGEKGYYLLY 568

Query: 324 RGANACGIESYAYLASVK 341
           RG  +CG+      A ++
Sbjct: 569 RGDGSCGVNQMVSSAIIE 586


>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
 gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
          Length = 324

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 116/338 (34%), Positives = 167/338 (49%), Gaps = 44/338 (13%)

Query: 24  SAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYF------------KQDG 71
           +A+ V  + A D     D  KT+     RTY    E K RF  F            K + 
Sbjct: 8   AALIVVINAASDQELWADFKKTHA----RTYKSLREEKLRFNIFQDTLRQIAEHNVKYEN 63

Query: 72  KETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKF-LNERKKGPLPKSL 130
            E+  Y   +  SD + +E   R  L        + EA R  ++   + +   G  P+S+
Sbjct: 64  GESTYYLAINKFSDITDEEF--RDMLM-------KNEASRPNLEGLEVADLTVGAAPESI 114

Query: 131 DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNL 188
           DWR   V +  PV +QG CGSCWA +T A +ESQ A+   +  PLS  QLV+C   +GN 
Sbjct: 115 DWRSKGVVL--PVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNH 172

Query: 189 NCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGV 245
            CNGG     FEYVK  GLES ADYPY  KE+   +C    +K++  V+ T    VT+  
Sbjct: 173 GCNGGFAVNGFEYVKDNGLESDADYPYSGKED---KCK-ANDKSRSVVELTGYKKVTASE 228

Query: 246 DHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
             +   + + GPI   +  + ++SY G     +D +C    L H V +VGYG +NG   W
Sbjct: 229 TSLKEAVGTIGPISAVVFGKPMKSYGGGIF--DDSSCLGDNLHHGVNVVGYGIENGQKYW 286

Query: 305 IVRNSWGDIGPDHGYFQIERGAN-ACGIE---SYAYLA 338
           I++N+WG    + GY ++ R  + +CG+E   SY  LA
Sbjct: 287 IIKNTWGADWGESGYIRLIRDTDHSCGVEKMASYPILA 324


>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
          Length = 271

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 84/207 (40%), Positives = 111/207 (53%), Gaps = 8/207 (3%)

Query: 128 KSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGN 187
           +  DWR+     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CDH +
Sbjct: 60  EKFDWREHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLD 117

Query: 188 LNCNGGNIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
             CNGG     + E  K  GLE  +DYPY   + I   C   + K   +V D+ V    +
Sbjct: 118 KGCNGGYPPKTYGEIEKMGGLELASDYPYTGVDGI---CYMNQSKFVAYVNDSTVLPLSE 174

Query: 247 HMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
            +    L + GP+   LN  L++ Y G  I    + CNPH L+HAV  VGYG + GI  W
Sbjct: 175 KIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCNPHGLNHAVLTVGYGTEFGIPYW 234

Query: 305 IVRNSWGDIGPDHGYFQIERGANACGI 331
           IV+NSWG    + GYF+I RGA  CGI
Sbjct: 235 IVKNSWGVGFGEKGYFRIFRGAGTCGI 261


>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
          Length = 320

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 156/314 (49%), Gaps = 37/314 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGS 83
           S++Q DAF+ + +K N+TY    E  TR+  F+    E +E+             G +  
Sbjct: 17  SLEQ-DAFQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLETYKKGVNKF 75

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           SD +  E     GL     +  +L      VK  ++      +P S+DWR      +  V
Sbjct: 76  SDWTQDEFNAYLGLH---PKPAKLGKGIPYVKTGVS------VPASVDWRTEGY--VTGV 124

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVECDHGNLN--CNGGNIDVAF 199
           ++QG CGSCWAF+ T  +E   AL K T  L  LS+ QLV+C +G +N  C+GG ++  F
Sbjct: 125 KNQGDCGSCWAFSLTGSVEG--ALFKSTGKLVSLSEQQLVDCTYGTVNFGCDGGYLEETF 182

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPI 257
            Y+++ GLE++A YPY+ ++     C ++  K    + D   W       +      GPI
Sbjct: 183 PYIQETGLEAEASYPYKARDG---TCKFDASKVVTKINDYVYWYGDEEALLEATATIGPI 239

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V ++   I+SY           C+   L+H V +VGYG +NG+  W+V+NSW +   + 
Sbjct: 240 SVAMDANYIDSYASGVFSSR--LCSSDDLNHGVLVVGYGSENGVNYWLVKNSWAEDWGES 297

Query: 318 GYFQIERGANACGI 331
           GY ++ RG N CGI
Sbjct: 298 GYLKLLRGQNECGI 311


>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
          Length = 379

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 154/302 (50%), Gaps = 23/302 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  ++ K+NR Y+   E K R+  F  + +E +E        D    E         + +
Sbjct: 78  FDEFLYKFNRLYSSQEEYKYRYHIFVHNVREFEEEERKHPGLDFDINEFTD-----WSEE 132

Query: 103 EKERLEADRERVKKFLNE-RKKGPL-------PKSLDWR-QSKVKVLNPVESQGRCGSCW 153
           E  ++  D++ VK+  N  R +G +       P S+DWR Q K   L P+++QG+CGSCW
Sbjct: 133 ELRKMIVDKKNVKEEKNAVRFEGSVLSSGIKRPASIDWRDQGK---LTPIKNQGQCGSCW 189

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADY 213
           AFAT A +E+Q A+ K  L  LS+ ++V+CD  N  C+GG    A  +VK+ GLE++  Y
Sbjct: 190 AFATVAAIEAQHAIKKGILVSLSEQEMVDCDGRNNGCSGGYRPYAMRFVKENGLETEKSY 249

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN-HRLIESYD 270
           PY   ++   +C   +   KV++ D  + S  +  +   +   GP+   +N  + + SY 
Sbjct: 250 PYSALKHD--QCMLHQNDTKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYR 307

Query: 271 GNPIRRNDWACNPHKLD-HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
                 +   C    +  HA+ IVGYG +     WIV+NSWG      GYF++ RG N+C
Sbjct: 308 SGIFNPSAEDCAEKSMGAHALTIVGYGGEGTSAYWIVKNSWGTSWGSDGYFRLARGVNSC 367

Query: 330 GI 331
           G+
Sbjct: 368 GL 369


>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
          Length = 358

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 162/330 (49%), Gaps = 26/330 (7%)

Query: 24  SAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD----GKETDEYYG 79
           S + V RDL        + FK + +++N++Y D  E + R + F  +     + T+E+ G
Sbjct: 32  SLLPVTRDLR-------ERFKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQG 84

Query: 80  TSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
            +        ++ +    RL    +      R + +     R +    +S DWR  K +V
Sbjct: 85  LAQFGVTRFSDLTEEEFRRLYQPSQPNYLGLRVKTEGGGYPRLQRLKTRSCDWR--KARV 142

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVA 198
           L PV  Q  C SCWA +    +E+  A+  + L+ LS  +L++C      C GG + D  
Sbjct: 143 LTPVRDQKNCNSCWAISAVGNVEALWAINYQQLFKLSVQELLDCRRCGQGCEGGFVWDAY 202

Query: 199 FEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV-------TSGVDHMMHL 251
              + Q GL  + DYPYR +  ++  C  +K+K + ++ D  +        S  D   +L
Sbjct: 203 MTILNQSGLAEEQDYPYRPQ--LSKGC--QKKKKRAWIHDFLMLHKEENSPSPPDMAQYL 258

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
            + GPI V +N RL++SY    I+  +  C+P  +DH V +VG+G+ +    WI++NSWG
Sbjct: 259 AEKGPITVTINSRLLKSYIRGVIKPGN-NCDPKYVDHVVQLVGFGQIHNFTYWILKNSWG 317

Query: 312 DIGPDHGYFQIERGANACGIESYAYLASVK 341
               + GYF++ RG NACGI  +   A +K
Sbjct: 318 SSWGEKGYFRLHRGRNACGITKFPLTAVLK 347


>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
          Length = 477

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 103/314 (32%), Positives = 151/314 (48%), Gaps = 23/314 (7%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I +  + Y++  E+  RF  FK++ K   E          YG +  SD +  E 
Sbjct: 172 NSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEF 231

Query: 92  LQRTGLRLTGKEK--ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            Q T L    ++      EAD E+    ++E     LP S DWR      +  V++QG C
Sbjct: 232 KQ-TMLPYQWEQPVYPMAEADFEKEGVTISE---DDLPDSFDWRDHGA--VTQVKNQGNC 285

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLE 208
           GSCWAF+TT  +E    L KK L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE
Sbjct: 286 GSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIMRMGGLE 345

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLI 266
            +  YPY  K      C   ++   V++  +       V     L+  GPI + LN   +
Sbjct: 346 PEDAYPYDGKGET---CHIVRKDIAVYINGSVELPHDEVKIQKWLVTKGPISIGLNANTL 402

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y    +      C P  L+H V IVGYG+      WIV+NSWG    + GYF++ RG 
Sbjct: 403 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWGESGYFRLYRGK 462

Query: 327 NACGIESYAYLASV 340
           N CG++  A  A V
Sbjct: 463 NVCGVQEMATSALV 476


>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
          Length = 322

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 147/297 (49%), Gaps = 20/297 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F+T+ V+  ++Y +  E   RF  F+ +  E +++         S ++ + +     T  
Sbjct: 26  FETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQ----FTDL 81

Query: 103 EKERLEADRE-RVKKFLN-----ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
            +E  +A     VK  LN     E K   +P S+DWR +    +  V++QG CGSCW+FA
Sbjct: 82  TQEEFKAYLGLHVKPVLNNTIQYELKGLEVPTSVDWRSAGQ--VTGVKNQGSCGSCWSFA 139

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLESQADYPY 215
            T   E       K L  LS+ QLV+C    N  CNGG +D  F Y++QYGL++++ YPY
Sbjct: 140 LTGSTEGAYYRKHKQLVSLSEQQLVDCSTSINYGCNGGFLDATFPYIEQYGLQTESSYPY 199

Query: 216 RNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQSGPIGVYLNHRLIESYDGNP 273
              +     C Y+  K    + +     G +   +  +   GP+ + ++   + SY    
Sbjct: 200 TGVDG---SCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSSYSSGI 256

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
              N   C    L+HAV +VGYG +NG   WIV+NSWG    + GYF++ RG+N CG
Sbjct: 257 YAANK--CTTTNLNHAVLVVGYGSQNGQNYWIVKNSWGSGWGEQGYFRLLRGSNECG 311


>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
          Length = 478

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 95/307 (30%), Positives = 148/307 (48%), Gaps = 20/307 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I +  + Y +  E+  RF  FK++ K   E          YG +  SD +  E 
Sbjct: 174 NSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEF 233

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            +     L  + ++ +  D+   +K      +  LP S DWR+     +  V++QG CGS
Sbjct: 234 KETM---LPYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGA--VTQVKNQGSCGS 288

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQ 210
           CWAF+TT  +E    L KK L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE +
Sbjct: 289 CWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRMGGLEPE 348

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIES 268
             YPY  +      C   ++   V++  +       V+    L+  GPI + LN   ++ 
Sbjct: 349 DAYPYDGRGET---CHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQF 405

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y    +      C P  L+H V IVGYG+      WIV+NSWG    + GYF++ RG N 
Sbjct: 406 YRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWGEAGYFKLYRGKNV 465

Query: 329 CGIESYA 335
           CG++  A
Sbjct: 466 CGVQEMA 472


>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
 gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
          Length = 475

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 152/314 (48%), Gaps = 23/314 (7%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I +  + Y++  E+  RF  FK++ K   E          YG +  SD +  E 
Sbjct: 170 NSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEF 229

Query: 92  LQRTGLRLTGKEK--ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            Q T L    ++      +AD E+    ++E     LP+S DWR      +  V++QG C
Sbjct: 230 KQ-TMLPYQWEQPVYPMDQADFEKEGITISEED---LPESFDWRDKGA--VTQVKNQGNC 283

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLE 208
           GSCWAF+TT  +E    L K  L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE
Sbjct: 284 GSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCDGVDQGCNGGLPSNAYKEIIRMGGLE 343

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLI 266
            +  YPY  K      C   ++   V++  +       V+    L+  GPI + LN   +
Sbjct: 344 PEDAYPYDGKGET---CHLVRKDIAVYINGSIELPHDEVEMQKWLVTKGPISIGLNANTL 400

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y    +      C P  L+H V IVGYG+      WIV+NSWG    + GYF++ RG 
Sbjct: 401 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWGESGYFKLYRGK 460

Query: 327 NACGIESYAYLASV 340
           N CG++  A  A V
Sbjct: 461 NVCGVQEMATSALV 474


>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
 gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
          Length = 299

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 159/300 (53%), Gaps = 28/300 (9%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--------SDRSPQEI-LQRTG 96
           ++  +N+ Y DD E   R+  F+ + ++ +     +GS        SD S  EI L+ TG
Sbjct: 2   FVANYNKMYDDDLEKTKRYSIFRDNLRDINIKNKLNGSAVYRINKFSDLSTSEIVLKYTG 61

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR-QSKVKVLNPVESQGRCGSCWAF 155
           L +     ERL  +  +         KGPL  + DWR Q+KV     +++QG CG+CWAF
Sbjct: 62  LSV--PPTERLTTNFCKTIVLDQPPGKGPL--NFDWRHQNKVT---SIKNQGVCGACWAF 114

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVKQYGLESQADYP 214
           AT A +ESQ A+       LS+ Q+++CD+ ++ C+GG +  AFE  ++  G++ + +YP
Sbjct: 115 ATLASIESQYAIKHNVQINLSEQQMIDCDYVDMGCDGGLLHTAFEQMIEMGGVKHEHEYP 174

Query: 215 YRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDG 271
           Y   E I   C    +   V +     ++    + +  LL++ GPI + ++   I +Y  
Sbjct: 175 Y---EGINMNCRLNDDNFAVKIIGCYRYIVLQEEKLKDLLRAVGPIPIAIDASGIANYYQ 231

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
             I      C  H L+HAV +VGYG +N I  W ++N+WG+   ++GYF++ +  NACG+
Sbjct: 232 GVINY----CENHGLNHAVLLVGYGVENNIPYWTIKNTWGEDWGENGYFRVRQNINACGM 287


>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
          Length = 471

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 152/310 (49%), Gaps = 37/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + +K+ R Y    E + RF  FKQ+ +  +E          YG +  +D +  E  Q
Sbjct: 166 FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTSPEYKQ 225

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCG 150
           RTGL             R+  K   N + + P   LPK  DWR+     ++ V++QG CG
Sbjct: 226 RTGL-----------WQRDPQKAASNPKAEIPNIDLPKEFDWREKGA--ISAVKNQGNCG 272

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
           SCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG  D A+E +++  GLE 
Sbjct: 273 SCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAIEKIGGLEL 332

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIE 267
           ++DYPY  +++   +C +   K  V V+        +  +   L+ +GPI + +N   ++
Sbjct: 333 ESDYPYHARKD---QCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQ 389

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGE------KNGILTWIVRNSWGDIGPDHGYFQ 321
            Y G         C+   LDH V IVGYG       K  +  WIV+NSWG    + GY++
Sbjct: 390 FYRGGVSHPPHILCSRKNLDHGVLIVGYGVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYR 449

Query: 322 IERGANACGI 331
           + RG N CG+
Sbjct: 450 VYRGDNTCGV 459


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 166/356 (46%), Gaps = 63/356 (17%)

Query: 14  EQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--- 70
           +QVT  V  D ++  +      + KQ   F+++I ++ + Y    E + RF+ FK +   
Sbjct: 30  QQVTDGVRVDGSVEQFAHALLGAEKQ---FESFIKEFGKVYHTVEEYEHRFKVFKSNLLR 86

Query: 71  -----GKETDEYYGTSGSSDRSPQEI------LQRTGLRLTGKEKERLEADRERVKKFLN 119
                  +    +G +  SD + +E       L+R     T    E L            
Sbjct: 87  ALKHQALDPTASHGVTMFSDLTEEEFATQYLGLKRPSALSTAPTAEPL------------ 134

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
               G LP S DWR+     + PV++QG CGSCWAF+TT  +E    L    L  LS+ Q
Sbjct: 135 --PTGDLPPSFDWREKGA--VGPVKNQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQ 190

Query: 180 LVECDH---------GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEK 229
           LV+CDH          +  C GG +  A++YV++  GLE ++DYPY+ ++    +C +  
Sbjct: 191 LVDCDHQCDPEEAQACDAGCGGGLMTNAYKYVEEAGGLELESDYPYKGRDG---KCQFNP 247

Query: 230 EKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESYDGN---PIRRNDWACNPH 284
            K    V + T +    D +  +L++SGP+ + +N   +++Y      PI      CN  
Sbjct: 248 NKVAAKVSNFTNIPIDEDQVAAYLIKSGPLAIGINAEFMQTYVAGVSCPIF-----CNKR 302

Query: 285 KLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACGIES 333
            LDH V +VGY E             WI++NSWG +  D GY++I RG   CG+ +
Sbjct: 303 NLDHGVLLVGYAEHGFAPARLAYKPYWIIKNSWGPMWGDKGYYKICRGHGECGLNT 358


>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
          Length = 473

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 106/323 (32%), Positives = 158/323 (48%), Gaps = 37/323 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDR 86
           S K    FK ++  +NRTY    E K R   F          Q   +    YG +  SD 
Sbjct: 169 SGKMASIFKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDL 228

Query: 87  SPQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           + +E   I     LR    +K RL            +  KGP+P   DWR      +  V
Sbjct: 229 TEEEFRTIYLNPLLREDPGQKMRL-----------GKAPKGPVPPDWDWRTKGA--VTKV 275

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 276 KDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACMGGVPSNAYSAIK 335

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y         C++  EKAKV++ D+   S  ++ +   L ++GPI V 
Sbjct: 336 TLGGLETEEDYSYHGHLQA---CSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISVA 392

Query: 261 LNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
           +N   ++ Y     +P+R     C+P  +DHAV IVGYG ++ +  W ++NSWG    + 
Sbjct: 393 INAFGMQFYRHGIAHPLRP---LCSPWLIDHAVLIVGYGNRSDVPFWAIKNSWGTDWGEE 449

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+ ACG+ + A  A V
Sbjct: 450 GYYYLHRGSGACGVNTMASSAVV 472


>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
          Length = 478

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 95/307 (30%), Positives = 148/307 (48%), Gaps = 20/307 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  +I +  + Y +  E+  RF  FK++ K   E          YG +  SD +  E 
Sbjct: 174 NSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEF 233

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            +     L  + ++ +  D+   +K      +  LP S DWR+     +  V++QG CGS
Sbjct: 234 KETM---LPYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGA--VTQVKNQGSCGS 288

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQ 210
           CWAF+TT  +E    L KK L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE +
Sbjct: 289 CWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRMGGLEPE 348

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIES 268
             YPY  +      C   ++   V++  +       V+    L+  GPI + LN   ++ 
Sbjct: 349 DAYPYDGRGET---CHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQF 405

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y    +      C P  L+H V IVGYG+      WIV+NSWG    + GYF++ RG N 
Sbjct: 406 YRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPTWGEAGYFKLYRGKNV 465

Query: 329 CGIESYA 335
           CG++  A
Sbjct: 466 CGVQEMA 472


>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
          Length = 1165

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 101/317 (31%), Positives = 159/317 (50%), Gaps = 33/317 (10%)

Query: 43   FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
            F+ + +K +R Y    E + RF  FK +  + ++          YG +  +D +  E  Q
Sbjct: 858  FEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSAEYRQ 917

Query: 94   RTGLRLTGKEKERLEADRERV---KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            RTGL +   E      DR  V   K  ++E  +  LP+S DWR+  +  ++PV++QG CG
Sbjct: 918  RTGLVIPRDE------DRNHVGNPKAEIDENME--LPESFDWRE--LGAVSPVKNQGNCG 967

Query: 151  SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
            SCWAF+    +E    +  K L   S+ +L++CD  +  C GG +D A++ +++  GLE 
Sbjct: 968  SCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAIEKIGGLEL 1027

Query: 210  QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIE 267
            +++YPY  K+  T  C +   +  V V+        +  M  +L+ +GPI + LN   ++
Sbjct: 1028 ESEYPYLAKKQKT--CHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIGLNANAMQ 1085

Query: 268  SYDGNPIRRNDWACNPHKLDHAVAIVGYGEK------NGILTWIVRNSWGDIGPDHGYFQ 321
             Y G         C+   LDH V IVGYG K        +  WIV+NSWG    + GY++
Sbjct: 1086 FYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWGPKWGEQGYYR 1145

Query: 322  IERGANACGIESYAYLA 338
            I RG N CG+   A  A
Sbjct: 1146 IFRGDNTCGVSEMASSA 1162


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 113/326 (34%), Positives = 163/326 (50%), Gaps = 41/326 (12%)

Query: 41  DAFKTYIVKWNRTY-TDDNEIKTRFEYFKQDGKETDE------------YYGTSGSSDRS 87
           + +K +   + + Y T + EIK RF+ F+   +  +E            Y G +  SD S
Sbjct: 52  ETWKEFKTLFGKVYDTVEEEIK-RFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMS 110

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
             E L+  GLR   ++  + E        +    K+  L   +DWR      + PV++QG
Sbjct: 111 HDEYLRHNGLRRGNRKYSKGEG----CDSYTKSGKQ--LDDKVDWRDKGY--VTPVKNQG 162

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY 205
           +CGSCW+F+TT  LE Q       L  LS+ QLV+C    GN  CNGG +D AFEY+K  
Sbjct: 163 QCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSI 222

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT---SGVDHMMH--LLQSGPIGV 259
            GLE + DYPY  K+    +C  +K   K    DT  T   SG +  +   L   GPI V
Sbjct: 223 GGLEGEDDYPYTAKQG---KCHLKKSLFK--ANDTGCTDVESGDEDALKDALASVGPISV 277

Query: 260 YLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPD 316
            ++  H   +SYDG      +  C+   LDH V  VGYG E+NG   W+V+NSWG++  +
Sbjct: 278 AIDASHASFQSYDGGVYDEEE--CSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGE 335

Query: 317 HGYFQIERGA-NACGIESYAYLASVK 341
            GY ++ R   N CGI + A   +V+
Sbjct: 336 EGYIKMSRNKDNQCGIATQASYPNVQ 361


>gi|348551380|ref|XP_003461508.1| PREDICTED: pro-cathepsin H-like [Cavia porcellus]
          Length = 335

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 121/223 (54%), Gaps = 13/223 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P S+DWR+ K   ++PV++QG CGSCW F+TT  LES VA+    +  L++ QL
Sbjct: 111 RGTGPYPTSVDWRK-KGNFVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQL 169

Query: 181 VEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C  D  N  C GG    AFEY+    G+  +  YPY+ K+     C ++ +KA  FV+
Sbjct: 170 VDCAQDFNNHGCEGGLPSQAFEYILYNKGIMGEDTYPYQGKDG---HCRFQPQKAIAFVK 226

Query: 238 DTWVTSGVDHMMHLLQSGPI--GVYLNHRLIE---SYDGNPIRRNDWACNPHKLDHAVAI 292
           D  V   ++    ++++  +   V     + E   SY             P K++HAV  
Sbjct: 227 DV-VNITLNDEEAMVEAVALYNPVSFAFEVTEDFISYQSGIYSSTSCHKTPDKVNHAVLA 285

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           VGYG +NG+  WIV+NSWG      GYF IERG N CG+ + A
Sbjct: 286 VGYGVQNGVPYWIVKNSWGTAWGQDGYFLIERGKNMCGLAACA 328


>gi|167427529|gb|ABZ80401.1| cathepsin L4, partial [Fasciola hepatica]
          Length = 303

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 126/224 (56%), Gaps = 16/224 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P+S+DWR+     +  V+ QG CGSCWAF+TT  +E Q     K     S+ QLV+C  
Sbjct: 85  VPESIDWRE--FGYVTEVKDQGDCGSCWAFSTTGAVEGQYMKNPKANISFSEQQLVDCSG 142

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-----FVQD 238
           D+GN  CNGG ++ A+EY+++ GLE+++ YPY+ +E     C Y+     V     F++ 
Sbjct: 143 DYGNHGCNGGFMENAYEYLERRGLETESSYPYKAEEG---PCKYDSRLGVVEVFGYFIEH 199

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           + + S + H++       + V +    +    G    RN   C+  KL+HA+ +VGYG +
Sbjct: 200 SGIESKLAHLVGDKGPAAVAVDVESDFLMYRGGIYASRN---CSSEKLNHAMLVVGYGTQ 256

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASVK 341
           +G   WIV+NSWG +  DHGY ++ R   N CGI S A +  V+
Sbjct: 257 DGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIASAASVPVVE 300


>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
          Length = 1761

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 136/272 (50%), Gaps = 21/272 (7%)

Query: 78   YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
            YG +  +D + +E  +  GLR   + +      + ++           LPK  DWR  K 
Sbjct: 1501 YGITRFADMTQKEFSRSLGLRTDLRNENETPFAQAKIPNI-------ELPKEFDWR--KK 1551

Query: 138  KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDV 197
             V+  V++Q +CGSCWAF+ T  +E Q AL    L   S+ +LV+CD  +  CNGG +D 
Sbjct: 1552 NVVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDT 1611

Query: 198  AFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG--VDHMMHLLQS 254
            A+  +++  GLE++ DYPY  ++    +C + +  A+V V      S    D    L+ +
Sbjct: 1612 AYRSIEKIGGLETEQDYPYDAEDE---KCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1668

Query: 255  GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRN 308
            GPI + +N   ++ Y G       + C+P  LDH V IVGYG  N       +  WIV+N
Sbjct: 1669 GPISIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKN 1728

Query: 309  SWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            SWG    + GY+++ RG   CG+      A V
Sbjct: 1729 SWGTGWGEQGYYRVYRGDGTCGLNQTPSSAIV 1760


>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
            castaneum]
          Length = 1726

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 136/272 (50%), Gaps = 21/272 (7%)

Query: 78   YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
            YG +  +D + +E  +  GLR   + +      + ++           LPK  DWR  K 
Sbjct: 1466 YGITRFADMTQKEFSRSLGLRTDLRNENETPFAQAKIPNI-------ELPKEFDWR--KK 1516

Query: 138  KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDV 197
             V+  V++Q +CGSCWAF+ T  +E Q AL    L   S+ +LV+CD  +  CNGG +D 
Sbjct: 1517 NVVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDT 1576

Query: 198  AFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG--VDHMMHLLQS 254
            A+  +++  GLE++ DYPY  ++    +C + +  A+V V      S    D    L+ +
Sbjct: 1577 AYRSIEKIGGLETEQDYPYDAEDE---KCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1633

Query: 255  GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRN 308
            GPI + +N   ++ Y G       + C+P  LDH V IVGYG  N       +  WIV+N
Sbjct: 1634 GPISIAINANAMQFYMGGVSHPFKFLCSPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKN 1693

Query: 309  SWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            SWG    + GY+++ RG   CG+      A V
Sbjct: 1694 SWGTGWGEQGYYRVYRGDGTCGLNQTPSSAIV 1725


>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
          Length = 385

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 158/319 (49%), Gaps = 30/319 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           +K ++  + R Y D +E + RF+ F  +     ++             G +  SD+    
Sbjct: 66  WKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKVIGL 125

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERK----KGPLPKSLDWRQSKVKVLNPVESQ 146
           I+     + T +E +RL   R  +    +  K      P P  +DWR      + PV++Q
Sbjct: 126 IIHTICFQ-TDEELKRLRCFRGSLNASRDGSKYITIAAPPPSEIDWRNKGA--VTPVKNQ 182

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQ 204
           G CGSCWAF+ T  +E Q  L    L  LS+ QLV+C  ++GN  CNGG +D AF+YVK 
Sbjct: 183 GNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKD 242

Query: 205 Y-GLESQADYPYRNKE--NITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPI 257
             G++++A YPY + E  +    C +  ++A V V   ++      +  L Q+    GPI
Sbjct: 243 SNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTG-YIDLPRGQVSELKQAVGHYGPI 301

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V +N  L           +D  C+   LDH V +VGYGE+NGI  W+++NSWG    ++
Sbjct: 302 SVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGEN 361

Query: 318 GYFQIERGA-NACGIESYA 335
           GY +I R   N CG+ S A
Sbjct: 362 GYVKILRDHNNLCGVASMA 380


>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
          Length = 367

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 152/323 (47%), Gaps = 31/323 (9%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + F  + +++NR+Y++  E   R + F Q+  +             +G +  SD + +E 
Sbjct: 40  EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q  G      +   +        K  +E     +P+S DWR+ K  V++ ++ Q  C  
Sbjct: 100 GQLHGHHWGAGKAPSMGI------KVGSEESGETVPQSCDWRK-KPGVISAIKHQKDCNC 152

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQYGLESQ 210
           CWA A    +E+Q A+       LS  Q+++CD     CNGG + D     +   GL S+
Sbjct: 153 CWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLASE 212

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYPY+     T RC  ++ +   ++QD  +    +  +  +L   GPI V +N  L++ 
Sbjct: 213 QDYPYKGTVK-THRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLLQQ 271

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEK-----------NGILTWIVRNSWGDIGPDH 317
           Y    IR     C+PH ++H+V +VG+G+            + I  WI++NSWG    + 
Sbjct: 272 YKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEE 331

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GYF++ RG+N CGI  Y   A V
Sbjct: 332 GYFRLHRGSNTCGITKYPVTARV 354


>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 149/303 (49%), Gaps = 24/303 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDG---------KETDEYYGTSGSSDRSPQEILQ 93
           F T+I K+ R Y+   E   RF  + Q+          ++    YG +  SD + +E  +
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP--LPKSLDWRQSKVKVLNPVESQGRCGS 151
              + L     +R+E++   +   LN+       LP   DWR   V  + PV+ QG CGS
Sbjct: 219 ---IMLPSIWWDRVESNG--ITFNLNDFNLSIYNLPSKFDWRTEGV--VTPVKDQGSCGS 271

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQ 210
           CWAF+ T  +ES  A+    L  LS+ +L++CD  +  CNGG    AF  +K+ G LE +
Sbjct: 272 CWAFSVTGNIESLWAIKTGKLISLSEQELIDCDVIDKGCNGGLPINAFREIKRMGGLEPE 331

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIES 268
             YPY  K      C   + +  V + D       + +M   + Q GP+ V ++  L+  
Sbjct: 332 DQYPYEAKNG---TCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAELLSY 388

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y    +  +   C P K++H V I GYG +N +  W ++NSWG+   ++GYFQ+ RG N 
Sbjct: 389 YKSGILHPSKSRCPPSKINHGVLITGYGIENNLPYWTIKNSWGEQWGENGYFQLMRGKNI 448

Query: 329 CGI 331
           CG+
Sbjct: 449 CGV 451


>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
          Length = 255

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 118/216 (54%), Gaps = 10/216 (4%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNL 188
           S DWR      ++PV++QG CGSCWAF+ T  +E Q  L   TL  LS+ +LV+CD  + 
Sbjct: 45  SWDWRDHGA--VSPVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQ 102

Query: 189 NCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            C GG    A+E +++ G LE++ DY Y  K+    RC +   K   ++  + V    D 
Sbjct: 103 ACRGGLPSNAYEAIEKLGGLETETDYSYTGKKQ---RCDFTNRKVAAYINSS-VELPKDE 158

Query: 248 ---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
                 L ++GPI V LN   ++ Y           CNP  +DHAV +VGYGE+NGI  W
Sbjct: 159 KEIAAWLAENGPISVALNAFAMQFYKKGVSHPWKIFCNPWMIDHAVLLVGYGERNGIPFW 218

Query: 305 IVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            ++NSWG+   + GY+ + RG+NACGI      A V
Sbjct: 219 AIKNSWGEDYGEQGYYYLHRGSNACGINKMGSSAVV 254


>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
 gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
          Length = 617

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 156/320 (48%), Gaps = 40/320 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + +K+ R Y +  E + R   F+Q+ +  +E          YG +  +D +  E   
Sbjct: 311 FHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFADMTSTEYKL 370

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GL    ++K    A        +     G +PK  DWRQ K   +  V++QG+CGSCW
Sbjct: 371 HAGLWQRSEDKPTGGA------AAVVPPYAGEMPKEFDWRQKKA--VTHVKNQGQCGSCW 422

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AF+ T  +E   A+    L   S+ +L++CD  +  CNGG +D A++ +K   GLE +++
Sbjct: 423 AFSVTGNIEGLYAIKTGELEEFSEQELLDCDSTDSACNGGLMDNAYKAIKDIGGLEYESE 482

Query: 213 YPYRNKENITFRCTYEKEKAKV----FVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLI 266
           YPY  K+    +C + +  + V    FV    +  G +  M   LL +GPI + LN   +
Sbjct: 483 YPYAAKK---MQCHFNRTMSHVQLSGFVD---LPKGNETAMQEWLLSNGPISIGLNANAM 536

Query: 267 ESYDGNPIRRNDWA--CNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHG 318
           + Y G     + WA  C+   LDH V IVGYG  +       +  WIV+NSWG    + G
Sbjct: 537 QFYRGG--VSHPWAPLCSKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 594

Query: 319 YFQIERGANACGIESYAYLA 338
           Y++I RG N CG+   A  A
Sbjct: 595 YYRIYRGDNTCGVSEMATSA 614


>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
 gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
          Length = 359

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 159/313 (50%), Gaps = 24/313 (7%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-------YGTSGSSDR 86
           Y+  +  D F+ ++  +NRTY D  E + R+E F Q+ K  +         Y  +  SD 
Sbjct: 45  YEPDRMRDYFERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLNQKSQASYDINKFSDL 104

Query: 87  SPQEILQR-TGLRLTGKEKERLEAD---RERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +  E++ R TGL  +       + +    +  K  + +   G +P   DWR S+   +  
Sbjct: 105 TKDEVVARFTGLDPSLAAAAYTDNNGTQYQLCKVVVVDGTPGRVPDLWDWRNSQK--VTS 162

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+ QG CGSCWAFA+ A +ESQ A+    L  LS+ QLV+CD  +  C+GG + +AF+ +
Sbjct: 163 VKQQGVCGSCWAFASVANIESQYAIRHDRLLDLSEQQLVDCDQIDQGCSGGLMHLAFQEI 222

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQS-GPIG 258
            Q  GLES+  YPY   + + + C     K  V + D       D   +  L+ + GPI 
Sbjct: 223 LQMGGLESELVYPY---QGVDYACRLNPRKFDVKLSDCHRYDLRDERKLRELVYTVGPIA 279

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           V ++   I  Y    +      CN + L+HAV +VG+G +     WI++NSWG+   + G
Sbjct: 280 VAIDCIDIIDYKSGIVS----MCNNNGLNHAVLLVGFGIEFDTPYWILKNSWGNDWGEKG 335

Query: 319 YFQIERGANACGI 331
           YF+++R  N CG+
Sbjct: 336 YFRLKRNINGCGM 348


>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
          Length = 396

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 163/309 (52%), Gaps = 22/309 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           FK +  K+ R +    E K RFE F+++ ++ +E         YG +  SD++  E L+ 
Sbjct: 88  FKDFNKKFGREHKSLEEYKMRFEVFQKNLRDIEELNLKNPSVQYGINRFSDKTESE-LKN 146

Query: 95  TGLRLTGKEKERLEADRERVKKFLNER---KKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
             +     +     +  + +  + N R   K    P  +DWR   V  +  V+ QG+CGS
Sbjct: 147 LLMDKKFMDSSLSNSSLKTLSSYRNPRNIIKNVQRPDYIDWRN--VGKVMSVKDQGQCGS 204

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           CWAFAT A +ESQ A+ K TL+ LS+ +LV+CD  +  C+GG +  A E++   GLE++ 
Sbjct: 205 CWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGASYGCSGGFLTSALEFILGNGLETED 264

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQS-GPIGVYLN--HRLIE 267
           DYPY   ++   +C    +K +V++ + + +T   D +   + + GP+   +   +  I 
Sbjct: 265 DYPYTATKHD--QCWINGDKTRVWIDEGYQLTMNEDDIAEWVANVGPVSFAMRAPYSFIA 322

Query: 268 SYDGNPIRRNDWACNPHKLDHA-VAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
            ++G     +++ C    + +  +AI+GYG++ G   WIV+NSWGD   + GY ++ RG 
Sbjct: 323 YHNG-IYSPSEYQCKHEAMGYVMMAIIGYGQEGGQNYWIVKNSWGDSWGNQGYMRLARGV 381

Query: 327 NACGIESYA 335
           N C + +Y 
Sbjct: 382 NTCEMANYV 390


>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
          Length = 331

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 156/311 (50%), Gaps = 24/311 (7%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGS 83
            AYD +K  D F+T++  +N+ Y D +E + RF  F+Q  +E +          Y  +  
Sbjct: 20  FAYDLLKAGDYFETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNRLNDSAVYQINKF 79

Query: 84  SDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D S  EI+ + TGL +  +            K  + ++  G  P + DWRQ     +  
Sbjct: 80  ADLSKNEIISKYTGLNMPVQTTNF-------CKTIVIDQPPGKGPLNFDWRQQNK--VTS 130

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           +++Q  CG+CWAFAT A +ESQ A+       LS+ Q+++CD+ ++ C+GG +  AFE +
Sbjct: 131 IKNQKACGACWAFATLASIESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQM 190

Query: 203 KQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVY 260
            Q G L  + +YPY            E    KV     +V    + +  LL++ GPI + 
Sbjct: 191 IQMGELVQEHEYPYAGVNKPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMA 250

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   I +Y    I      C  + L+HAV +VGYG +N +  W  +N+WG    + GYF
Sbjct: 251 IDASGIVNYHHGIIHY----CENYGLNHAVLLVGYGVENNVPFWTFKNTWGKDWGEEGYF 306

Query: 321 QIERGANACGI 331
           ++ +  +ACG+
Sbjct: 307 RVRQNVDACGM 317


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 157/315 (49%), Gaps = 34/315 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           +K ++  + R Y D +E + RF+ F  +     ++             G +  SD++ +E
Sbjct: 66  WKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKTDEE 125

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           + +    R +      L A R+  K         P P  +DWR      + PV++QG CG
Sbjct: 126 LKRLRCFRGS------LNASRDGSKYI---TIAAPPPSEIDWRNKGA--VTPVKNQGNCG 174

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GL 207
           SCWAF+ T  +E Q  L    L  LS+ QLV+C  ++GN  CNGG +D AF+YVK   G+
Sbjct: 175 SCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGI 234

Query: 208 ESQADYPYRNKE--NITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYL 261
           +++A YPY + E  +    C +  ++A V V   ++      +  L Q+    GPI V +
Sbjct: 235 DTEASYPYVSGETGDANPTCRFNLKEAVVRVTG-YIDLPRGQVSELKQAVGHYGPISVAI 293

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           N  L           +D  C+   LDH V +VGYGE+NGI  W+++NSWG    ++GY +
Sbjct: 294 NAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVK 353

Query: 322 IERGA-NACGIESYA 335
           I R   N CG+ S A
Sbjct: 354 ILRDHNNLCGVASMA 368


>gi|392873946|gb|AFM85805.1| cathepsin H [Callorhinchus milii]
          Length = 259

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 120/218 (55%), Gaps = 13/218 (5%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           GP P  +DWR +K   + PV++QG CGSCW F+TT  LES +A+    L  L++ QLV+C
Sbjct: 38  GPYPDFVDWR-TKGNYVTPVKNQGGCGSCWTFSTTGCLESAIAIKTGKLLSLAEQQLVDC 96

Query: 184 D--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT- 239
              + N  CNGG    AFEY+K   GLE++ DYPY  ++     C Y+  KA  FV++  
Sbjct: 97  AGAYKNHGCNGGLPSQAFEYIKYNGGLEAEKDYPYTAQDQ---HCQYQPNKAVAFVKEVV 153

Query: 240 ----WVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
               +  +G+   +  L    I   +     + Y+G     ++    P K++HAV  VGY
Sbjct: 154 NITQYDENGIVDAVARLNPVSIAFEVTDDFFQ-YEGGVYSNSNCDSTPDKVNHAVLAVGY 212

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G +NG   WIV+NSWG     +GYF I RG N CG+ +
Sbjct: 213 GVQNGTKYWIVKNSWGPEWGLNGYFYIIRGKNMCGLAA 250


>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
          Length = 371

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/312 (31%), Positives = 153/312 (49%), Gaps = 31/312 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           ++ ++ K+ R Y    E + R   F ++     E+             G +  SD++  E
Sbjct: 67  WQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNSE 126

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           +    G R + K      A R   +    +      P  +DWR      + PV++QG CG
Sbjct: 127 LDVLRGFRHSSK------ASRSGSQYIPFDAAP---PAEVDWRTKGA--VTPVKNQGDCG 175

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
           SCWAF+ T  +E Q  L    L  LS+ QLV+C   N  C+GG +D+AFEYVK++ G+++
Sbjct: 176 SCWAFSATGGIEGQHYLATGKLVSLSEQQLVDCSSSNDGCDGGLMDLAFEYVKEHKGIDT 235

Query: 210 QADYPYRNKENITFR-CTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLNHR 264
           +  YPY +      R C+++ + A V V   +V       + L Q+    GPI V +N  
Sbjct: 236 EVHYPYVSGNTGYARQCSFDPKYAAVNVTG-YVDIPEGQELLLQQAVGFHGPISVGINAG 294

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
           L           +D  CNPH LDH V +VGYG  NG+  W+++NSWG+   ++GY +I R
Sbjct: 295 LPSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILR 354

Query: 325 GA-NACGIESYA 335
              N CG+ + A
Sbjct: 355 NHNNLCGVATMA 366


>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
          Length = 375

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 152/332 (45%), Gaps = 41/332 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + F+ + +++NR+Y +  E   R + F Q+  +             +G +  SD + +E 
Sbjct: 40  EVFRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEF 99

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           +Q  G R+ G   E L   R+   +   E +    P + DWR +K   ++PV +Q  C  
Sbjct: 100 VQLYGSRVAG---EALGVSRKVGSEEWGESQ----PPTCDWR-NKPNTISPVRNQRHCNC 151

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQYGLESQ 210
           CWA A    +E+  A+           +L++CD     C GG + D     +K  GL S+
Sbjct: 152 CWAMAAAGNIEALWAIKFNRSVEERGGELLDCDRCGNGCKGGFVWDAFLTVLKNRGLASE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYP+ +    T RC  EK K   ++QD  +    +  +  HL   GPI V +N +L++ 
Sbjct: 212 TDYPF-DGSGKTHRCLAEKHKKVAWIQDFIMLQACEQSIARHLATQGPITVTINVKLLQQ 270

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGE--------------------KNGILTWIVRN 308
           Y    I+     C+P  +DH+V +VG+G+                    +  +  W ++N
Sbjct: 271 YQKGVIKATPTTCDPRHVDHSVLLVGFGKTKSVEGRQGKAASFRSYTRPRRSMAYWTLKN 330

Query: 309 SWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           SWG    + GYF++ RG+N CGI  Y   A V
Sbjct: 331 SWGPHWGEEGYFRLHRGSNTCGITKYPVTAIV 362


>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
 gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
          Length = 477

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 153/314 (48%), Gaps = 23/314 (7%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           ++F  ++ +  + YT+  E+  RF  FK++ K   E          YG +  SD +  E 
Sbjct: 172 NSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTME- 230

Query: 92  LQRTGLRLTGKEK--ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            ++  L    ++      +A+ E+    +NE     LP+S DWR+     +  V++QG C
Sbjct: 231 FKKIMLPYQWEQPVYPMEQANFEKHDVTINEED---LPESFDWREKGA--VTQVKNQGNC 285

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLE 208
           GSCWAF+TT  +E    + K  L  LS+ +LV+CD  +  CNGG    A+ E ++  GLE
Sbjct: 286 GSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGLE 345

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLI 266
            +  YPY  +      C   ++   V++  +       V+    L+  GPI + LN   +
Sbjct: 346 PEDAYPYDGRGET---CHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTL 402

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y    +      C P  L+H V IVGYG+      WIV+NSWG    + GYF++ RG 
Sbjct: 403 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYRGK 462

Query: 327 NACGIESYAYLASV 340
           N CG++  A  A V
Sbjct: 463 NVCGVQEMATSALV 476


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 155/318 (48%), Gaps = 43/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F T+  K+ ++Y    E   RF  F+ + +    +        +G +  SD +P+E  ++
Sbjct: 44  FTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDPSAEHGVTKFSDLTPEEFKRQ 103

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL +   +            LP++ DWR      + PV++QG CGSCWA
Sbjct: 104 ----YLGLKPLRLPSTANKAPILPTSD----LPENFDWRDKGA--VTPVKNQGSCGSCWA 153

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF+Y+ Q 
Sbjct: 154 FSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFDYILQA 213

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++++ DYPY  ++     C ++K K    V +  V S  +  +  +L++ GP+ V +N
Sbjct: 214 GGVQTEKDYPYSGRDET---CKFDKSKVAATVANFSVVSLDEDQIAANLVKHGPLAVGIN 270

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C    LDH V +VGYG              WI++NSWG+   
Sbjct: 271 AIFMQTYIGG--VSCPYICG-KNLDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSWGESWG 327

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 328 EDGYYKICRGKNVCGVDS 345


>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
 gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
 gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
          Length = 371

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 166/340 (48%), Gaps = 37/340 (10%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +D     ++  + FK + +++NR+Y++  E   R   F  +  +             +G 
Sbjct: 27  KDAGPRPLELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQ 86

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G +   +  ER+    ++VK   +ER    +P + DWR+ K  ++
Sbjct: 87  TPFSDLTEEEFGQLYGHQ---RAPERILNMAKKVK---SERWGESVPPTCDWRKVK-NII 139

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           + +++QG C  CWA A    +++   +  +    +S  +L++CD     CNGG + D   
Sbjct: 140 SSIKNQGNCRCCWAIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYI 199

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++  +    RC  +K +   ++QD  + S  + ++  +L   GPI
Sbjct: 200 TVLNNSGLASEEDYPFQGHQK-PHRCLADKYRKVAWIQDFTMLSSNEQVIAGYLAIHGPI 258

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILT------------- 303
            V +N +L++ Y    I+     C+PH ++H+V +VG+G EK G+ T             
Sbjct: 259 TVTINMKLLQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRS 318

Query: 304 ---WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
              WI++NSWG    + GYF++ RG N CGI  Y   A V
Sbjct: 319 TPYWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARV 358


>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
          Length = 328

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/300 (33%), Positives = 146/300 (48%), Gaps = 17/300 (5%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK ++ ++N+ Y D  E   R + F ++ +  D  Y   G+  +    + Q + L     
Sbjct: 28  FKLWMSQYNKVY-DMEEYYHRLQIFIENKRRID--YHNEGN-HKFTMGLNQFSDLTFAEF 83

Query: 103 EKERL--EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
            K  L  E       K  +    GP P+S+DWR+ K   +  V++QG CGSCW F+TT  
Sbjct: 84  RKSFLLTEPQNCSATKGSHVSSNGPYPESVDWRK-KGNYVTAVKNQGSCGSCWTFSTTGC 142

Query: 161 LESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRN 217
           LES  A+    L  LS+ QLV+C     N  CNGG    AFEY+K   G+ ++ DYPY  
Sbjct: 143 LESVTAIATGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKFNKGIMTEDDYPYTA 202

Query: 218 KENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGV-YLNHRLIESYDGNP 273
            ++    C ++ + A  FV+D    +  D M  +    +  P+ + Y        YDG  
Sbjct: 203 HDDT---CKFKTDLAAAFVKDVVNITKYDEMGMVDAVARFNPVSLAYEVTSDFMHYDGGV 259

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
               +       ++HAV  VGYGE+ G   WIV+NSWG      GYF IERG N CG+ +
Sbjct: 260 YTSKECHNTTDTVNHAVLAVGYGEEKGTPYWIVKNSWGSSWGMKGYFFIERGKNMCGLAA 319


>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 330

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/321 (32%), Positives = 151/321 (47%), Gaps = 43/321 (13%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           AFK +   +N+ Y+ +     R   FK++ +  + +      +D +   I Q   L    
Sbjct: 29  AFKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELF----NKNDEAQHGITQFADLT--- 81

Query: 102 KEKERLEADRERVKKFLNERKKGPL-------PKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             +E  +       +  N + K  L       P ++DW  +    + PV++QG CGSCWA
Sbjct: 82  -HEEFADMYLGYKPQLRNSQAKVSLSSTPFTAPTAIDW--TTKGAVTPVKNQGSCGSCWA 138

Query: 155 FATTAILESQVAL-LKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLESQAD 212
           F+TT  +E Q  L LK+ L   S+ QLV+CD   +  CNGG +D AF Y++   LE+++ 
Sbjct: 139 FSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTKEDQGCNGGLMDNAFTYLESAKLETESA 198

Query: 213 YPYRNKENITFRCTYEKEKAKV------------FVQDTWVTSGVDHMMHLLQSGPIGVY 260
           YPY   +     C Y +    V             V DT  T GV     L   GP+ V 
Sbjct: 199 YPYTAVDG---SCKYNQSLGVVGVASFVDIEQGKTVADTENTMGV----ALDNIGPLSVA 251

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y G     N   CNP+ L+H V IVG G +NG   W V+NSWG    + GYF
Sbjct: 252 INANNLQFYAGGI--SNPLICNPNGLNHGVLIVGLGSENGKDFWKVKNSWGASWGEKGYF 309

Query: 321 QIERGANACGIE---SYAYLA 338
           +I RG   CGI    SY  LA
Sbjct: 310 RIVRGKGKCGINRAVSYPVLA 330


>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
          Length = 326

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 160/331 (48%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   ++  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CYMDKSKFVAYINGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
          Length = 803

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 94/302 (31%), Positives = 144/302 (47%), Gaps = 38/302 (12%)

Query: 52  RTYTDDNEIKTRFEYFKQDGKETD---------EYYGTSGSSDRSPQEILQR-TGLRLTG 101
           R+Y    E+K RF  F+ + K+ D           YG +  SD S +E  +   GL+   
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGLK--- 565

Query: 102 KEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                    R    KF  E  + P   LP+  DWR      + PV++QG CGSCWAF+ T
Sbjct: 566 --------KRTPDIKFKQEMAQIPNITLPEEYDWRN--YNAVTPVKNQGMCGSCWAFSVT 615

Query: 159 AILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRN 217
             +E Q A+    L  LS+ +LV+CD  +  C GG  + A+  +++  GLE ++DYPY  
Sbjct: 616 GNIEGQYAIKTGNLVSLSEQELVDCDKYDDGCEGGLFETAYHAIEELGGLELESDYPYSG 675

Query: 218 KENITFRCTYEKEKAKVFVQDTWVTSG--VDHMMHLLQSGPIGVYLNHRLIESYDGNPIR 275
           ++N    C +   + +V +  +   S    D    L+ +GPI + +N   ++ Y G    
Sbjct: 676 RDNT---CHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSH 732

Query: 276 RNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPDHGYFQIERGANAC 329
              + C+P  LDH V IVGYG     L       W+++NSW       GY+ + RG  +C
Sbjct: 733 PLKFLCDPKTLDHGVLIVGYGIHRTWLLHRHLPYWLIKNSWSSYWGAKGYYMLYRGDGSC 792

Query: 330 GI 331
           G+
Sbjct: 793 GV 794


>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
 gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
          Length = 371

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 107/332 (32%), Positives = 162/332 (48%), Gaps = 48/332 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D  E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 48  FLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARRHQLLDPSAEHGVTKFSDLTPAE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGS 151
           T L   G  K R    RE + K  NE    P   LP   DWR      + PV++QG CGS
Sbjct: 107 TYL---GLRKSRRALLRE-LGKSANEAPVLPTDGLPDDFDWRDHGA--VTPVKNQGSCGS 160

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV 202
           CW+F+T+  LE    L    L  LS+ Q+V+CDH          +  CNGG +  AF Y+
Sbjct: 161 CWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPDSCDSGCNGGLMTNAFSYL 220

Query: 203 -KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIG 258
            K  GLES+ DYPY   ++   +C ++K K    VQ+  V S VD      +L++ GP+ 
Sbjct: 221 QKAGGLESEKDYPYTGSDD---KCKFDKSKIVASVQNFSVVS-VDEGQIAANLIKHGPLA 276

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           + +N   +++Y G       + C    LDH V +VGYG              WI++NSWG
Sbjct: 277 IGINAAYMQTYIGG--VSCPYICG-RTLDHGVLLVGYGAAGFAPIRLKDKPYWIIKNSWG 333

Query: 312 DIGPDHGYFQIERGANA---CGIESYAYLASV 340
           +   ++GY++I RG+N    CG++S     S 
Sbjct: 334 ENWGENGYYKICRGSNVRNKCGVDSMVSTVSA 365


>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
          Length = 358

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 154/325 (47%), Gaps = 36/325 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDN-EIKTRFEYFKQDGKETDE-----------YYGTSGS 83
           +++    F+ YIV++N++Y +D+ E K RFE F++  +  ++           YYG +  
Sbjct: 29  NVEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSFQSSQESAYYGLTKF 88

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKG-------PLPKSLDWRQSK 136
           SD S  E LQ+T L       ++        + F N    G       P+P  +DWR   
Sbjct: 89  SDLSEDEFLQQTLLPDLSLRNQKHTTASYYHQYFTNSSNHGKRAIIPPPIPSKVDWRNRG 148

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNID 196
           V  + PV+ Q  CG+CWAF+T  ++ES  A+   TLYP S  ++++C  G+  C GG+  
Sbjct: 149 V--VGPVQYQDNCGACWAFSTIGVVESMYAIKNGTLYPFSVQEMIDCMPGSYGCQGGDTC 206

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFR---CTYEKEKAKV-------FVQDTWVTSGVD 246
               ++    LES+      N   +T R   C   K  AK        F  +++V +  +
Sbjct: 207 ALLSWL----LESKTKIISENVYPLTLRNDPCKLSKTSAKTTGVKITDFTCNSFVNAESN 262

Query: 247 HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIV 306
            +  L   GP+   +N    ++Y G  I+ +      H L+HAV IVGY     I  +I+
Sbjct: 263 LLTLLGTHGPVVAGVNAISWQNYLGGIIQYHCDGSFSH-LNHAVQIVGYDMAARIPHYII 321

Query: 307 RNSWGDIGPDHGYFQIERGANACGI 331
           +NSWG    + GY  I  G N CGI
Sbjct: 322 KNSWGSTFGNKGYIYIAIGKNLCGI 346


>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
 gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 155/322 (48%), Gaps = 50/322 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL-Q 93
           F  +  K+ ++Y    E   RF  FK + +    +        +G +  SD +P E   Q
Sbjct: 53  FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQ 112

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCG 150
             GLR            R R+ K  NE    P   LP+  DWR      + P+++QG CG
Sbjct: 113 VLGLR------------RLRLPKDANEAPILPTSDLPEDFDWRDKGA--VGPIKNQGSCG 158

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY 201
           SCW+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY
Sbjct: 159 SCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 218

Query: 202 -VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
            +K  GL  + DYPY   +     C ++K K    V +  V S  +  +  +L+++GP+ 
Sbjct: 219 TLKAGGLMREEDYPYTGTDRDA--CKFDKNKVAARVANFSVVSLDEDQIAANLVKNGPLA 276

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           V +N   +++Y G       + C+  +LDH V +VGYG              WI++NSWG
Sbjct: 277 VAINAVFMQTYIGG--VSCPYICS-RRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWG 333

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   ++G+++I RG N CG++S
Sbjct: 334 EKWGENGFYKICRGRNVCGVDS 355


>gi|167427527|gb|ABZ80400.1| cathepsin L4, partial [Fasciola hepatica]
          Length = 303

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 79/224 (35%), Positives = 125/224 (55%), Gaps = 16/224 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P+S+DWR+     +  V+ QG CGSCWAF+TT  +E Q    +K     S+ QLV+C  
Sbjct: 85  VPESIDWRE--FGYVTEVKDQGDCGSCWAFSTTGAVEGQYTKNQKANISFSEQQLVDCSG 142

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-----FVQD 238
           D+GN  CNGG ++ A+EY+++ GLE+++ YPY+ +E     C Y+     V     F++ 
Sbjct: 143 DYGNHGCNGGFMENAYEYLERRGLETESSYPYKAEEG---PCKYDSRLGVVEVFGYFIEH 199

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           + + S + H++       + V +    +    G    RN   C+   L+H + +VGYG +
Sbjct: 200 SGIESKLAHLVGDKGPAAVAVDVESDFLMYRGGIYASRN---CSSESLNHGILVVGYGTQ 256

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASVK 341
           +G   WIV+NSWG +  DHGY ++ R   N CGI S A +  V+
Sbjct: 257 DGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIASAASVPVVE 300


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 149/300 (49%), Gaps = 19/300 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F+++ +K  +TY +  E   RF  F+++ ++ + +         S  + + +    +T  
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFA-DMTRA 84

Query: 103 EKERLEADRERVKKFLNERKKGPL------PKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           E + + A + + K  +   K   L      P+S+DWR   V  + P++ Q +CGSCWAFA
Sbjct: 85  EFKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNV--VTPIKDQAQCGSCWAFA 142

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLESQADYPY 215
                E   AL    L   S+ QLV+C    N  C+GG +D  F Y++  GLE ++DYPY
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTNGLELESDYPY 202

Query: 216 RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ---SGPIGVYLNHRLIESYDGN 272
              +     C+YE  K    V  ++V+   +    L     +GP+ + +N   ++ Y   
Sbjct: 203 TGYDGY---CSYESSKVVTKVS-SYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSG 258

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
            I  +D  C+P  LDH V  VGY  +NG   W+++NSWG    + GYF+  RG N CG++
Sbjct: 259 II--DDKYCDPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVK 316


>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
 gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/321 (31%), Positives = 163/321 (50%), Gaps = 48/321 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y+  +E   RF+ FK +      +        +G +  SD +P+E  + 
Sbjct: 48  FNLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKS 107

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G  K+   A+   +    N      LPK  DWR+     +  V++QG CGSCW
Sbjct: 108 VLGLRGVGLPKD---ANAAPILPTDN------LPKDFDWREKGA--VTAVKNQGSCGSCW 156

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-K 203
           +F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ K
Sbjct: 157 SFSTTGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILK 216

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             G+  + DYPY   +  +  C ++K+K    V +  V S  +  +  +L+++GP+ + L
Sbjct: 217 SGGVMREEDYPYSGTDRGS--CKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIAL 274

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT---------WIVRNSWGD 312
           N   +++Y G       + C+  +LDH V +VGYG  +G  +         WI++NSWG+
Sbjct: 275 NAVYMQTYVGG--VSCPYICS-KRLDHGVLLVGYG--SGAYSPIRLKEKPYWIIKNSWGE 329

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++GY++I RG N CG++S
Sbjct: 330 TWGENGYYKICRGRNICGVDS 350


>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 366

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 154/324 (47%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +   ++        +G +  SD +P E  +R
Sbjct: 49  FTVFKRRFGKVYASDEEHDYRLSVFKANMRRAKQHQELDPAAVHGVTQFSDLTPTE-FRR 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 108 KFLGL------------NRRLKFPADAKTAPILPTDELPSDFDWRDHGA--VTPVKNQGT 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 154 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAF 213

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 214 EYTLKAGGLMREEDYPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 271

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 272 LAVAINAVFVQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 328

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 329 WGESWGENGYYKICRGRNVCGVDS 352


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 165/334 (49%), Gaps = 30/334 (8%)

Query: 16  VTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD 75
           V+     D +I  + + + + ++++  +  ++ +   TY    E + RFE F+ + +  D
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRM--YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYID 75

Query: 76  EYYGTSGSSDRSPQEILQRTGLRLTGKE------KERLEADRER-VKKFLNERKKGPLPK 128
           ++   + +   S +  L R    LT +E        R + DRER +           LP+
Sbjct: 76  QHNAAADAGVHSFRLGLNRFA-DLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPE 134

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-N 187
           S+DWR  K   +  V+ QG CGSCWAF+  A +E    ++   + PLS+ +LV+CD   N
Sbjct: 135 SVDWR--KKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYN 192

Query: 188 LNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
             CNGG +D AFE++    G++S+ DYPY+ ++N   RC   K+ AKV   D +    V+
Sbjct: 193 QGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDN---RCDANKKNAKVVTIDGYEDVPVN 249

Query: 247 H---MMHLLQSGPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
               +   + + PI V +    R  + Y           C    LDH VA VGYG +NG 
Sbjct: 250 SEKSLQKAVANQPISVAIEAGGRAFQLYKSGIF---TGTCG-TALDHGVAAVGYGTENGK 305

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
             W+VRNSWG +  + GY ++ER   A    CGI
Sbjct: 306 DYWLVRNSWGSVWGEDGYIRMERNIKASSGKCGI 339


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 165/328 (50%), Gaps = 30/328 (9%)

Query: 22  TDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTS 81
            D +I  + + + + ++++  +  ++ + + TY    E + RFE F+ + +  D++   +
Sbjct: 23  ADMSIVFYGERSEEEVRRM--YAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAA 80

Query: 82  GSSDRSPQEILQRTGLRLTGKE------KERLEADRER-VKKFLNERKKGPLPKSLDWRQ 134
            +   S +  L R    LT +E        R + DRER +           LP+S+DWR 
Sbjct: 81  DAGVHSFRLGLNRFA-DLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWR- 138

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGG 193
            K   +  V+ QG CGSCWAF+  A +E    ++   + PLS+ +LV+CD   N  CNGG
Sbjct: 139 -KKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGG 197

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL- 251
            +D AFE++    G++S+ DYPY+ ++N   RC   K+ AKV   D +    V+    L 
Sbjct: 198 LMDYAFEFIINNGGIDSEEDYPYKERDN---RCDANKKNAKVVTIDGYEDVPVNSEKSLQ 254

Query: 252 --LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVR 307
             + + PI V +    R  + Y           C    LDH VA VGYG +NG   W+VR
Sbjct: 255 KAVANQPISVAIEAGGRAFQLYKSGIFTGT---CGT-ALDHGVAAVGYGTENGKDYWLVR 310

Query: 308 NSWGDIGPDHGYFQIERGANA----CGI 331
           NSWG +  ++GY ++ER   A    CGI
Sbjct: 311 NSWGSVWGENGYIRMERNIKASSGKCGI 338


>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
 gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
          Length = 364

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 156/304 (51%), Gaps = 32/304 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y +++E K R+  F+        ++ +     Y  +  +D +  E++  
Sbjct: 67  FEKFISQYNKHYKNEDEKKYRYNIFRHNIESINHKNSRNDSAVYKINRFADMTKNEVV-- 124

Query: 95  TGLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             +R TG     L  +     V     +R++   P S DWR   +  +  V+ QG CG+C
Sbjct: 125 --IRHTGLASGELGVNFCETIVVDGPGQRQR---PTSFDWR--TLNKVTSVKDQGMCGAC 177

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAFA    LESQ A+    L  LS+ QLV+CDH ++ C+GG I  A+E + +  G+E   
Sbjct: 178 WAFAGLGALESQYAIKYDRLIDLSEQQLVDCDHVDMGCDGGLIHTAYEEIMRMGGVEQDF 237

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVYLNHRLIES 268
           DYPYR +      C  +  K    V+    +V    + +  LL+  GPI + ++   I  
Sbjct: 238 DYPYRAERQ---PCALKPHKFAAGVRSCYRYVLLNEERLEDLLRHVGPIAIAVDAVDITD 294

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGAN 327
           Y G  +      C  + L+HAV +VGYG +N +  WI++NSWG D G D GY ++ RG N
Sbjct: 295 YYGGIVS----FCENNGLNHAVLLVGYGVENNVPYWILKNSWGSDYGED-GYVRVRRGVN 349

Query: 328 ACGI 331
           +CG+
Sbjct: 350 SCGM 353


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 105/334 (31%), Positives = 165/334 (49%), Gaps = 30/334 (8%)

Query: 16  VTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD 75
           V+     D +I  + + + + ++++  +  ++ +   TY    E + RFE F+ + +  D
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRM--YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYID 75

Query: 76  EYYGTSGSSDRSPQEILQRTGLRLTGKE------KERLEADRER-VKKFLNERKKGPLPK 128
           ++   + +   S +  L R    LT +E        R + DRER +           LP+
Sbjct: 76  QHNAAADAGVHSFRLGLNRFA-DLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPE 134

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-N 187
           S+DWR  K   +  V+ QG CGSCWAF+  A +E    ++   + PLS+ +LV+CD   N
Sbjct: 135 SVDWR--KKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYN 192

Query: 188 LNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
             CNGG +D AFE++    G++S+ DYPY+ ++N   RC   K+ AKV   D +    V+
Sbjct: 193 QGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDN---RCDANKKNAKVVTIDGYEDVPVN 249

Query: 247 HMMHL---LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
               L   + + PI V +    R  + Y           C    LDH VA VGYG +NG 
Sbjct: 250 SEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGT---CGT-ALDHGVAAVGYGTENGK 305

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
             W+VRNSWG +  + GY ++ER   A    CGI
Sbjct: 306 DYWLVRNSWGSVWGEDGYIRMERNIKASSGKCGI 339


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 89/300 (29%), Positives = 150/300 (50%), Gaps = 19/300 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F+++ +K  +TY +  E   RF  F+++ ++ + +         S  + + +    +T  
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFA-DMTRA 84

Query: 103 EKERLEADRERVKKFLNERKKGPL------PKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           E + + A + + K  +   K   L      P+S+DWR   V  + P++ Q +CGSCW+FA
Sbjct: 85  EFKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNV--VTPIKDQAQCGSCWSFA 142

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLESQADYPY 215
                E   AL    L   S+ QLV+C    N  C+GG +D  F Y++  GLE ++DYPY
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTNGLELESDYPY 202

Query: 216 RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ---SGPIGVYLNHRLIESYDGN 272
              +     C+Y+  K    V  ++V+   +    L     +GP+ + +N   ++ Y   
Sbjct: 203 TGYDG---SCSYDSSKVVTKVS-SYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSG 258

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
            I  +D  C+P  LDH V  VGY  +NG+  W+++NSWG    + GYF+  RG N CG++
Sbjct: 259 II--DDKYCDPEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVK 316


>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
          Length = 361

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 104/332 (31%), Positives = 158/332 (47%), Gaps = 42/332 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK---------ETDEYYGTSGSSDRSPQEILQ 93
           F  +   +N+TY D  E + RF  FK + K         E   +YG +  SD SP E  +
Sbjct: 34  FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSE-FE 92

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ----SKVK----------- 138
           R  L L    K+ L   +  VK         PLP   DWR     ++VK           
Sbjct: 93  RHYLGL----KKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAF 148

Query: 139 -VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDV 197
                V++QG CGSCWAF+ T  +E Q  L +  L  LS+ +LV+CDHG+  C GG +  
Sbjct: 149 SXXTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHGDHGCKGGYMGQ 208

Query: 198 AFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMHLLQS 254
           A + V +  GLE++++YPY+  +     C + K ++K  VQ       +  +    L++ 
Sbjct: 209 AMKAVIEMGGLETESEYPYKGVDGT---CEFNKTESKARVQSFVGLPQNETELAYWLMKH 265

Query: 255 GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG------EKNGILTWIVRN 308
           GP+ + +N   ++ Y G       + C+P  LDH V +VG+G       +  +  WIV+N
Sbjct: 266 GPVSIGINANAMQFYFGGISHPWKFLCSPTDLDHGVLLVGFGVDKRSFRRKPVPYWIVKN 325

Query: 309 SWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           SWG    + GY+++ RG   CG+   A  A V
Sbjct: 326 SWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 357


>gi|1594287|gb|AAC48340.1| cathepsin L-like cysteine proteinase [Toxocara canis]
          Length = 360

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 159/318 (50%), Gaps = 31/318 (9%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYF----------KQDGKETDEYYGTSGSSDR 86
           ++ +D F+ +I K+++ Y  + E   RF  +           Q  ++    YG +  +D 
Sbjct: 44  LRLLDRFEEFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADW 103

Query: 87  SPQE----ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +  E    +L +   +   K+   +++  +  +  L  R++  +P   DWR     V+ P
Sbjct: 104 NVNEFREILLPKDFFKNLRKKSTFIDSFIDPPETVLARREE--IPDHFDWR--PYNVVTP 159

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+SQ +CGSCWAFAT   +ES  AL    L  LS+ QL++C+  N  C+GG++D A  YV
Sbjct: 160 VKSQFKCGSCWAFATVGTVESAYALGTGELRSLSEQQLLDCNLENNACDGGDVDKALRYV 219

Query: 203 KQYGLESQADYPY--RNKENITFRCTYEKEKAKVFV-QDTWVTSGVDHMMHLLQSGPIGV 259
              GL  + DYPY    ++    R    + KA VF+ QD    S +D ++H    GP+ V
Sbjct: 220 YDEGLMREYDYPYVAHRQDTCQLRGETTRIKAAVFLHQDE--ASIIDWLLHY---GPVNV 274

Query: 260 YLNHRL-IESYDGNPIRRNDWACNPHKL-DHAVAIVGYGEKNGI--LTWIVRNSWG-DIG 314
            +N    +++Y G     + W C    +  H++ IVGYG  N      WIV+NSWG   G
Sbjct: 275 GINVTADMKAYKGGVYTPDKWECENKIIGTHSINIVGYGTWNATNQKYWIVKNSWGQSYG 334

Query: 315 PDHGYFQIERGANACGIE 332
            + GY    RG N+CGIE
Sbjct: 335 IEDGYVYFARGINSCGIE 352


>gi|17569349|ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans]
 gi|351061560|emb|CCD69414.1| Protein R09F10.1 [Caenorhabditis elegans]
          Length = 383

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 167/316 (52%), Gaps = 13/316 (4%)

Query: 25  AIYVWRDLAY--DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSG 82
           + +V++ L +  +++K    F  +I+K++R YT   E + R++ F ++  E +     + 
Sbjct: 62  SFFVFQRLNHKMENLKHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNL 121

Query: 83  SSDRSPQEILQRTG--LRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKV 139
             D    E    T   L+   +E +  + D +  K   +  + G + P S+DWR+     
Sbjct: 122 GLDLDVNEFTDWTDEELQKMVQENKYTKYDFDTPKFEGSYLETGVIRPASIDWREQGK-- 179

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           L P+++QG+CGSCWAFAT A +E+Q A+ K  L  LS+ ++V+CD  N  C+GG    A 
Sbjct: 180 LTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSGGYRPYAM 239

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPI 257
           ++VK+ GLES+ +YPY   ++   +C  ++   +VF+ D  + S  +  +   +   GP+
Sbjct: 240 KFVKENGLESEKEYPYSALKHD--QCFLKENDTRVFIDDFRMLSNNEEDIANWVGTKGPV 297

Query: 258 GVYLN-HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYGEKNGILTWIVRNSWGDIGP 315
              +N  + + SY       +   C    +  HA+ I+GYG +     WIV+NSWG    
Sbjct: 298 TFGMNVVKAMYSYRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWG 357

Query: 316 DHGYFQIERGANACGI 331
             GYF++ RG N+CG+
Sbjct: 358 ASGYFRLARGVNSCGL 373


>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
 gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
          Length = 328

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 156/314 (49%), Gaps = 32/314 (10%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSD 85
           YD +K  D F++++  + + Y DD E   R+  FK + +E +          Y  +  SD
Sbjct: 20  YDLLKAPDYFESFVANYQKNYNDDLEKSKRYTIFKDNLEEINVKNRLNDTAVYRINKFSD 79

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            S  EI+ + TGL    +            K  + ++  G  P + DWRQ     +  ++
Sbjct: 80  LSKTEIISKYTGLNAPSETTNF-------CKTIVLDQPPGKGPLNFDWRQQNK--VTSIK 130

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
           +QG CG+CWAFAT A +ESQ A+       LS+ QL++CD+ ++ C GG +  AFE + Q
Sbjct: 131 NQGSCGACWAFATLASIESQYAIRNDRHINLSEQQLIDCDYVDMGCYGGLLHTAFEQMIQ 190

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT-----WVTSGVDHMMHLLQS-GPI 257
             G++ + +YPY     +  +C         FV        +V    + +  LL++ GPI
Sbjct: 191 MGGVKQEHEYPY---AGVNKQCELNDITDDSFVVRIKGCYRYVVVREEKLKDLLRAVGPI 247

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            + ++   I +Y    I      C  + L+HAV +VGYG  NG+  W  +N+WG    ++
Sbjct: 248 PIAIDASGIVNYYKGVINY----CENYGLNHAVLLVGYGVDNGVPYWTFKNTWGVDWGEN 303

Query: 318 GYFQIERGANACGI 331
           GYF++ +  NACG+
Sbjct: 304 GYFRLRQNINACGM 317


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 155/319 (48%), Gaps = 42/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET--------DEYYGTSGSSDRSPQEILQR 94
           F  ++ K+N+ Y+   E   RF  FK++  +         D  +G +  SD + +E  ++
Sbjct: 75  FAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDRDAIHGINKFSDLTEEEFHEQ 134

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L LT     R  + R +    L       LP   DWR+  +  + PV++QG CGSCW 
Sbjct: 135 Y-LGLT--TPPRSLSQRTQPAPILPTDD---LPPDFDWRE--LGAVTPVKNQGACGSCWT 186

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    +    L  LS+ QLV+CDH          +  CNGG +  A++Y +K 
Sbjct: 187 FSTTGAMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKA 246

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY     I   C ++  K    V + + T  +D      +L+++GP+ V +
Sbjct: 247 GGLQREEDYPYT---GIDGSCKFDNTKVAAMVAN-FSTVSIDEDQIAANLVKNGPLAVGI 302

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN---GILT----WIVRNSWGDIG 314
           N   +++Y G       + CN   LDH V +VGYG      G L     WI++NSWG   
Sbjct: 303 NAAFMQTYVGG--VSCPYVCNKQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDW 360

Query: 315 PDHGYFQIERGANACGIES 333
            + GY+++ RG N CGI +
Sbjct: 361 GEDGYYKLCRGHNVCGINT 379


>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
          Length = 471

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 97/310 (31%), Positives = 151/310 (48%), Gaps = 37/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + +K+ R Y    E + RF  FKQ+ +  +E          YG +  +D +  E  Q
Sbjct: 166 FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTSPEYKQ 225

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCG 150
           RTGL             R+  K   N + + P   LPK  DWR+     ++ V++QG CG
Sbjct: 226 RTGL-----------WQRDPQKAASNPKAEIPNIDLPKEFDWREKGA--ISAVKNQGNCG 272

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
           SCWAF+ T  +E   A+    L   S+ +L++CD  +  CNGG  D A+E +++  GLE 
Sbjct: 273 SCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAIEKIGGLEL 332

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIE 267
           ++DYPY  +++   +C +   K  V V+        +  +   L+ +GPI + +N   ++
Sbjct: 333 ESDYPYHARKD---QCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQ 389

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGE------KNGILTWIVRNSWGDIGPDHGYFQ 321
            Y G         C+   LDH V IVGY        K  +  WIV+NSWG    + GY++
Sbjct: 390 FYRGGVSHPPHILCSRKNLDHGVLIVGYRVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYR 449

Query: 322 IERGANACGI 331
           + RG N CG+
Sbjct: 450 VYRGDNTCGV 459


>gi|209732040|gb|ACI66889.1| Cathepsin H precursor [Salmo salar]
          Length = 330

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 146/303 (48%), Gaps = 23/303 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK +++++N+ Y D  E   R + F +  +  D  Y  +G    S   + Q + +     
Sbjct: 30  FKQWMLQYNKVY-DLEEYYHRLDIFTRHKRRID--YHNAGKHTFS-MGLNQFSDMSFAEF 85

Query: 103 EKERL--EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
            K  L  E       K  +    GP P S+DWR+ K   ++PV+ QG CGSCW F+TT  
Sbjct: 86  RKTFLLTEPQNCSATKGSHISSHGPYPGSVDWRE-KGNYVSPVKYQGHCGSCWTFSTTGC 144

Query: 161 LESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRN 217
           LES  A+    L  LS+ QLV+C  D  N  C GG    AFEYVK   GL ++ DYPY  
Sbjct: 145 LESVTAIATGKLPLLSEQQLVDCAQDFNNHGCMGGLPSQAFEYVKYNNGLMTEDDYPYTG 204

Query: 218 KENITFRCTYEKEKAKVFVQDTW-VTS----GVDHMMHLLQSGPIGVYLNHRLIESYDGN 272
            +     C ++ E A  FV+D   +TS    G+   +  L     G  +    +   DG 
Sbjct: 205 HDG---SCNFKPELAAAFVKDVVNITSYDEKGMVDAVARLNPVSFGYEVTDDFLHYKDG- 260

Query: 273 PIRRNDWAC--NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
               +   C      ++HAV  VGYGEKN    WIV+NSWG      GYF IERG N CG
Sbjct: 261 --VYSSTTCKNTTDNVNHAVLAVGYGEKNSTPYWIVKNSWGTNWGMDGYFLIERGRNMCG 318

Query: 331 IES 333
           + +
Sbjct: 319 LAA 321


>gi|358334193|dbj|GAA43174.2| cysteine proteinase 3, partial [Clonorchis sinensis]
          Length = 374

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 98/317 (30%), Positives = 147/317 (46%), Gaps = 36/317 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQ 89
           A++ + V++NR YTD  E   R   F Q      E+             G +  SDR P 
Sbjct: 66  AWEKFRVEFNRKYTDSQEQINRLNVFCQSFMRVREHNKAYEEGRVTFKRGINEFSDRFPD 125

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E     G         R+   +     F   +   P P+S+DWR++    + PV  QG C
Sbjct: 126 ERQHACG--------GRINISKHSGSTF--RKVAAPAPQSIDWRRNGA--VTPVRRQGDC 173

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN--CNGGNIDVAFEYVKQYG- 206
           G+CWAFA T  +E +  + +K L   S  QLV+C  G+    CNGG    AFEYV+  G 
Sbjct: 174 GACWAFAATGAIEGRYFIFEKRLETFSPQQLVDCIQGDTTNGCNGGYPSEAFEYVENVGG 233

Query: 207 LESQADYPYRNKEN--ITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVY 260
           LE + DYPY +         C Y++ K +V +    +    D    LLQ+    GPI + 
Sbjct: 234 LELERDYPYVSVATGLPNPFCGYDQTKQQVKLTSHVILPSGDEEA-LLQAVSIYGPIAIL 292

Query: 261 LN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
            +  H   + Y+ +     +       + HA+ +VGYGE+ G   W+V+NSWGD   + G
Sbjct: 293 FDASHPSFKDYESDIYSEENCGTTLDDVTHAMLVVGYGEELGEPYWLVKNSWGDKWGEKG 352

Query: 319 YFQIERGANACGIESYA 335
           Y ++ RG N C +  ++
Sbjct: 353 YMRVRRGVNMCAVAGFS 369


>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 327

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 149/321 (46%), Gaps = 58/321 (18%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL----R 98
           FK+++   N+ Y+   E   R + F ++ +  +++ G + S      +    T      R
Sbjct: 29  FKSWMALHNKAYSVQ-EFHQRLQIFTENKRRIEKHNGGNHSFTMGLNQFSDMTFAEFRKR 87

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
               E +   A +    K        P P+S+DWR +K   + PV++QG CGSCW F+TT
Sbjct: 88  FLWSEPQNCSATKGSYMK-----TNSPQPESIDWR-TKGNYVTPVKNQGACGSCWTFSTT 141

Query: 159 AILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPY 215
             LES  A+    L PLS+ QLV+C  D  N  CNGG    AFEY+K   GL +++ YPY
Sbjct: 142 GCLESVTAINTGKLVPLSEQQLVDCAWDFNNHGCNGGLPSQAFEYIKYNKGLMTESGYPY 201

Query: 216 RNKENITFRCTYEKEKAKVFVQD----------------------TWVTSGVDHMMHLLQ 253
              E    +C Y+ E A  FV++                      ++     D  MH   
Sbjct: 202 TAFEG---KCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVSFAFEVTDDFMHYKG 258

Query: 254 SGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILTWIVRNSWGD 312
               GVY + R  ++ D              K++HAV  VGYG  N  +  WIV+NSWG 
Sbjct: 259 ----GVYSSSRCHKTTD--------------KVNHAVLAVGYGNNNSSVPYWIVKNSWGP 300

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++GYF IERG N CG+ +
Sbjct: 301 YWGENGYFLIERGKNMCGLAA 321


>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
          Length = 364

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 157/319 (49%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEI-LQ 93
           F  +  K+ +TY    E   RF  FK + +        +    +G +  SD +  E   Q
Sbjct: 50  FSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQLDPSAVHGVTKFSDLTAAEFQRQ 109

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GL+  G     L A+ ++            LPK  DWR  K  V N V+ QG CGSCW
Sbjct: 110 FLGLKPLG-----LPANAQKAPILPTNN----LPKDFDWRD-KGAVTN-VKDQGACGSCW 158

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-K 203
           +F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+  
Sbjct: 159 SFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILG 218

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             G++ + DYPY  +++    C ++K K    V +  V S  +  +  +L+++GP+ V +
Sbjct: 219 AGGVQREEDYPYAGRDS---SCKFDKSKIAASVANYSVISLDEDQIAANLVKNGPLAVGI 275

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C   +LDH V IVGYGE             WI++NSWG+  
Sbjct: 276 NAVYMQTYIGG--VSCPYIC-AKRLDHGVQIVGYGESGYAPIRFKEKPYWIIKNSWGESW 332

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG NACG++S
Sbjct: 333 GENGYYKICRGQNACGVDS 351


>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
          Length = 491

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 107/318 (33%), Positives = 163/318 (51%), Gaps = 25/318 (7%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYF------KQDGKETDE---YYGTSGSSDR 86
           S++ +  FK ++  +NRTY    E + R   F       Q  +  D+    YG +  SD 
Sbjct: 187 SMQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDL 246

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT + L    +E     + RV K + +    P P   DWR +K  V N V++Q
Sbjct: 247 TEEEF--RT-IYLNPLLRED-PGKKMRVAKPVGD----PAPPEWDWR-NKGAVTN-VKNQ 296

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 297 GMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKMDKACLGGLPSNAYSAIKNLG 356

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+ +      C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 357 GLETEEDYSYQGQMQA---CNFSAEKAKVYINDSVELSHNEQKLAAWLAKKGPISVAINA 413

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C P  +DHAV IVGYG ++ I  W ++NSWG    + GY+ + 
Sbjct: 414 FGMQFYRHGISRPLRPLCTPWLIDHAVLIVGYGNRSDIPFWAIKNSWGTDWGEQGYYYLH 473

Query: 324 RGANACGIESYAYLASVK 341
           RG+ ACG+ + A  A V+
Sbjct: 474 RGSGACGVNTMASSAVVE 491


>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
           vulgare]
 gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 377

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 161/343 (46%), Gaps = 51/343 (14%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSG 82
           DL  DS      F  ++ ++ +TY D  E   R   FK + +    +        +G + 
Sbjct: 46  DLELDS-----QFVGFVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQLLDPSAEHGVTK 100

Query: 83  SSDRSPQEILQRT--GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
            SD +P E  +RT  GL+ T +   R  A        L       LP+  DWR      +
Sbjct: 101 FSDLTPAE-FRRTYLGLKTTRRSFLREMAGSAHDAPVLPTDG---LPEDFDWRDHGA--V 154

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCN 191
            PV++QG CGSCW+F+ +  LE    L    +  LS+ QLV+CDH          +  CN
Sbjct: 155 GPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDHECDPSEPDSCDAGCN 214

Query: 192 GGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--- 247
           GG +  AF Y +K  GLE + DYPY  K+     C ++K K    VQ+  V + VD    
Sbjct: 215 GGLMTSAFSYLLKSGGLEREKDYPYTGKDGT---CKFDKSKIAASVQNYSVVA-VDEEQI 270

Query: 248 MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT---- 303
             +L++ GP+ + +N   +++Y G       + C  H LDH V +VGYG      +    
Sbjct: 271 AANLVKYGPLAIGINAAYMQTYIGG--VSCPYICGRH-LDHGVLLVGYGASGFAPSRFKE 327

Query: 304 ---WIVRNSWGDIGPDHGYFQIERGANA---CGIESYAYLASV 340
              WI++NSWG+   D GY++I RG+N    CG++S     S 
Sbjct: 328 KPYWIIKNSWGENWGDKGYYKICRGSNVRNKCGVDSMVSTVSA 370


>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 153/324 (47%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTE-FRR 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 110 KFLGL------------NRRLKFPADAKTAPILPTDELPSDFDWRDHGA--VTPVKNQGT 155

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 156 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAF 215

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 216 EYTLKAGGLMREEDYPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 273

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 274 LAVAINAVFMQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 330

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 331 WGESWGENGYYKICRGRNVCGVDS 354


>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
 gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
          Length = 366

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 153/324 (47%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 49  FAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQQLDPAAVHGVTQFSDLTPTE-FRR 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 108 KFLGLN------------RRLKFPADAKTAPILPTDELPSDFDWRDRGA--VTPVKNQGT 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 154 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAF 213

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 214 EYTLKAGGLMREEDYPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 271

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 272 LAVAINAVFMQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 328

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 329 WGESWGENGYYKICRGRNVCGVDS 352


>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
 gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
 gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
 gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
 gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
 gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
 gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
          Length = 371

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/335 (27%), Positives = 164/335 (48%), Gaps = 27/335 (8%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK----QDGKETDEYYGTSGSSD 85
           +D     ++  + FK + +++NR+Y +  E   R   F     Q  +   E  GT+   +
Sbjct: 27  KDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGE 86

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
               ++ +    +L G+E+   E      KK  +      +P++ DWR++K  +++ V++
Sbjct: 87  TPFSDLTEEEFGQLYGQERSP-ERTPNMTKKVESNTWGESVPRTCDWRKAK-NIISSVKN 144

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQ 204
           QG C  CWA A    +++   +  +    +S  +L++C+     CNGG + D     +  
Sbjct: 145 QGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNN 204

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQ-SGPIGVYLN 262
            GL S+ DYP++       RC  +K K   ++QD T +++    + H L   GPI V +N
Sbjct: 205 SGLASEKDYPFQGDRK-PHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTIN 263

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILT----------------WI 305
            +L++ Y    I+    +C+P ++DH+V +VG+G EK G+ T                WI
Sbjct: 264 MKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWI 323

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           ++NSWG    + GYF++ RG N CG+  Y + A V
Sbjct: 324 LKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358


>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/308 (31%), Positives = 153/308 (49%), Gaps = 32/308 (10%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           AF  ++ K+ ++Y    E   R + FKQ+  +         S + +  ++  R GL    
Sbjct: 42  AFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKV--------SMNNARNDVTYRLGLN--- 90

Query: 102 KEKERLEADRERVKKFLNERKKGP-------LPKS--LDWRQSKVKVLNPVESQGRCGSC 152
           K  +  EA+ +R+  F  ++ K P        PK+  ++W +     + PV+ QG+CGSC
Sbjct: 91  KFADYTEAEYKRLLGFGGQKNKNPRNIKVLGAPKNDGVNWVEQGA--VTPVKDQGQCGSC 148

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQYGLESQ 210
           W+F+ T  +E    +   TLY LS+ QLV+C    GN  C GG +D AF+YV+Q  LE++
Sbjct: 149 WSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQTALETE 208

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIES 268
             YPY   ++     +    K   FV  T   + V+ +   L  GP+ V +  +  + + 
Sbjct: 209 DQYPYEAVDDTCRASSAGVVKVDSFVDVT--PNNVNELKAALDKGPVSVAIEADQMVFQF 266

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-N 327
           Y G  I  ND +C    LDH V  VGYG ++G   ++V+NSWG    + GY +I     N
Sbjct: 267 YSGGVI--NDASCGT-TLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDN 323

Query: 328 ACGIESYA 335
            CGI S A
Sbjct: 324 ICGILSQA 331


>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 153/324 (47%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTE-FRR 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 110 KFLGL------------NRRLKFPADAKTAPILPTDELPSDFDWRDHGA--VTPVKNQGT 155

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 156 CGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAF 215

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 216 EYTLKAGGLMREEDYPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 273

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 274 LAVAINAVFMQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 330

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 331 WGESWGENGYYKICRGRNVCGVDS 354


>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
          Length = 364

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/310 (30%), Positives = 155/310 (50%), Gaps = 22/310 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD------GKETDE---YYGTSGSSDRSPQEILQ 93
           F ++I + ++ Y +++E   RF  FK++       +E D+    YG +  +D SP+E  +
Sbjct: 64  FTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEE-FK 122

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           +T L  T K+ +      +   + ++ ++  PLP+S DWR+     +  V+++G C +CW
Sbjct: 123 KTHLPHTWKQPDHPNRIVDLAAEGVDPKE--PLPESFDWREHGA--VTKVKTEGHCAACW 178

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQAD 212
           AF+ T  +E Q  L KK L  LS  QL++CD  +  CNGG  +D   E V+  GLE +  
Sbjct: 179 AFSVTGNIEGQWFLAKKKLVSLSAQQLLDCDVVDEGCNGGFPLDAYKEIVRMGGLEPEDK 238

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESYD 270
           YPY  K     +C        V++  +      +  M   L++ GPI + +    I+ Y 
Sbjct: 239 YPYEAKAE---QCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISIGITVDDIQFYK 295

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
           G   R     C    + H   +VGYG +  I  WI++NSWG    + GY+++ RG NAC 
Sbjct: 296 GGVSRPT--TCRLSSMIHGALLVGYGVEKNIPYWIIKNSWGPNWGEDGYYRMVRGENACR 353

Query: 331 IESYAYLASV 340
           I  +   A V
Sbjct: 354 INRFPTSAVV 363


>gi|13625989|gb|AAK35220.1|AF362769_1 pre-procathepsin L [Paragonimus westermani]
          Length = 235

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 122/225 (54%), Gaps = 11/225 (4%)

Query: 112 ERVKKF-LNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
           ERV +  LN+ +  P   S+DWR  K   + PVE QG CGSCWAF+ TA +E Q  L   
Sbjct: 9   ERVDRVQLNDLQTAP--ASVDWR--KKGAVGPVEHQGSCGSCWAFSVTANVEGQWFLKTG 64

Query: 171 TLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEK 229
            L  LSK QLV+CD  +  C+GG     ++ +K+ G LE Q+ YPY   E     C  ++
Sbjct: 65  RLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEIKRMGGLELQSAYPYTGWEQA---CRLDR 121

Query: 230 EKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLD 287
            K    + D+ V    +      L + GP+   LN   ++ Y    +  +++AC+P  L+
Sbjct: 122 SKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYACSPEGLN 181

Query: 288 HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
           HAV  VGY  + G+  W VRNSWG    ++GYF+I RG   CGI+
Sbjct: 182 HAVLTVGYDTERGVPYWTVRNSWGTRWGENGYFRIYRGDGTCGID 226


>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
          Length = 326

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/331 (30%), Positives = 159/331 (48%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +E Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   ++  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CYMDKSKFVAYINGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  +R     C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGGIMRPR--LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 148/318 (46%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ +TY    E   RF  FK +      +        +G +  SD +P E  ++
Sbjct: 51  FSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQ 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL +D ++            LP   DWR+     +  V++QG CGSCW+
Sbjct: 111 ----FLGLKPLRLPSDAQKAPIL----PTNDLPTDFDWREHGA--VTGVKNQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY  Q 
Sbjct: 161 FSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLQA 220

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DYPY  ++     C ++K K    V +  V S  +  +  +L+Q+GP+ V +N
Sbjct: 221 GGLMREKDYPYTGRDRGP--CKFDKSKVAASVANFSVVSLDEEQIAANLVQNGPLAVGIN 278

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYG              WI++NSWG+   
Sbjct: 279 AVFMQTYIGG--VSCPYICGKH-LDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWG 335

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 336 EEGYYKICRGRNVCGVDS 353


>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
 gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
          Length = 371

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 157/329 (47%), Gaps = 42/329 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D +E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L    +  L    E   +       G LP   DWR      + PV++QG CGSCW+
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDG-LPDDFDWRDHGA--VGPVKNQGSCGSCWS 163

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+ +  LE    L    L  LS+ Q V+CDH          +  CNGG +  AF Y+ K 
Sbjct: 164 FSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKA 223

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHM---MHLLQSGPIGVYL 261
            GLES+ DYPY   +    +C ++K K    VQ+  V S VD      +L++ GP+ + +
Sbjct: 224 GGLESEKDYPYTGSDG---KCKFDKSKIVASVQNFSVVS-VDEAQISANLIKHGPLAIGI 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C  H LDH V +VGYG              WI++NSWG+  
Sbjct: 280 NAAYMQTYIGG--VSCPYICGRH-LDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENW 336

Query: 315 PDHGYFQIERGANA---CGIESYAYLASV 340
            ++GY++I RG+N    CG++S     S 
Sbjct: 337 GENGYYKICRGSNVRNKCGVDSMVSTVSA 365


>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
 gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
          Length = 371

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 157/329 (47%), Gaps = 42/329 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D +E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L    +  L    E   +       G LP   DWR      + PV++QG CGSCW+
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDG-LPDDFDWRDHGA--VGPVKNQGSCGSCWS 163

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+ +  LE    L    L  LS+ Q V+CDH          +  CNGG +  AF Y+ K 
Sbjct: 164 FSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKA 223

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHM---MHLLQSGPIGVYL 261
            GLES+ DYPY   +    +C ++K K    VQ+  V S VD      +L++ GP+ + +
Sbjct: 224 GGLESEKDYPYTGSDG---KCKFDKSKIVASVQNFSVVS-VDEAQISANLIKHGPLAIGI 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C  H LDH V +VGYG              WI++NSWG+  
Sbjct: 280 NAAYMQTYIGG--VSCPYICGRH-LDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENW 336

Query: 315 PDHGYFQIERGANA---CGIESYAYLASV 340
            ++GY++I RG+N    CG++S     S 
Sbjct: 337 GENGYYKICRGSNVRNKCGVDSMVSTVSA 365


>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
          Length = 251

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 86/249 (34%), Positives = 135/249 (54%), Gaps = 17/249 (6%)

Query: 104 KERLEADRERVKKFLNE-----RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
           K +  ++  R   FL+       K   +P S+DWR+S    +  V+ QG CGSCWAF+TT
Sbjct: 6   KAKYLSEMPRASAFLSHGMPYRAKNRAVPTSIDWRESGY--VTEVKDQGGCGSCWAFSTT 63

Query: 159 AILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYR 216
             +E Q    ++     S+ QLV+C  D GN  C+GG ++ A+EY++ +GLE+++ YPYR
Sbjct: 64  GAMEGQYMKSQRINISFSEQQLVDCSGDFGNHGCSGGLMEKAYEYLRHFGLETESSYPYR 123

Query: 217 NKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQ-SGPIGVYLNHRLIESYDGNP 273
             E     C Y+K+     + D ++    D   + +L+   GP  V L+  +      + 
Sbjct: 124 ADEG---PCQYDKQLGVAQLSDYYIVHSQDEVALKNLIGVEGPAAVALDVNIDFMMYKSG 180

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIE 332
           I +++  C+   L+HA+  VGYG ++G   WIV+NSWG    +HGY ++ R   N CGI 
Sbjct: 181 IYQDE-ICSSRYLNHALLAVGYGTEDGTEYWIVKNSWGSRWGEHGYIRLARNRDNMCGIA 239

Query: 333 SYAYLASVK 341
           + A L  VK
Sbjct: 240 TLASLPIVK 248


>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 376

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/328 (30%), Positives = 154/328 (46%), Gaps = 54/328 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++N++Y D +E   R   F  + +    +        +G +  SD +P E   R
Sbjct: 52  FASFVQRFNKSYRDADEHAHRLSVFTANLRRARRHQRLDPSAVHGVTKFSDLTPDEFRDR 111

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
                 G  K R    R  +K         P      LP   DWR+     + PV+ QG 
Sbjct: 112 ----FLGLRKYR----RSFLKGLSGSAHDAPALPTDGLPTEFDWREHGA--VGPVKDQGS 161

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+T+  LE    L    L  LS+ Q+V+CDH          +  CNGG +  AF
Sbjct: 162 CGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAF 221

Query: 200 EYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSG 255
            Y+ K  GLE++ DYPY  +      C ++K K    V++ + T  VD      +L++ G
Sbjct: 222 SYLAKAGGLETEKDYPYTGRGGA---CKFDKSKIAAQVKN-FSTVAVDEDQIAANLVKHG 277

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRN 308
           P+ + +N   +++Y G       + C  H LDH V +VGYG              WI++N
Sbjct: 278 PLAIGINAVFMQTYIGG--VSCPFICGRH-LDHGVLLVGYGSAGYAPLRFKEKPYWIIKN 334

Query: 309 SWGDIGPDHGYFQIERGA---NACGIES 333
           SWG+   + GY++I RGA   N CG++S
Sbjct: 335 SWGENWGESGYYKICRGAHVKNKCGVDS 362


>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
          Length = 326

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 159/331 (48%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFTLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   +V  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEKGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|38045864|gb|AAR08900.1| cathepsin L [Fasciola gigantica]
          Length = 326

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 125/224 (55%), Gaps = 16/224 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P+S+DWR+     +  V+ QG CGSCWAF+ T  +E Q    +K     S+ QLV+C  
Sbjct: 108 VPESIDWRE--FGYVTEVKDQGDCGSCWAFSATGAMEGQYMKNQKANISFSEQQLVDCSG 165

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKE----KAKVFVQDT 239
           D+GN  C+GG ++ A+EY+ + GLE+++ YPY+ +E     C Y+      K   F  D 
Sbjct: 166 DYGNRGCSGGFMEHAYEYLYEVGLETESSYPYKAEEG---PCKYDSRLGVAKVNGFYFDH 222

Query: 240 W-VTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           + V S + H++       + V +    +    G    RN   C+  KL+HA+ +VGYG +
Sbjct: 223 FGVESKLAHLVGDKGPAAVAVDVESDFLMYRGGIYASRN---CSSEKLNHAMLVVGYGTQ 279

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASVK 341
           +G   WIV+NSWG +  DHGY ++ R   N CGI S+A L  V+
Sbjct: 280 DGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIASFASLPVVE 323


>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 250

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 122/220 (55%), Gaps = 12/220 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR+  +  + PV+ Q  CG CW FATT ++ESQ AL    L   S+ QL++CD 
Sbjct: 39  LPSYFDWREQGI--ITPVKYQDTCGGCWTFATTGVIESQYALKYNKLVNFSEQQLIDCDS 96

Query: 186 GNLNCNGGNIDVAFEYVKQY-GLESQADY-PYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
            N  C GG +  A++ +++  GLE+  DY  Y N +    +C  +  K    V + +  S
Sbjct: 97  INDGCRGGLMTDAYKAIQEMGGLETSEDYGEYLNSKG---QCKIDSNKVSAKVINWYQIS 153

Query: 244 GVDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
             +  +   L+Q+GPI V +N R ++ Y G  +   D       ++HAV IVGYGE+NG 
Sbjct: 154 EDEEAIRRELVQNGPIAVGVNARFLQFYQGGIL---DPKLCDDSINHAVLIVGYGEENGK 210

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
             WI++N WG     +GYF++ RG   CG+ +YA +A ++
Sbjct: 211 KYWIIKNQWGKSWGINGYFKLVRGKKQCGVHTYASIAFIE 250


>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
          Length = 327

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 120/236 (50%), Gaps = 39/236 (16%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P+++DWR+ K   + PV++QG CGSCW F+TT  LES +A+    L  L++ QL
Sbjct: 103 RSDGPCPEAVDWRK-KGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQQL 161

Query: 181 VECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C     N  C+GG    AFEY+    GL  +  YPYR +      C ++ +KA  FV+
Sbjct: 162 VDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRAQNGT---CKFQPDKAIAFVK 218

Query: 238 DTWVTSGVDHMMHLLQSG---PI---------------GVYLNHRLIESYDGNPIRRNDW 279
           D    +  D    +   G   P+               GVY N R   +           
Sbjct: 219 DVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEHT----------- 267

Query: 280 ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
              P K++HAV  VGYGE++G   WIV+NSWG +    GYF IERG N CG+ + A
Sbjct: 268 ---PDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACA 320


>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
           Group]
 gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
 gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
          Length = 373

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/330 (30%), Positives = 156/330 (47%), Gaps = 44/330 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D +E   R   FK + +    +        +G +  SD +P E  + 
Sbjct: 50  FASFVQRFGKSYRDADEHAYRLSVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRA 109

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR + +   R           L       LP   DWR      + PV++QG CGSCW
Sbjct: 110 YLGLRTSRRAFLRGLGGSAHEAPVLPTDG---LPDDFDWRDHGA--VGPVKNQGSCGSCW 164

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VK 203
           +F+ +  LE    L    +  LS+ Q+V+CDH          +  CNGG +  AF Y +K
Sbjct: 165 SFSASGALEGANYLATGKMDVLSEQQMVDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLK 224

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVY 260
             GLES+ DYPY  ++     C ++K K    VQ+  V S VD      +L++ GP+ + 
Sbjct: 225 SGGLESEKDYPYTGRDGT---CKFDKSKIVTSVQNFSVVS-VDEDQIAANLVKHGPLAIG 280

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C  H LDH V +VGYG              WI++NSWG+ 
Sbjct: 281 INAAYMQTYIGG--VSCPYICGRH-LDHGVLLVGYGASGFAPIRLKDKAYWIIKNSWGEN 337

Query: 314 GPDHGYFQIERGANA---CGIESYAYLASV 340
             +HGY++I RG+N    CG++S     S 
Sbjct: 338 WGEHGYYKICRGSNVRNKCGVDSMVSTVSA 367


>gi|633096|dbj|BAA04664.1| prepro NTP [Paragonimus westermani]
          Length = 245

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/218 (37%), Positives = 114/218 (52%), Gaps = 8/218 (3%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P+ +DWR      + PVE+QG CGSCWAF+T   +E Q  +    L  LSK QLV+CD  
Sbjct: 32  PERMDWRAKGA--VTPVENQGECGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDMA 89

Query: 187 NLNCNGGNIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
              CNGG    ++ E +   GLES++DYPY   E     C   KEK    + D+ V    
Sbjct: 90  AEGCNGGWPASSYLEIMYMGGLESESDYPYVGVEQT---CALNKEKLVAKIDDSIVLGPE 146

Query: 246 --DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
             DH  +L + GP+   LN   ++ Y    ++     C   +L+HAV  VGY ++  +  
Sbjct: 147 EEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPTFEECPDTELNHAVLTVGYDKEGDMPY 206

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           WI++NSWG    + GYF++ RG   CGI   A  A +K
Sbjct: 207 WIIKNSWGTDWGEKGYFRLFRGDCTCGINRMATSAIIK 244


>gi|121531590|gb|ABM55480.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 321

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/302 (33%), Positives = 150/302 (49%), Gaps = 35/302 (11%)

Query: 52  RTYTDDNEIKTRFEYFKQDGKETDE------------YYGTSGSSDRSPQEILQRTGLRL 99
           +TY    E +TRF  F+ + +  +E            Y G +  +D + +E     GL+ 
Sbjct: 32  KTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAEEFRHMLGLQ- 90

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
               +  L A      + L        P+S+DW Q    +   V++QG+CGSCWAF++T 
Sbjct: 91  -NGARPNLNATLHVFSENLQA------PESIDWTQKGADL--GVKNQGKCGSCWAFSSTG 141

Query: 160 ILESQVALLKKTLYPLSKSQLVECD--HGNLNCN-GGNIDVAFEYVKQYGLESQADYPYR 216
            LE Q A+  K   PLS+ QL++C   +GN +C+ GG +  AF+Y+K  G+E+ + YPY+
Sbjct: 142 SLEGQNAIHHKVKTPLSERQLLDCSSSYGNGDCDEGGLMTNAFKYIKAKGIEAGSSYPYQ 201

Query: 217 NKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPI 274
            +      C Y  +K  + ++       S V+    +   GPI V ++   +  Y G  I
Sbjct: 202 GRVG---SCRYNAQKTILRIKGFKELRASEVELKKAVGTIGPISVAVSSEHLRLYGGGVI 258

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
                 C    LDHAV  VGYG +NG   W +RNSWG    DHGYF++ R A N CG+ S
Sbjct: 259 TTR---C-IKDLDHAVLAVGYGSENGRKYWKIRNSWGKTWGDHGYFKLARDAGNLCGVAS 314

Query: 334 YA 335
            A
Sbjct: 315 MA 316


>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 368

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 101/324 (31%), Positives = 152/324 (46%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDSTPTE-FRR 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 110 KFLGL------------NRRLKFPADAKTAPILPTDELPSDFDWRDRGA--VTPVKNQGT 155

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CG CW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 156 CGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAF 215

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 216 EYTLKAGGLMREEDYPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 273

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 274 LAVAINAVFMQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 330

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 331 WGESWGENGYYKICRGRNVCGVDS 354


>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
           [Strongylocentrotus purpuratus]
          Length = 453

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 145/308 (47%), Gaps = 35/308 (11%)

Query: 41  DAFKTYIVKWNRTYTDD---NEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSP 88
           D F  +++ + R Y  +   NE + R+  F Q+    + +         YG +  +D + 
Sbjct: 154 DLFDKFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTE 213

Query: 89  QEI--LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
            E   LQ   L+ TG +K+                 +GP+P+  DWR      + PV++Q
Sbjct: 214 AEFRKLQSGPLKKTGIKKQA-------------AIPQGPVPEEYDWRTHGA--VTPVKNQ 258

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVKQY 205
           G CGSCWAF+    +E Q  + K  L  LS+ +LV+CD  +  C GG +  A+E  +K  
Sbjct: 259 GMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDGGCEGGEMSDAYEAIIKLG 318

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
           G  S+  YPYR +     +C +     +V +      S  +  M   L   GPI + +N 
Sbjct: 319 GAMSEEKYPYRGENE---KCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPISIGINA 375

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
            +++ Y G         C+P  LDH V IVGY  K+G   WIV+NSWG    + GY+ + 
Sbjct: 376 LMMQFYFGGIAHPWKIFCSPDSLDHGVLIVGYSVKDGEPYWIVKNSWGKDWGEEGYYLVY 435

Query: 324 RGANACGI 331
           RG   CG+
Sbjct: 436 RGDGTCGL 443


>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
 gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
          Length = 330

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 155/323 (47%), Gaps = 51/323 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQR 94
           FK++I ++ + Y        R + F+ +          +    +G +  SD + +E  Q+
Sbjct: 21  FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSAVHGITQFSDLTEEEFKQQ 80

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR+  + +E   A++  V           LP+  DWR+     +  V++QG CGSCW
Sbjct: 81  FLGLRVPSRLRE---ANKAPV------LPTNDLPEDFDWREHGA--VTEVKNQGACGSCW 129

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYV-K 203
           AF+TT  +E    L    L  LS+ QLV+CDH          +  CNGG +  A++YV K
Sbjct: 130 AFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMK 189

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVY 260
             GLE++ DYPY    N   +C +   K    V + + T  +D      +L++ GP+ + 
Sbjct: 190 SGGLETETDYPYTGNSN--GKCQFNANKIVASVAN-FSTVSLDEDQIAANLVKHGPLAIG 246

Query: 261 LNHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSW 310
           +N   +++Y G    PI      C+ H +DH V +VGYG K            WI++NSW
Sbjct: 247 INAVFMQTYIGGVSCPI-----ICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSW 301

Query: 311 GDIGPDHGYFQIERGANACGIES 333
           G    + GY++I RG   CG+ +
Sbjct: 302 GATWGEQGYYKICRGHGMCGMNT 324


>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
 gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
          Length = 367

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 155/323 (47%), Gaps = 51/323 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQR 94
           FK++I ++ + Y        R + F+ +          +    +G +  SD + +E  Q+
Sbjct: 58  FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSAVHGITQFSDLTEEEFKQQ 117

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR+  + +E   A++  V           LP+  DWR+     +  V++QG CGSCW
Sbjct: 118 FLGLRVPSRLRE---ANKAPV------LPTNDLPEDFDWREHGA--VTEVKNQGACGSCW 166

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYV-K 203
           AF+TT  +E    L    L  LS+ QLV+CDH          +  CNGG +  A++YV K
Sbjct: 167 AFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMK 226

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVY 260
             GLE++ DYPY    N   +C +   K    V + + T  +D      +L++ GP+ + 
Sbjct: 227 SGGLETETDYPYTGNSN--GKCQFNANKIVASVAN-FSTVSLDEDQIAANLVKHGPLAIG 283

Query: 261 LNHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSW 310
           +N   +++Y G    PI      C+ H +DH V +VGYG K            WI++NSW
Sbjct: 284 INAVFMQTYIGGVSCPI-----ICSKHHIDHGVLLVGYGAKGYAPIRFTEKPYWIIKNSW 338

Query: 311 GDIGPDHGYFQIERGANACGIES 333
           G    + GY++I RG   CG+ +
Sbjct: 339 GATWGEQGYYKICRGHGMCGMNT 361


>gi|121531592|gb|ABM55481.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 318

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/302 (33%), Positives = 149/302 (49%), Gaps = 35/302 (11%)

Query: 52  RTYTDDNEIKTRFEYFKQDGKETDE------------YYGTSGSSDRSPQEILQRTGLRL 99
           +TY    E +TRF  F+ + +  +E            Y G +  +D + +E     GL+ 
Sbjct: 29  KTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAEEFRHMLGLQ- 87

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
               +  L A      + L        P+S+DW Q    +   V+ QG+CGSCWAF++T 
Sbjct: 88  -NGARPNLNATLHVFSENLQA------PESIDWTQKGADL--GVKDQGKCGSCWAFSSTG 138

Query: 160 ILESQVALLKKTLYPLSKSQLVECD--HGNLNCN-GGNIDVAFEYVKQYGLESQADYPYR 216
            LE Q A+  K   PLS+ QL++C   +GN +C+ GG +  AF+Y+K  G+E+ + YPY+
Sbjct: 139 SLEGQNAIHHKVKTPLSEQQLLDCSSSYGNGDCDEGGLMTNAFKYIKAKGIEAGSSYPYQ 198

Query: 217 NKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPI 274
            +      C Y  +K  + ++       S V+    +   GPI V ++   +  Y G  I
Sbjct: 199 GRVG---SCRYNAQKTILRIKGFKELRASEVELKKAVGTIGPISVAVSSEHLRLYGGGVI 255

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
                 C    LDHAV  VGYG +NG   W +RNSWG    DHGYF++ R A N CG+ S
Sbjct: 256 TTR---C-IKDLDHAVLAVGYGSENGRKYWKIRNSWGKTWGDHGYFKLARDAGNLCGVAS 311

Query: 334 YA 335
            A
Sbjct: 312 MA 313


>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 159/331 (48%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFETRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   +V  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|341878255|gb|EGT34190.1| hypothetical protein CAEBREN_02333 [Caenorhabditis brenneri]
          Length = 410

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/235 (33%), Positives = 124/235 (52%), Gaps = 22/235 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S DWR SK  ++ PV++QG CGSCWAFA  A +E+Q A+ K  L  LS+ +LV+CD 
Sbjct: 177 VPDSFDWRSSKSPMVTPVKNQGDCGSCWAFAVVAAIETQYAMKKGALLSLSEQELVDCDV 236

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSG 244
            +  CNGG ++ A  +  + GLE++ADYPY   ++   +C+ + +K +V + D + + + 
Sbjct: 237 LSYGCNGGYLNTALLFAIEKGLETEADYPYVAIQHK--QCSIQTQKIRVKIDDGYHLKAN 294

Query: 245 VDHMMH-LLQSGPI-----------------GVYLNHRLIESYDGNPIRRNDWACNPHKL 286
            D +   + + GP+                  V    + I  Y G     +   C    +
Sbjct: 295 EDQIADWVAREGPVSFCKLLLFLFFFKFFKCSVMPVPKSIMFYRGGIFNPSMAECRGQAV 354

Query: 287 -DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            +H +AIVGYG +     WIV+NSWG    + GY ++ RG N CG  +Y +   +
Sbjct: 355 GNHVMAIVGYGREGNQKYWIVKNSWGTSWGEQGYLKMARGVNICGFTNYVFAPHI 409


>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 333

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 89/251 (35%), Positives = 127/251 (50%), Gaps = 27/251 (10%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           L+    R  KF+ E     LP S+DWR   V  L PV+ QG CGSCWAF+TT  LE+Q A
Sbjct: 92  LKMSTRRDDKFVIEADTTQLPTSVDWRNKNV--LTPVKDQGSCGSCWAFSTTGALEAQYA 149

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFR 224
           +    L  LS+ QLV+C   +GN  C GG +D A+EY+K  GL+ ++ Y Y   +++  +
Sbjct: 150 IATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEYIKSAGLDQESTYSYNGTDDVC-Q 208

Query: 225 CTYEKEKAKVFVQDTWVTSGVD----HMMHLLQSGPIGVYLNHRLIESYDGNPIRR---- 276
            +  K    +   +      +D     +M  L   P+ V +       Y  +P  R    
Sbjct: 209 GSLAKRSDGIPAGEVTGFHMLDKTEQSLMKALADAPVSVAM-------YAADPDFRFYKS 261

Query: 277 ---NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---CG 330
              +   CN  KLDH V  VGYG +NG   +I+RNSWG      GYF ++RG +    C 
Sbjct: 262 GVYSSATCNG-KLDHGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKRGVSGYGECN 320

Query: 331 IESYAYLASVK 341
           I  Y  +A++K
Sbjct: 321 ILEYMCVATLK 331


>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
          Length = 319

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 102/309 (33%), Positives = 151/309 (48%), Gaps = 44/309 (14%)

Query: 52  RTYTDDNEIKTRFEYFKQDGK-------ETDEYYGTSGSSDRSPQEILQRTGLRLTGKEK 104
           R Y    E   RF  FK + +        T   +G +  SD +P E  ++      G + 
Sbjct: 15  RPYATKEEHDHRFGVFKSNLRRASCTPSSTPRVHGVTKFSDLTPAEFRRQ----FLGLKA 70

Query: 105 ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
            R  A  ++      +     LPK  DWR  K  V N V+ QG CGSCW+F+TT  LE  
Sbjct: 71  VRFPAHAQKAPILPTKD----LPKDFDWRD-KGAVTN-VKDQGGCGSCWSFSTTGALEGA 124

Query: 165 VALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY-GLESQADYP 214
             L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q  G++ + DYP
Sbjct: 125 YYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEKDYP 184

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDGN 272
           Y  ++     C ++K K    V +  V    +  +  +L+++GP+ V +N   +++Y G 
Sbjct: 185 YTGRDGT---CKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYVGG 241

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGE--------KNGILTWIVRNSWGDIGPDHGYFQIER 324
                 + C  H LDH V +VGYGE        KN    WI++NSWG+   ++GY +I R
Sbjct: 242 --VSCPYICGKH-LDHGVLLVGYGEGAYAPIRFKNKPY-WIIKNSWGESWGENGYDEICR 297

Query: 325 GANACGIES 333
           G N CG++S
Sbjct: 298 GRNVCGVDS 306


>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
          Length = 327

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 155/336 (46%), Gaps = 30/336 (8%)

Query: 18  YNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------- 68
           + +    AI V      DS +++  ++ +   + + Y ++++ K RF  FK         
Sbjct: 9   FALIVSCAIAVSAGRVPDSAREL--YEQFKRGYGKVYANEDDQK-RFAIFKDNLVRAQKL 65

Query: 69  QDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPK 128
           Q   +    YG +  SD +P+E   +               + ++VK+      K   P+
Sbjct: 66  QLKDQGTARYGVTQFSDLTPEEFAAKY---------LSAPVNDDQVKRMRPTGLKAA-PE 115

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNL 188
            +DWR      +  VE+QG CGSCWAF+T   +E Q  +    L  LSK QLV+CD    
Sbjct: 116 RIDWRAKGA--VTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAQ 173

Query: 189 NCNGG-NIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV-- 245
            CNGG       E +   GLES++DYPY   E     C   KEK    + D+ V      
Sbjct: 174 GCNGGWPASSYLEIMYMGGLESESDYPYVGVEQT---CALNKEKLVAKIDDSIVLGPEEE 230

Query: 246 DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWI 305
           DH  +L + GP+   LN   ++ Y    ++     C   +L+HAV  VGY ++  +  WI
Sbjct: 231 DHAAYLAEHGPLSTLLNAVALQHYQSGVLKPTFDECPDTELNHAVLTVGYDKEGDMPYWI 290

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           ++NSWG    + GYF++ RG   CGI   A  A +K
Sbjct: 291 IKNSWGTDWGEKGYFRLFRGDCTCGINRMATSAIIK 326


>gi|14602252|ref|NP_148795.1| ORF11 cathepsin [Cydia pomonella granulovirus]
 gi|13124000|sp|O91466.1|CATV_GVCPM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|14591773|gb|AAK70678.1| ORF11 cathepsin [Cydia pomonella granulovirus]
          Length = 333

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 152/311 (48%), Gaps = 19/311 (6%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGS 83
           L YD     + FK + +K+N+TY  D E   + E FK + K  +E         +  +  
Sbjct: 21  LTYDLNNSDELFKNFAIKYNKTYVSDEERAIKLENFKNNLKMINEKNMASKYAVFDINEY 80

Query: 84  SDRSPQEILQRT-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           SD +   +L+RT G RL  K+        E     + +  +  LP++LDWR      + P
Sbjct: 81  SDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIKDEPQALLPETLDWRDKHG--VTP 138

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V++Q  CGSCWAF+T A +ES   +       LS+  LV CD+ N  C GG +  A E +
Sbjct: 139 VKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCDNINNGCAGGLMHWALESI 198

Query: 203 KQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGPIGVY 260
            Q G + S  + PY   + +  +  +E     +     +V    + +  LL  +GPI V 
Sbjct: 199 LQEGGVVSAENEPYYGFDGVCKKSPFE---LSISGSRRYVLQNENKLRELLVVNGPISVA 255

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           ++   + +Y        D   N   L+HAV +VGYG KN +  WI++NSWG    + GYF
Sbjct: 256 IDVSDLINYKAGIA---DICENNEGLNHAVLLVGYGVKNDVPYWILKNSWGAEWGEEGYF 312

Query: 321 QIERGANACGI 331
           +++R  N+CG+
Sbjct: 313 RVQRDKNSCGM 323


>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 368

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 155/321 (48%), Gaps = 43/321 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y    E   RF  FK + +    +        +G +  SD +  E  
Sbjct: 49  DHFSLFKKKFGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSE-F 107

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +R  L + G  K   +A++  +    N      LP+  DWR+     + PV++QG CGSC
Sbjct: 108 KRKHLGVKGGFKLPKDANKAPILPTEN------LPEEFDWRERGA--VTPVKNQGSCGSC 159

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 160 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTL 219

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           K  GL  + DYPY  K+  T  C  +K K    V +  V S +D      +L+++GP+ V
Sbjct: 220 KTGGLMREEDYPYTGKDGAT--CKLDKSKIVASVSNFSVIS-IDEEQIAANLVKNGPLAV 276

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGD 312
            +N   +++Y G       + C   +L+H V +VGYG              WI++NSWG+
Sbjct: 277 AINAAYMQTYIGG--VSCPYICM-RRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGE 333

Query: 313 IGPDHGYFQIERGANACGIES 333
              + G+++I RG N CG++S
Sbjct: 334 TWGEDGFYKICRGRNVCGVDS 354


>gi|229596403|ref|XP_001009843.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|225565321|gb|EAR89598.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 324

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 148/303 (48%), Gaps = 29/303 (9%)

Query: 39  QVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR 98
           +VD F+ ++ K+   + D+ +++ R   F Q+    ++      S +      L    + 
Sbjct: 32  EVDEFQAWMHKYGFKFADEVQLQYRRSIFYQNKDLVEQL----NSENNGTFHTLNAFAIY 87

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
              +  +  +  ++R K  L    KG +  S+DWRQ     + PV++QG+CGSCWAF+T 
Sbjct: 88  TKDEFNQLFKGYQKRQKSHLIYSLKGDVAPSIDWRQKNA--VTPVKNQGQCGSCWAFSTV 145

Query: 159 AILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNK 218
             LE   A+    L   S+ Q+V+C   N  CNGG++  A++YV Q G+E++ADYPY+  
Sbjct: 146 GGLEGAYAIATGNLTSFSEQQIVDCSKANAGCNGGDLPPAYKYVVQNGIETEADYPYK-- 203

Query: 219 ENITFRCTYEKEKA----KVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGN 272
             +  +C Y+  K     K FVQ T   +  D +   L   P+ + +  + +  + Y   
Sbjct: 204 -GVNQKCAYDASKVVFKPKSFVQVT--PNSPDQLAIALNKEPVPICIEADQKAFQFYTSG 260

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER----GANA 328
            I      C  + LDH V  VGY       +WIV+NSWG    ++GY +I R    G   
Sbjct: 261 IISS---GCGTN-LDHCVLAVGYDAD----SWIVKNSWGASWGENGYVRIARTTAKGPGV 312

Query: 329 CGI 331
           CGI
Sbjct: 313 CGI 315


>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
          Length = 368

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/313 (30%), Positives = 153/313 (48%), Gaps = 32/313 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +  K+ ++Y    E   RF  FK + +    +     ++      + Q + L     
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHG---VTQFSDLTSAEF 109

Query: 103 EKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
            K+ L   + R+ K  N     P   LP+  DWR+     + PV++QG CGSCW+F+TT 
Sbjct: 110 RKQVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGA--VGPVKNQGSCGSCWSFSTTG 167

Query: 160 ILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLES 209
            LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL  
Sbjct: 168 ALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 227

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIE 267
           + DYPY   +     C ++K K    V +  V S  +  +  +L+++GP+ V +N   ++
Sbjct: 228 EEDYPYTGMDRGA--CKFDKNKVAAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ 285

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYF 320
           +Y G       + C+  +LDH V +VGYG              WI++NSWG+   ++G++
Sbjct: 286 TYIGG--VSCPYICS-RRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFY 342

Query: 321 QIERGANACGIES 333
           +I RG N CG++S
Sbjct: 343 KICRGRNICGVDS 355


>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
          Length = 326

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 159/331 (48%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   +V  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 377

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 154/318 (48%), Gaps = 40/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ ++Y    E   RF  FK + K    +        +G +  SD +P E  +R
Sbjct: 60  FSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSE-FRR 118

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           + L L  + +  L AD  +      +     LP   DWR      ++ V++QG CGSCW+
Sbjct: 119 SFLGLRSR-RLGLPADANKAPILPTDG----LPTDFDWRDKGA--VSEVKNQGSCGSCWS 171

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K 
Sbjct: 172 FSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKS 231

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DYPY   +  T  C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 232 GGLMKEQDYPYTGTDRGT--CKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAIN 289

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C+ H LDH V +VGYG              WI++NSWG    
Sbjct: 290 AVFMQTYIKG--VSCPYICSKH-LDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWG 346

Query: 316 DHGYFQIERGANACGIES 333
           ++GY++I RG N CG++S
Sbjct: 347 ENGYYKICRGRNICGVDS 364


>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
          Length = 322

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 150/319 (47%), Gaps = 32/319 (10%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSD 85
           DS +++  ++ +   + + Y ++++ K RF  FK         Q   +    YG +  SD
Sbjct: 21  DSAREL--YEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLRDQGTARYGVTQFSD 77

Query: 86  RSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            +P+E   +     L   + ER++    +             P+ +DWR      + PVE
Sbjct: 78  LTPEEFAAKYLSPPLNSDQVERVQPTGLKAA-----------PERMDWRAKGA--VTPVE 124

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVK 203
           +QG CGSCWAF+T   +E Q  +    L  LSK QLV+CD     CNGG    ++ E + 
Sbjct: 125 NQGECGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPSSSYLEIMD 184

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV--TSGVDHMMHLLQSGPIGVYL 261
             GLES+ DYPY   E     C   KEK    + D  V   S  +H+ +L + GP+   L
Sbjct: 185 MGGLESENDYPYVGVEQT---CALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLL 241

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           N   ++ Y    +  +   C    L+HAV  VGY  +  +  WI++NSWG    + GYF+
Sbjct: 242 NAVALQHYQSGILHPSHKDCPDDDLNHAVLTVGYDREGDMPYWIIKNSWGTDWGEKGYFR 301

Query: 322 IERGANACGIESYAYLASV 340
           + RG   CGI   A  A +
Sbjct: 302 LFRGDCVCGINRMATSAVI 320


>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
          Length = 459

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 154/316 (48%), Gaps = 28/316 (8%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEI 91
           + F  ++ +  + Y   ++   RF  FK         Q+ +E    YG +  SD +P+E 
Sbjct: 155 NQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEE- 213

Query: 92  LQRTGLRLTGKE----KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
            ++  L     E       ++   E V   LNE     LP+S DWR      +  V++QG
Sbjct: 214 FKKIYLPYIWDEPIVPNRMVDLTAEGVH--LNET----LPESFDWRDHGA--VTDVKNQG 265

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYG 206
            CGSCWAF+TT  +E Q  L KK L  LS+ +LV+CD  +  C GG    A+ E ++  G
Sbjct: 266 FCGSCWAFSTTGNIEGQWFLAKKKLVSLSEQELVDCDKVDDGCEGGLPSQAYKEIMRMGG 325

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR 264
           LE+++ YPY  +      C   + +  V++ D+      +  M   L++ GPI + +N  
Sbjct: 326 LETESAYPYDGRGE---ECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPISIGINAN 382

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
            ++ Y         + C P+ L+H V +VGYG +     WI++NSWG    ++GY+++ R
Sbjct: 383 PLQFYRHGISHPWKFFCEPYMLNHGVLLVGYGSEKNKPYWIIKNSWGPKWGENGYYRLYR 442

Query: 325 GANACGIESYAYLASV 340
           G N CG+      A V
Sbjct: 443 GKNVCGVHEMPTSAVV 458


>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
 gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 153/319 (47%), Gaps = 44/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y    E   RF+ FK + +   ++        +G +  SD +P+E  ++
Sbjct: 51  FTAFKAKFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDPSAVHGVTKFSDLTPREFRRQ 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +K RL AD         +     +P+  DWR      +  V++QG CGSCW+
Sbjct: 111 ----YLGLKKLRLPADAHEAPILPTDG----IPEDFDWRDHGA--VTNVKNQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY+ K 
Sbjct: 161 FSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKA 220

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE + DYPY   +     C +E+ K    V +  V S VD      +L+Q+GP+ V +
Sbjct: 221 GGLEREEDYPYTGSDRGP--CKFERAKIAASVNNFSVVS-VDEDQIAANLVQNGPLAVGI 277

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  + DH V +VGYG              WI++NSWG+  
Sbjct: 278 NAVFMQTYIGG--VSCPYICSKRQ-DHGVVLVGYGSAGYAPVRLKDKPFWIIKNSWGENW 334

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG+++
Sbjct: 335 GENGYYKICRGRNVCGVDA 353


>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
          Length = 379

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 156/318 (49%), Gaps = 27/318 (8%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           ++K    F+ +++ +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 75  TVKMASIFRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDL 134

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVES 145
           + +E   RT + L    +E      E  KK    +  G L P   DWR      +  V+ 
Sbjct: 135 TEEEF--RT-IYLNPLLRE------EPGKKMKQAKSVGDLAPPEWDWRSKGA--VTKVKD 183

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  
Sbjct: 184 QGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAIKNL 243

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE++ DY YR        C++  EKAKV++ D+   S  +  +   L + GPI V +N
Sbjct: 244 GGLETEDDYSYRGHMQA---CSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAIN 300

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              ++ Y     R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+ +
Sbjct: 301 AFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWGEKGYYYL 360

Query: 323 ERGANACGIESYAYLASV 340
            RG+ ACG+ + A  A V
Sbjct: 361 HRGSGACGVNTMASSAVV 378


>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
 gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
          Length = 610

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 151/315 (47%), Gaps = 29/315 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  + +K+ R Y +  E + R   F+Q+ +  ++          YG +  +D +  E  +
Sbjct: 303 FHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAKYGITEFADMTSTEYKE 362

Query: 94  RTGL--RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           RTGL  R  G+     +A        +     G LPK  DWRQ     ++ V++QG CGS
Sbjct: 363 RTGLWQRTEGQPTGGQKA-------VVPSYPGGELPKEFDWRQKGA--VSSVKNQGSCGS 413

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQ 210
           CWAF+T   +E   A+    L   S+ +L++CD  +  CNGG  D A++ +++  GLE +
Sbjct: 414 CWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCDTKDSACNGGLPDNAYKAIQEIGGLEYE 473

Query: 211 ADYPYR-NKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESY 269
           ++YPY+  KE   F  T    +   FV D    +       L+ +GPI + +N   ++ Y
Sbjct: 474 SEYPYKARKEQCHFNKTLAHVQVTGFV-DLPKNNETAMQEWLIANGPISIGINANAMQFY 532

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN------GILTWIVRNSWGDIGPDHGYFQIE 323
            G         C    LDH V IVGYG  +       +  WIV+NSWG    + GY+++ 
Sbjct: 533 RGGVSHPWKILCEKSNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVY 592

Query: 324 RGANACGIESYAYLA 338
           RG N CG+   A  A
Sbjct: 593 RGDNTCGVSEMASSA 607


>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
 gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
          Length = 337

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 153/303 (50%), Gaps = 30/303 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y  ++E K R+  F+        ++ +     Y  +  +D +  E++  
Sbjct: 40  FEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNSRNDSAIYKINRFADMTKNEVV-- 97

Query: 95  TGLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             +R TG     L A+     V     +R++   P S DWR   +  +  V+ QG CG+C
Sbjct: 98  --IRHTGLASGELGANFCETIVVDGPAQRQR---PTSFDWR--TLNKVTSVKDQGMCGAC 150

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAFA    LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E +    G+E + 
Sbjct: 151 WAFAGLGALESQYAIKYDRLIDLAEQQLVDCDSVDMGCDGGLIHTAYEQIMHMGGVEQEF 210

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVYLNHRLIES 268
           DYPYR +      C  +  K    V+    +V    + +  LL+  GPI + ++   +  
Sbjct: 211 DYPYRAERQ---PCALKPHKFAAGVRSCYRYVLLNEERLEDLLRYVGPIAIAVDAVDLTD 267

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +      C  + L+HAV +VGYG +N +  WI++NSWG    + GY ++ RG N+
Sbjct: 268 YYGGIVS----FCENNGLNHAVLLVGYGVENNVPFWIIKNSWGSDYGEDGYVRVRRGVNS 323

Query: 329 CGI 331
           CG+
Sbjct: 324 CGM 326


>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
          Length = 322

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 144/304 (47%), Gaps = 28/304 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           + + Y ++++ K RF  FK         Q   +    YG +  SD +P+E   +    L+
Sbjct: 34  YGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAAK---YLS 89

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
                    + ++VK+      K   P+ +DWR      +  VE+QG CGSCWAF+T   
Sbjct: 90  AP------VNNDQVKRVRPTGLKAA-PERIDWRAKGA--VTAVENQGSCGSCWAFSTAGN 140

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKE 219
           +E Q  +    L  LSK QLV+CD     CNGG       E +   GLES++DYPY   E
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRAAQGCNGGWPASSYLEIMYMGGLESESDYPYVGVE 200

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGV--DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C   KEK    + D+ V      DH  +L + GP+   LN   ++ Y    ++  
Sbjct: 201 QT---CALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPT 257

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYL 337
              C   +L+HAV  VGY ++  +  WI++NSWG    + GYF++ RG   CGI   A  
Sbjct: 258 FEECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWGEKGYFRLFRGDCTCGINRMATS 317

Query: 338 ASVK 341
           A +K
Sbjct: 318 AIIK 321


>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 98/308 (31%), Positives = 152/308 (49%), Gaps = 32/308 (10%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           AF  ++ K+ ++Y    E   R + FKQ+  +         S +    ++  R GL    
Sbjct: 42  AFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKV--------SMNNVRNDVTYRLGLN--- 90

Query: 102 KEKERLEADRERVKKFLNERKKGP-------LPKS--LDWRQSKVKVLNPVESQGRCGSC 152
           K  +  EA+ +R+  F  ++ K P        PK+  ++W +     + PV+ QG+CGSC
Sbjct: 91  KFADYTEAEYKRLLGFGGQKNKNPRNIKVLGAPKNDGVNWVEQGA--VTPVKDQGQCGSC 148

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQYGLESQ 210
           W+F+ T  +E    +   TLY LS+ QLV+C    GN  C GG +D AF+YV+Q  LE++
Sbjct: 149 WSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQTALETE 208

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIES 268
             YPY   ++     +    K   FV  T   + V+ +   L  GP+ V +  +  + + 
Sbjct: 209 DQYPYEAVDDTCRASSAGVVKVDSFVDVT--PNNVNELKAALDKGPVSVAIEADQMVFQF 266

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-N 327
           Y G  I  ND +C    LDH V  VGYG ++G   ++V+NSWG    + GY +I     N
Sbjct: 267 YSGGVI--NDASCGT-TLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDN 323

Query: 328 ACGIESYA 335
            CGI S A
Sbjct: 324 ICGILSQA 331


>gi|4972585|gb|AAD34707.1|AF071801_1 cysteine proteinase [Paragonimus westermani]
          Length = 229

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 111/208 (53%), Gaps = 8/208 (3%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P+ +DWR+     + PVE+QG CGSCWAF+    +E Q  L    L  LSK QLV+CD  
Sbjct: 17  PERMDWRE--WGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVDCDVM 74

Query: 187 NLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           +  C GG   +   E ++  GLE Q+DYPY   +    +C   KEK    + D  V    
Sbjct: 75  DYGCGGGWPTNAYMEIMRMGGLELQSDYPYVGVQQ---QCYLNKEKLLAKIDDLIVLGAY 131

Query: 246 D--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
           +  H  +L + GP+   LN   ++ Y       +   C+P  L+HAV  VGY  +NG+  
Sbjct: 132 EEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPASLNHAVLTVGYDTENGVPY 191

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGI 331
           WI++NSWG    ++GYF++ RG   CGI
Sbjct: 192 WIIKNSWGTGWGENGYFRLYRGDGTCGI 219


>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 51/322 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y  + E   RF+ FK + +    +        +G +  SD +P E  +R
Sbjct: 47  FSLFKSKFGKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSE-FRR 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K +L A++  +           LP   DWR      +  V++QG CGSCW+
Sbjct: 106 TYLGLH-KPKPKLNAEKAPI------LPTSDLPADFDWRDHGA--VTGVKNQGSCGSCWS 156

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C GG +  AFEY +K 
Sbjct: 157 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKA 216

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  K+    +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 217 GGLQLEKDYPYTGKDG---KCHFDKSKIAAAVTNFSVI-GLDEDQIAANLVKHGPLAVGI 272

Query: 262 NHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           N   +++Y G    P+      C   + DH V +VGYG              WI++NSWG
Sbjct: 273 NAAWMQTYVGGVSCPL-----ICFKRQ-DHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWG 326

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   +HGY++I RG N CG+++
Sbjct: 327 ENWGEHGYYKICRGHNICGVDA 348


>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
          Length = 371

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 164/335 (48%), Gaps = 27/335 (8%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK----QDGKETDEYYGTSGSSD 85
           +D     ++  + FK + +++NR+Y +  E   R   F     Q  +   E  GT+   +
Sbjct: 27  KDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGE 86

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
               ++ +    +L G+E+   E      KK  +      +P++ DWR++K  +++ V++
Sbjct: 87  TPFSDLTEEEFGQLYGQERSP-ERTPNMTKKVESNTWGESVPRTCDWRKAK-NIISSVKN 144

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQ 204
           QG C  CWA A    +++   +  +    +S  +L++C+     CNGG + D     +  
Sbjct: 145 QGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNN 204

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQ-SGPIGVYLN 262
            GL S+ DYP++       RC  +K K   ++QD T +++    + H L   GPI V +N
Sbjct: 205 SGLASEKDYPFQGDRK-PHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTIN 263

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT----------------WI 305
            +L++ Y    I+    +C+P ++DH+V +VG+G+K  G+ T                WI
Sbjct: 264 MKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHSSPYWI 323

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           ++NSWG    + GYF++ RG N CG+  Y + A V
Sbjct: 324 LKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358


>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 371

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 148/318 (46%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F+ + +K+ +TYT D E   RF  FK + +        + D  +G +  SD +  E  + 
Sbjct: 58  FQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDPDAVHGVTRFSDLTESEFREN 117

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G  + RL AD  +      +     L    DWR      + PV+ QG CGSCW+
Sbjct: 118 ----FVGLNRLRLPADAHQAPILPTDN----LASDFDWRDQGA--VTPVKDQGSCGSCWS 167

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY VK 
Sbjct: 168 FSAVGALEGANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKA 227

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG-VDHM-MHLLQSGPIGVYLN 262
            GLE + DYPY   +  +  C ++  K      +  V S   D +  +L+++GP+ + +N
Sbjct: 228 GGLEREEDYPYTGTDRGS--CKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGIN 285

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C+   LDH V +VGYG              WI++NSWG+   
Sbjct: 286 AVFMQTYMKG--ISCPYICSKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWG 343

Query: 316 DHGYFQIERGANACGIES 333
           ++GY+ I +G N CG ES
Sbjct: 344 ENGYYFICKGKNICGSES 361


>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
 gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
          Length = 462

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 158/320 (49%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK +++ +NRTY    E + R   F ++  +  +          YG +  SD 
Sbjct: 158 SVKMTTVFKDFMITYNRTYESREETQWRLTVFTRNMVKAQKIEALDRGTAQYGITKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  K+       +  + K +N+    P P   DWR  K   +  V+ Q
Sbjct: 218 TEEEFYTIYLNPLLQKKP----GSKMSLAKSIND----PAPPEWDWR--KKGAVTKVKDQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACLGGMPSNAYTAIKSLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  +KAKV++ D+   S  +  M   L Q GPI V +N 
Sbjct: 328 GLETEDDYSYKGYVQA---CNFSAQKAKVYINDSVELSKNESKMAAWLAQKGPISVAINA 384

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV +VGYG ++    W ++NSWG    + GY+
Sbjct: 385 FGMQFYRHGIAHPLRP---LCSPWLIDHAVLLVGYGNRSNTPYWAIKNSWGSNWGEEGYY 441

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 442 YLYRGSGACGVNTMASSAVV 461


>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 154/319 (48%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y  + E   RF+ FK + +    +        +G +  SD +P E  +R
Sbjct: 47  FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSE-FRR 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K +L A++  +           LP   DWR      +  V++QG CGSCW+
Sbjct: 106 TYLGLH-KPKPKLNAEKAPI------LPTSDLPADFDWRDHGA--VTGVKNQGSCGSCWS 156

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C GG +  AFEY +K 
Sbjct: 157 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKA 216

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  K+    +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 217 GGLQLEKDYPYTGKDG---KCHFDKSKIAAAVTNFSVI-GLDEDQIAANLVKHGPLAVGI 272

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G         C   + DH V +VGYG              WI++NSWG+  
Sbjct: 273 NAAWMQTYVGG--VSCPLICFKRQ-DHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENW 329

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY++I RG N CG+++
Sbjct: 330 GEHGYYKICRGHNICGVDA 348


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 86/220 (39%), Positives = 121/220 (55%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPKS+DWRQ     + PV+ QG CGSCW+F+ T  LE Q+ L    L  LS+  LV+C  
Sbjct: 100 LPKSVDWRQRGA--VTPVKDQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSK 157

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  C GG ++ AF+YV+   G++++A YPY  +EN    C ++++K     K +V D
Sbjct: 158 TYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEAREN---NCRFKEDKVGGTDKGYV-D 213

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               S  D    +   GPI V ++  H   + Y     +     C+P +LDH V  VGYG
Sbjct: 214 ILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGVYKEQ--YCSPSQLDHGVLTVGYG 271

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
            +NG   W+V+NSWG    + GY +I R   N CGI S A
Sbjct: 272 TENGQDYWLVKNSWGPSWGESGYIKIARNHKNHCGIASMA 311


>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 156/317 (49%), Gaps = 30/317 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPE 95

Query: 90  EILQRTGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E       R T     E   A  +R +K +     G  P+++DWR  K   + PV+ QG+
Sbjct: 96  E------FRATYHNGAEYYAAALKRPRKVVT-VSTGKAPEAVDWR--KKGAVTPVKDQGQ 146

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQY 205
           CGSCWAF+    +E Q  +    L  LS+  LV CD  +L C GG +D AF+++    ++
Sbjct: 147 CGSCWAFSAIGNIEGQWKVTGHNLTSLSEQMLVSCDTEDLGCAGGLMDNAFKWIVSSNRH 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            + ++  YPY +K      C    +     ++D       ++ +   L ++GP+ + ++ 
Sbjct: 207 NVFTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVAIAVDS 266

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
              +SY G  +     +C   +LDH V +VGY + +    WI++NSW     + GY +IE
Sbjct: 267 TSFQSYTGGVLT----SCISKQLDHGVLLVGYDDTSKPPYWIIKNSWSKGWGEEGYIRIE 322

Query: 324 RGANACGIESYAYLASV 340
           +G N C +++YA  A V
Sbjct: 323 KGTNQCLVKNYATSAVV 339


>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 365

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 166/330 (50%), Gaps = 51/330 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQE---- 90
           F  +  ++ + Y  ++E   R++ FK + +    +        +G +  SD +P E    
Sbjct: 50  FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVTRFSDLTPSEFRNK 109

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           +L   G+RL       L+A++  +    N      LP   DWR      + PV++QG CG
Sbjct: 110 VLGLRGVRLP------LDANKAPILPTDN------LPSDFDWRDHGA--VTPVKNQGSCG 155

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY 201
           SCW+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY
Sbjct: 156 SCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEY 215

Query: 202 V-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
           + K  G+  + DYPY   ++ T  C ++K K    V +  V S  +  +  +L+++GP+ 
Sbjct: 216 ILKSGGVMREEDYPYSGADSGT--CKFDKTKIAASVANFSVVSLDEDQIAANLVKNGPLA 273

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           V +N   +++Y G       + C+  +L+H V +VGYG              WI++NSWG
Sbjct: 274 VAINAAYMQTYIGG--VSCPYVCS-RRLNHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWG 330

Query: 312 DIGPDHGYFQIERGANACGIESY-AYLASV 340
           +   ++GY++I RG N CG++S  + +ASV
Sbjct: 331 ENWGENGYYKICRGRNICGVDSMVSTVASV 360


>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 324

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 156/325 (48%), Gaps = 40/325 (12%)

Query: 33  AYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGT 80
           A +++   + ++ + + ++++Y +  E K RF  F  +    +E+             G 
Sbjct: 13  ATEALSDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGV 72

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKK----GPLPKSLDWRQSK 136
           +  +D +P+E +            ER    R+   KFL+E+ K    G LP  +DW  +K
Sbjct: 73  NKFADLTPEEFM------------ERFRPLRKTKPKFLSEQAKFNFDGDLPAEVDW--TK 118

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNID 196
              +  V+SQG CGSCWAF+TT  +ES   +    L  LS+ QLV+C   N  C GG +D
Sbjct: 119 QGAVTEVKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKNNSGCAGGWMD 178

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG---VDHMMHLLQ 253
           +A EY++  G+ S+ DYPY  + N T  C +   KA V ++          +D    +  
Sbjct: 179 IALEYIEADGIMSEDDYPYEER-NTT--CRFNNSKAAVQIKSYKAIKKNDEIDLQKAVAL 235

Query: 254 SGPIGVYLNHRLIESYDGNPIRRNDWACNPHK--LDHAVAIVGYGEKNGILTWIVRNSWG 311
            GP+ V +   +        I  ND  C   +  L HAV + GYG ++G   WIV+NSWG
Sbjct: 236 EGPVSVAIEVTIAFQLYARGI-LNDPQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWG 294

Query: 312 DIGPDHGYFQIERGA-NACGIESYA 335
                 GY ++ R A N CGI + A
Sbjct: 295 AEYGMDGYLRMSRNADNQCGIATRA 319


>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
 gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
 gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
 gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
          Length = 341

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/302 (30%), Positives = 153/302 (50%), Gaps = 28/302 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y+ ++E K R+  F+        ++ +     Y  +  +D +  E++ R
Sbjct: 44  FEKFITQYNKQYSSEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEVVNR 103

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
            TGL          E     V     +R++   P + DWR      +  V+ QG CG+CW
Sbjct: 104 HTGLASGDTGANFCET---IVVDGPGQRQR---PANFDWRN--YNKVTSVKDQGMCGACW 155

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AFA    LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E +    G+E + D
Sbjct: 156 AFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHIGGVEQEYD 215

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVYLNHRLIESY 269
           YPY+    +   C  +  K  V V++   +V    + +  LL+  GPI + ++   +  Y
Sbjct: 216 YPYK---AVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDLTDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
            G  I      C  + L+HAV +VGYG +N +  W ++NSWG    ++GY +I RG N+C
Sbjct: 273 YGGVIS----FCENNGLNHAVLLVGYGVENNVPYWTIKNSWGPDYGENGYVRIRRGVNSC 328

Query: 330 GI 331
           G+
Sbjct: 329 GM 330


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 111/322 (34%), Positives = 162/322 (50%), Gaps = 49/322 (15%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE------------YYGTSGSSDRSPQ 89
           A+K + +  ++TY    E   RFE F+++ ++ +E            Y G +  SD   +
Sbjct: 55  AWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHE 114

Query: 90  EILQRTGLRLT----GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
           E ++  GL+ T    G     L A+       L E      P S+DWR  K   +  V++
Sbjct: 115 EFVKYNGLKKTSLKDGGCSSYLAANN------LVE------PDSVDWR--KKGYVTDVKN 160

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVK 203
           QG+CGSCW+F+TT  LE Q       L  LS+SQLV+C    GN  CNGG +D AF+Y+K
Sbjct: 161 QGQCGSCWSFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIK 220

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMH--LLQSGPI 257
              GLES+ DYPY+ K+     C +  +  KV   DT    V SG +  +   + + GP+
Sbjct: 221 SVGGLESEEDYPYKPKQGT---CKF--DDTKVAATDTGCVDVESGSESALKKAVSEVGPV 275

Query: 258 GVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIG 314
            V ++  H   +SY G      +  C+  +LDH V  VGYG +  G   WIV+NSWG   
Sbjct: 276 SVAIDASHSSFQSYAGGVYDEPE--CSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEW 333

Query: 315 PDHGYFQIERG-ANACGIESYA 335
            + GY ++ R   N CGI + A
Sbjct: 334 GEDGYVKMSRNKKNQCGIATQA 355


>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 158/331 (47%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS  QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRETGHLLALSGQQLVDCDYLDDGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   +V  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
          Length = 340

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 156/303 (51%), Gaps = 30/303 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y  ++E K R+  F+ + +  ++         Y  +  +D +  EI+  
Sbjct: 43  FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIV-- 100

Query: 95  TGLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             +R TG     L A+     V     +R++   P + DWR   +  +  V+ QG CG+C
Sbjct: 101 --IRHTGLASGELGANFCETVVVDGPAQRQR---PANFDWR--TLNKVTSVKDQGMCGAC 153

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAFA    LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E + +  G+E + 
Sbjct: 154 WAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRMGGVEQEF 213

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVYLNHRLIES 268
           DYPY+ +      C  +  K    V++   +V    + +  LL+  GPI + ++   +  
Sbjct: 214 DYPYKAERQ---PCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTD 270

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +      C  + L+HAV +VGYG +N +  WI++NSWG    + GY ++ RG N+
Sbjct: 271 YYGGIVS----FCKNNGLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNS 326

Query: 329 CGI 331
           CG+
Sbjct: 327 CGM 329


>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
 gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
          Length = 339

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 156/303 (51%), Gaps = 30/303 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y  ++E K R+  F+ + +  ++         Y  +  +D +  EI+  
Sbjct: 42  FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIV-- 99

Query: 95  TGLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             +R TG     L A+     V     +R++   P + DWR   +  +  V+ QG CG+C
Sbjct: 100 --IRHTGLASGELGANFCETVVVDGPAQRQR---PANFDWR--TLNKVTSVKDQGMCGAC 152

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
           WAFA    LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E + +  G+E + 
Sbjct: 153 WAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRMGGVEQEF 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVYLNHRLIES 268
           DYPY+ +      C  +  K    V++   +V    + +  LL+  GPI + ++   +  
Sbjct: 213 DYPYKAERQ---PCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDLTD 269

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +      C  + L+HAV +VGYG +N +  WI++NSWG    + GY ++ RG N+
Sbjct: 270 YYGGIVS----FCKNNGLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNS 325

Query: 329 CGI 331
           CG+
Sbjct: 326 CGM 328


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 155/310 (50%), Gaps = 35/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++V+  + Y    E + RFE FK + +  DE+     S DRS +  L R    LT +
Sbjct: 51  YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEH----NSVDRSYKVGLNRFA-DLTNE 105

Query: 103 EKER--LEADRERVKKFLNERKK-------GPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           E +   L    ER  +FL  R +         LP+++DWR+    V  PV+ QG+CGSCW
Sbjct: 106 EYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVV--PVKDQGQCGSCW 163

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQA 211
           AF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++++ 
Sbjct: 164 AFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDTEE 223

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLI 266
           DYPY+  +NI   C   ++ AKV   D +     +  + +   +   P+ V +    R  
Sbjct: 224 DYPYKASDNI---CDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAF 280

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y           C   +LDH V  VGYG +NG+  WIVRNSWG    + GY ++ER  
Sbjct: 281 QLYKSGVFTGR---CGT-ELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNV 336

Query: 327 -----NACGI 331
                  CGI
Sbjct: 337 ANTKTGKCGI 346


>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
          Length = 374

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 152/312 (48%), Gaps = 42/312 (13%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQRTGLRLT 100
           K+ ++Y    E   RF  FK + +    +        +G +  SD +  E  ++    + 
Sbjct: 65  KFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQ----VL 120

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
           G  K RL  D  +            LP+  DWR+     + PV++QG CGSCW+F+TT  
Sbjct: 121 GLRKLRLPKDANKAPIL----PTNDLPEDFDWREKGA--VGPVKNQGSCGSCWSFSTTGA 174

Query: 161 LESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLESQ 210
           LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL  +
Sbjct: 175 LEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMRE 234

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYPY   +     C ++K+K    V +  V S  +  +  +L+++GP+ V  N   +++
Sbjct: 235 EDYPYTGMDRGA--CKFDKDKVAAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQT 292

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYFQ 321
           Y G       + C+  +LDH V +VGYG              WI++NSWG+   ++G+++
Sbjct: 293 YIGG--VSCPYICS-RRLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWGENGFYK 349

Query: 322 IERGANACGIES 333
           I RG N CG++S
Sbjct: 350 ICRGRNICGVDS 361


>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1454

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 163/319 (51%), Gaps = 29/319 (9%)

Query: 41   DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
            D FKT   + NRTY    E + RF  FK +  + ++          YG +  +D +  E 
Sbjct: 1147 DKFKT---RHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSAEY 1203

Query: 92   LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
              RTGL +  +E + +   R  + + ++E  +  LP + DWR+  +  ++ V++QG CGS
Sbjct: 1204 RARTGL-VVPREGDEVNHIRNPMAE-IDEHME--LPDAFDWRE--LGAVSEVKNQGNCGS 1257

Query: 152  CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQ 210
            CWAF+    +E    +  K L   S+ +L++CD  +  CNGG +D A++ +++  GLE +
Sbjct: 1258 CWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCDTVDSACNGGFMDDAYKAIEKIGGLELE 1317

Query: 211  ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            ++YPY  K+  T  C + K  A V V+        +  +   L+ +GP+ + LN   ++ 
Sbjct: 1318 SEYPYLAKKQKT--CHFNKTMAHVRVKGAVDLPKNETAIAQFLVANGPVSIGLNANAMQF 1375

Query: 269  YDGNPIRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQI 322
            Y G         C+   LDH V IVGYG K     N  L  WIV+NSWG    + GY+++
Sbjct: 1376 YRGGISHPWKPLCSKKNLDHGVLIVGYGVKEYPMFNKTLPYWIVKNSWGPKWGEQGYYRV 1435

Query: 323  ERGANACGIESYAYLASVK 341
             RG N CG+   A  A ++
Sbjct: 1436 FRGDNTCGVSEMATSAVLE 1454


>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
 gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
          Length = 366

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/314 (29%), Positives = 149/314 (47%), Gaps = 40/314 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           FK ++V++N+ Y  +     ++  FK         Q+ ++    YG +  +D +P+E   
Sbjct: 66  FKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEF-- 123

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKS-----LDWRQSKVKVLNPVESQGR 148
                     K  L  +   VKK    ++   +PKS     +DWR  K   +  V+ QG 
Sbjct: 124 ---------RKTHLNFNPNNVKK---PKRMANIPKSNISERMDWR--KFNAVTSVKDQGN 169

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGL 207
           CGSCWAF T A +E   A+    L  LS+ QLV+CD  +  C GG  ++   E ++  GL
Sbjct: 170 CGSCWAFCTVANIEGAWAVKTAQLISLSEQQLVDCDRLDDGCEGGLPVNAYLEIIRLGGL 229

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRL 265
           E + DY Y  +     +C +   K+ V++ DT V    +  +  ++ ++GP+ V LN   
Sbjct: 230 EKEEDYKYTARSG---KCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVAVGLNADA 286

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL----TWIVRNSWGDIGPDHGYFQ 321
           +  Y       +   C+P  ++H V IVGY  K  +      WI++NSWG    + GY+ 
Sbjct: 287 MMFYRSGIAHPSRLMCSPDGINHGVTIVGYDVKESLFWSTPYWIIKNSWGPNWGEKGYYY 346

Query: 322 IERGANACGIESYA 335
           + RG   CGI+  A
Sbjct: 347 LYRGKGVCGIDQMA 360


>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
           [Cucumis sativus]
          Length = 381

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 98/331 (29%), Positives = 159/331 (48%), Gaps = 47/331 (14%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSG 82
           D  + ++     F  +  ++ ++Y  + E   RF+ FK + +  + +        +G + 
Sbjct: 47  DFNHHALGAEHHFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQ 106

Query: 83  SSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
            SD +P E  ++  L L G  + RL  D         E     LP   DWRQ     +  
Sbjct: 107 FSDLTPFE-FRKAFLGLRG-HRLRLPVDTNAAPILPTEN----LPIDFDWRQHGG--VTR 158

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGG 193
           V++QG CGSCW+F+TT  LE            LS+ QLV+CDH          +  CNGG
Sbjct: 159 VKNQGSCGSCWSFSTTGALEG------ANFLXLSEQQLVDCDHECDPEEEDACDSGCNGG 212

Query: 194 NIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MM 249
            ++ AFEY +K  GL  + DYPY   +  T  C ++K K    +    V + +D      
Sbjct: 213 LMNSAFEYTLKAGGLMKEQDYPYAGIDRNT--CNFDKSKIAASIASFSVVNSIDEDQIAA 270

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------ 303
           +L+++GP+ + +N   +++Y G       + C+  +LDH V +VGYG             
Sbjct: 271 NLVKNGPLAIAINAVFMQTYIGGV--SCPFICSK-RLDHGVLLVGYGSAGYAPIRMRDKD 327

Query: 304 -WIVRNSWGDIGPDHGYFQIERGANACGIES 333
            WI++NSWG+   ++GY++I RG N CG++S
Sbjct: 328 YWIIKNSWGESWGENGYYKICRGRNICGVDS 358


>gi|403352840|gb|EJY75943.1| Oryzain gamma chain [Oxytricha trifallax]
          Length = 338

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 150/319 (47%), Gaps = 44/319 (13%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTG 96
           +   + F  YI ++ ++Y    E + R + F +   E  +    + SS+  P     R G
Sbjct: 30  VSSTEEFLNYIARFGKSYATKAEFQKRAKLFLKTKMEIMQ----AASSNSVPTF---RLG 82

Query: 97  L-RLTGKEKERLEA---------DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
             + +   +E  +A         + +   + L   +   LP S DWR   V  +NPV+ Q
Sbjct: 83  FNQFSDWTEEEFQAILGNKPSEEEHDVYHEHLKILEDAILPASKDWRDDGV--VNPVKDQ 140

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ 204
           GRCGSCWAF+T A +ES  A+    LY LS+ QLV+C   + N  CNGG     ++YVK 
Sbjct: 141 GRCGSCWAFSTAAGVESHFAIQFGKLYSLSEQQLVDCSTAYDNAGCNGGLATQGYDYVKS 200

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD---TWVTSGVDHMMHLLQSGPIGVYL 261
           YGLE +ADYPY   +     C  +K K   +V+D       S       L   GP  V  
Sbjct: 201 YGLEQEADYPYLAADGT---CHRDKSKIVAYVEDFHTVQTLSPSQLKAALATQGPASV-- 255

Query: 262 NHRLIESYDGNPIRRN------DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
                 S D + + +N      +  C    L+HA+  VGYG +NG   +IVRNSWG    
Sbjct: 256 ------SVDASGVFKNYQSGILNAGCGT-SLNHAILAVGYGVENGQEYYIVRNSWGPSWG 308

Query: 316 DHGYFQ--IERGANACGIE 332
           ++GY +  I  G   CG++
Sbjct: 309 ENGYIRLAIVEGQGTCGVQ 327


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 103/327 (31%), Positives = 164/327 (50%), Gaps = 30/327 (9%)

Query: 23  DSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSG 82
           D +I  + + + + ++++  +  ++ +  RTY    E + RFE F+ + +  D++   + 
Sbjct: 23  DMSIVSYGERSEEEVRRM--YAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAAD 80

Query: 83  SSDRSPQEILQRTGLRLTGKE------KERLEADRER-VKKFLNERKKGPLPKSLDWRQS 135
           +   S +  L R    LT +E        R + DRER +           LP+++DWR  
Sbjct: 81  AGLHSFRLGLNRFA-DLTNEEYRSTYLGARTKPDRERKLSARYQADDNEELPETVDWR-- 137

Query: 136 KVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGN 194
           K   +  ++ QG CGSCWAF+  A +E    ++   + PLS+ +LV+CD   N  CNGG 
Sbjct: 138 KKGAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGL 197

Query: 195 IDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL-- 251
           +D AFE++    G++S+ DYPY+ ++N   RC   K+ AKV   D +    V+    L  
Sbjct: 198 MDYAFEFIINNGGIDSEEDYPYKERDN---RCDANKKNAKVVTIDGYEDVPVNSEKSLQK 254

Query: 252 -LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRN 308
            + + PI V +    R  + Y           C    LDH VA VGYG +NG   W+VRN
Sbjct: 255 AVANQPISVAIEAGGRAFQLYKSGIFTGT---CGT-ALDHGVAAVGYGTENGKDYWLVRN 310

Query: 309 SWGDIGPDHGYFQIERGANA----CGI 331
           SWG +  + GY ++ER   A    CGI
Sbjct: 311 SWGTVWGEDGYIRMERNIKASSGKCGI 337


>gi|332024588|gb|EGI64786.1| Cathepsin O [Acromyrmex echinatior]
          Length = 356

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 156/324 (48%), Gaps = 38/324 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDD-NEIKTRFEYFKQDGKETDE-----------YYGTSGSSD 85
           K  + F  YI ++N++Y +D  + + RFE+F++  +  ++           YYG +  SD
Sbjct: 31  KDAELFANYIARYNKSYRNDPAKYEERFEHFQKSLRHIEKLNSLRSSQESAYYGLTEFSD 90

Query: 86  RSPQEILQRT---GLRLTGKEKERLEADRERVKKFLNERKKG----PLPKSLDWRQSKVK 138
            S  E +Q+     L L G++        +     +N  K+      +P   DWR   V 
Sbjct: 91  LSDDEFIQQALIPDLPLRGQKHTTASYYHQHFMGSVNRMKRMIPIIGIPSKFDWRDKGV- 149

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            + PV SQ  CG+CWAF+T  + ES  A+   TL+  S  ++++C  GN  C GG+I   
Sbjct: 150 -VGPVMSQENCGACWAFSTVGVAESMYAIENGTLHSFSVQEMIDCMPGNFGCQGGDICSL 208

Query: 199 FEYV--KQYGLESQADYPYRNKENITFRCTYEKEKAKV-------FVQDTWVTSGVDHMM 249
             ++   +  + S+ DYP   +   T  C   K  AK        F  D++V +  + + 
Sbjct: 209 LSWLLASKTRIISEIDYPLTLQ---TDTCRLHKISAKTSGVRITDFTCDSFVDAETELLT 265

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNP--HKLDHAVAIVGYGEKNGILTWIVR 307
            L+  GP+ V +N    ++Y G  I+ N   C+   + L+HAV IVGY  +  I  +I++
Sbjct: 266 LLVTHGPVAVAVNAISWQNYLGGIIQYN---CDSSFNSLNHAVQIVGYDTEARIPHYIIK 322

Query: 308 NSWGDIGPDHGYFQIERGANACGI 331
           NSWG    + GY  I  G N CGI
Sbjct: 323 NSWGPSFGNKGYIYIAVGKNLCGI 346


>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
          Length = 292

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 92/276 (33%), Positives = 144/276 (52%), Gaps = 34/276 (12%)

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           +G +  SD +P E  +RT L L   +K  + +  E      N+     LP+  DWR    
Sbjct: 16  HGVTQFSDLTPGE-FKRTYLGLRKGKKHLVGSAHEAPLLPTND-----LPEDFDWRDKGA 69

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNL 188
             +  V++QG CGSCW+F+T+  LE    L    L  LS+ Q+V+CDH          + 
Sbjct: 70  --VTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAEEPDDCDQ 127

Query: 189 NCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            CNGG ++ AF+Y+++  GLES+ DYPY   +  T  C +++ K K  V +  V S +D 
Sbjct: 128 GCNGGLMNTAFQYLQKVGGLESEKDYPYTGTDRGT--CKFDESKIKASVHNFSVVS-IDE 184

Query: 248 ---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT- 303
                +L++ GP+ + +N   +++Y G       + C  H LDH V +VGYG        
Sbjct: 185 EQIAANLVKHGPLAIAINAVFMQTYIGG--VSCPYICGKH-LDHGVLLVGYGSAGYAPIR 241

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIES 333
                 WI++NSWG+   ++GY++I RG N CG++S
Sbjct: 242 LKEKPYWIIKNSWGETWGENGYYKICRGRNVCGVDS 277


>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
 gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
          Length = 343

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 92/302 (30%), Positives = 152/302 (50%), Gaps = 27/302 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y ++ E + RF  F  + +E ++         Y  +  +D +  E++ R
Sbjct: 45  FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQKNSRNDSAVYKINRFADMTKNEVVIR 104

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
            TGL   G+           V     +R++   P S DWR      +  V+ Q  CG+CW
Sbjct: 105 HTGLASIGELNSNF--CETVVVDGPGQRQR---PSSFDWR--TYNKVTSVKDQSMCGACW 157

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           AFA+   LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E + Q  G+E + D
Sbjct: 158 AFASLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMQMGGVEQEFD 217

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVYLNHRLIESY 269
           YPYR +      C  +  K    V+    +V    + +  LL+  GPI + ++   +  Y
Sbjct: 218 YPYRAERQ---PCALKPHKFAAGVRKCFRYVLRNEERLEDLLRHVGPIAIAVDAVDLTDY 274

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
            G  +      C  + L+HAV +VGYG +N +  W ++NSWG    + GY ++ RG N+C
Sbjct: 275 YGGIVS----FCENNGLNHAVLLVGYGVENNVPFWTLKNSWGSDYGEDGYVRVRRGVNSC 330

Query: 330 GI 331
           G+
Sbjct: 331 GL 332


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 157/321 (48%), Gaps = 50/321 (15%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEI 91
           + YD       ++ ++VK  + Y    E + RF+ FK + +  +E+   +G+ D+S +  
Sbjct: 37  IDYDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEH---NGAGDKSYKLG 93

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGP---------------------LPKSL 130
           L +    LT +E   +         FL  R +GP                     LP  +
Sbjct: 94  LNKFA-DLTNEEYRAM---------FLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMV 143

Query: 131 DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLN 189
           DWR+     + P++ QG+CGSCWAF+T   +E    ++   L  LS+ +LV+CD G N+ 
Sbjct: 144 DWREKGA--VTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMG 201

Query: 190 CNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGV 245
           CNGG +D AFE++ Q G ++++ DYPY  K+N    C   ++ A+V   D +    T+  
Sbjct: 202 CNGGLMDYAFEFIVQNGGIDTEEDYPYHAKDNT---CDPNRKNARVVTIDGYEDVPTNDE 258

Query: 246 DHMMHLLQSGPIGVYLNHRLIES--YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
             +M  + + P+ V +    +E   Y           C  + LDH V  VGYG +NG   
Sbjct: 259 KSLMKAVANQPVSVAIEAGGMEFQLYQSGVFTGR---CGTN-LDHGVVAVGYGTENGTDY 314

Query: 304 WIVRNSWGDIGPDHGYFQIER 324
           W+VRNSWG    ++GY ++ER
Sbjct: 315 WLVRNSWGSAWGENGYIKLER 335


>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
          Length = 359

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 152/314 (48%), Gaps = 31/314 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 55  RHVLSFARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQ 114

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L       +   A  +   K   E     LP++ DWR+  +  ++PV+ QG C
Sbjct: 115 E-FQRTKL----GAAQNCSATLKGTHKLTGE----ALPETKDWREDGI--VSPVKDQGGC 163

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 164 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 223

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  ++     C Y  E   V V D+  +T G +    H + LL+   I   +
Sbjct: 224 LDTEEAYPYTGEDGT---CKYSAENVGVQVLDSVNITLGAEDELKHAVGLLRPVSIAFEV 280

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 281 IHSF-RLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFK 339

Query: 322 IERGANACGIESYA 335
           +E G N CGI + A
Sbjct: 340 MEMGKNMCGIATCA 353


>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
 gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
 gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
          Length = 337

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 155/305 (50%), Gaps = 34/305 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F+ +I ++N+ Y+ ++E K R+  F+        ++ +     Y  +  +D +  E++ R
Sbjct: 40  FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEVVNR 99

Query: 95  -TGLR---LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            TGL    +     E +  D         +R++   P + DWR      +  V+ QG CG
Sbjct: 100 HTGLASGDIGANFCETIVVDGP------GQRQR---PANFDWRN--YNKVTSVKDQGMCG 148

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLES 209
           +CWAFA    LESQ A+    L  L++ QLV+CD  ++ C+GG I  A+E +    G+E 
Sbjct: 149 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHIGGVEQ 208

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQS-GPIGVYLNHRLI 266
           + DYPY+    +   C  +  K  V V++   +V    + +  LL+  GPI + ++   +
Sbjct: 209 EYDYPYK---AVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDL 265

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y G  I      C  + L+HAV +VGYG +N +  W ++NSWG    ++GY +I RG 
Sbjct: 266 TDYYGGVIS----FCENNGLNHAVLLVGYGIENNVPYWTIKNSWGSDYGENGYVRIRRGV 321

Query: 327 NACGI 331
           N+CG+
Sbjct: 322 NSCGM 326


>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 158/331 (47%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR 133
              YG +  SD + +E   R   +R  G       +  E V    NE+         DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDPSPEEDVT-MDNEK--------FDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   ++  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CYMDKSKFVAYINGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  +R     C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGGIMRPR--LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
          Length = 359

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 154/320 (48%), Gaps = 45/320 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  + E   RF  FK +      +        +G +  SD +P E    
Sbjct: 45  FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTQFSDLTPMEFQHS 104

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G     L +D +       +     LPK  DWR+     + PV++QG CGSCW
Sbjct: 105 VLGLRGVG-----LPSDADSAPILPTDN----LPKDFDWREHGA--VTPVKNQGSCGSCW 153

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN----------CNGGNIDVAFEYV- 202
           +F+ T  LE    L    L  LS+ QLV+CDH   +          CNGG ++ AFEY+ 
Sbjct: 154 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDSGCNGGLMNSAFEYIL 213

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
              G+  + DYPY      T  C ++K K    V +  V S  +  +  +L+++GP+ V 
Sbjct: 214 NNGGVMREEDYPYSGTNGGT--CKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVA 271

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  KL+H V +VGYG ++           WI++NSWG+ 
Sbjct: 272 INAVYMQTYVGG--VSCPYVCS-KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGEN 328

Query: 314 GPDHGYFQIERGANACGIES 333
             ++GY++I RG N CG++S
Sbjct: 329 WGENGYYKICRGRNICGVDS 348


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 162/346 (46%), Gaps = 50/346 (14%)

Query: 15  QVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET 74
           QVT  V +D  I   R   +++      F+ +I ++ + Y+   E + RF  FK +    
Sbjct: 32  QVTDEVVSDPQILDARSALFNAEVH---FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRA 88

Query: 75  DEY--------YGTSGSSDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGP 125
            E+        +G +  SD + +E   Q  GLR          A   R            
Sbjct: 89  LEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLR----------APPLRDAHDAPILPTND 138

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+  DWR+     +  V++QG CGSCWAF+TT  LE    L    L  LS+ QLV+CDH
Sbjct: 139 LPEDFDWREKGA--VTEVKNQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDH 196

Query: 186 ---------GNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
                     +  CNGG +  A++Y +K  GLE + DYPY  K+     C++ K K    
Sbjct: 197 ECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGKDGT---CSFNKNKIVAH 253

Query: 236 VQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAI 292
           V +  V S +D      +L+++GP+ V +N   +++Y G       + C+   LDH V +
Sbjct: 254 VSNFSVVS-IDEGQIAANLVKNGPLSVGINAAFMQTYVGG--VSCPYVCSKRNLDHGVLL 310

Query: 293 VGYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACGI 331
           VGYG              W+++NSWG    ++GY+++ RG N CGI
Sbjct: 311 VGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGI 356


>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
          Length = 336

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 101/305 (33%), Positives = 152/305 (49%), Gaps = 28/305 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQ- 93
           F+ +I + N+ YT  ++    F  FK++    +          YG +  SD         
Sbjct: 33  FENFIKQHNKEYTTPDQRDDAFVNFKRNLVNMNAMNNISNHAVYGINKFSDIDKITFANV 92

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCG 150
             GL LT    +    D  R+ +F+     GP    P+S DWR  K+  +  V+ QG CG
Sbjct: 93  HAGLVLTLNATDS-NFDPYRLCEFVT--VAGPSARTPESFDWR--KLHKVTKVKEQGVCG 147

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLES 209
           SCWAFA    +ESQ A+L  +L  LS+ QL++CD  +  C+GG + +AF E ++  G+E 
Sbjct: 148 SCWAFAAIGNIESQYAILHDSLIDLSEQQLLDCDRIDQGCDGGLMHLAFQEIMRIGGVEH 207

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLL-QSGPIGVYLNHRLI 266
           + DYPY   + I + C     K  V +   +     D   ++ LL ++GPI V ++ R I
Sbjct: 208 EIDYPY---QGIEYACRSAPSKFAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCRDI 264

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y           CN + L+HAV +VGYG +N    WI +NSWG    ++GYF+  R  
Sbjct: 265 IDYRSGIAT----VCNDNGLNHAVLLVGYGIENDTPYWIFKNSWGSNWGENGYFRARRNI 320

Query: 327 NACGI 331
           NACG+
Sbjct: 321 NACGM 325


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 162/346 (46%), Gaps = 50/346 (14%)

Query: 15  QVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET 74
           QVT  V +D  I   R   +++      F+ +I ++ + Y+   E + RF  FK +    
Sbjct: 32  QVTDEVVSDPQILDARSALFNAEVH---FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRA 88

Query: 75  DEY--------YGTSGSSDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGP 125
            E+        +G +  SD + +E   Q  GLR          A   R            
Sbjct: 89  LEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLR----------APPLRDAHDAPILPTND 138

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+  DWR+     +  V++QG CGSCWAF+TT  LE    L    L  LS+ QLV+CDH
Sbjct: 139 LPEDFDWREKGA--VTEVKNQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDH 196

Query: 186 ---------GNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
                     +  CNGG +  A++Y +K  GLE + DYPY  K+     C++ K K    
Sbjct: 197 ECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGKDGT---CSFNKNKIVAH 253

Query: 236 VQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAI 292
           V +  V S +D      +L+++GP+ V +N   +++Y G       + C+   LDH V +
Sbjct: 254 VSNFSVVS-IDEGQIAANLVKNGPLSVGINAAFMQTYVGG--VSCPYVCSKRNLDHGVLL 310

Query: 293 VGYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACGI 331
           VGYG              W+++NSWG    ++GY+++ RG N CGI
Sbjct: 311 VGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGI 356


>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
 gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 96/313 (30%), Positives = 154/313 (49%), Gaps = 32/313 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +  K+ ++Y    E   RF  FK + +    +     ++      + Q + L     
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHG---VTQFSDLTSAEF 109

Query: 103 EKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
            K+ L   + R+ K  N     P   LP+  DWR+     + PV++QG CGSCW+F+TT 
Sbjct: 110 RKQVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGA--VGPVKNQGSCGSCWSFSTTG 167

Query: 160 ILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLES 209
            LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL  
Sbjct: 168 ALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 227

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIE 267
           + DYPY   +     C ++K K    V + + V+   D +  +L+++GP+ V +N   ++
Sbjct: 228 EEDYPYTGMDRGA--CKFDKNKVAAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQ 285

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYF 320
           +Y G       + C+  +LDH V +VGYG              WI++NSWG+   ++G++
Sbjct: 286 TYIGG--VSCPYICS-RRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFY 342

Query: 321 QIERGANACGIES 333
           +I RG N CG++S
Sbjct: 343 KICRGRNICGVDS 355


>gi|157862759|gb|ABV90502.1| cathepsin L, partial [Fasciola gigantica]
          Length = 280

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 125/224 (55%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 62  VPDKIDWRESGY--VTGVKDQGNCGSCWAFSTTGTMEGQYMKNQRTSISFSEQQLVDCSG 119

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN+ C+GG ++ A+EY+KQ+GLE+++ YPYR  E    +C Y ++   V V   + V 
Sbjct: 120 PWGNMGCSGGLMENAYEYLKQFGLETESSYPYRAVEG---QCRYNRQLGVVKVTGYYTVH 176

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           SG +  +  L    GP  V ++   +ES D    R   +    C+P  L+HAV  VGYG 
Sbjct: 177 SGSEVGLKNLVGAEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPFGLNHAVLAVGYGT 232

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 233 QGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASMASLPMV 276


>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
          Length = 326

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 158/331 (47%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   +V  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  I R  W C+P  ++H V  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHGVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
          Length = 360

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 154/319 (48%), Gaps = 44/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  + E   RF  FK +      +        +G +  SD +P E    
Sbjct: 45  FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHS 104

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G     L +D +       +     LPK  DWR+     + PV++QG CGSCW
Sbjct: 105 VLGLRGVG-----LPSDADSAPILPTDN----LPKDFDWREHGA--VTPVKNQGSCGSCW 153

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-K 203
           +F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+  
Sbjct: 154 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSGCNGGLMNSAFEYILN 213

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             G+  + DYPY      T  C ++K K    V +  V S  +  +  +L+++GP+ V +
Sbjct: 214 NGGVMREEDYPYSGTNGGT--CKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLAVAI 271

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  KL+H V +VGYG ++           WI++NSWG+  
Sbjct: 272 NAVYMQTYVGG--VSCPYVCS-KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENW 328

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 329 GENGYYKICRGRNICGVDS 347


>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
          Length = 367

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 162/329 (49%), Gaps = 21/329 (6%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET----DEYYGTSGSSD 85
           +DL    ++  + FK + V++NR+Y++  E   R + F  +  +     +E  GT+    
Sbjct: 29  QDLDPRPLELKEVFKLFQVQFNRSYSNPAEHSRRLDIFAHNLAKAQQLQEEDLGTAEFGM 88

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            S  ++ +    ++ G +K   E  R   +K  +E++   LP++ DWR +K  +++ +++
Sbjct: 89  TSLSDLTEEEFGKIFGHQKAVGEVPRMG-RKVGSEQQGETLPRTCDWR-NKAGIISRIKN 146

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQ 204
           Q  C  CWA A    +E+   +       +S  +L++C+     C GG + D     +  
Sbjct: 147 QENCKCCWAMAAADNIEALWGIKYHQSVEVSVQELLDCNRCGDGCQGGFVWDAFITVLNN 206

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL S+ DYP++     T RC   K +   ++QD  +    +H +  +L   GPI V +N
Sbjct: 207 SGLASEKDYPFKASVK-THRCLANKYRKVAWIQDFIMLEDNEHKIAQYLATHGPITVTIN 265

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-----------GILTWIVRNSWG 311
            +L++ Y    I+     C+P  ++H+V +VG+G +                WI++NSWG
Sbjct: 266 MKLLQHYKKGVIKAKPTTCDPQLVNHSVLLVGFGAETVSSQSHLRPHRSTPYWILKNSWG 325

Query: 312 DIGPDHGYFQIERGANACGIESYAYLASV 340
               + GYF++ RG+N+CGI  Y + A V
Sbjct: 326 AHWGEEGYFRLHRGSNSCGITKYPFTARV 354


>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
          Length = 370

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 89/330 (26%), Positives = 152/330 (46%), Gaps = 40/330 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + F  + +++NR+Y++  E   R + F ++                +G +  SD + +E 
Sbjct: 40  EVFTLFQIQYNRSYSNPAEYAHRLDIFARNLAHAQRLQEEDLGTAEFGVTAFSDLTEEEF 99

Query: 92  LQRTG-LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            Q  G  R  G+       DRE       E     +P + DWR++   V++PV+ Q  C 
Sbjct: 100 DQLYGNQRAAGRAPN---VDREVGSDEWQES----VPSTCDWRKAP-GVMSPVKDQKTCS 151

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQYGLES 209
            CWA A    +E+Q  +  +    +S  +L++C      C+GG + D     +   GL S
Sbjct: 152 CCWAMAAAGNIEAQWGIKTRQSVEVSVQELLDCGRCGDGCSGGFVWDAFITVLNNSGLAS 211

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIE 267
           + DYP++    +  +C  +K K   ++QD  + S  +  +  +L   GPI V +N +L++
Sbjct: 212 EKDYPFQGA--VRAKCQAKKHKKVAWIQDFIMLSDNEQRIAWYLATEGPITVTINKKLLQ 269

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----------------WIVRNSW 310
            Y    I+     C+P  +DH V +VG+G+   +                   WI++NSW
Sbjct: 270 QYQNGVIKATQTTCDPQNVDHVVLLVGFGKTKSVEGRQAKGVPGHSRRRSTPYWILKNSW 329

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASV 340
           G    + GYF++ RG+NACGI  Y   A V
Sbjct: 330 GANWGEKGYFRLHRGSNACGITKYPITARV 359


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 154/317 (48%), Gaps = 34/317 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +D F+++I K  + Y    E   RFE FK +    DE        + G +  SD S +
Sbjct: 28  KIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVVNYWLGLNEFSDLSHE 87

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +  L L     ER E  +E      N +    +PKS+DWR  K   +  V++QG C
Sbjct: 88  EFKNKY-LGLKVDMSERRECSQE-----FNYKDVMSIPKSVDWR--KKGAVTDVKNQGSC 139

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-HGNLNCNGGNIDVAFEY-VKQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AF Y +   GL
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGL 199

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
             + DYPY  +E     C   KE+++V     +     +  + ++  L + P+ V +  +
Sbjct: 200 HKEVDYPYIMEEGT---CEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEAS 256

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + Y G      D  C   +LDH VA VGYG  NG+   IV+NSWG    + GY ++
Sbjct: 257 GRDFQFYSGGVF---DGHCGT-QLDHGVAAVGYGSTNGLDYIIVKNSWGSKWGEKGYIRM 312

Query: 323 ERG----ANACGIESYA 335
           +R     A  CGI   A
Sbjct: 313 KRNTGKPAGLCGINKMA 329


>gi|195455847|ref|XP_002074892.1| GK22908 [Drosophila willistoni]
 gi|194170977|gb|EDW85878.1| GK22908 [Drosophila willistoni]
          Length = 381

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 158/318 (49%), Gaps = 36/318 (11%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-----YGTS-------GSS 84
           +  V  F  ++ +  +TY    E   R   F+      D        GTS         S
Sbjct: 68  LNNVQDFGDFLQQTGKTYASAAEQALRQGVFEGSQNLVDSANAAFAAGTSTFTSAVNAFS 127

Query: 85  DRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           D +  E L Q TG + + + + R+ A R+ V     E    P+P S DWR+     + PV
Sbjct: 128 DLTHLEFLKQLTGFKKSAEGESRVAAARQAV-----EVPAEPIPDSFDWREKGG--VTPV 180

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN---CNGGNIDVAFE 200
           + QG CGSCW FA T  +E  +      L  LS+  LV+C   N     C+GG  + AF 
Sbjct: 181 KHQGTCGSCWTFAATGAIEGHLFRKTNQLPNLSEQNLVDCGPLNFGLNGCDGGCQEYAFA 240

Query: 201 YVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQS--G 255
           ++K  Q G+ S+A Y Y +K ++   C+Y +++A+ +V     VT   + ++  + +  G
Sbjct: 241 FLKEAQRGIASEAKYTYVDKRDV---CSYTEKQAEAYVHGLATVTPNDEDLLKKVVATLG 297

Query: 256 PIG--VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDI 313
           P+G  ++ +  L+    G  I  N+  CN  +L+HAV +VGYG +NG   W ++NSWG+ 
Sbjct: 298 PVGCSLFADEALLHYEKG--IFSNE-TCNGQELNHAVLVVGYGSENGQDYWTIKNSWGEN 354

Query: 314 GPDHGYFQIERGANACGI 331
             + GYF++ RG N CGI
Sbjct: 355 WGESGYFRLIRGQNFCGI 372


>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 158/331 (47%), Gaps = 34/331 (10%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGG 180

Query: 194 NIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--H 250
                +  + K  GLE  +DYPY     I   C  +K K   ++  + +    + +    
Sbjct: 181 YPPQTYTAIQKMGGLELASDYPYTGVGGI---CYMDKSKFVAYINGSTILPLSEKVQAQK 237

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           L   GP+   LN   ++ Y G  +R     C+P  ++HAV  VGYG +NG   WIV+NSW
Sbjct: 238 LRAIGPLSSALNADTLQLYKGGIMRPR--LCDPAGVNHAVLTVGYGVQNGKPYWIVKNSW 295

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+   + GYF+I RG   CGI S    A +K
Sbjct: 296 GEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326


>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
          Length = 489

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 156/324 (48%), Gaps = 31/324 (9%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DLA   +K    F+ +++ +NRTY    E + R   F  +     +          YG 
Sbjct: 183 QDLA---VKMASIFRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGV 239

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKV 139
           +  SD + +E  + T L             RE  KK    +  G L P   DWR      
Sbjct: 240 TKFSDLTEEE-FRTTYLN---------PLLREPGKKMKQAKSVGDLAPPEWDWRSKGA-- 287

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  V+ QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+
Sbjct: 288 VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAY 347

Query: 200 EYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGP 256
             +K  G LE++ DY YR        C +  EKAKV++ D+   S  +  +   L + GP
Sbjct: 348 SAIKNLGGLETEDDYSYRGHMQA---CNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGP 404

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           I V +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    +
Sbjct: 405 ISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGE 464

Query: 317 HGYFQIERGANACGIESYAYLASV 340
            GY+ + RG+ ACG+ + A  A V
Sbjct: 465 KGYYYLHRGSGACGVNTMASSAVV 488


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 88/222 (39%), Positives = 123/222 (55%), Gaps = 22/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPK++DWRQ     + PV+ QG+CGSCW+F+ T  LE QV L    L  LS+  LV+C  
Sbjct: 114 LPKTVDWRQKGA--VTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCST 171

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  C GG +D AF+YV    G++++A YPY  +EN    C ++K K     K  V  
Sbjct: 172 SYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENT---CRFKKNKVGGTDKGHVD- 227

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             + +G +  +   L   GPI V +  NH   + Y       N+  C+ + LDH V  VG
Sbjct: 228 --IPAGDEKALQNALATVGPISVAIDANHGSFQFYSKG--VYNEPNCSSYDLDHGVLAVG 283

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           YG +NG   W+V+NSWG    ++GY +I R  +N CGI S A
Sbjct: 284 YGTENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMA 325


>gi|355681666|gb|AER96819.1| cathepsin W [Mustela putorius furo]
          Length = 373

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 156/341 (45%), Gaps = 37/341 (10%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL  + +K    F+ +  ++NR+Y++  E   R E F  +  +  +          +G 
Sbjct: 29  KDLDPNMLKLEQVFELFRAQYNRSYSNPKEYAHRLEIFAHNLAQAQKMEVEDLATAEFGM 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q     L G +K          +K  +E     +P S DWR+ K  V 
Sbjct: 89  TPFSDLTEEEFEQ-----LHGHQKITPGETPAVGRKVGSEVVMESVPASCDWRKLK-GVK 142

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE 200
           +P++ QG C  CWA A    +E+  ++       +S  +L++C+     C GG +  AF 
Sbjct: 143 SPIKEQGNCNCCWAMAAAGNIEALWSIRYNQSVQVSVQELLDCNRCGDGCKGGFVWDAFV 202

Query: 201 YV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
            V    GL S+ DYP+R       +C     K   ++QD  +    +  M  +L   GPI
Sbjct: 203 TVLNNSGLASEKDYPFRGSLK-RHKCLASNYKKVAWIQDFIMLQNNEQTMANYLATHGPI 261

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG----------------- 300
            V +N +L++ Y    I+     C+P+ ++H+V +VG+G+ N                  
Sbjct: 262 TVTINMKLLQQYKKGVIKATPATCDPYLVNHSVLLVGFGKTNSSERRRAKGGHFWPHPHR 321

Query: 301 -ILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            I  WI++NSWG    + GYF++ RG+N CGI  Y   A V
Sbjct: 322 PIPYWILKNSWGAEWGEEGYFRLHRGSNTCGITKYPLTARV 362


>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 157/324 (48%), Gaps = 54/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ ++Y    E   RF+ FK + +    +        +G +  SD +P E  + 
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAE-FRG 120

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
           T L L             R  K  ++ +K P      LP+  DWR      +  V++QG 
Sbjct: 121 TYLGL-------------RPLKLPHDAQKAPILPTNDLPEDFDWRDHGA--VTAVKNQGS 165

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLVECDH          +  CNGG ++ AF
Sbjct: 166 CGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAF 225

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY   +  +  C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 226 EYTLKAGGLMKEEDYPYTGTDRGS--CKFDKTKIAASVSNFSVISLDEDQIAANLVKNGP 283

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 284 LAVAINAVFMQTYVGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNS 340

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++G+++I RG N CG++S
Sbjct: 341 WGENWGENGFYKICRGRNVCGVDS 364


>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
          Length = 283

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 93/295 (31%), Positives = 142/295 (48%), Gaps = 29/295 (9%)

Query: 63  RFEYFKQDGKET---------DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRER 113
           RF+ F+++ K+          D  YG +  SD + +E  +R  L        R +  R +
Sbjct: 2   RFKIFRENMKKINTLNDNELGDAEYGVTQFSDLAEEE-FRRYYLTPKWDLSHRPDLVRAK 60

Query: 114 VKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLY 173
           +            P S DWR      + PV++QG CGSCWAF+TT  +E Q A+ +  L 
Sbjct: 61  IPDV-------DPPASFDWRDHNA--VTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLV 111

Query: 174 PLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKA 232
            LS+ +LV+CD  +  C GG  ++   E ++  GLES+  YPY  ++    +C +     
Sbjct: 112 SLSEQELVDCDKLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAEDE---KCKFTVGDV 168

Query: 233 KVFVQDTWVTSG--VDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAV 290
            V++  +   S    D    L ++GPI + +N   ++ Y G       + C+P +LDH V
Sbjct: 169 AVYINSSVNISSNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPFSFLCSPDELDHGV 228

Query: 291 AIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
            IVGYG K G  +    WIV+NSWG      GY+ + RG   CG+      A VK
Sbjct: 229 LIVGYGTKKGWFSDSPYWIVKNSWGASWGVQGYYLVYRGDGVCGLNKMPTSAIVK 283


>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
          Length = 355

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 51/322 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y  + E   RF+ FK + +    +        +G +  SD +P E  +R
Sbjct: 49  FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSE-FRR 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K +L A++  +           LP   DWR      +  V++QG CGSCW+
Sbjct: 108 TYLGLH-KPKPKLNAEKAPI------LPTSDLPADYDWRDHGA--VTGVKNQGSCGSCWS 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C+GG +  AFEY +K 
Sbjct: 159 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFEYTLKA 218

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  K     +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 219 GGLQREKDYPYTGKXG---KCHFDKSKIAAAVTNFSVI-GLDEDQIAANLVKHGPLAVGI 274

Query: 262 NHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           N   +++Y G    P+      C   + DH V +VGYG              WI++NSWG
Sbjct: 275 NAAWMQTYVGGVSCPL-----ICFKRQ-DHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWG 328

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   +HGY++I RG N CG+++
Sbjct: 329 ENWGEHGYYKICRGHNICGVDA 350


>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
 gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
          Length = 337

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/347 (28%), Positives = 168/347 (48%), Gaps = 54/347 (15%)

Query: 3   SSQCDHQETNTEQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKT 62
           + Q +H   N + + YN+N+ + +Y               F+ +I ++N+ Y  ++E K 
Sbjct: 16  TRQDNHASANNKPMLYNINS-APLY---------------FEKFITQYNKQYKSEDEKKY 59

Query: 63  RFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR-TGLR-----LTGKEKERLE 108
           R+  F+ + +  ++         Y  +  +D    EI+ R TGL      L   E   ++
Sbjct: 60  RYNIFRHNIESINQKNSRNDSAVYKINRFADMPKNEIVIRHTGLASGELGLNFCETIVVD 119

Query: 109 ADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALL 168
              +R +           P S DWR   +  +  V+ QG CG+CW FA+   LESQ A+ 
Sbjct: 120 GPAQRQR-----------PVSFDWR--SMNKITSVKDQGMCGACWRFASLGALESQYAIK 166

Query: 169 KKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTY 227
              L  LS+ QLV+CD  ++ C+GG I  A+E + K  G+E + DY Y+ +      C  
Sbjct: 167 YDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKMGGVEQEFDYSYKAERQ---PCAL 223

Query: 228 EKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVYLNHRLIESYDGNPIRRNDWACNPH 284
           +  K    V++   +V    + +  LL+  GPI + ++   +  Y G  +      C  +
Sbjct: 224 KPHKFATGVRNCYRYVILNEERLEDLLRYVGPIAIAVDAVDLTDYYGGIVS----FCENN 279

Query: 285 KLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
            L+HAV +VGYG +N +  WI++NSWG    + GY ++ RG N+CG+
Sbjct: 280 GLNHAVLLVGYGVENNVPYWIIKNSWGSDYGEDGYVRVRRGVNSCGM 326


>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 367

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 158/336 (47%), Gaps = 43/336 (12%)

Query: 26  IYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-------- 77
           I V  D   D +     F ++  K+ +TY    E   RF  FK + +   ++        
Sbjct: 34  IQVVSDGEDDLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDPTAA 93

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           +G +  SD +P+E  +R  L L  K + RL  D  +            LP   DWR    
Sbjct: 94  HGVTKFSDLTPKE-FRRQFLGL--KRRLRLPTDANKAPILPTTD----LPTDYDWRDHGA 146

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNL 188
             +  V+ QG CGSCW+F+ T  LE    L    L  LS+ QLV+CDH          + 
Sbjct: 147 --VTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDS 204

Query: 189 NCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            C+GG ++ AFEY +K  GLE + DYPY   +  T  C ++K K    V +  V S +D 
Sbjct: 205 GCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGT--CKFDKSKVVASVSNFSVVS-IDE 261

Query: 248 ---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT- 303
                +L++ GP+ V +N   +++Y G       + C+  + DH V +VGYG        
Sbjct: 262 DQIAANLVKHGPLSVAINAAFMQTYVGG--VSCPYICSKRQ-DHGVLLVGYGSAGYAPIR 318

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIES 333
                 WI++NSWG    ++GY++I RG N CG++S
Sbjct: 319 FKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDS 354


>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 154/319 (48%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y  + E   RF+ FK + +        +    +G +  SD +P E  +R
Sbjct: 47  FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSE-FRR 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K +L A++  +           LP   DWR      +  V++QG CGSCW+
Sbjct: 106 TYLGLH-KPKPKLNAEKAPI------LPTSDLPADFDWRDHGA--VTGVKNQGSCGSCWS 156

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C GG+   AFEY +K 
Sbjct: 157 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKA 216

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  K+    +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 217 GGLQLEKDYPYTGKDG---KCHFDKSKICAAVTNFSVI-GLDEDQIAANLVKHGPLAVGI 272

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G         C   + DH V +VGYG              WI++NSWG+  
Sbjct: 273 NAAWMQTYVGG--VSCPLICFKRQ-DHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENW 329

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY++I RG N CG+++
Sbjct: 330 GEHGYYKICRGHNICGVDA 348


>gi|268578473|ref|XP_002644219.1| Hypothetical protein CBG17217 [Caenorhabditis briggsae]
          Length = 413

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 154/313 (49%), Gaps = 29/313 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFE-YFKQD------GKETDEYYGTSGSSDRSPQEILQR 94
           ++ T   ++N++Y+   E   R   Y+  D       K+ +      G +D S     + 
Sbjct: 100 SYSTVTHRYNKSYSTSKESLKRLNAYYTTDENVANWNKQKEHGSAVYGHNDLSDWTDEEF 159

Query: 95  TGLRLTGKEKERLEADRERVKKF------LNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           T   L     +RL  D E +K        +   + GPLP   DWR   V  + PV++QG+
Sbjct: 160 TKTLLPKSFYQRLHKDAEFIKPIPESLAAMKGERNGPLPDFFDWRDRNV--VTPVKAQGQ 217

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLE 208
           CGSCWAFA+TA +E+  A+       LS+  L++CD  +  C+GG+ D AF Y+ + GL 
Sbjct: 218 CGSCWAFASTATVEAAYAIAHGEKRNLSEQTLLDCDLDDNACDGGDEDKAFRYIHRQGLA 277

Query: 209 SQADYPY----RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPIGVYLN- 262
              D PY    +N  ++       K KA  F+         D M++ L+  GP+ + ++ 
Sbjct: 278 YAVDLPYVAHRQNTCSVDGHYNTTKIKAAYFLH-----HDEDSMINWLVNFGPVNIGMSV 332

Query: 263 HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYG-EKNGILTWIVRNSWGDI-GPDHGY 319
            + + +Y G     +++AC    +  HA+ I GYG  + G   WIV+NSWG+  G ++GY
Sbjct: 333 IQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSEKGEKYWIVKNSWGNTWGVENGY 392

Query: 320 FQIERGANACGIE 332
               RG NACGIE
Sbjct: 393 IYFARGINACGIE 405


>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
 gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
 gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
          Length = 381

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 154/328 (46%), Gaps = 45/328 (13%)

Query: 39  QVDA---FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRS 87
           Q+DA   F ++  ++ RTY D  E   R   F  + +    +        +G +  SD +
Sbjct: 51  QLDAEAHFASFERRFGRTYRDAGERAYRMSVFAANLRRARRHQRLDPTATHGVTKFSDLT 110

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           P E   R  L L     E L          L       LP   DWR+     + PV+ QG
Sbjct: 111 PGEFRDRF-LGLRRPSLEGLVGGEPHEAPILPTDG---LPDDFDWREHGA--VGPVKDQG 164

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVA 198
            CGSCW+F+T+  LE    L    L  LS+ Q+V+CDH          +  CNGG +  A
Sbjct: 165 SCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTA 224

Query: 199 FEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSG 255
           F Y+ K  GL+S+ DYPY  +EN    C ++K K    V++  V S  +  +  +L++ G
Sbjct: 225 FSYLMKSGGLQSEKDYPYAGRENT---CKFDKSKIVAQVKNFSVISVNEDQIAANLVKHG 281

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRN 308
           P+ + +N   +++Y G       + C  H LDH V +VGYG              WI++N
Sbjct: 282 PLAIAINAAYMQTYIGG--VSCPFICGRH-LDHGVLLVGYGSAGYAPIRFKEKPYWIIKN 338

Query: 309 SWGDIGPDHGYFQIERGA---NACGIES 333
           SWG+   + GY++I RG    N CG++S
Sbjct: 339 SWGENWGEKGYYKICRGPHDKNKCGVDS 366


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 96/333 (28%), Positives = 156/333 (46%), Gaps = 50/333 (15%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGS 83
            A+  I     F++++  + + Y    E + RF  FK +          +    +G +  
Sbjct: 45  FAHALIGAEKRFESFMKDFGKVYHSVEEYEHRFGVFKSNLLKALKHQALDPTASHGVTMF 104

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           SD + +E   +      G ++  + +   +      E     LP + DWR+     + PV
Sbjct: 105 SDLTEEEFTSK----YLGLKRPSVLSSAPQAPPLPTED----LPPNFDWREKGA--VGPV 154

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGN 194
           + QG CGSCWAF+TT  +E    L    L  LS+ QLV+CDH          +  CNGG 
Sbjct: 155 KDQGGCGSCWAFSTTGAVEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGF 214

Query: 195 IDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMH 250
           +  A++YV+   GLE ++DYPY  ++    +C ++  K  V V + +    VD      +
Sbjct: 215 MTNAYQYVEAAGGLELESDYPYEGRDG---KCKFDSNKVAVKVSN-FTNIPVDEDQVAAY 270

Query: 251 LLQSGPIGVYLNHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT---- 303
           L++SGP+ + +N   +++Y      PI      CN   LDH V +VGY E+         
Sbjct: 271 LIKSGPLAIGINAEFMQTYIAGVSCPIF-----CNKRNLDHGVLLVGYAERGFAPARLAY 325

Query: 304 ---WIVRNSWGDIGPDHGYFQIERGANACGIES 333
              WI++NSWG    D+GY++I RG   CG+ +
Sbjct: 326 KPYWIIKNSWGPNWGDNGYYKICRGHGECGLNT 358


>gi|308495037|ref|XP_003109707.1| hypothetical protein CRE_07390 [Caenorhabditis remanei]
 gi|308245897|gb|EFO89849.1| hypothetical protein CRE_07390 [Caenorhabditis remanei]
          Length = 405

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 156/320 (48%), Gaps = 53/320 (16%)

Query: 35  DSIKQVDAFKTY---IVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQE- 90
           +S+K+++A+ T    IV WN+               K+ G      YG +  SD +  E 
Sbjct: 109 ESLKRLNAYYTTEENIVNWNKQ--------------KEHGSTV---YGHNDMSDWTDAEF 151

Query: 91  ---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
              +L ++  +   K+ E +    E +   + ER  GPLP   DWR   V  + PV++QG
Sbjct: 152 EKTLLPKSFYQRLHKDAEYIVPVPESLAGMIGERA-GPLPDFFDWRDRNV--VTPVKAQG 208

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGL 207
           +CGSCWAFA+TA +E+  A+       LS+  L++CD  +  C+GG+ D AF Y+ + GL
Sbjct: 209 QCGSCWAFASTATVEAAYAIAHGERRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRQGL 268

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMH---------LLQSGP 256
               D PY               +    V D W T+ +   + +H         L+  GP
Sbjct: 269 AYSVDLPY-----------VAHRQNNCVVNDHWNTTRIKAAYFLHHDEDSIINWLVNFGP 317

Query: 257 IGVYLN-HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYGEKN-GILTWIVRNSWGDI 313
           + + ++  + + +Y G     +++AC    +  HA+ I GYG  + G   WIV+NSWG+ 
Sbjct: 318 VNIGMSVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSDKGEKYWIVKNSWGNT 377

Query: 314 -GPDHGYFQIERGANACGIE 332
            G +HGY    RG NACGIE
Sbjct: 378 WGVEHGYIYFARGINACGIE 397


>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
          Length = 329

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 86/236 (36%), Positives = 119/236 (50%), Gaps = 39/236 (16%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P+++DWR+ K   + PV++QG CGSCW F+TT  LES +A+    L  L++  L
Sbjct: 105 RSDGPCPEAVDWRK-KGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLL 163

Query: 181 VECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C     N  C+GG    AFEY+    GL  +  YPYR +      C ++ +KA  FV+
Sbjct: 164 VDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRAQNGT---CKFQPDKAIAFVK 220

Query: 238 DTWVTSGVDHMMHLLQSG---PI---------------GVYLNHRLIESYDGNPIRRNDW 279
           D    +  D    +   G   P+               GVY N R   +           
Sbjct: 221 DVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEHT----------- 269

Query: 280 ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
              P K++HAV  VGYGE++G   WIV+NSWG +    GYF IERG N CG+ + A
Sbjct: 270 ---PDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACA 322


>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 366

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 147/318 (46%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ +TY    E   RF  FK +      +        +G +  SD +P E   +
Sbjct: 51  FSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPSEFRGQ 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL +D ++            LP   DWR      +  V++QG CGSCW+
Sbjct: 111 ----FLGLKPLRLPSDAQKAPIL----PTSDLPTDFDWRDHGA--VTGVKNQGSCGSCWS 160

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY +K 
Sbjct: 161 FSAVGALEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKA 220

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DYPY  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 221 GGLMREEDYPYTGRDRGP--CKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGIN 278

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYG              WI++NSWG+   
Sbjct: 279 AVFMQTYIGG--VSCPYICGKH-LDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWGESWG 335

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 336 EEGYYKICRGRNVCGVDS 353


>gi|7271891|gb|AAF44676.1|AF239265_1 cathepsin L [Fasciola gigantica]
          Length = 326

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 127/230 (55%), Gaps = 18/230 (7%)

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
           E  K  +P S+DWR+S    +  V+ QG+CGSCWAF+TT  +E Q    ++T    S+ Q
Sbjct: 102 EANKRAVPASIDWRESGY--VTEVKDQGQCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQ 159

Query: 180 LVEC--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           LV+C  D GN  CNGG ++ A EY+K++GLE+++ YPYR  E     C Y K+     V 
Sbjct: 160 LVDCSDDFGNFGCNGGLMENACEYLKRFGLETESSYPYRAVEG---PCRYNKQLGVAKVT 216

Query: 238 DTWVTSGVD--HMMHLLQ-SGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVA 291
             ++    D   + +L+   GP  V L+   ++S D    R   +    C+P  L+H V 
Sbjct: 217 GYYMVHSGDEVELQNLVGIEGPAAVALD---VDS-DFMMYRSGIYQSQTCSPEFLNHGVL 272

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            VGYG ++G   WIV+NSWG    ++GY ++ R   N CGI S A +  V
Sbjct: 273 AVGYGTQSGTDYWIVKNSWGPWWGENGYIRMVRNRGNMCGIASLASVPMV 322


>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
          Length = 322

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 141/304 (46%), Gaps = 28/304 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           + + Y ++++ K RF  FK         Q   +    YG +  SD +P+E   +      
Sbjct: 34  YGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAAKY----- 87

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
                R   + ++V++      K   P+ +DWR+     +  VE+QG CGSCWAF+    
Sbjct: 88  ----LRAAVNNDQVERVRPTGLKAA-PERMDWREKGA--VTAVENQGSCGSCWAFSAAGN 140

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKE 219
           +E Q  +    L  LSK QLV+CD     CNGG  +    E     GLES++DYPY   E
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRVAEGCNGGWPVSSYLEIKHMGGLESESDYPYVGAE 200

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C   KEK    + D  V    +  H  +L + GP+   LN   ++ Y    +   
Sbjct: 201 QT---CALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPT 257

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYL 337
              C   +L+HAV  VGY ++  +  WI++NSWG    + GYF++ RG   CGI   A  
Sbjct: 258 YEECPDTELNHAVLTVGYDKEGDMPYWIIKNSWGTDWGEKGYFRLFRGDYTCGINRMATS 317

Query: 338 ASVK 341
           A +K
Sbjct: 318 AIIK 321


>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
 gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
          Length = 373

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 149/303 (49%), Gaps = 32/303 (10%)

Query: 53  TYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRE 112
           TY    E   RF+ FK + +  + +     ++      + Q + L  +   ++ L   R 
Sbjct: 68  TYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHG---VTQFSDLTHSEFRRQFLGLRRL 124

Query: 113 RVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           R+ K  NE    P   LP   DWR+     +  V++QG CGSCW+F+TT  LE    L  
Sbjct: 125 RLPKDANEAPMLPTNDLPADFDWREKGA--VTAVKNQGSCGSCWSFSTTGALEGANYLAT 182

Query: 170 KTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKE 219
             L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL  + DYPY   +
Sbjct: 183 GKLVSLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTD 242

Query: 220 NITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRN 277
                C ++K K    V +  V S  +  +  +L+++GP+ V +N   +++Y G      
Sbjct: 243 RGA--CQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGG--VSC 298

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACG 330
            + C+  +LDH V +VGYG              WI++NSWG+   + GY++I RG N CG
Sbjct: 299 PYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICG 357

Query: 331 IES 333
           ++S
Sbjct: 358 VDS 360


>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 359

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 152/314 (48%), Gaps = 31/314 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 55  RHVISFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQ 114

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L       +   A  +   K   E     LP++ DWR+  +  ++PV+ QG C
Sbjct: 115 E-FQRTKL----GAAQNCSATLKGTHKLTGE----ALPETKDWREDGI--VSPVKDQGGC 163

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 164 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 223

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  ++     C Y  E   V V D+  +T G +    H + L++   I   +
Sbjct: 224 LDTEEAYPYTGEDGT---CKYSAENVGVEVLDSVNITLGAEDELKHAVGLVRPVSIAFEV 280

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 281 IHSF-RLYKSGVYSDSHCGQTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFK 339

Query: 322 IERGANACGIESYA 335
           +E G N CGI + A
Sbjct: 340 MEMGKNMCGIATCA 353


>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
          Length = 363

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 156/318 (49%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F ++  K++++Y+   E   RF  FK +  +   +        +G +  SD +  E  +R
Sbjct: 48  FTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K++ RL A  ++            LP+  DWR+     + PV+ QG CGSCWA
Sbjct: 107 QFLGL--KKRLRLPAHAQKAPILPTTN----LPEDFDWREKGA--VTPVKDQGSCGSCWA 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q 
Sbjct: 159 FSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQS 218

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+  + DY Y  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 219 GGVVQEKDYAYTGRDG---SCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGIN 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C   +LDH V +VG+G+             WIV+NSWG    
Sbjct: 276 AAWMQTYMSGV--SCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWG 333

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 334 EQGYYKICRGRNVCGVDS 351


>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 153/321 (47%), Gaps = 43/321 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y  + E   RF  FK + +    +        +G +  SD +  E  
Sbjct: 49  DHFSLFKSKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFR 108

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           ++    L  +   +L  D  +      E     LP+  DWR      + PV++QG CGSC
Sbjct: 109 KK---HLGVRAGFKLPKDANKAPILPTEN----LPEDFDWRDRGA--VTPVKNQGSCGSC 159

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 160 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTL 219

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           K  GL  + DYPY  K+  T  C  +K K    V +  V S +D      +L+++GP+ V
Sbjct: 220 KTGGLMKEEDYPYTGKDGKT--CKLDKSKIVASVSNFSVIS-IDEEQIAANLVKNGPLAV 276

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGD 312
            +N   +++Y G       + C   +L+H V +VGYG              WI++NSWG+
Sbjct: 277 AINAGYMQTYIGG--VSCPYICT-RRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGE 333

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++G+++I +G N CG++S
Sbjct: 334 TWGENGFYKICKGRNICGVDS 354


>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
          Length = 363

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 156/318 (49%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F ++  K++++Y+   E   RF  FK +  +   +        +G +  SD +  E  +R
Sbjct: 48  FTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K++ RL A  ++            LP+  DWR+     + PV+ QG CGSCWA
Sbjct: 107 QFLGL--KKRLRLPAHAQKAPILPTTN----LPEDFDWREKGA--VTPVKDQGSCGSCWA 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q 
Sbjct: 159 FSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQS 218

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+  + DY Y  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 219 GGVVQEKDYAYTGRDG---SCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGIN 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C   +LDH V +VG+G+             WIV+NSWG    
Sbjct: 276 AAWMQTYMSGV--SCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWG 333

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 334 EQGYYKICRGRNVCGVDS 351


>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 158/336 (47%), Gaps = 43/336 (12%)

Query: 26  IYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-------- 77
           I V  D   D +     F ++  K+ +TY    E   RF  FK + +   ++        
Sbjct: 34  IQVVSDGEDDLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDPTAA 93

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           +G +  SD +P+E  +R  L L  K + RL  D  +            LP   DWR    
Sbjct: 94  HGVTKFSDLTPKE-FRRQFLGL--KRRLRLPTDANKAPILPTTD----LPTDYDWRDHGA 146

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNL 188
             +  V+ QG CGSCW+F+ T  LE    L    L  LS+ QLV+CDH          + 
Sbjct: 147 --VTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDS 204

Query: 189 NCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            C+GG ++ AFEY +K  GLE + DYPY   +  T  C ++K K    V +  V S +D 
Sbjct: 205 GCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGT--CKFDKSKVVASVSNFSVVS-IDE 261

Query: 248 ---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT- 303
                +L++ GP+ V +N   +++Y G       + C+  + DH V +VGYG        
Sbjct: 262 DQIAANLVKHGPLSVAINAAFMQTYVGG--VSCPYICSKRQ-DHGVLLVGYGSAGYAPIR 318

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIES 333
                 WI++NSWG    ++GY++I RG N CG++S
Sbjct: 319 FKEKPFWIIKNSWGQNWGENGYYKICRGRNICGVDS 354


>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
          Length = 368

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 46/320 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL-Q 93
           F T+  K+ +TY    E   RF+ FK + +   ++        +G +  SD +P+E   Q
Sbjct: 52  FTTFKAKFGKTYATQEEHDYRFKLFKANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQ 111

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR     + RL AD               LP   DWR      +  V++QG CGSCW
Sbjct: 112 YLGLR-----RLRLPADAHEAPIL----PTNDLPTDFDWRDHGA--VTNVKNQGSCGSCW 160

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VK 203
           +F+    LE    L    L  LS+ QLV+CDH          +  CNGG +  AFEY +K
Sbjct: 161 SFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLK 220

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVY 260
             GLE + DYPY   +     C +++ K    V +  V S +D      +L++ GP+ V 
Sbjct: 221 AGGLEREEDYPYTGNDRGP--CKFDRNKIVASVSNFSVVS-IDEDQIAANLVKHGPLAVG 277

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  + DH V +VGYG              WI++NSWG+ 
Sbjct: 278 INAVFMQTYMGG--VSCPYICSKRQ-DHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGES 334

Query: 314 GPDHGYFQIERGANACGIES 333
             ++GY++I RG N CG+++
Sbjct: 335 WGENGYYRICRGRNICGVDA 354


>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
          Length = 597

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 159/320 (49%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 293 SVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTKFSDL 352

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT + L    +E +   +  + K + +    P P   DWR  K   +  V+ Q
Sbjct: 353 TEEEF--RT-IYLNPLLRE-VPGKKMHLAKSIGD----PAPPEWDWR--KNGAVTKVKDQ 402

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 403 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 462

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 463 GLETEDDYSYQGHMQA---CNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINA 519

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV IVGYG ++ +  W ++NSWG    + GY+
Sbjct: 520 FGMQFYRHGIAHPLRP---LCSPWLIDHAVLIVGYGNRSEVPFWAIKNSWGTDWGEKGYY 576

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ +CG+ + A  A V
Sbjct: 577 YLHRGSGSCGVNTMASSAVV 596


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 150/307 (48%), Gaps = 36/307 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           F+ + ++  +TY +  E   RF  F  + +  + +             G +  +D S +E
Sbjct: 26  FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEE 85

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
              +T L L+   K  LE     VK  +       +P S+DWR  K   +  V+ QG CG
Sbjct: 86  F--KTMLTLSASRKPTLET-TSYVKTGVE------IPSSVDWR--KEGRVTGVKDQGDCG 134

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNIDVAFEYVKQYGLES 209
           SCWAF+ T   E   A     L  LS+ QL++C    +  C+GG++D  F+YV + GL+S
Sbjct: 135 SCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCCTDTSAGCDGGSLDDNFKYVMKDGLQS 194

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS----GVDHMMHLLQS-GPIGVYLNHR 264
           +  Y Y+ ++     C Y    A V  + +  TS      D ++  + + GP+ V ++  
Sbjct: 195 EESYTYKGEDG---ACKYNV--ASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDAS 249

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
            + SYD       D  C+P  L+HA+  VGYG +NG   WI++NSWG    + GYF++ R
Sbjct: 250 YLSSYDSGIYEDQD--CSPAGLNHAILAVGYGTENGKDYWIIKNSWGASWGEQGYFRLAR 307

Query: 325 GANACGI 331
           G N CGI
Sbjct: 308 GKNQCGI 314


>gi|307206026|gb|EFN84119.1| Cathepsin O [Harpegnathos saltator]
          Length = 353

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/322 (30%), Positives = 157/322 (48%), Gaps = 37/322 (11%)

Query: 38  KQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDE-----------YYGTSGSSD 85
           + +  F  Y+ ++N++Y  D  E   RF+ F++  +  +            YYG +  SD
Sbjct: 31  EDIKLFVDYVARYNKSYRHDPPEYNERFDRFQRSLRHIERMNGFRSSQESAYYGLTEFSD 90

Query: 86  RSPQEILQRTGLRLTGKEKERLEADR--ERVKKFLNER--KKGPLPKSLDWRQSKVKVLN 141
            S  E +QRT L       +  +A     R  K  N R  ++  +P  +DWR   V  + 
Sbjct: 91  LSEDEFVQRTLLPDLSSRGQMHKAASYYHRHTKNTNNRSERETNVPPKIDWRDKGV--VG 148

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
           P++SQ  CG+CWAF+T  + ES  A+   TLYP S  ++++C  G+  C GG+I     +
Sbjct: 149 PIQSQEICGACWAFSTIGVAESMYAMKNGTLYPFSVQEMIDCMPGDFGCQGGDICSLLSW 208

Query: 202 V--KQYGLESQADYPYRNKENITFRCTYEKEKAKV-------FVQDTWVTSGVDHMMHLL 252
           +   +  +  ++ YP   +++   +C   K  AK        F  D++  +  D ++ LL
Sbjct: 209 LLTSKTKIIPESAYPLTRRDD---QCKLLKLSAKTSGVGITDFTCDSFADAE-DELLALL 264

Query: 253 QS-GPIGVYLNHRLIESYDGNPIRRNDWACNP--HKLDHAVAIVGYGEKNGILTWIVRNS 309
            S GP+   +N    ++Y G  I+   + C+     L+HAV IVGY    GI  +IV+NS
Sbjct: 265 ASHGPVAAAVNAISWQNYLGGVIQ---YHCDGSFSSLNHAVQIVGYDLSAGIPHYIVKNS 321

Query: 310 WGDIGPDHGYFQIERGANACGI 331
           WG    D GY  I  G+N CGI
Sbjct: 322 WGTAFGDKGYLYISIGSNLCGI 343


>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
          Length = 375

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 156/332 (46%), Gaps = 41/332 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEI 91
           + FK + +++NR+Y++  E   R + F          Q+ +     +G +  SD + +E 
Sbjct: 40  EVFKLFQIQFNRSYSNQAEYARRLDIFVHNLATAQRLQEEELGTAEFGVTPFSDLTEEEF 99

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q  G R   ++  R+       +K   ++++  + +S DWR  K  +++PV++QG C  
Sbjct: 100 GQLYGNRRVARKDLRV------ARKVSFDKQEELMSQSCDWR--KAHIISPVKNQGNCRC 151

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQ 210
           CWA A    +E+   +  K    LS  +L++C      C GG I  AF  V  Y GL S+
Sbjct: 152 CWAIAAAGNIEAMWNIRYKVSVTLSVQELLDCARCEDGCAGGYIWDAFITVLNYSGLASE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYP+R   NI  +C     +   ++ D  +    +  +  ++   GPI V +N ++++ 
Sbjct: 212 KDYPFRGHANI-HKCLASNYRKVAWIYDYIMLPRDEQGIARYVATQGPITVIINSKILQH 270

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGE--------------------KNGILTWIVRN 308
           Y    I+     C+P  +DH V +VGYG                     ++ I  WI++N
Sbjct: 271 YKKGIIKGTSSKCDPWFVDHYVLLVGYGRSKAEEEKWTETDLSHSNRPPRHSIPYWILKN 330

Query: 309 SWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           SWG    + GYF++ RG+N CGI  Y   A V
Sbjct: 331 SWGANWGEEGYFRLHRGSNTCGITKYPITARV 362


>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
          Length = 377

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 155/328 (47%), Gaps = 46/328 (14%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQRT-- 95
           ++ ++ +TY D  E   R   FK + +    +        +G +  SD +P E  +RT  
Sbjct: 56  FVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQMLDPSAEHGVTKFSDLTPAE-FRRTFL 114

Query: 96  GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           GL+ T +   R  A        L       LP+  DWR      + PV++QG C SCW+F
Sbjct: 115 GLKTTRRSFLREMAGSAHDAPVLPTDG---LPEDFDWRDHGA--VGPVKNQGSCWSCWSF 169

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQY 205
           + +  LE    L    +  LS+ QLV+CDH          +  CNGG +  AF Y +K  
Sbjct: 170 SASGALEGANYLATGKMEVLSEQQLVDCDHECDPAEPDSCDAGCNGGLMTSAFSYLLKSG 229

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYLN 262
           GLE + DYPY  K+     C +EK K    VQ+  V + VD      +L++ GP+ + +N
Sbjct: 230 GLEREKDYPYTGKDGT---CKFEKSKIAASVQNFSVVA-VDEEQIAANLVEYGPLAIGIN 285

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYG      +       WI++NSWG+   
Sbjct: 286 AAYMQTYIGG--VSCPYICGRH-LDHGVLLVGYGASGFAPSRFKEKPYWIIKNSWGENWG 342

Query: 316 DHGYFQIERGANA---CGIESYAYLASV 340
           D GY++I RG+N    CG++S     S 
Sbjct: 343 DKGYYKICRGSNVRNKCGVDSMVSTVSA 370


>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
          Length = 381

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/317 (32%), Positives = 153/317 (48%), Gaps = 27/317 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 78  VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 137

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQ 146
            +E   RT + L    +E      E   K    +  G L P   DWR      +  V+ Q
Sbjct: 138 EEEF--RT-IYLNPLLRE------EPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKVKDQ 186

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 187 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 246

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY YR        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 247 GLETEDDYSYRGHMQA---CNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINA 303

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+ + 
Sbjct: 304 FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWGEKGYYYLH 363

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+ + A  A V
Sbjct: 364 RGSGACGVNTMASSAVV 380


>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
          Length = 379

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 76  VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 135

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I     LR            +E   K    +  G L P   DWR      +  V
Sbjct: 136 EEEFRTIYLNPLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 181

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 182 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 241

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+ V S  +  +   L + GPI V 
Sbjct: 242 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVA 298

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 299 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 358

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 359 YLHRGSGACGVNTMASSAVV 378


>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
          Length = 373

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 152/315 (48%), Gaps = 48/315 (15%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQRTGLRLT 100
           ++ ++Y    E   RF+ F+ + +    +        +G +  SD +P E          
Sbjct: 64  RFKKSYGSQKEHDYRFKIFQVNLRRAARHQNLDPSATHGVTQFSDLTPGEF--------- 114

Query: 101 GKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
              K  L   R R+ K   E    P   LP+  DWR+     + PV++QG CGSCW+F+T
Sbjct: 115 --RKAYLGLRRLRLPKDATEAPILPTDNLPQDFDWREKGA--VTPVKNQGSCGSCWSFST 170

Query: 158 TAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGL 207
           T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL
Sbjct: 171 TGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGL 230

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRL 265
             + DYPY   +  T  C ++  K    V +  V S  +  +  +L ++GP+ V +N   
Sbjct: 231 MREEDYPYTGTDRGT--CKFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVF 288

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHG 318
           +++Y G       + C+  +LDH V +VGYG              WI++NSWG+   ++G
Sbjct: 289 MQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGENG 345

Query: 319 YFQIERGANACGIES 333
           +++I RG N CG++S
Sbjct: 346 FYRICRGRNICGVDS 360


>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
          Length = 394

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/321 (32%), Positives = 159/321 (49%), Gaps = 33/321 (10%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S++ V  FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 90  SVRMVSIFKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQALDRGTAQYGITKFSDL 149

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVES 145
           + +E   RT + L    +E       R KK    +  G   P   DWR      +  V+ 
Sbjct: 150 TEEEF--RT-IYLNPLLRE------NRGKKMDLAKSIGDSAPPEWDWRNKGA--VTQVKD 198

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L +  L  LS+ +L++CD  +  C GG    A+  +K  
Sbjct: 199 QGMCGSCWAFSVTGNVEGQWFLKRGALLSLSEQELLDCDKVDKACLGGLPSNAYSAIKTL 258

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
           G LE++ DY YR        C++  +KA+V++ D+   S  +  +   L Q+GPI V +N
Sbjct: 259 GGLETEDDYSYRGHVQT---CSFSSKKARVYINDSVELSQNEQKLVAWLAQNGPISVAIN 315

Query: 263 HRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
              ++ Y     +P+R     C+P  +DHAV +VGYG ++GI  W ++NSWG    + GY
Sbjct: 316 AFGMQFYRRGISHPLRP---LCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGY 372

Query: 320 FQIERGANACGIESYAYLASV 340
           + + RG+ ACG+ + A  A V
Sbjct: 373 YYLHRGSGACGVNTMASSAVV 393


>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 43/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F ++  K+ +TY    E   RF  FK + +   ++        +G +  SD +P+E  +R
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKE-FRR 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K   RL  D  +            LP   DWR      +  V+ QG CGSCW+
Sbjct: 110 QFLGL--KRWLRLPTDANKAPILPTTD----LPTDYDWRDHGA--VTEVKDQGSCGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+ T  LE    L    L  LS+ QLV+CDH          +  C+GG ++ AFEY +K 
Sbjct: 162 FSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKA 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE +ADYPY   +  T  C ++K K    V +  V S +D      +L++ GP+ V +
Sbjct: 222 GGLEREADYPYTGTDGGT--CKFDKSKVVASVSNFSVVS-IDEDQIAANLVKHGPLSVAI 278

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  + DH V +VGYG              WI++NSWG   
Sbjct: 279 NAAFMQTYVGG--VSCPYICSKRQ-DHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNW 335

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 336 GENGYYKICRGRNICGVDS 354


>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
          Length = 366

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 161/345 (46%), Gaps = 48/345 (13%)

Query: 15  QVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET 74
           QVT  V +D  I   R   +++      F+ +I ++ + Y+   E + RF  FK +    
Sbjct: 32  QVTDEVVSDPQILDARSALFNAEVH---FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRA 88

Query: 75  DEY--------YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL 126
            E+        +G +  SD      L + G R    +   L A   R            L
Sbjct: 89  LEHQKLDPRASHGVTKFSD------LTQEGFR---HQYLGLRAPPLRDAHDAPILPTNDL 139

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
           P+  DWR+     +  V++QG CGSCWAF+TT  LE    L    L  LS+ QLV+CDH 
Sbjct: 140 PEDFDWREKGA--VTEVKNQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHE 197

Query: 186 --------GNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFV 236
                    +  CNGG +  A++Y +K  GLE + DYPY  K+     C++ K K    V
Sbjct: 198 CDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGKDGT---CSFNKNKIVAHV 254

Query: 237 QDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
            +  V S +D      +L+++GP+ V +N   +++Y G       + C+   LDH V +V
Sbjct: 255 SNFSVVS-IDEGQIAANLVKNGPLSVGINAAFMQTYVGG--VSCPYVCSKRNLDHGVLLV 311

Query: 294 GYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACGI 331
           GYG              W+++NSWG    ++GY+++ RG N CGI
Sbjct: 312 GYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLCRGHNVCGI 356


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/317 (32%), Positives = 159/317 (50%), Gaps = 43/317 (13%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR------- 94
           A+++++VK  ++Y    E + RF+ FK +    DE    + + DRS +  L R       
Sbjct: 43  AYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDE---QNAAKDRSFKLGLNRFADLTNE 99

Query: 95  ------TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
                 TG+R T   ++++    +R      E     LP+S+DWR+     +  V+ QG+
Sbjct: 100 EYRSKYTGIR-TKDSRKKVSGKSQRYASLAGES----LPESVDWREHGA--VASVKDQGQ 152

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYG 206
           CGSCWAF+T + +E    +    L  LS+ +LV+CD   N  CNGG +D AF+++    G
Sbjct: 153 CGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGG 212

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG----PIGVYL- 261
           ++S ADYPY  ++    +C   ++ AKV   D++     ++    LQ      PI V + 
Sbjct: 213 IDSDADYPYTGRDG---QCDQYRKNAKVVTIDSY-EDVPEYDEKALQKAAANQPISVAIE 268

Query: 262 -NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
            + R  + YD          C    LDH V +VGYG +NG   WIVRNSWG    + GY 
Sbjct: 269 ASGRDFQFYDSGIFTGK---CGT-DLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYL 324

Query: 321 QIERG----ANACGIES 333
           ++ERG    A  CGI S
Sbjct: 325 RMERGISSKAGICGITS 341


>gi|324514421|gb|ADY45863.1| Viral cathepsin [Ascaris suum]
          Length = 399

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 157/321 (48%), Gaps = 27/321 (8%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSG 82
           +L+ D    +D+F  ++ +++R Y+ ++E + RF  F ++ K          +  +G + 
Sbjct: 87  ELSNDYPIYIDSFVKFMQEYDRQYSSNDETRLRFRNFVRNMKFIKKAQKGRDNVVFGITR 146

Query: 83  SSDRSPQEILQRTGLRLTGKE---KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKV 139
            +D S  E+   T       E   +  L+ D++   +  +       P + DWR   V  
Sbjct: 147 FTDWSEAEMKSMTCEDWAANEVGSEITLDDDQDESDEVFDR------PDAFDWRTKSV-- 198

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF 199
           +  ++ Q RCGSCWAFA   ++ES  A+ K  L  LS+ +L++CD  +  C+GG    AF
Sbjct: 199 VTDIKDQERCGSCWAFAAIGVVESMNAIAKNPLISLSEQELIDCDTDDNGCSGGYRPYAF 258

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT-WVTSGVDHMM-HLLQSGPI 257
            YV+++G+ S+ DYPY+ KE    +C       +V+++   ++    D M   +   GPI
Sbjct: 259 RYVRRHGIVSEKDYPYKGKEQS--QCA--ANGTRVYIKSVKYIGRNEDAMADFVFYRGPI 314

Query: 258 GVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
            V +N          G    + +      +  HAVA+VGYG +NG   W+++NSWG    
Sbjct: 315 SVGINVTKEFFHYRSGVFTPKKEDCEEDSQGSHAVAVVGYGSQNGEDYWLIKNSWGKKWG 374

Query: 316 DHGYFQIERGANACGIESYAY 336
             GY   +RG N CGI +  +
Sbjct: 375 MDGYVLYKRGENCCGIANTPF 395


>gi|298713906|emb|CBJ33775.1| Cathepsin-like proteinase [Ectocarpus siliculosus]
          Length = 462

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 153/330 (46%), Gaps = 49/330 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------------------Y 78
           F+ + +K+ ++Y +D+E   RFE FK++ K  DE                         Y
Sbjct: 126 FQEFGIKFEKSYENDDEKAMRFEVFKRNLKRIDERNSKSLGVKYDVTMWTDLTHEEFKGY 185

Query: 79  GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKF--LNERKKGPLPKSLDWRQSK 136
              G      +E+ +   +  + K+   +    +   +F  L +   G LP   DWR   
Sbjct: 186 QNYGKISDEAKEVARSKAM--STKDASDMYESCQSCTRFPELEQYITGDLPTEFDWRD-- 241

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNI 195
              + PV++Q  CGSCW F+TT  LE    L    L  LS+ QLV CD   N  CNGG  
Sbjct: 242 YGAVTPVKNQAYCGSCWTFSTTGCLEGAWYLSGHPLESLSEQQLVACDTSYNQGCNGGWP 301

Query: 196 DVAFEYV-KQYGLESQADYPYRN-------KENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            ++ +Y+ K  G+  ++ YPYR         + +      E   A     +  V    D 
Sbjct: 302 SISMDYISKNGGIVPESIYPYRKVFMNGHLGDPVCSDVVKEGNYAATLAIE--VALAEDS 359

Query: 248 MMH------LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
           M        L+ +GP+ V L+   ++ Y    I   ++ C P ++DHAV IVGYGE++G+
Sbjct: 360 MTEEAMARWLILNGPLSVALDAMGMDYYS-EGIDMGEY-CEPLEIDHAVLIVGYGEEDGV 417

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANACGI 331
             WI++NSW  +  + GY+++ RG NACGI
Sbjct: 418 KYWIIKNSWKYLWGERGYYRLVRGVNACGI 447


>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
          Length = 377

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 156/324 (48%), Gaps = 54/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ ++Y    E   RF+ FK + +    +        +G +  SD +P E  + 
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAE-FRG 120

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
           T L L             R  K  ++ +K P      LP+  DWR      +  V++QG 
Sbjct: 121 TYLGL-------------RPLKLPHDAQKAPILPTNDLPEDFDWRDHGA--VTAVKNQGS 165

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+TT  LE    L    L  LS+ QLVECDH          +  CNGG ++ AF
Sbjct: 166 CGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAF 225

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + DYPY   +  +  C ++K K    V +  V S  +  +  +L++ GP
Sbjct: 226 EYTLKAGGLMKEEDYPYTGTDRGS--CKFDKTKIAASVSNFSVISLDEDQIAANLVKIGP 283

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 284 LAVAINAVFMQTYVGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNS 340

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++G+++I RG N CG++S
Sbjct: 341 WGENWGENGFYKICRGRNVCGVDS 364


>gi|86279345|gb|ABC88768.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 328

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/220 (35%), Positives = 127/220 (57%), Gaps = 15/220 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K PL  S+DWR + V   + V+ QG+CGSCW+F+TT  +E Q+AL +  L  LS+  L++
Sbjct: 112 KKPLAASVDWRSNAV---SEVKDQGQCGSCWSFSTTGAVEGQLALQRGGLTSLSEQNLID 168

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C   +GN  C+GG +D AF Y+  YG+ S++ YPY  +++    C ++  ++   +   +
Sbjct: 169 CSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYEAQDDY---CRFDSSQSVTTLSGYY 225

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLNHR-LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            + SG ++ +   + Q+GP+ V ++    ++ Y G      D  CN   L+H V +VGYG
Sbjct: 226 DLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFY--DQTCNQSDLNHGVFVVGYG 283

Query: 297 EKNGILTWIVRNSWGDIGPDHGYF-QIERGANACGIESYA 335
             NG   WI++NSWG    ++GY+ Q+    N CGI + A
Sbjct: 284 SDNGQDYWILKNSWGSGWGENGYWTQVRNYGNNCGIATAA 323


>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
          Length = 484

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 155/317 (48%), Gaps = 27/317 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 181 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 240

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQ 146
            +E   RT + L    +E      E   K    +  G L P   DWR SK  V   V+ Q
Sbjct: 241 EEEF--RT-IYLNPLLRE------EPGNKMKQAKSVGDLAPPEWDWR-SKGAVTK-VKDQ 289

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 290 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 349

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY YR        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 350 GLETEDDYSYRGHMQA---CNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINA 406

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+ + 
Sbjct: 407 FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWGEKGYYYLH 466

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+ + A  A V
Sbjct: 467 RGSGACGVNTMASSAVV 483


>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
          Length = 350

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 127/224 (56%), Gaps = 14/224 (6%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R  GP P  +DWR+ K K ++PV++QG CGSCW F+TT  LES +A+    L  L++ QL
Sbjct: 125 RGTGPYPPFVDWRK-KGKFVSPVKNQGSCGSCWTFSTTGALESAIAIKSGKLLSLAEQQL 183

Query: 181 VEC--DHGNLNCNG-GNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFV 236
           V+C  +  N  C G G    AFEY++   G+  +  YPY+ ++     C Y+  KA  FV
Sbjct: 184 VDCAQNFNNHGCQGYGAPLQAFEYIRYNKGIMGEDSYPYKGQDG---DCKYQPSKAIAFV 240

Query: 237 QDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW---ACN--PHKLDHAVA 291
           +D      ++    ++++  +   ++     + D    R+  +   +C+  P K++HAV 
Sbjct: 241 KDV-ANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTSCHKTPDKVNHAVL 299

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
            VGYGE+NGI  WIV+NSWG     +GYF +ERG N CG+ + A
Sbjct: 300 AVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACA 343


>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
 gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
           Precursor
 gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
 gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
          Length = 368

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/321 (30%), Positives = 153/321 (47%), Gaps = 43/321 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y  + E   RF  FK + +    +        +G +  SD +  E  
Sbjct: 49  DHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFR 108

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           ++    L  +   +L  D  +      E     LP+  DWR      + PV++QG CGSC
Sbjct: 109 KK---HLGVRSGFKLPKDANKAPILPTEN----LPEDFDWRDHGA--VTPVKNQGSCGSC 159

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 160 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTL 219

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           K  GL  + DYPY  K+  T  C  +K K    V +  V S +D      +L+++GP+ V
Sbjct: 220 KTGGLMKEEDYPYTGKDGKT--CKLDKSKIVASVSNFSVIS-IDEEQIAANLVKNGPLAV 276

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGD 312
            +N   +++Y G       + C   +L+H V +VGYG              WI++NSWG+
Sbjct: 277 AINAGYMQTYIGG--VSCPYICT-RRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGE 333

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++G+++I +G N CG++S
Sbjct: 334 TWGENGFYKICKGRNICGVDS 354


>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
          Length = 460

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/317 (32%), Positives = 153/317 (48%), Gaps = 27/317 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 157 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 216

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQ 146
            +E   RT + L    +E      E   K    +  G L P   DWR      +  V+ Q
Sbjct: 217 EEEF--RT-IYLNPLLRE------EPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKVKDQ 265

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 266 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 325

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY YR        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 326 GLETEDDYSYRGHMQA---CNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINA 382

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+ + 
Sbjct: 383 FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDIPFWAIKNSWGTDWGEKGYYYLH 442

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+ + A  A V
Sbjct: 443 RGSGACGVNTMASSAVV 459


>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
          Length = 338

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 35  VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 94

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 95  EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 140

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 141 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 200

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 201 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 257

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 258 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 317

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 318 YLHRGSGACGVNTMASSAVV 337


>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
 gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
          Length = 362

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 150/309 (48%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V++ ++Y    E++ RF  F +  +E         S++R  + +  R G+ R + 
Sbjct: 61  FARFAVRYGKSYESAAEVRRRFRIFSESLEEVR-------STNR--KGLPYRLGINRFSD 111

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV++Q  CGSCW
Sbjct: 112 MSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGI--VSPVKNQAHCGSCW 169

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C  G  N  CNGG    AFEY+K   G++++
Sbjct: 170 TFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTE 229

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY+    +   C Y+ E A V V D+  +T   +  +         V +  ++I+  
Sbjct: 230 ESYPYKGVNGV---CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGF 286

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +     P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G 
Sbjct: 287 RQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGK 346

Query: 327 NACGIESYA 335
           N C I + A
Sbjct: 347 NMCAIATCA 355


>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
 gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
          Length = 392

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 89  VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 148

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 149 EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 194

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 195 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 254

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 255 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 311

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 312 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 371

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 372 YLHRGSGACGVNTMASSAVV 391


>gi|197258084|gb|ACH56226.1| cathepsin L-like cysteine proteinase [Radopholus similis]
          Length = 417

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 158/310 (50%), Gaps = 31/310 (10%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKET------------DEYYGTSGSSDRSPQEILQRTGL 97
           ++R +  + E   RF+ F+++  E             +  YG +G +D + +E + R  L
Sbjct: 104 FSREWNSERERWERFKLFERNLAEIARLNAEAKRTGRNMTYGVNGMADWTEEE-MGRMLL 162

Query: 98  RLTGKEKERLEADRERV------KKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGR 148
            L   ++ R+EA   R       + F +   + P    P+  DWR   V  + PV++QG+
Sbjct: 163 PLDHFKRRRVEAKFIRKMNPILRRAFTDRSAEEPGSEYPRHFDWRPRGV--VTPVKAQGQ 220

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLE 208
           CGSCWAFA  A  ES  A+    L  LS+ +L++C+  N  CNGG+ D AF Y+ + GL 
Sbjct: 221 CGSCWAFAAVATTESAYAVAHGHLRSLSEQELLDCNLENNACNGGSEDKAFRYIHERGLV 280

Query: 209 SQADYPY-RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPIGVYLNHRL- 265
           ++ +YPY  +++N+       K   K+ V   ++      MM  L+  GP+ V +     
Sbjct: 281 TEDEYPYVAHRQNVCSVDFGSKNLTKIDVA-VFINPDEQSMMDWLINFGPVNVGIAVPPD 339

Query: 266 IESYDGNPIRRNDWACNPHKLD-HAVAIVGYGE-KNGILTWIVRNSWGDI-GPDHGYFQI 322
           ++ Y       +D+ C    L  HA+ +VGYGE + G+  WIV+NSW +  G +HGY   
Sbjct: 340 MKPYKSGIYHPSDYDCKFRVLGLHALLVVGYGESQEGVKYWIVKNSWNNTWGQEHGYVNF 399

Query: 323 ERGANACGIE 332
            RG NACGIE
Sbjct: 400 VRGINACGIE 409


>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
          Length = 327

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 132/228 (57%), Gaps = 12/228 (5%)

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
           + K   +P+S+DWR      +  V+ QG+CGSCWAF++T  +E Q     +T    S+ Q
Sbjct: 103 QAKGNDVPESIDWRD--YGYVTEVKDQGQCGSCWAFSSTGAMEGQYIKKFRTTVSFSEQQ 160

Query: 180 LVEC--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKE--KAKVF 235
           LV+C  ++GN  CNGG ++ AFEY+++ GLE+++ YPYR  ++    C YE +   AKV 
Sbjct: 161 LVDCTRNYGNSGCNGGWMERAFEYLRRNGLETESSYPYRAVDD---HCRYESQLGVAKVT 217

Query: 236 VQDTWVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
              T  +     +M+++   GP+ V ++ +   S   + I +++  C+ + ++HAV  VG
Sbjct: 218 GYYTEHSGNEVSLMNMVGGEGPVAVAVDVQSDFSMYKSGIYQSE-TCSTYYVNHAVLAVG 276

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASVK 341
           YG ++G   WI++NSWG    D GY +  R   N CGI SYA +  V+
Sbjct: 277 YGTESGTDYWILKNSWGSWWGDQGYIRFARNRNNMCGIASYASVPMVE 324


>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
           Full=Turgor-responsive protein 15A; Flags: Precursor
 gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
          Length = 363

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 156/318 (49%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQR 94
           F ++  K++++Y    E   RF  FK +         ++    +G +  SD +  E  +R
Sbjct: 48  FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASE-FRR 106

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K++ RL A  ++            LP+  DWR+     + PV+ QG CGSCWA
Sbjct: 107 QFLGL--KKRLRLPAHAQKAPILPTTN----LPEDFDWREKGA--VTPVKDQGSCGSCWA 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ + 
Sbjct: 159 FSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLES 218

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLN 262
            G+  + DY Y  ++     C ++K K    V + + VT   D +  +L+++GP+ V +N
Sbjct: 219 GGVVQEKDYAYTGRDG---SCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAIN 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C   +LDH V +VG+G+             WI++NSWG    
Sbjct: 276 AAWMQTYMSGV--SCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWG 333

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 334 EQGYYKICRGRNVCGVDS 351


>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
          Length = 265

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 91/270 (33%), Positives = 140/270 (51%), Gaps = 24/270 (8%)

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERV---KKFLNERKKGPLPKSLDWRQ 134
           YG +  +D +  E  QRTGL +   E      DR  V   K  ++E  +  LP+S DWR+
Sbjct: 2   YGITHFADMTSAEYRQRTGLVIPRDE------DRNHVGNPKAEIDENME--LPESFDWRE 53

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
             +  ++PV++QG CGSCWAF+    +E    +  K L   S+ +L++CD  +  C GG 
Sbjct: 54  --LGAVSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGY 111

Query: 195 IDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HL 251
           +D A++ +++  GLE +++YPY  K+  T  C +   +  V V+        +  M  +L
Sbjct: 112 MDDAYKAIEKIGGLELESEYPYLAKKQKT--CHFNSTEVHVRVKGAVDLPKNETAMAQYL 169

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK------NGILTWI 305
           + +GPI + LN   ++ Y G         C+   LDH V IVGYG K        +  WI
Sbjct: 170 VANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGYGVKEYPMFNKTMPYWI 229

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIESYA 335
           V+NSWG    + GY++I RG N CG+   A
Sbjct: 230 VKNSWGPKWGEQGYYRIFRGDNTCGVSEMA 259


>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 365

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 154/319 (48%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y  + E   RF+ FK + +        +    +G +  SD +P E  +R
Sbjct: 49  FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSE-FRR 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K ++ A++  +           LP   DWR      +  V++QG CGSCW+
Sbjct: 108 TYLGLH-KPKPKVNAEKAPI------LPTSDLPADYDWRDHGA--VTGVKNQGSCGSCWS 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C GG +  AFEY +K 
Sbjct: 159 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKA 218

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  K+    +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 219 GGLQLEKDYPYTGKDG---KCHFDKSKIAAAVTNFSVI-GLDEDQIAANLVKHGPLAVGI 274

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G         C   + DH V +VGYG              WI++NSWG+  
Sbjct: 275 NAAWMQTYVGG--VSCPLICFKRQ-DHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGENW 331

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY++I RG N CG+++
Sbjct: 332 GEHGYYKICRGHNICGVDA 350


>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
 gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
          Length = 490

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 155/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 186 SVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDL 245

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  +E  R    + R+ K ++       P   DWR  K   +  V+ Q
Sbjct: 246 TEEEFRTIYLNPLLQEEPGR----KMRLAKSVSSLP----PPEWDWR--KKGAVTKVKDQ 295

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKTLG 355

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY YR        C++  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 356 GLETEEDYSYRGHLQT---CSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINA 412

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV +VGYG ++    W ++NSWG    + GY+
Sbjct: 413 FGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYY 469

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+   A  A V
Sbjct: 470 YLYRGSGACGVNIMASSAVV 489


>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
          Length = 485

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 181 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 240

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 241 EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 286

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 287 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 346

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 347 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISVA 403

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 404 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 463

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 464 YLHRGSGACGVNTMASSAVV 483


>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
 gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
          Length = 326

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 151/314 (48%), Gaps = 34/314 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           ++ + +K+ +TY++D++ + RF  FK         Q  ++    YG +  SD + +E   
Sbjct: 32  YEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEFKT 90

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKS-LDWRQSKVKVLNPVESQGRCGSC 152
           R           R+  D   V +    ++   +  S  DWR      + PV  QG CGSC
Sbjct: 91  RY---------LRMRFDEPIVNEDPTPQEDVTMDNSNFDWRDHGA--VGPVLDQGDCGSC 139

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQA 211
           WAF+    +E Q       L  LS+ QL++CDH +  C+GG     +  +++ G LE ++
Sbjct: 140 WAFSVIGNVEGQWFRKTGDLLGLSEQQLIDCDHSDQGCDGGYPPQTYSAIEEMGGLELRS 199

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT----WVTSGVDHMMHLLQSGPIGVYLNHRLIE 267
           DYPY  K+ I   C  ++ K   +V  +    W          L + GP+   LN  L++
Sbjct: 200 DYPYTGKDGI---CYMDQSKFVAYVNGSTRLPWCEK--TQAKSLKEIGPLSSGLNAVLLQ 254

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y    I R  W CNP +L+HAV  VGYG ++ +  WIV+NSWG    + GYF+I RG  
Sbjct: 255 LYK-RGIMRPRW-CNPAELNHAVLTVGYGMEHRMPYWIVKNSWGKRFGEKGYFRIYRGDG 312

Query: 328 ACGIESYAYLASVK 341
            CGI      A VK
Sbjct: 313 TCGINRAVTTAVVK 326


>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 596

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/292 (34%), Positives = 142/292 (48%), Gaps = 34/292 (11%)

Query: 43  FKTYIVKWNRTYTDD-NEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEIL 92
           F  ++ K+ RTY+   +E   RFE FK + +              YG +   D S +E  
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228

Query: 93  QRTGLRLTGKEKERLEADRERVK-KFLN--ERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           +      T          R  V  + LN  E     +P S+DWR  K   +  V++QG C
Sbjct: 229 RTLAPGFT----------RPLVPIQTLNSAELDTTNIPDSMDWR--KHGAVTEVKNQGSC 276

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLE 208
           GSCWAF+TT  +E Q  L  K L  LS+ +LV+CD  +  C GG    A++ +++  GLE
Sbjct: 277 GSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSIEKLGGLE 336

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG--VDHMMHLLQSGPIGVYLNHRLI 266
            + DYPY  +     +C  ++   KVFV ++       V     L Q+GPI + +N  L+
Sbjct: 337 PEKDYPYVGEGE---KCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLM 393

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           + Y G         CNP  LDH V IVGYG +NG   WI++NSW   GPD G
Sbjct: 394 QFYWGGISHPWKIFCNPKSLDHGVLIVGYGTENGTPFWIIKNSW---GPDWG 442



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 2/79 (2%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR  K   +  V++QG CGSCWAF+TT  +E Q  L  K L  LS+ +LV+CD 
Sbjct: 475 IPDSMDWR--KHGAVTEVKNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCDT 532

Query: 186 GNLNCNGGNIDVAFEYVKQ 204
            +  C GG    A++ +++
Sbjct: 533 LDSGCGGGLPSNAYKSIEK 551



 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 25/38 (65%)

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           +NG   WI++NSWG    + GY++I RG  +CG+ + A
Sbjct: 553 ENGTPFWIIKNSWGPDWGEEGYYRIYRGDGSCGLNNMA 590


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 97/308 (31%), Positives = 147/308 (47%), Gaps = 27/308 (8%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           +F  +  ++ + Y    EIK RFE F  + K    +     S      E        LT 
Sbjct: 60  SFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTD-----LTW 114

Query: 102 KEKERLEADRERVKKFLNERKKG-------PLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
            E  R   DR    +  +   KG        LP++ DWR++ +  ++PV++QG+CGSCW 
Sbjct: 115 DEFRR---DRLGAAQNCSATTKGNLKVTNVVLPETKDWREAGI--VSPVKNQGKCGSCWT 169

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQA 211
           F+TT  LE+  +        LS+ QLV+C     N  CNGG    AFEY+K  G L+++ 
Sbjct: 170 FSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEE 229

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES-- 268
            YPY  K  +   C +  E   V V D+  +T G +  +    +    V +   +I+   
Sbjct: 230 AYPYTGKNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFK 286

Query: 269 -YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y        +    P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G N
Sbjct: 287 QYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKN 346

Query: 328 ACGIESYA 335
            CGI + A
Sbjct: 347 MCGIATCA 354


>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
 gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
          Length = 363

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 143/315 (45%), Gaps = 41/315 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------DGKETDEYYGTSGSSDRSPQEI--- 91
           F  + V++ ++Y    E++ RF  F +        + K      G +  SD S +E    
Sbjct: 62  FARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLSYRLGINRFSDMSWEEFRAT 121

Query: 92  ----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
                Q     L G  + R  A                LPK+ DWR+  +  ++PV++QG
Sbjct: 122 RLGAAQNCSATLAGNHRMRAAAV--------------ALPKTKDWREDGI--VSPVKNQG 165

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  
Sbjct: 166 HCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYIKYN 225

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH--LLQSGPIGVYL 261
            GL+++  YPY+    I   C ++ E   V V D+  +T G +  +   +    P+ V  
Sbjct: 226 GGLDTEESYPYKGVNGI---CDFKAENVGVKVLDSVNITLGAEDELKDAVALVRPVSVAF 282

Query: 262 NH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
                   Y       +     P  ++HAV  VGYG +NG+  W+++NSWG    D GYF
Sbjct: 283 QVVNGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKGYF 342

Query: 321 QIERGANACGIESYA 335
           ++E G N CG+ + A
Sbjct: 343 KMEMGKNMCGVATCA 357


>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
          Length = 374

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 88/335 (26%), Positives = 152/335 (45%), Gaps = 42/335 (12%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +KQV  F  + +++NR+Y++  E   R + F  +  +  +          +G +  SD +
Sbjct: 38  LKQV--FALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLT 95

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQ 146
            +E  Q  G        +R+  +   V + +   + G P+P + DWR+    +++P++ Q
Sbjct: 96  EEEFGQFYG-------HQRMAGEAPSVGRKVESEEWGEPVPPTCDWRKLP-GIISPIKQQ 147

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQY 205
           G C  CWA A    +E+   +       +S  +L++C      C GG   D     +   
Sbjct: 148 GNCRCCWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNS 207

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNH 263
           GL S  DYP+        RC  +K K   ++QD  +  G +  +  +L   GPI V +N 
Sbjct: 208 GLASAKDYPFLGNTK-PHRCLAKKYKKVAWIQDFIMLQGNEQAIAWYLATKGPITVTINM 266

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------------------WI 305
           +L++ Y    I+     C+P ++DH+V +VG+G+   +                    WI
Sbjct: 267 KLLQHYQKGVIQATHTTCDPQRVDHSVLLVGFGKSKSVAGKQAEGGSSRPRPHHPIPYWI 326

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           ++NSWG    + GYF++ RG N CGI  Y   A V
Sbjct: 327 LKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTARV 361


>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
          Length = 365

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 146/319 (45%), Gaps = 45/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  ++ +TY    E   RF  FK + +        +    +G +  SD +P E  Q 
Sbjct: 52  FAAFKARFRKTYATAEEHDYRFSIFKANLRRAKRNQLLDPSAVHGVTRFSDLTPAEFRQN 111

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  R   D ++            LP   DWR      +  V+ QG CGSCW+
Sbjct: 112 ----YLGLKPLRFPIDTQQAPIL----PTNDLPTDFDWRDHGA--VTAVKDQGECGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ K 
Sbjct: 162 FSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAFEYILKA 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            G+    DYPY   +     C ++K K    V + + T  +D      +L+++GP+ V +
Sbjct: 222 GGVVRGEDYPYTGTDG---HCKFDKTKIAASVSN-FSTVSIDEDQIAANLVKNGPLAVGI 277

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   ++SY G       + C+   L+H V +VGYG              W+++NSWG   
Sbjct: 278 NAIFMQSYAGG--VSCPFICST-SLNHGVLLVGYGSAGYSPIRFKEKPYWLLKNSWGQNW 334

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY++I RG N CG++S
Sbjct: 335 GEHGYYKICRGHNICGVDS 353


>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 363

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 152/318 (47%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y    E   RFE FK + +    +        +G +  SD +  E   +
Sbjct: 48  FLDFKRRFGKAYASQEEHNYRFEVFKANMRRARRHQSLDPSAAHGVTRFSDLTASEFRNK 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               + G    RL ++  +      +     LP   DWR      + PV++QG CGSCW+
Sbjct: 108 ----VLGLRGVRLPSNANKAPILPTDN----LPSDFDWRDHGA--VTPVKNQGSCGSCWS 157

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ K 
Sbjct: 158 FSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKS 217

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+  + DYPY   +     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 218 GGVMREEDYPYSGTDR--GNCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAIN 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C+  +LDH V +VGYG              WI++NSWG+   
Sbjct: 276 AAYMQTYIGG--VSCPYICS-RRLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWG 332

Query: 316 DHGYFQIERGANACGIES 333
           ++GY++I RG N CG++S
Sbjct: 333 ENGYYKICRGRNICGVDS 350


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 100/316 (31%), Positives = 151/316 (47%), Gaps = 35/316 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  EIK RF  FK++         K      G +  +D + Q
Sbjct: 54  RHVLSFARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QR  L         L+          ++  +  LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRNKLGAAQNCSATLKGS--------HKLTEAALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIG--V 259
           L+++  YPY  K+     C Y  E   V V D+  +T G +    H + L++   I   V
Sbjct: 223 LDTEEAYPYTGKDGT---CKYSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
             + RL   Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GY
Sbjct: 280 VKSFRL---YKSGVYTDSHCGNTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGY 336

Query: 320 FQIERGANACGIESYA 335
           F++E G N CGI + A
Sbjct: 337 FKMEMGKNMCGIATCA 352


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 100/311 (32%), Positives = 153/311 (49%), Gaps = 34/311 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSS---------DRSPQEI-L 92
           ++ ++VK  + Y    E + RFE FK + +  DE     G +         D + +E   
Sbjct: 52  YEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRA 111

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
              G ++  KEK R E  +  + K  N+     LP  +DWR+     +  V+ QG+CGSC
Sbjct: 112 MYLGAKMEKKEKLRTERSQRYLHKAGNDDD---LPSHVDWREKGA--VTEVKDQGQCGSC 166

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G++S+
Sbjct: 167 WAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSE 226

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYLNH--RL 265
           ADYPYR  +N+   C   ++ A V   D +     +  + +   + + P+ V +    R 
Sbjct: 227 ADYPYRASDNM---CDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGRE 283

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + Y           C  + LDH V  VGYG +NGI  WIVRNSWG    + GY ++ER 
Sbjct: 284 FQLYQSGVFTGR---CGTN-LDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERN 339

Query: 326 ANA-----CGI 331
             +     CGI
Sbjct: 340 VASTDTGKCGI 350


>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
          Length = 362

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 157/319 (49%), Gaps = 44/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F T+  K++++Y    E   RF  FK + K+   +        +G +  SD +  E  +R
Sbjct: 47  FTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAKLHQKLDPSAEHGVTKFSDLTASE-FRR 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K++ RL A  ++            LP+  DWR+     + PV+ QG CGSCWA
Sbjct: 106 QFLGL--KKRLRLPAHAQKAPILPTNN----LPEDFDWREKGA--VTPVKDQGSCGSCWA 157

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q 
Sbjct: 158 FSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLMNNAFEYLLQS 217

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+  + DY Y  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 218 GGVVREQDYSYTGRDG---SCKFDKSKIAASVSNFSVVSVDEDQIAANLVKNGPLAVAIN 274

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT--------WIVRNSWGDIG 314
              +++Y         + C   +LDH V +VG+G  NG           WI++NSWG   
Sbjct: 275 AAWMQTYMSGV--SCPYICAKSRLDHGVLLVGFG--NGFAPIRLKEKPYWIIKNSWGQNW 330

Query: 315 PDHGYFQIERGANACGIES 333
            + GY++I RG N CG++S
Sbjct: 331 GEEGYYKICRGRNICGVDS 349


>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
          Length = 326

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 87/226 (38%), Positives = 122/226 (53%), Gaps = 20/226 (8%)

Query: 125 PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD 184
            +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C 
Sbjct: 107 AVPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCS 164

Query: 185 H--GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              GN  C GG ++ A+EY+KQ+GLE+++ YPYR  E    +C Y K+     V   + V
Sbjct: 165 RPWGNNGCGGGLMENAYEYLKQFGLETESSYPYRAVEG---QCRYNKQLGVAKVTGYYTV 221

Query: 242 TSGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGY 295
            SG +  +  L    GP  V ++   +ES    Y G   +     C+P  L+HAV  VGY
Sbjct: 222 HSGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSQ--TCSPLGLNHAVLAVGY 276

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           G + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 277 GTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLLMV 322


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 84/222 (37%), Positives = 124/222 (55%), Gaps = 22/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LPKS+DWRQ     + PV+ QG+CGSCW+F+ T  LE Q+ L K  L  LS+  L++C  
Sbjct: 114 LPKSVDWRQKGA--VTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSK 171

Query: 184 DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
           ++GN  C GG +D AF+YV    G+++++ YPY  ++   + C ++K+K     K +V  
Sbjct: 172 EYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARD---YACRFKKDKVGGTDKGYVD- 227

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             +  G +  +   L   GPI V ++  H     Y       N+  C+ + LDH V  VG
Sbjct: 228 --IPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVY--NEPYCSSYDLDHGVLAVG 283

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           YG +NG   W+V+NSWG    + GY +I R  +N CGI S A
Sbjct: 284 YGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMA 325


>gi|308462787|ref|XP_003093674.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
 gi|308249538|gb|EFO93490.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
          Length = 392

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 120/218 (55%), Gaps = 8/218 (3%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P+ +DWR S  KV++ V++QG CGSCWAFAT A +ESQ A+ K TL+ LS+ +LV+CD  
Sbjct: 178 PERIDWRDSG-KVMS-VKNQGACGSCWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGE 235

Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGV 245
           +  C GG +D A  +V   GLE++ DYPY   ++   +C     K +V V + W +    
Sbjct: 236 SYGCGGGFLDKALGWVLGNGLETEDDYPYECTQHD--QCYINGGKTRVTVDEGWSLGRDE 293

Query: 246 DHMMHLLQS-GPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYGEKNGIL 302
           D +   + S GP+   ++      +Y       ++  C    L  HA+ ++GYG +    
Sbjct: 294 DSIADWVASVGPVAFAMSVPNSFTAYSNGVYNPSEHECRDESLGYHAMTLIGYGTEGNQP 353

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            WIV+NSWG    D GY ++ RG NACG+  +     +
Sbjct: 354 YWIVKNSWGSSWGDQGYMRLARGNNACGMRDFVVAPKI 391


>gi|10441624|gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
           virgifera]
          Length = 322

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 94/310 (30%), Positives = 158/310 (50%), Gaps = 30/310 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           ++++ V+  + Y +  E + RF  F+ + K  +E+               +  +D +P+E
Sbjct: 21  WESFKVQHGKVYKNPIEERVRFSVFQANLKTINEHNAKYEQGLVGYTMAVNQFADMTPEE 80

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
              + G++     K +    + R  K +N      +P S+DWRQ K  VL  V+ QG+CG
Sbjct: 81  FKAKLGMQAKNMPKIK----KSRHVKNVNAE----VPDSVDWRQ-KGAVLG-VKDQGQCG 130

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCN-GGNIDVAFEYVKQYGL 207
           SCWAF+ T  LE Q  ++     PLS+ +L++C  ++GN +C+ GG + +AFE+V++ G+
Sbjct: 131 SCWAFSATGSLEGQNYIVNGKSEPLSEQELLDCSVEYGNGDCDEGGLMTLAFEFVEENGI 190

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQS-GPIGVYLNHRL 265
            S+A YPY   E I   C    +KA + +Q    V    + +   + + GPI   +    
Sbjct: 191 VSEASYPY---EAIQGDCRTTNDKAVLHIQGYNEVYPSEEALRQAVGTVGPISAAIWAEP 247

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           I+ +        +       LDH + +VGYGE+NG   WIV+NSWG    + GYF+++R 
Sbjct: 248 IQFFSSGIYDDPNCLNYVEYLDHGILVVGYGEENGTPYWIVKNSWGATWGEEGYFRLKRN 307

Query: 326 ANACGIESYA 335
              CG+   A
Sbjct: 308 IALCGLAQMA 317


>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
          Length = 358

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 151/314 (48%), Gaps = 31/314 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 54  RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L         L+   +  +          LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRTKLGAAQNCSATLKGSHKVTE--------AALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  K+     C +  E   V V ++  +T G +    H + L++   I   +
Sbjct: 223 LDTEKAYPYTGKDET---CKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 280 IHSF-RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFK 338

Query: 322 IERGANACGIESYA 335
           +E G N CGI + A
Sbjct: 339 MEMGKNMCGIATCA 352


>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 149/309 (48%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V + ++Y    E++ RF  F +  +E         S++R  + +  R G+ R + 
Sbjct: 61  FARFAVGYGKSYESAAEVRRRFRIFSESLEEVR-------STNR--KGLPYRLGINRFSD 111

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV++Q  CGSCW
Sbjct: 112 MSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGI--VSPVKNQAHCGSCW 169

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C  G  N  CNGG    AFEY+K   G++++
Sbjct: 170 TFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTE 229

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY+    +   C Y+ E A V V D+  +T   +  +         V +  ++I+  
Sbjct: 230 ESYPYKGVNGV---CHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGF 286

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +     P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G 
Sbjct: 287 RQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGK 346

Query: 327 NACGIESYA 335
           N C I + A
Sbjct: 347 NMCAIATCA 355


>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
          Length = 326

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 126/227 (55%), Gaps = 18/227 (7%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+
Sbjct: 105 KRAVPDRIDWRESGY--VTEVKDQGGCGSCWAFSTTGAMEGQYMKNQRTSISFSEQQLVD 162

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C  D GN  CNGG ++ A+EY+K++GLE+++ YPYR  E    +C Y ++     V   +
Sbjct: 163 CSRDFGNYGCNGGLMENAYEYLKRFGLETESSYPYRAVEG---QCRYNEQLGVAKVTGYY 219

Query: 241 VTSGVD--HMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVG 294
                D   + +L+ + GP  V L+   +ES D    R   +    C+P +L+H V  VG
Sbjct: 220 TVHSGDEVELQNLVGAEGPAAVALD---VES-DFMMYRSGIYQSQTCSPDRLNHGVLAVG 275

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           YG ++G   WIV+NSWG    + GY ++ R   N CGI S A +  V
Sbjct: 276 YGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIASLASVPMV 322


>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
 gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
 gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
 gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
 gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
 gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
 gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
 gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
 gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
 gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
          Length = 484

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 181 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 240

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 241 EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 286

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 287 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 346

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 347 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 403

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 404 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 463

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 464 YLHRGSGACGVNTMASSAVV 483


>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 153/319 (47%), Gaps = 33/319 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V  F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 54  RHVLTFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L         L+          ++  +  LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRTKLGAAQNCSATLKGS--------HKLTEAALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C   + N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  K+     C +  E   V V D+  +T G +    H + L++   I   +
Sbjct: 223 LDTEEAYPYIGKDGT---CKFSAENVGVQVLDSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 280 IHSF-RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFK 338

Query: 322 IERGANACGIESYAYLASV 340
           +E G N CG   Y Y+  V
Sbjct: 339 MEMGKNMCG--KYCYMCIV 355


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 155/319 (48%), Gaps = 38/319 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  ++ K+ ++Y    E   RF  F ++     E+        +G +  SD S +E  +R
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEE-FER 147

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             + + G        +  +  +   E  KG LP+  DWR      +  V+ QG CGSCWA
Sbjct: 148 MFMGVRGGAGGEGLPEMNQAVEVTAEEVKG-LPERFDWRDKGA--VTEVKMQGTCGSCWA 204

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYVKQY 205
           F+T   +E    +    L  LS+ QLV+CDH          N  CNGG +  A++Y+ Q 
Sbjct: 205 FSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS 264

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE ++ YPY  +     +C ++ +K  V V + + T  +D      HL++SGP+ V L
Sbjct: 265 GGLEEESSYPYTGRSG---QCNFQSDKIAVKVSN-FTTIPIDENQIAAHLVRSGPLAVGL 320

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSWGDIG 314
           N   +++Y G         C    ++H V +VGYG++   IL       W+++NSWG+  
Sbjct: 321 NAVFMQTYIGG--VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERW 378

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY+++ RG   CGI +
Sbjct: 379 GEHGYYRLCRGHGMCGINT 397


>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
          Length = 384

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 157/321 (48%), Gaps = 47/321 (14%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------------GKETDEYYGTSGSSDRSP 88
           A+K + +  +++Y D  E   RFE F+++             GK++  Y G +  +D   
Sbjct: 78  AWKEFKILHDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKS-YYLGVNQFTDLEY 136

Query: 89  QEILQRTGLRLTG----KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            E +   GL++T     K    L A+   V            P S+DWR      +  V+
Sbjct: 137 AEFVNFNGLKMTNLNNTKCSSHLSANNIVV------------PDSVDWRSKGY--VTKVK 182

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYV 202
           +QG CGSCWAF+ T  LE Q       L PLS+SQLV+C    GN  CNGG ++ AF+YV
Sbjct: 183 NQGACGSCWAFSATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYV 242

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQS--GPIG 258
           K   G+ES++DYPY+ ++     C ++K K    V     V SG +  +  + S  GP+ 
Sbjct: 243 KSVGGIESESDYPYKARQRT---CAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVS 299

Query: 259 VYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK-NGILTWIVRNSWGDIGP 315
           V ++  H   + Y G     ++  C+  +L+H V  VGYG    G   WIV+NSWG    
Sbjct: 300 VAIDAGHSSFQLYAGGVY--DEPLCSTSRLNHGVLCVGYGTSLQGKDYWIVKNSWGVRWG 357

Query: 316 DHGYFQIERGA-NACGIESYA 335
             GY ++ R   N CGI S A
Sbjct: 358 VEGYIKMSRNKNNQCGIASEA 378


>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
 gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 158/331 (47%), Gaps = 42/331 (12%)

Query: 31  DLAYDSIKQVDA-FKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTS 81
           D A D I   +  F ++  K+++ Y    E   RF  FK +          +    +G +
Sbjct: 38  DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDPSAQHGIT 97

Query: 82  GSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLN 141
             SD +  E  +R  L L   ++ RL A  ++            LP+  DWR+     + 
Sbjct: 98  KFSDLTASE-FRRQFLGLN--KRLRLPAHAQKAPILPTNN----LPEDFDWREKGA--VT 148

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNG 192
           PV+ QG CGSCWAF+TT  LE    L    L  LS+ QLV+CDH          +  CNG
Sbjct: 149 PVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNG 208

Query: 193 GNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM-- 249
           G ++ AFEY+ Q  G+ S+ DY Y  ++     C ++K K    V +  V S  +  +  
Sbjct: 209 GLMNNAFEYILQSGGVVSEKDYAYTGRDG---SCKFDKSKVVASVSNFSVVSLDEDQIAA 265

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT------ 303
           +L+++GP+ V +N   +++Y         + C   +LDH V ++G+G+            
Sbjct: 266 NLVKNGPLAVAINAAWMQTYMSGV--SCPYICAKARLDHGVLLLGFGQGGYAPIRLKEKP 323

Query: 304 -WIVRNSWGDIGPDHGYFQIERGANACGIES 333
            WI++NSWG    + GY++I RG N CG++S
Sbjct: 324 YWIIKNSWGQNWGEEGYYKICRGRNVCGVDS 354


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 155/319 (48%), Gaps = 38/319 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  ++ K+ ++Y    E   RF  F ++     E+        +G +  SD S +E  +R
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEE-FER 147

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             + + G        +  +  +   E  KG LP+  DWR      +  V+ QG CGSCWA
Sbjct: 148 MFMGVRGGAGGEGLPEMNQAVEVTAEEVKG-LPERFDWRDKGA--VTEVKMQGTCGSCWA 204

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYVKQY 205
           F+T   +E    +    L  LS+ QLV+CDH          N  CNGG +  A++Y+ Q 
Sbjct: 205 FSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS 264

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE ++ YPY  +     +C ++ +K  V V + + T  +D      HL++SGP+ V L
Sbjct: 265 GGLEEESSYPYTGRSG---QCNFQSDKIAVKVSN-FTTIPIDENQIAAHLVRSGPLAVGL 320

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSWGDIG 314
           N   +++Y G         C    ++H V +VGYG++   IL       W+++NSWG+  
Sbjct: 321 NAVFMQTYIGG--VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERW 378

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY+++ RG   CGI +
Sbjct: 379 GEHGYYRLCRGHGMCGINT 397


>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
          Length = 365

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 156/318 (49%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F+ +  ++ ++Y    +   RF  FK + +    +        +G +  SD +P E  +R
Sbjct: 50  FRLFKRRFGKSYATQEDHDYRFSVFKTNLRRARHHQRLDPSAVHGVTQFSDLTPAE-FRR 108

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L   G ++ R  AD  +      E     LP   DWR      +  V++QG CGSCW+
Sbjct: 109 NHL---GLKRLRFPADANKAPILPTED----LPADFDWRDHGA--VASVKNQGSCGSCWS 159

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ A EY +K 
Sbjct: 160 FSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSALEYTLKA 219

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DYPY   +  T  C +++ K    V +  V S  ++ +  +L+++GP+ V +N
Sbjct: 220 GGLMREEDYPYSGTDRGT--CKFDETKIAASVANFSVVSLDENQIAANLVKNGPLAVAIN 277

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C+  +LDH V +VGYG              WI++NSWG+   
Sbjct: 278 AVFMQTYVGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWG 334

Query: 316 DHGYFQIERGANACGIES 333
           ++G+++I +G N CG++S
Sbjct: 335 ENGFYKICQGRNVCGVDS 352


>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
          Length = 338

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 151/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 35  VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 94

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 95  EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 140

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 141 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 200

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE+  DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 201 NLGGLETVDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 257

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 258 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 317

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 318 YLHRGSGACGVNTMASSAVV 337


>gi|86279347|gb|ABC88769.1| putative cathepsin L-like proteinase [Tenebrio molitor]
          Length = 328

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/220 (35%), Positives = 125/220 (56%), Gaps = 15/220 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K PL  S+DWR + V   + V+ QG+CGSCW+F+TT  +E Q+AL +  L  LS+  L++
Sbjct: 112 KKPLAASVDWRSNAV---SEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLID 168

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C   +GN  C+GG +D AF Y+  YG+ S++ YPY  + +    C ++  ++   +   +
Sbjct: 169 CSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYEAQGDY---CRFDSSQSVTTLSGYY 225

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLNHR-LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            + SG ++ +   + Q+GP+ V ++    ++ Y G      D  CN   L+H V +VGYG
Sbjct: 226 DLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFY--DQTCNQSDLNHGVLVVGYG 283

Query: 297 EKNGILTWIVRNSWGDIGPDHGYF-QIERGANACGIESYA 335
             NG   WI++NSWG    + GY+ Q+    N CGI + A
Sbjct: 284 SDNGQDYWILKNSWGSGWGESGYWRQVRNYGNNCGIATAA 323


>gi|37963625|gb|AAP94048.2| cathepsin-L-like midgut cysteine proteinase [Tenebrio molitor]
          Length = 330

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/220 (35%), Positives = 125/220 (56%), Gaps = 15/220 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K PL  S+DWR + V   + V+ QG+CGSCW+F+TT  +E Q+AL +  L  LS+  L++
Sbjct: 114 KKPLAASVDWRSNAV---SEVKDQGQCGSCWSFSTTGAVEGQLALQRGRLTSLSEQNLID 170

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C   +GN  C+GG +D AF Y+  YG+ S++ YPY  + +    C ++  ++   +   +
Sbjct: 171 CSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYEAQGDY---CRFDSSQSVTTLSGYY 227

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLNHR-LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            + SG ++ +   + Q+GP+ V ++    ++ Y G      D  CN   L+H V +VGYG
Sbjct: 228 DLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFY--DQTCNQSDLNHGVLVVGYG 285

Query: 297 EKNGILTWIVRNSWGDIGPDHGYF-QIERGANACGIESYA 335
             NG   WI++NSWG    + GY+ Q+    N CGI + A
Sbjct: 286 SDNGQDYWILKNSWGSGWGESGYWRQVRNYGNNCGIATAA 325


>gi|386364440|emb|CCH03781.1| Clan CA, family C1, cathepsin L-like cysteine peptidase, partial
           [Trichomonas gallinae]
          Length = 261

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/205 (38%), Positives = 114/205 (55%), Gaps = 12/205 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           PKS+DWR+  V  +NPV+ QG+CGSCWAF+    +ES  A+   TLY LS+  +V+C + 
Sbjct: 62  PKSIDWREKGV--VNPVKDQGQCGSCWAFSAIQAMESVWAINHNTLYSLSEQNMVDCCYL 119

Query: 187 NLNCNGGNIDVAFEYVK--QYG-LESQADYPYRNKENITFRCTYEKEKAKVFV----QDT 239
            + C GG +D+A++Y K  Q G   ++ADYPY     I   C ++K KA   +     D 
Sbjct: 120 CMGCAGGIMDLAYKYAKNEQGGKFMTEADYPY---HAIREECKFDKSKAVDAIVTGFMDI 176

Query: 240 WVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
            VTS  D    + Q GP  + ++      +  +    +D  C P  +DH V  VGYG +N
Sbjct: 177 AVTSERDLAAKVAQYGPAAIGIDASQTSFHLYSSGIYDDPHCTPMNIDHGVGCVGYGSEN 236

Query: 300 GILTWIVRNSWGDIGPDHGYFQIER 324
           G+  WIVRNSWG    + GY ++ +
Sbjct: 237 GVNYWIVRNSWGPTWGEKGYIRMVK 261


>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
 gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
          Length = 376

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 161/349 (46%), Gaps = 44/349 (12%)

Query: 27  YVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------- 77
           ++ +D     ++ ++ FK + +K+NR+Y +  E   R   F  +  +             
Sbjct: 24  FLTKDTGPRPLELIEVFKLFQIKYNRSYANPAEYARRLNIFAHNLAQAQRLQEEDLGTAE 83

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           +G +  SD + +E  Q  G +   K    +      VKK  +E+   P+P + DWR++  
Sbjct: 84  FGETPFSDLTEEEFGQLYGQQKAPKRIPNM------VKKAGSEKWGQPVPSTCDWRKA-T 136

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-D 196
            +++ +++Q  C  CWA A    +E+   +  +    +S  +L++C+     C+GG + D
Sbjct: 137 NIISSIKNQKTCRCCWAIAAADNIEALWRIKTQHFVEVSVQELLDCERCGNGCDGGFVWD 196

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM---HLLQ 253
                +   GL S+ DYP++   N    C   + K   ++QD +   G D  +   +L  
Sbjct: 197 AYMTVLNNSGLASEKDYPFKGYPN-PHGCLANRYKKVAWIQD-FTMLGRDEQVIAGYLAT 254

Query: 254 SGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---------------- 297
            GPI V +N +L++ Y    I+     C+P ++DH+V +VG+G+                
Sbjct: 255 HGPITVTINMKLLQGYQKGVIKATPTTCDPQQVDHSVLLVGFGKGKEKEDIQSGTILSQT 314

Query: 298 ------KNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
                 +  +  WI++NSWG    + GYF++ RG N+CGI  Y   A +
Sbjct: 315 RKPRKPRRSVPYWILKNSWGAEWGEKGYFRLYRGNNSCGITKYPITACL 363


>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
          Length = 324

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 105/306 (34%), Positives = 152/306 (49%), Gaps = 27/306 (8%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKE 105
           +  K N+TY+ D +I  R  Y  Q   +  E +    +   S   + +     +T +E  
Sbjct: 25  FKAKHNKTYSGDEDIIRR--YIWQTNLQKIEAHNELYAKGLSTYFLGENKYADMTNEEFR 82

Query: 106 R----LEADRERVK-KFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
           R    L  D+E     F++   K  LP ++DWR  K   +  V+ QG+CGSCWAF+TT  
Sbjct: 83  RTLSGLRVDKELTPGDFVSGMFKDSLPTAVDWR--KEGYVTEVKDQGQCGSCWAFSTTGS 140

Query: 161 LESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRN 217
           LE Q     K L  LS+S LV+C    GN  CNGG +D AF+Y+    G++++  YPY+ 
Sbjct: 141 LEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDNAFKYIADNKGIDTEKSYPYKP 200

Query: 218 KENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYD 270
           ++    +C +  +KA V   D     +TSG +  +   +   GPI V ++  H   + Y 
Sbjct: 201 EDR---KCNF--KKANVGATDKLYKDITSGSEDALQEAVATIGPISVAIDASHDSFQLYS 255

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANAC 329
           G     N+ AC+   LDH V  VGY  KNG   WIV+NSWG      GY  + R   N C
Sbjct: 256 GGVY--NEKACSTKTLDHGVLAVGYDSKNGDDYWIVKNSWGKSWGIDGYIWMSRNKKNQC 313

Query: 330 GIESYA 335
           GI + A
Sbjct: 314 GIATMA 319


>gi|91085671|ref|XP_971698.1| PREDICTED: similar to cathepsin L-like protein; cysteine proteinase
           [Tribolium castaneum]
 gi|270011034|gb|EFA07482.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 92/261 (35%), Positives = 133/261 (50%), Gaps = 18/261 (6%)

Query: 84  SDRSPQEILQRT-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +P+E+   T GL       + L   + R    LN   +   P S DWR   +  +  
Sbjct: 80  TDMTPEEMRPYTHGLIEPAVVPKPLVEIKSRADLGLNHSVQ--YPASFDWRDKGM--VTG 135

Query: 143 VESQGRCGSCWAFATTAILESQVALLK--KTLYPLSKSQLVECDHGNLNCNGGNIDVAFE 200
           V++QG CGSCWAF++T  +ESQV + K   T   +S+ QLV+CD     C GG +  AF 
Sbjct: 136 VKNQGGCGSCWAFSSTGAIESQVKIAKGANTDISVSEQQLVDCDTAADGCGGGWMTDAFT 195

Query: 201 YVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM---HLLQSGP 256
           Y+ Q G ++S++ YPY+  +     C +  +K    ++     +G D  M    +   GP
Sbjct: 196 YIAQTGGIDSESSYPYKGVDE---SCHFMSDKVAAKLKGYAYLTGPDENMLADMVSSKGP 252

Query: 257 IGVYLNHRL-IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
           + V  +      SY G      +  C  +K  HAV IVGYG +NG   W+V+NSWGD   
Sbjct: 253 VSVAFDAEGDFGSYSGGVYYNPN--CATNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWG 310

Query: 316 DHGYFQIERG-ANACGIESYA 335
           +HGYF+I R   N CGI S A
Sbjct: 311 EHGYFKIARNKGNHCGIASKA 331


>gi|341903430|gb|EGT59365.1| hypothetical protein CAEBREN_22193 [Caenorhabditis brenneri]
          Length = 410

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 154/313 (49%), Gaps = 31/313 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYF----------KQDGKETDEYYGTSGSSDRSPQE-- 90
           +  Y  K+N++Y   +E   R   +           +  +     YG +  SD +  E  
Sbjct: 98  YIAYTEKYNKSYATSHESLKRLNAYYTTEENIANWNKQSEHGSAVYGHNDLSDWTDAEFI 157

Query: 91  --ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
             +L +T  +   ++ E +    E +     ER  GPLP   DWR   V  + PV++QG+
Sbjct: 158 KTLLPKTFYQRLHEDAEFITPIPESLAAMKGERN-GPLPDFFDWRDRNV--VTPVKAQGQ 214

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLE 208
           CGSCWAFA+TA +E+  A+       LS+  L++CD  +  C+GG+ D AF Y+ + GL 
Sbjct: 215 CGSCWAFASTATVEAAYAIAHGERRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNGLA 274

Query: 209 SQADYPY----RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPIGVYLN- 262
              D PY    +N   +T      + KA  F+         D +++ L+  GP+ + ++ 
Sbjct: 275 YAVDLPYVAHRQNGCAVTDNWNTTRIKAAYFLHHD-----EDSIINWLVNFGPVNIGMSV 329

Query: 263 HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYG-EKNGILTWIVRNSWGDI-GPDHGY 319
            + + +Y G     +++AC    +  HA+ I GYG  + G   WIV+NSWG+  G +HGY
Sbjct: 330 IQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSEKGEKYWIVKNSWGNTWGVEHGY 389

Query: 320 FQIERGANACGIE 332
               RG NACGIE
Sbjct: 390 IYFARGINACGIE 402


>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
          Length = 473

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 152/323 (47%), Gaps = 34/323 (10%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK +++ +NRTY+   E + R   F+Q+ K              YG +  SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query: 86  RSPQE--ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
            +  E  ++    +      K+ ++          +         + DWR      ++PV
Sbjct: 227 LTEDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPD---------TWDWRDHGA--VSPV 275

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
           ++QG CGSCWAF+ T  +E Q    KKT  L  LS+ +LV+CD  +  C GG    A+E 
Sbjct: 276 KNQGMCGSCWAFSVTGNIEGQ--WFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEA 333

Query: 202 VKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPI 257
           ++  G LE++ DY Y   +     C +   K   ++  + V    D       L ++GP+
Sbjct: 334 IENLGGLETETDYSYTGHKQ---SCDFSTGKVAAYINSS-VELPKDEKEIAAFLAENGPV 389

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
              LN   ++ Y           CNP  +DHAV +VG+G++NG+  W ++NSWG+   + 
Sbjct: 390 SAALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQ 449

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+  CGI      A V
Sbjct: 450 GYYYLYRGSGLCGIHKMCSSAIV 472


>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
 gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
          Length = 356

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 147/303 (48%), Gaps = 42/303 (13%)

Query: 63  RFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQRTGLR--LTGKEKERLEADRE 112
           R++ FKQ+ +           +  G +  SD +P E      ++     + +E L   R+
Sbjct: 57  RYQIFKQNVERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTPKQARELLSGMRQ 116

Query: 113 -RVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
                 L  ++    PK  DWR+     + PV+ QG CGSCW F+TT  +E   A     
Sbjct: 117 YPANAKLTMKQVSDAPKEFDWREHNA--VTPVKDQGNCGSCWTFSTTGNVEGMYAAKTGK 174

Query: 172 LYPLSKSQLVECDHG----------NLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKEN 220
           L  LS+ QLV+CDH           N  CNGG +  +FE+ +K  GL ++  YPY   +N
Sbjct: 175 LISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKTGGLVTEESYPYEAVDN 234

Query: 221 ITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH-LLQSGPIGVYLNHRLIESYDG---NPIR 275
              RC +    A V + + T+V+S  D M   L  +GPI + +N   ++ Y     NP R
Sbjct: 235 ---RCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQYYRKGILNPSR 291

Query: 276 RNDWACNPHKLDHAVAIVGYGEK---NGILT--WIVRNSWGDIGPDHGYFQIERGANACG 330
                C+P +L+H V IVGYGE+   NG +   WIV+NSW     + GY ++ RG   CG
Sbjct: 292 -----CDPEELNHGVLIVGYGEEKAANGKVEKYWIVKNSWSASWGEKGYVRVLRGKGVCG 346

Query: 331 IES 333
           + +
Sbjct: 347 LNA 349


>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
           Full=Senescence-associated gene product 2; Flags:
           Precursor
 gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
 gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
 gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
 gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
 gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
 gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 358

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 151/314 (48%), Gaps = 31/314 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 54  RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L         L+   +  +          LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRTKLGAAQNCSATLKGSHKVTE--------AALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  K+     C +  E   V V ++  +T G +    H + L++   I   +
Sbjct: 223 LDTEKAYPYTGKDET---CKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 280 IHSF-RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFK 338

Query: 322 IERGANACGIESYA 335
           +E G N CGI + A
Sbjct: 339 MEMGKNMCGIATCA 352


>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
 gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
          Length = 343

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 150/317 (47%), Gaps = 43/317 (13%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQD-GK-----------ETDEYYGTSGSSDRSPQEILQRTG 96
           K+N+ Y+ + E   RFE FK + GK           + D  +G +  +D S  E      
Sbjct: 35  KFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---- 89

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
                  KE +  D   V  +L++     +P + DWR      + PV++QG+CGSCW+F+
Sbjct: 90  -NYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA--VTPVKNQGQCGSCWSFS 146

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHGNL----------NCNGGNIDVAFEY-VKQY 205
           TT  +E Q  + +  L  LS+  LV+CDH  +           CNGG    A+ Y +K  
Sbjct: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNG 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNH 263
           G+++++ YPY  +     +C +        + +  +    + +M  +++ +GP+ +  + 
Sbjct: 207 GIQTESSYPYTAETGT--QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDIGPDHG 318
              + Y G      D  CNP+ LDH + IVGY  KN I       WIV+NSWG    + G
Sbjct: 265 VEWQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321

Query: 319 YFQIERGANACGIESYA 335
           Y  + RG N CG+ ++ 
Sbjct: 322 YIYLRRGKNTCGVSNFV 338


>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
          Length = 358

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 153/319 (47%), Gaps = 43/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           F ++  K++++Y    E   RF  FK         Q    T E+ G +  SD +  E  +
Sbjct: 43  FTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKLDPTAEH-GITKFSDLTASE-FR 100

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           R  L L  + +    A +  +    N      LP+  DWR+     + PV+ QG CGSCW
Sbjct: 101 RQFLGLNKRLRLPAHAQKAPILPTTN------LPEDFDWREKGA--VTPVKDQGSCGSCW 152

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQ 204
           AF+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY+ Q
Sbjct: 153 AFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMNNAFEYLLQ 212

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             G+  + DY Y  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +
Sbjct: 213 SGGVVQEKDYAYTGRDG---SCKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVAI 269

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y         + C   +LDH V +VG+G+             WI++NSWG   
Sbjct: 270 NAAWMQAYMSGV--SCPYVCAKARLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNW 327

Query: 315 PDHGYFQIERGANACGIES 333
            + GY++I RG N CG++S
Sbjct: 328 GEQGYYKICRGRNVCGVDS 346


>gi|42744610|gb|AAH66625.1| Ctssa protein [Danio rerio]
          Length = 321

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 94/323 (29%), Positives = 152/323 (47%), Gaps = 36/323 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSD 85
           +  + + T+  + N+TY +  E + R   ++Q+ ++   +             G +  SD
Sbjct: 15  RLTNQWTTWKSQHNKTYRNTREERLRRSVWEQNLQDILLHNEAAAVGLHSYTLGLNQLSD 74

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E+    GL         LE D   V    +      LP+ ++W +  +  ++PV++
Sbjct: 75  MTADEVNDMNGL---------LEEDFPDVNATFSPPSLQTLPQRVNWTEHGM--VSPVQN 123

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK 203
           QG CGSCWAF+    LE+Q+      L PLS   L++C    GN  C GG +  AF YV 
Sbjct: 124 QGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVI 183

Query: 204 Q-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-----GPI 257
           Q  G++S   YPY +KE +   C Y       +     +     H    LQS     GP+
Sbjct: 184 QNRGIDSSTFYPYEHKEGV---CRYSVSGRAGYCTGFRIVP--RHNEAALQSAVANIGPV 238

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V +N +L+  +       ND  C+   ++HAV +VGYG +NG   W+V+NSWG    ++
Sbjct: 239 SVGINAKLLSFHRYRSGIYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGEN 298

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY ++ R  N CGI S+    ++
Sbjct: 299 GYIRMARNKNMCGISSFGIYPTI 321


>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
          Length = 354

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 96/305 (31%), Positives = 150/305 (49%), Gaps = 23/305 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  ++ ++ ++Y  + E+K R+E F Q+ +     +  S +  R P  +        T +
Sbjct: 55  FARFVSRFGKSYQSEEEMKERYEIFSQNLR-----FIRSHNKKRLPYTLSVNHFADWTWE 109

Query: 103 E--KERLEADRERVKKFLNERKK---GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           E  + RL A  +     LN   K     LP + DWR  K  +++ V+ QG CGSCW F+T
Sbjct: 110 EFKRHRLGA-AQNCSATLNGNHKLTDAVLPPTKDWR--KEGIVSSVKDQGSCGSCWTFST 166

Query: 158 TAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE+  A        LS+ QLV+C     N  C+GG    AFEY+K   GLE++  YP
Sbjct: 167 TGALEAAYAQAFGKSISLSEQQLVDCAGPFNNFGCHGGLPSQAFEYIKYNGGLETEEAYP 226

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTW-VTSGV-DHMMHLLQ-SGPIGVYLNH-RLIESYD 270
           Y  K+ +   C +  E   V V D+  +T G  D + H +    P+ V          Y+
Sbjct: 227 YTGKDGV---CKFSAENVAVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVNGFHFYE 283

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
                 +        ++HAV  VGYG +NG+  W+++NSWG+   ++GYF++E G N CG
Sbjct: 284 NGVFTSDTCGSTSQDVNHAVLAVGYGVENGVPYWLIKNSWGESWGENGYFKMELGKNMCG 343

Query: 331 IESYA 335
           + + A
Sbjct: 344 VATCA 348


>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
          Length = 329

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 151/305 (49%), Gaps = 28/305 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTR-----FEYFKQDGKETDEY-YGTSGSSDRSPQEILQRT 95
           A++ + +K+NR+Y  D E++ +       Y K+   E   Y    +  +D +  E  Q  
Sbjct: 29  AWEGWKLKYNRSYGLDEELRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQ-- 86

Query: 96  GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
            + L    + RL   RE  K F  + K   LP ++DWR   V  + PV++QG+CGSCW+F
Sbjct: 87  -IYLGYDNEARLSRKREG-KVFQRKMKDEDLPTTVDWRSKGV--VTPVKNQGQCGSCWSF 142

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQYGLESQADY 213
           + T  LE Q A+    L   S+ +LV+C    GN  C GG +D AF+Y +    E ++DY
Sbjct: 143 SATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLAEKESDY 202

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVT----SGVDHMMHLLQS-GPIGVYLN--HRLI 266
            Y  K     +C Y  +      +D+  T       D +   + + GPI V ++  H   
Sbjct: 203 TYTAKNG---KCKYNAQLG--VTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSF 257

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y         + C+  KLDH V +VGYG  NG+  W+++NSWG      GYF+IE  +
Sbjct: 258 QMYHSGIY--TPFLCSKTKLDHGVLVVGYGTDNGVDYWLIKNSWGMAWGMDGYFKIEMKS 315

Query: 327 NACGI 331
           + CGI
Sbjct: 316 DKCGI 320


>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
 gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
          Length = 473

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 152/323 (47%), Gaps = 34/323 (10%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSD 85
           +S++ +  FK +++ +NRTY+   E + R   F+Q+ K              YG +  SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query: 86  RSPQE--ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
            +  E  ++    +      K+ ++          +         + DWR      ++PV
Sbjct: 227 LTEDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPD---------TWDWRDHGA--VSPV 275

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVECDHGNLNCNGGNIDVAFEY 201
           ++QG CGSCWAF+ T  +E Q    KKT  L  LS+ +LV+CD  +  C GG    A+E 
Sbjct: 276 KNQGMCGSCWAFSVTGNIEGQ--WFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEA 333

Query: 202 VKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPI 257
           ++  G LE++ DY Y   +     C +   K   ++  + V    D       L ++GP+
Sbjct: 334 IENLGGLETETDYSYTGHKQ---SCDFSTGKVAAYINSS-VELPKDEKEIAAFLAENGPV 389

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
              LN   ++ Y           CNP  +DHAV +VG+G++NG+  W ++NSWG+   + 
Sbjct: 390 SAALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQ 449

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+  CGI      A V
Sbjct: 450 GYYYLYRGSGLCGIHKMCSSAIV 472


>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 140/281 (49%), Gaps = 13/281 (4%)

Query: 60  IKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLN 119
           +K   E   +   E D   G +  +D S +E  +    ++ G     L+    +    ++
Sbjct: 78  VKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMYMSKVKGSRSNELKMGGVKRNMSVS 137

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
            R     P SLDWR   V  + P++ QG+CGSCWAF+ +  +ES  A+    L  LS+ +
Sbjct: 138 SRT-CDAPTSLDWRDKGV--VTPMKDQGQCGSCWAFSVSGSIESANAIATGDLIRLSEQE 194

Query: 180 LVECDHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD 238
           LV+CD  +  C+GGN+D A+ + +K  GL+S+ DYPY +      +C   K    V   D
Sbjct: 195 LVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLD 254

Query: 239 TW--VTSGVDHMMHLLQSGP--IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
           ++  V S  D ++  + + P  IG+  +    + Y G  +     +  P+ +DHAV IVG
Sbjct: 255 SYVEVESNEDAVLCAVATTPVTIGIVGSAYDFQLYTGG-VYNGQCSSKPYDIDHAVLIVG 313

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN----ACGI 331
           YG ++G   WIV+NSWG      GY  +ER  +     CG+
Sbjct: 314 YGSQDGKDYWIVKNSWGTYWGLEGYILMERNTDIKNGVCGM 354


>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
          Length = 343

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 150/317 (47%), Gaps = 43/317 (13%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQD-GK-----------ETDEYYGTSGSSDRSPQEILQRTG 96
           K+N+ Y+ + E   RFE FK + GK           + D  +G +  +D S  E      
Sbjct: 35  KFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---- 89

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
                  KE +  D   V  +L++     +P + DWR      + PV++QG+CGSCW+F+
Sbjct: 90  -NYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGA--VTPVKNQGQCGSCWSFS 146

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHGNL----------NCNGGNIDVAFEY-VKQY 205
           TT  +E Q  + +  L  LS+  LV+CDH  +           CNGG    A+ Y +K  
Sbjct: 147 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNH 263
           G+++++ YPY  +     +C +        + +  +    + +M  +++ +GP+ +  + 
Sbjct: 207 GIQTESSYPYTAETGT--QCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 264

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDIGPDHG 318
              + Y G      D  CNP+ LDH + IVGY  KN I       WIV+NSWG    + G
Sbjct: 265 VEWQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 321

Query: 319 YFQIERGANACGIESYA 335
           Y  + RG N CG+ ++ 
Sbjct: 322 YIYLRRGKNTCGVSNFV 338


>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
           familiaris]
          Length = 490

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 159/320 (49%), Gaps = 30/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 185 SVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDL 244

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT + L    +E     + R+ K +++      P   DWR      +  V+ Q
Sbjct: 245 TEEEF--RT-IYLNPLLREN-RGKKMRLAKSISDHAP---PPEWDWRSKGA--VTKVKDQ 295

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +   G
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYSAIMTLG 355

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C++  +KA+V++ D+   S  +  +   L + GPI V +N 
Sbjct: 356 GLETEDDYSYQGHLQA---CSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPISVAINA 412

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV +VGYG ++GI  W ++NSWG    + GY+
Sbjct: 413 FGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYY 469

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 470 YLHRGSGACGVNTMASSAVV 489


>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
          Length = 361

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 153/320 (47%), Gaps = 45/320 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  + E   RF  FK +      +        +G +  SD +P E    
Sbjct: 45  FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHS 104

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G     L +D +       +     LPK  DWR+     + PV++QG CGSCW
Sbjct: 105 VLGLRGVG-----LPSDADSAPILPTDN----LPKDFDWREHGA--VTPVKNQGSCGSCW 153

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN----------CNGGNIDVAFEYV- 202
           +F+ T  LE    L    L  LS+ QLV+CDH   +          C GG ++ AFEY+ 
Sbjct: 154 SFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDSGCKGGLMNSAFEYIL 213

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
              G+  + DYPY      T  C +++ K    V +  V S  +  +  +L+++GP+ V 
Sbjct: 214 NNGGVMREEDYPYSGTAGGT--CKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVA 271

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  KL+H V +VGYG ++           WI++NSWG+ 
Sbjct: 272 INAVYMQTYVGG--VSCPYVCS-KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGEN 328

Query: 314 GPDHGYFQIERGANACGIES 333
             ++GY++I RG N CG++S
Sbjct: 329 WGENGYYKICRGRNVCGVDS 348


>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
          Length = 368

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 153/321 (47%), Gaps = 43/321 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y  + E   RF  FK + +    +        +G +  SD +  E  
Sbjct: 49  DHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFR 108

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           ++    L  +   +L  D  +      E     LP+  DWR      + PV++QG CGSC
Sbjct: 109 KK---HLGVRSGFKLPKDANKAPILPTEN----LPEDFDWRDHGA--VTPVKNQGSCGSC 159

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFE+ +
Sbjct: 160 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTL 219

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           K  GL  + DYPY  K+  T  C  +K K    V +  V S +D      +L+++GP+ V
Sbjct: 220 KTGGLMKEEDYPYTGKDGKT--CKLDKSKIVASVSNFSVIS-IDEEQIAANLVKNGPLAV 276

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGD 312
            +N   +++Y G       + C   +L+H V +VGYG              WI++NSWG+
Sbjct: 277 AINAGYMQTYIGG--VSCPYICT-RRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGE 333

Query: 313 IGPDHGYFQIERGANACGIES 333
              ++G+++I +G N CG++S
Sbjct: 334 TWGENGFYKICKGRNICGVDS 354


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 156/328 (47%), Gaps = 55/328 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ ++Y D +E + R   F+ + +    +        +G +  SD +P E  +R
Sbjct: 58  FASFVRRFGKSYRDADEHEHRLSVFRANLRRARRHQRLDPSAVHGITKFSDLTPDEFRER 117

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
                 G  K R    R  +K         P      LP   DWR+     + PV+ QG 
Sbjct: 118 ----FLGLRKSR----RSFLKGISGSAHDAPALPTDGLPTEFDWREHGA--VGPVKDQGS 167

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+T+  LE    L    L  LS+ QLV+CDH          +  CNGG +  AF
Sbjct: 168 CGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAGCNGGLMTTAF 227

Query: 200 EYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSG 255
            Y+ K  GLE++ DYPY  + +    C ++K K    V++ + T  +D      +L++ G
Sbjct: 228 SYLAKAGGLETEKDYPYTGRNSA---CKFDKSKIAAQVKN-FSTVAIDEDQIAANLVKHG 283

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRN 308
           P+ + +N   +++Y G       + C  H LDH V +VGYG              WI++N
Sbjct: 284 PLAIGINAVFMQTYIGG--VSCPYICGRH-LDH-VFLVGYGSAGYAPLRFKEKPYWIIKN 339

Query: 309 SWGDIGPDHGYFQIERGA---NACGIES 333
           SWG+   + GY++I RG    N CG++S
Sbjct: 340 SWGENWGESGYYKICRGPHVKNKCGVDS 367


>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
 gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 152/319 (47%), Gaps = 43/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F ++  K+ +TY    E   RF  FK + +   ++        +G +  SD +P+E  +R
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKE-FRR 109

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L L  K   RL  D  +            LP   DWR      +  V+ QG CGSCW+
Sbjct: 110 QFLGL--KRWLRLPTDANKAPILPTTD----LPTDYDWRDHGA--VTEVKDQGSCGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+ T  LE    L    L  LS+ QLV+CDH          +  C+GG ++ AFEY +K 
Sbjct: 162 FSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMNNAFEYALKA 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GLE + DYPY   +  T  C ++K K    V +  V S +D      +L++ GP+ V +
Sbjct: 222 GGLEREEDYPYTGTDGGT--CKFDKSKVVASVSNFSVVS-IDEDQIAANLVKHGPLSVAI 278

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  + DH V +VGYG              WI++NSWG   
Sbjct: 279 NAAFMQTYVGG--VSCPYICSKRQ-DHGVLLVGYGSAGYAPIRFKEKPFWIIKNSWGQNW 335

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 336 GENGYYKICRGRNICGVDS 354


>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
          Length = 328

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 101/306 (33%), Positives = 144/306 (47%), Gaps = 29/306 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYF--------KQDGKETDEYYGTSGSSDRSPQEILQR 94
           FK+++++ N+ Y D  E   R + F        + +G       G +  SD +  E   R
Sbjct: 30  FKSWMMQHNKQY-DIEEYYHRLQIFIENKMKIERHNGGNHKYRMGLNTFSDMTFDEF--R 86

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           +   LT       E       K  +   KG  P S+DWR+    V N V++QG CGSCW 
Sbjct: 87  SSFLLT-------EPQNCSATKGTHVSSKGLYPDSVDWRKKGNYVTN-VKNQGPCGSCWT 138

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQA 211
           F+TT  LES  A+    L  LS+ QLV+C     N  CNGG    AFEY+K   GL ++ 
Sbjct: 139 FSTTGCLESVTAISTGKLLQLSEQQLVDCAQAFNNHGCNGGLPSQAFEYIKYNKGLMTED 198

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGV-YLNHRLIE 267
           DYPY  ++     C ++ E+A  FV+D    +  D M  +    +  P+ + Y       
Sbjct: 199 DYPYTAQDGT---CKFKPERAAAFVKDVVNITMYDEMGMVDAVARLNPVSMAYEVTSDFM 255

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y       ++       ++HAV  VGY E+N    WIV+NSWG      GYF IERG N
Sbjct: 256 HYHSGVYSSSECHNTTDTVNHAVLAVGYDEENVTPYWIVKNSWGPFWGMKGYFFIERGKN 315

Query: 328 ACGIES 333
            CG+ +
Sbjct: 316 MCGLSA 321


>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
          Length = 366

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 153/324 (47%), Gaps = 53/324 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  D E   R   FK + +   ++        +G +  SD +P E  +R
Sbjct: 49  FTVFKRRFGKVYASDEEHDYRLSEFKANMRRAKQHQELDPAAVHGVTQFSDLTPTE-FRR 107

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
             L L             R  KF  + K  P      LP   DWR      + PV++QG 
Sbjct: 108 KFLGL------------NRRLKFPADAKTAPILPTDELPSDFDWRDHGA--VTPVKNQGT 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSC +F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AF
Sbjct: 154 CGSCCSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAF 213

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP 256
           EY +K  GL  + D+PY    N    C ++K K    V +  V S  +  +  +L+++GP
Sbjct: 214 EYTLKAGGLMREEDHPYTG--NDLQVCRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGP 271

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
           + V +N   +++Y G       + C+  +LDH V +VGYG              WI++NS
Sbjct: 272 LAVAINAVFMQTYIGG--VSCPYICS-KRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNS 328

Query: 310 WGDIGPDHGYFQIERGANACGIES 333
           WG+   ++GY++I RG N CG++S
Sbjct: 329 WGESWGENGYYKICRGRNVCGVDS 352


>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
          Length = 408

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 158/321 (49%), Gaps = 33/321 (10%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 105 SVKMASIFKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQALDRGTAQYGVTKFSDL 164

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVES 145
           + +E   RT + L    +E       R K    ++  G   P   DWR+     +  V++
Sbjct: 165 TEEEF--RT-IYLNPLLREY------RGKNMRLDKSTGDSAPSEWDWRRKGA--VTKVKN 213

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
           QG CGSCWAF+ T  +E Q  L +  L  LS+ +L++CD  +  C GG    A+  +K  
Sbjct: 214 QGMCGSCWAFSVTGNVEGQWFLKQGALLSLSEQELLDCDKVDKACLGGLPSNAYSAIKTL 273

Query: 206 G-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN 262
           G LE++ DY YR +      C +  +KA+V++ D+   S  +  +   L + GPI V +N
Sbjct: 274 GGLETEDDYSYRGRMQT---CGFSPKKARVYINDSVELSQNEETLAAWLAEKGPISVAIN 330

Query: 263 HRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
              ++ Y     +P+R     C+P  +DHAV +VGYG ++G   W ++NSWG    + GY
Sbjct: 331 AFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSGTPFWAIKNSWGSDWGEEGY 387

Query: 320 FQIERGANACGIESYAYLASV 340
           + + RG+ ACG+ + A  A V
Sbjct: 388 YYLHRGSGACGVNTMASSAVV 408


>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
 gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
          Length = 953

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 149/311 (47%), Gaps = 36/311 (11%)

Query: 52  RTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLTGK 102
           R Y    E + RF  F+ +  + ++          YG +  +D +  E    TGL +   
Sbjct: 652 RQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVP-- 709

Query: 103 EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
           + +R      RV    +    G LP+S DWR      +  V++QG CGSCWAF+    +E
Sbjct: 710 KHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGA--VTEVKNQGSCGSCWAFSAVGNVE 767

Query: 163 SQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
               +  K L   S+ +L++CD  +  C GG +D AF+ ++Q  GLE + DYPY  K   
Sbjct: 768 GLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQK 827

Query: 222 TFRCTYEKEKAKVFVQ--------DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNP 273
           +  C + +  + V V+        +T++        +L+++GPI + LN   ++ Y G  
Sbjct: 828 S--CHFNRSLSHVQVKGAVDMPKNETYIAK------YLIKNGPIAIGLNANAMQFYRGGI 879

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQIERGAN 327
                  CN   +DH V IVGYG K     N  L  WI++NSWG    + GY++I RG N
Sbjct: 880 SHPWHPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDN 939

Query: 328 ACGIESYAYLA 338
           +CG+   A  A
Sbjct: 940 SCGVSEMASSA 950


>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
 gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
          Length = 337

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/305 (31%), Positives = 151/305 (49%), Gaps = 27/305 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL-Q 93
           ++ +I + N+ YT  ++    F  FK++  + +          YG +  SD      + +
Sbjct: 33  YENFIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSNQAVYGINKFSDIDKITFVNE 92

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCG 150
             GL            D  R+ +++     GP    P+S DWR  K+  +  V+ QG CG
Sbjct: 93  HAGLVSNLINSTDSNFDPYRLCEYVT--VAGPSARTPESFDWR--KLNKVTKVKEQGVCG 148

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLES 209
           SCWAFA    +ESQ A++  +L  LS+ QL++CD  +  C+GG + +AF E ++  G+E 
Sbjct: 149 SCWAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRIGGVEH 208

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLL-QSGPIGVYLNHRLI 266
           + DYPY   + I + C     K  V +   +     D   ++ LL ++GPI V ++   I
Sbjct: 209 EIDYPY---QGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDI 265

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y           CN + L+HAV +VGYG +N    WI +NSWG    ++GYF+  R  
Sbjct: 266 IDYRSGIAT----VCNDNGLNHAVLLVGYGIENDTPYWIFKNSWGSNWGENGYFRARRNI 321

Query: 327 NACGI 331
           NACG+
Sbjct: 322 NACGM 326


>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
          Length = 482

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDR 86
           S+K +  FK ++  +NRTY    E + R   F          Q        YG +  SD 
Sbjct: 178 SMKMISIFKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYGVTKFSDL 237

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  +E  +        K  L +  + P P   DWR  K   +  V++Q
Sbjct: 238 TEEEFRTIYLNPLLREEPGK--------KMHLAKAVRDPAPLEWDWR--KKGAVTEVKNQ 287

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY- 205
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K   
Sbjct: 288 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGFPSNAYLAIKSLG 347

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
           GLE++ DY Y+        C +  +KAKV++ D+   S  +  +   L   GPI V +N 
Sbjct: 348 GLETEDDYSYQGHMKA---CNFSAKKAKVYINDSVELSKNEQKLAAWLAVKGPISVAINA 404

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHA+ +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 405 FGMQFYRHGIAHPLRP---LCSPWFIDHAMLVVGYGNRSNVPFWAIKNSWGTDWGEEGYY 461

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+   A  A V
Sbjct: 462 YLHRGSGACGVNIMASSAVV 481


>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
 gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
 gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
          Length = 460

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 155/324 (47%), Gaps = 39/324 (12%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E   R   F  +     +          YG +  SD 
Sbjct: 156 SVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDL 215

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL----PKSLDWRQSKVKVLNP 142
           + +E        L      R            N R   P+    P   DWR +K  V N 
Sbjct: 216 TEEEFRTIYLNPLLKDAPGR------------NMRPAQPVTDVPPPQWDWR-NKGAVTN- 261

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+ QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +
Sbjct: 262 VKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAI 321

Query: 203 KQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGV 259
           +  G LE++ DY YR +      C++  EKAKV++ D+   S  +  +   L ++GP+ +
Sbjct: 322 RTLGGLETEDDYSYRGRLQT---CSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSI 378

Query: 260 YLNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
            +N   ++ Y     +P+R     C+P  +DHAV +VGYG ++ I  W ++NSWG    +
Sbjct: 379 AINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGE 435

Query: 317 HGYFQIERGANACGIESYAYLASV 340
            GY+ + RG+ ACG+   A  A +
Sbjct: 436 EGYYYLHRGSGACGVNIMASSAVI 459


>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
 gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
          Length = 368

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 153/319 (47%), Gaps = 44/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F+ +  ++ +TY    E   RF  FK + +    +        +G +  SD +P E  +R
Sbjct: 52  FEKFKARFQKTYATPEEHDYRFNVFKANLRRAKRHQLLDPSAVHGVTQFSDLTPAE-FRR 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L   G    R  AD ++      +     LP   DWR++    + PV++QG CGSCW+
Sbjct: 111 DYL---GLNPLRFPADAQQAPILPTDN----LPTDFDWRENGA--VTPVKNQGNCGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+T   LE    L    L  LS+ QLV+CD           +  CNGG ++ AFEY+ K 
Sbjct: 162 FSTIGALEGAHFLATGNLESLSEQQLVDCDRECDPEEYDACDDGCNGGLMNNAFEYILKT 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            G+E + DYPY  ++     C + + K    V +  V S +D      +L+++GP+ V +
Sbjct: 222 GGVEREKDYPYTGRDRSP--CKFNESKIVASVSNFSVVS-IDEDQIAANLVKNGPLAVGI 278

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y         + C+  +LDH V +VGYG              WI++NSW    
Sbjct: 279 NAVFMQTYTAG--VSCPFLCS-GELDHGVLLVGYGSAGYSPIRFKEKPYWILKNSWSKYW 335

Query: 315 PDHGYFQIERGANACGIES 333
            +HGY++I RG N CG++S
Sbjct: 336 GEHGYYRICRGQNMCGVDS 354


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 145/309 (46%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V++ ++Y    E++ RF  F +  +E             + + +  R G+ R + 
Sbjct: 62  FARFAVRYGKSYESAAEVQRRFRIFSESLEEV---------RSTNQKGLSYRLGINRYSD 112

Query: 102 KEKERLEADRERVKKFLNERKKG--------PLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +   +G         LP++ DWR+  +  ++PV+ Q  CGSCW
Sbjct: 113 MSWEEFQASRLGAAQTCSATLRGNHRMQDANALPETKDWREDGI--VSPVKDQSHCGSCW 170

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C   + N  CNGG    AFEY+K   GL+++
Sbjct: 171 TFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGLDTE 230

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQ-SGPIGVYLNH-RLI 266
             YPY+    +   C Y+ E A V V D+     +  D + + +    P+ V        
Sbjct: 231 ESYPYKGVNGV---CHYKPENAAVQVLDSVNITLNAEDELQNAVGLVRPVSVAFEVINGF 287

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +     P  ++HAV  VGYG +NG   W+++NSWG+   D GYF++ERG 
Sbjct: 288 RQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGTPYWLIKNSWGESWGDKGYFKMERGK 347

Query: 327 NACGIESYA 335
           N C + + A
Sbjct: 348 NMCAVATCA 356


>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 145/303 (47%), Gaps = 17/303 (5%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           +F  +  ++ + Y    EIK RFE F  + K    +     S      E    T L    
Sbjct: 60  SFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEF---TDLTWDE 116

Query: 102 KEKERLEADR--ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
             ++RL A +      K   +     LP++ DWR+  +  ++PV++QG+CGSCW F+TT 
Sbjct: 117 FRRDRLGAAQNCSATTKGNVKLTNAVLPETKDWREDGI--VSPVKNQGKCGSCWTFSTTG 174

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQADYPYR 216
            LE+  +        LS+ QLV+C     N  CNGG    AFEY+K  G L+++  YPY 
Sbjct: 175 ALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYT 234

Query: 217 NKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES---YDGN 272
            K  +   C +  E   V V D+  +T G +  +    +    V +   +I+    Y   
Sbjct: 235 GKNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSG 291

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
                +    P  ++HAV  VGYG +NG+  W+++NSWG    D GYF++E G N CGI 
Sbjct: 292 VYSSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDGYFKMEMGKNMCGIA 351

Query: 333 SYA 335
           + A
Sbjct: 352 TCA 354


>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
          Length = 360

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 154/319 (48%), Gaps = 44/319 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  + E   RF  FK +      +        +G +  SD +P E    
Sbjct: 45  FLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTRFSDLTPMEFRHS 104

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G   +   AD   + +  N      LPK  DWR+     + PV++QG CG+CW
Sbjct: 105 VLGLRGVGLPSD---ADSAPILRTDN------LPKDFDWREHGA--VTPVKNQGSCGACW 153

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-K 203
           +F+ T  LE    L    L  LS+ QLV+CDH          +  C GG ++ AFEY+  
Sbjct: 154 SFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGGLMNSAFEYILN 213

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             G+  + DYPY      T  C +++ K    V +  V S  +  +  +L+++GP+ V +
Sbjct: 214 NGGVMREEDYPYSGTAGGT--CKFDQTKIAASVANFSVVSRDEDQIAANLVKNGPLAVAI 271

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  KL+H V +VGYG ++           WI++NSWG+  
Sbjct: 272 NAVYMQTYVGG--VSCPYVCS-KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENW 328

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 329 GENGYYKICRGRNVCGVDS 347


>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 138/304 (45%), Gaps = 30/304 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F  ++ ++ ++Y    E + RF  F Q+  ET            +G +  +D S +E   
Sbjct: 34  FNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQS 93

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           R    L             R  KF    +    P + DWR +K  V+ PV  QG+CGSCW
Sbjct: 94  RV---LMSNPPPPPTEKPYRGPKF----EGFTAPSTFDWR-NKPGVVTPVYDQGQCGSCW 145

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQYGLESQAD 212
           AF+ T  +ESQ AL    L  LS  Q+V+C   +  C GG    A++YV    GL++ A+
Sbjct: 146 AFSATENIESQWALAGHKLTGLSMQQIVDCSWWDDGCGGGFPSYAYDYVIDAPGLDALAN 205

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD---HMM--HLLQSGPIGVYLNHRLIE 267
           YPY     +   C + KE   V    +W  +  D   H M  +L Q GPI V ++     
Sbjct: 206 YPYT---AVGGSCAF-KESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWP 261

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           SY G   R +  AC    +DH V  VGY        WI+RNSWG      GY  +E G +
Sbjct: 262 SYTGGVYRAS--ACGT-SIDHCVLAVGYNLTANPPYWIIRNSWGTSWGLEGYMHLEFGTD 318

Query: 328 ACGI 331
           AC +
Sbjct: 319 ACAV 322


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 147/298 (49%), Gaps = 22/298 (7%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +  F++ +VK ++ Y   +E   RFE F  + K  DE        + G +  +D + +
Sbjct: 44  KVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHE 103

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +      G + E  E   E +++F   R    LPKS+DWR  K   ++PV++QG+C
Sbjct: 104 EFKNK----FLGFKGELAERKDESIEQF-RYRDFVDLPKSVDWR--KKGAVSPVKNQGQC 156

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLE 208
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF YV + GL 
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRNGLH 216

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLI 266
            + +YPY   E          EK  +        +  D  +  L + PI V +  + R  
Sbjct: 217 KEEEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDF 276

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
           + Y G      D  C   +LDH VA VGYG   G+   IVRNSWG    + GY +++R
Sbjct: 277 QFYSGGVF---DGHCGT-ELDHGVAAVGYGTSKGLDYVIVRNSWGPKWGEKGYIRMKR 330


>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
          Length = 321

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 93/294 (31%), Positives = 139/294 (47%), Gaps = 28/294 (9%)

Query: 50  WNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           + + Y ++++ K RF  FK         Q   +    YG +  SD +P+E   +      
Sbjct: 34  YGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAAKY----- 87

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
                    + ++VK+      K   P+ +DWR      +  VE+QG CGSCWAF+T   
Sbjct: 88  ----LSAPVNNDQVKRVRPTGLKAA-PERIDWRAKGA--VTAVENQGSCGSCWAFSTAGN 140

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKE 219
           +E Q  +    L  LSK QLV+CD     CNGG       E +   GLESQ DYPY    
Sbjct: 141 VEGQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHMGGLESQDDYPY---A 197

Query: 220 NITFRCTYEKEKAKVFVQDTWV--TSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
            +  +C  EKE+    + D+     S  D+  +L + GP+   LN   ++ Y    I  +
Sbjct: 198 GVKEQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPS 257

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
              C+P  L+HAV  VGY ++  +  WI++NSW     + GYF++ RG   CGI
Sbjct: 258 YEECSPVDLNHAVLTVGYDKEGDMPYWIIKNSWNVEWGEKGYFRLYRGDGTCGI 311


>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 100/333 (30%), Positives = 158/333 (47%), Gaps = 38/333 (11%)

Query: 28  VWRDLAYDSIKQVD----AFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKET 74
           +W  LA  +  + D     ++ + +K+ +TY++D++ + RFE FK         Q+ ++ 
Sbjct: 13  IWSALARTTQVEPDNARALYEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQG 71

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP-KSLDWR 133
              YG +  SD + +E   R           R+  D   V + L   +   +  +  DWR
Sbjct: 72  TAQYGVTQFSDLTSEEFKTRY---------LRMRFDGPIVSEDLTPEEDVTMDNEKFDWR 122

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
           +     + PV  QG+CGSCWAF+    +  Q       L  LS+  LV+CD+ +  C+GG
Sbjct: 123 EHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQPLVDCDYLDGGCDGG 180

Query: 194 ---NIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM- 249
                + A +  K  GLE  +DYPY     I   C  +K K   ++  + +    + +  
Sbjct: 181 YPPQTNTAIQ--KMGGLELASDYPYTGVGGI---CYMDKSKFVAYINGSTILPLSEKVQA 235

Query: 250 -HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRN 308
             L   GP+   LN   ++ Y G  +R     C+P  ++HAV  VGYG +NG   WIV+N
Sbjct: 236 QKLRAIGPLSSALNADTLQLYKGGIMRPR--LCDPAGVNHAVLTVGYGVQNGKPYWIVKN 293

Query: 309 SWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           SWG+   + GYF+I RG   CGI S    A +K
Sbjct: 294 SWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326


>gi|387765908|gb|AFJ95133.1| cathepsin-L [Toxocara canis]
          Length = 360

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 158/318 (49%), Gaps = 31/318 (9%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYF----------KQDGKETDEYYGTSGSSDR 86
           ++ +D F+ +I K+++ Y  + E   RF  +           Q  ++    YG +  +D 
Sbjct: 44  LRLLDRFEDFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADW 103

Query: 87  SPQE----ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +  E    +L +   +   K+   +++  +  +  +  R++  +P   DWR     V+ P
Sbjct: 104 NVNEFREILLPKDFFKNLRKKATFIDSFIDPPETVMARREE--IPDHFDWR--PYNVVTP 159

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV 202
           V+SQ +CGSC AFAT   +ES  AL    L  LS+ QL++C+  N  C+GG++D A  YV
Sbjct: 160 VKSQFKCGSCRAFATGGTVESAYALGTGELRSLSEHQLLDCNLENNACDGGDVDKALRYV 219

Query: 203 KQYGLESQADYPY--RNKENITFRCTYEKEKAKVFV-QDTWVTSGVDHMMHLLQSGPIGV 259
              GL  + DYPY    ++    R    + KA VF+ QD    S +D ++H    GP+ V
Sbjct: 220 YDEGLMREYDYPYVAHRQDTCQLRGETTRIKAAVFLHQDE--ASIIDWLLHY---GPVNV 274

Query: 260 YLNHRL-IESYDGNPIRRNDWACNPHKL-DHAVAIVGYGEKNGI--LTWIVRNSWG-DIG 314
            +N    +++Y G     + W C    +  H++ IVGYG  N      WIV+NSWG   G
Sbjct: 275 GINVTADMKAYKGGVYTPDRWECENKIIGTHSINIVGYGTWNATNQKYWIVKNSWGQSYG 334

Query: 315 PDHGYFQIERGANACGIE 332
            + GY    RG N+CGIE
Sbjct: 335 IEDGYVYFARGINSCGIE 352


>gi|383852175|ref|XP_003701604.1| PREDICTED: cathepsin O-like [Megachile rotundata]
          Length = 370

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/324 (29%), Positives = 157/324 (48%), Gaps = 36/324 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDD-NEIKTRFEYFKQDGKETD-----------EYYGTSGS 83
           S + +  FK Y+ ++N+TY +D  E + RF+ F++  +  +            +YG +  
Sbjct: 45  SSEDIKLFKNYVTRYNKTYRNDPTEYEERFQRFQRSLRHIETMNSLRSSPESAFYGLTEF 104

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKV 139
           SD +  E   Q     L  + ++   A   R+ +  +    R+   +P   DWR   V  
Sbjct: 105 SDMTEDEFRSQALSPDLAARGEKHATAPYHRLHRLKHSNRVRRATVVPLRFDWRDKGV-- 162

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNI--D 196
           + PV SQG CG+CWAF+T  + ES  A+   TLYPLS  ++++C  + N  C GG+I   
Sbjct: 163 ITPVRSQGACGACWAFSTVEVAESMFAIQNGTLYPLSVQEMIDCAKNSNFGCEGGDICSL 222

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKV-------FVQDTWVTSGVDHMM 249
           +++  + +  +  +  YP   K +    C  EK   K+       F  D++V +  + + 
Sbjct: 223 LSWLLLSKVQIFQEHAYPLTRKTDT---CKLEKTAGKISGVRIKDFTCDSFVDAEDELVS 279

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPH--KLDHAVAIVGYGEKNGILTWIVR 307
            L   GP+   +N    ++Y G  I+   + C+     L+HAV IVGY +   I  +I++
Sbjct: 280 TLATHGPVAAAVNALSWQNYLGGVIQ---FHCDGSFDSLNHAVQIVGYDKSAKIPHYIIK 336

Query: 308 NSWGDIGPDHGYFQIERGANACGI 331
           NSWG    D+G+  I  G N CGI
Sbjct: 337 NSWGSNFGDNGFMYIAIGNNLCGI 360


>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
          Length = 1118

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 150/299 (50%), Gaps = 27/299 (9%)

Query: 43   FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSSDRSPQEILQR 94
            F+ +I  +N+ Y D++E + RF+ F  + K+        ++  YG +  SD S  E ++ 
Sbjct: 819  FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKDEFVKF 877

Query: 95   TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 TG ++E   ++ +  K  L +      P   DWR  K  V++ V+ QG C SCWA
Sbjct: 878  ----YTGLKREESPSNEDHKKTDLPKSFNVTAPDQFDWR--KKGVVSSVKFQGHCVSCWA 931

Query: 155  FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI--DVAFEYVKQYGLESQAD 212
            F+    +ES  A+    L  +S+ QLV+CD  N  C+GG       F Y  + G  S   
Sbjct: 932  FSVAGNVESINAIKTGKLIDVSEQQLVDCDEWNFGCSGGIACSKSHFSYFHKKGAMSLES 991

Query: 213  YPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMM-HLLQSGPIGVYLNHRLIESY 269
            YPY  KE    +C Y   K  + ++D   ++    D +  +L   GP+ + ++   I  Y
Sbjct: 992  YPYVGKEG---QCRYNSSKVVIRLKDYQYFIALSEDEIKEYLYNIGPLSIDIDSSQIHHY 1048

Query: 270  DGNPIRRNDWACNP-HKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
             G  + +    C    K +HAV +VGYG++NG+  WIV+NSWG    + GYF+I+RG N
Sbjct: 1049 KGGIVIKE---CQEVKKTNHAVLLVGYGKENGVEYWIVKNSWGQNWGEKGYFRIQRGVN 1104



 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 134/260 (51%), Gaps = 16/260 (6%)

Query: 72  KETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLD 131
           + ++  YG +  SD S +E ++      TG ++E   ++ +  K  L E      P   D
Sbjct: 4   RSSNAVYGINKFSDLSKEEFVKY----YTGLKREESPSNEDHKKTDLPESFNVTAPDQFD 59

Query: 132 WRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCN 191
           WR  K  V++ +++Q  CGSCWAF+  A +ES  A+    L  +S+ QL++CD  +  C+
Sbjct: 60  WR--KKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCS 117

Query: 192 GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---M 248
           GG    A  Y    G  S   YPY  KE    +C Y+  K ++ +++      +      
Sbjct: 118 GGLPWDALRYFVANGAMSLKSYPYVAKEG---KCRYDSSKVEIRLKEYKHKEKLSEDQIK 174

Query: 249 MHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACN-PHKLDHAVAIVGYGEKNGILTWIVR 307
            HL   GP+ + +    + SY+G  +      C+  + ++HAV +VGYG++NG+  WIV+
Sbjct: 175 EHLYNIGPLSIAITSSPLASYNGGILIEE---CHRSYLINHAVLLVGYGKENGVKYWIVK 231

Query: 308 NSWGDIGPDHGYFQIERGAN 327
           NSWG    ++GYF+++ G N
Sbjct: 232 NSWGQNWGENGYFRMKMGVN 251



 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 143/294 (48%), Gaps = 31/294 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSSDRSPQEILQR 94
           F+ +I  +N+ Y D++E + RF+ F  + K+        ++  YG +  SD S +E ++ 
Sbjct: 519 FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 577

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                TG ++E   ++ +  K  L E      P   DWR  K  V++ +++Q  CGSCWA
Sbjct: 578 ----YTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWR--KKGVVSSIKNQKHCGSCWA 631

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYP 214
           F+    +ES  A+    L  +S+ QLV+CD  +  C+GG    A  Y +  G  S   YP
Sbjct: 632 FSAAGNVESIHAIKTGKLVHVSEQQLVDCDSQDSGCSGGLTWNAMRYFRTNGAVSLKSYP 691

Query: 215 YRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDG 271
           Y  +      C Y+  K  + ++D   +T   +  +  HL   G + + +    +  Y+G
Sbjct: 692 YVAQNE---NCRYDSNKVVIRLKDYKHITQLSEDQIKEHLYNIGLLSIDITSTQLTWYEG 748

Query: 272 NPI----RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             +    RR+D       +DHAV +V YG++N +  WIV+NSWG  G +    Q
Sbjct: 749 GILIEECRRSDL------VDHAVLLVEYGKENSVEYWIVKNSWGQNGGEKVALQ 796



 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 89/180 (49%), Gaps = 15/180 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSSDRSPQEILQR 94
           F+ +I  +N+ Y D++E + RF+ F  + K+        ++  YG +  SD S +E ++ 
Sbjct: 302 FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 360

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                TG +++R           L +      P   DWR  K  V++ V++Q  CGSCWA
Sbjct: 361 ----YTGLKRDRCTTTEHHKSTDLPKSFNITAPDQFDWR--KKGVVSSVKNQRHCGSCWA 414

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYP 214
           F+  A +ES  A+    L  +S+ QL++CD  +  C+GG   +A   + Q  L S  + P
Sbjct: 415 FSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGLEWIAMRELGQRRLYSLEEAP 474


>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
          Length = 490

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 155/317 (48%), Gaps = 27/317 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 187 VKMASIFKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 246

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQ 146
            +E   RT + L    +E      E   K    +  G L P   DWR SK  V   V+ Q
Sbjct: 247 EEEF--RT-IYLNPLLRE------EPSNKMKQAKSVGDLAPPEWDWR-SKGAVTK-VKDQ 295

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 355

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 356 GLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINA 412

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+ + 
Sbjct: 413 FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLH 472

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+ + A  A V
Sbjct: 473 RGSGACGVNTMASSAVV 489


>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
          Length = 371

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 152/320 (47%), Gaps = 37/320 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ ++Y    E   R   FK + +    +        +G +  SD +P+E  +R
Sbjct: 47  FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQMLDPSAVHGVTKFSDLTPKE-FRR 105

Query: 95  T--GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           T  G+R +   K++L+                 LP   +WR      +  V+ QG CGSC
Sbjct: 106 TYLGIRKSSSSKQKLKLKLPADAHAAEILPTSDLPFDFEWRD--YGAVTGVKDQGLCGSC 163

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVK 203
           W+F+TT  LE    L    L  L++ +LV+CDH          +  CNGG +  A+EYV 
Sbjct: 164 WSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVL 223

Query: 204 QY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
           Q  GLE + DYPY  ++     C ++K K    V +  V S  +  +  +L++ GP+ V 
Sbjct: 224 QSGGLEKEKDYPYTGRDGT---CKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVG 280

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+   LDH V IVGYG              WI++NSWG+ 
Sbjct: 281 INSIFMQTYIGG--VSCPYICSKKNLDHGVLIVGYGAAGYAPIRFKDKPYWIIKNSWGEN 338

Query: 314 GPDHGYFQIERGANACGIES 333
             + GY++I RG N CG++S
Sbjct: 339 WGEEGYYKICRGNNICGVDS 358


>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
          Length = 363

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 154/322 (47%), Gaps = 51/322 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y    E   R + FK + +    +        +G +  SD +P E  +R
Sbjct: 47  FSLFKSKYGKIYASQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITQFSDLTPSE-FRR 105

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K + +L A +  +           LP+  DWR+     +  V++QG CGSCW+
Sbjct: 106 TYLGLH-KPRPKLNAQKAPI------LPTSDLPEDFDWREKGA--VTGVKNQGSCGSCWS 156

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  CNGG +  AFEY +K 
Sbjct: 157 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLMTTAFEYTLKA 216

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  ++    +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 217 GGLQREKDYPYTGRDG---KCHFDKSKIAASVANFSVI-GLDEDQIAANLVKHGPLAVGI 272

Query: 262 NHRLIESYDGNPIRRNDWACNP---HKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           N   +++Y          +C      + DH V +VGYG              WI++NSWG
Sbjct: 273 NAAWMQTY------MRGVSCPLICFKRQDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWG 326

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   +HGY++I RG N CG+++
Sbjct: 327 ENWGEHGYYKICRGHNICGVDA 348


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 101/309 (32%), Positives = 150/309 (48%), Gaps = 26/309 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +  F++++VK ++ Y   +E   RFE F  + K  DE        + G +  +D + +
Sbjct: 44  KVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHE 103

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +      G + E  E   E  K+F   R    LPKS+DWR  K   + PV++QG+C
Sbjct: 104 EFKHK----FLGFKGELAERKDESSKEF-GYRDFVDLPKSVDWR--KKGAVAPVKNQGQC 156

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLE 208
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF YV + GL 
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLH 216

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLI 266
            + +YPY   E          EK  +        +     +  L + PI V +  + R  
Sbjct: 217 KEEEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDF 276

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y G      D  C   +LDH VA VGYG   G+   IVRNSWG    + GY +++RG+
Sbjct: 277 QFYSGGVF---DGHCGT-ELDHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGS 332

Query: 327 ----NACGI 331
                 CG+
Sbjct: 333 GKPHGMCGL 341


>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
          Length = 460

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 155/323 (47%), Gaps = 37/323 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 156 SVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDL 215

Query: 87  SPQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           + +E   I     LR    +K +L    E            P P   DWR SK  V N V
Sbjct: 216 TEEEFRTIYLNPLLRSEPGKKMQLAKPVE-----------DPAPPQWDWR-SKGAVTN-V 262

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 263 KDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKLDKACLGGLPSNAYSAIK 322

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  +KAKV++ D+   S  +  +   L + GPI V 
Sbjct: 323 NLGGLETEEDYTYQGHMQA---CNFSAQKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 379

Query: 261 LNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
           +N   ++ Y     +P+R     C+P  +DHAV +VGYG ++    W ++NSWG    + 
Sbjct: 380 INAFGMQFYRRGIAHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGADWGEE 436

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+  CG+ + A  A V
Sbjct: 437 GYYYLYRGSGVCGVNTMASSAVV 459


>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 377

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 154/328 (46%), Gaps = 54/328 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F +++ ++ +TY D  E   R   FK + +    +        +G +  SD +P E  +R
Sbjct: 53  FTSFVQRFGKTYKDAEEHAHRLSVFKANLRRARRHQLLDPSAEHGITKFSDLTPAE-FRR 111

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGR 148
           T L L         + R  +++        P      LP   DWR      + PV++QG 
Sbjct: 112 TFLGLK-------TSRRSFLREIGGSAHDAPVLPTDGLPDDFDWRDHGA--VGPVKNQGS 162

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCW+F+ +  LE    L    +  LS+ Q V+CDH          +  CNGG +  AF
Sbjct: 163 CGSCWSFSASGALEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSCDAGCNGGLMTSAF 222

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSG 255
            Y +K  GLE + DYPY  ++     C ++K K    VQ+  V S VD      +L++ G
Sbjct: 223 SYLLKSGGLEREKDYPYTGRDGT---CKFDKSKIVASVQNFSVVS-VDEEQIAANLVKHG 278

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRN 308
           P+ + +N   +++Y G       + C    LDH V +VGYG      +       W+++N
Sbjct: 279 PLAIGINAAYMQTYIGG--VSCPYICG-RSLDHGVLLVGYGASGFAPSRLKNKPYWVIKN 335

Query: 309 SWGDIGPDHGYFQIERGANA---CGIES 333
           SWG+   + GY++I RG+N    CG++S
Sbjct: 336 SWGENWGEKGYYKICRGSNVRNKCGVDS 363


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 116/207 (56%), Gaps = 15/207 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP S+DWR      + P+++QG+CGSCWAF+TT  LE Q AL K  L  LS+ +LV+C  
Sbjct: 113 LPASVDWRTKGA--VTPIKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDCSA 170

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWV 241
             GN  C+GG +D AF Y+K+  G++++  YPY  ++     C+++K      V     V
Sbjct: 171 AEGNDGCDGGLMDDAFTYIKKNNGIDTEQSYPYTGEDGT---CSFKKSDVAATVTGFVDV 227

Query: 242 TSGVDHMMHLLQS--GPIGVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
           TSG +  +    +  GPI V ++      + Y+      +D  C+  +LDH V +VGYG 
Sbjct: 228 TSGSESGLQDASATIGPISVAIDASSWDFQLYESGVYDVSD--CSTTELDHGVLVVGYGT 285

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIER 324
            +G   W+V+NSWG     HGY Q+ R
Sbjct: 286 DDGTAYWLVKNSWGTDWGHHGYIQMSR 312


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 145/307 (47%), Gaps = 27/307 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +  ++ + Y    EIK RFE F  + K    +     S      E        +T  
Sbjct: 61  FARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTD-----ITWD 115

Query: 103 EKERLEADRERVKKFLNERKKG-------PLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           E  R   DR    +  +   KG        LP++ DWR++ +  ++PV++QG+CGSCW F
Sbjct: 116 EFRR---DRLGAAQNCSATTKGNLKLTNVVLPETKDWREAGI--VSPVKNQGKCGSCWTF 170

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQAD 212
           +TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G L+++  
Sbjct: 171 STTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEA 230

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES--- 268
           YPY  K  +   C +  E   V V D+  +T G +  +    +    V +   +I+    
Sbjct: 231 YPYTGKNGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQ 287

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y        +    P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G N 
Sbjct: 288 YKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNM 347

Query: 329 CGIESYA 335
           CGI + A
Sbjct: 348 CGIATCA 354


>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
          Length = 245

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 125/245 (51%), Gaps = 33/245 (13%)

Query: 109 ADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALL 168
           AD  +  K         LP+  DWR+     +  V++QG CGSCW+F+TT  LE    L 
Sbjct: 1   ADENKAPKLPTSN----LPEEFDWREKGA--VTAVKNQGSCGSCWSFSTTGALEGANYLA 54

Query: 169 KKTLYPLSKSQLVECDH----------GNLNCNGGNIDVAFEY-VKQYGLESQADYPYRN 217
              L  LS+ QLV+CDH           +  CNGG ++ AFEY +K  GL+ + DYPY  
Sbjct: 55  TGELISLSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQKEKDYPYTG 114

Query: 218 KENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDGNPI 274
           K+     C ++K K    V +  V S +D      +L++ GP+ V +N   +++Y G   
Sbjct: 115 KDG---TCKFDKTKIAASVHNFSVVS-IDEDQIAANLVKYGPLAVGINAAWMQTYIGG-- 168

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILT------WIVRNSWGDIGPDHGYFQIERGANA 328
               + C    LDH V IVGYG     +       WI++NSWG+   + GY++I RG N 
Sbjct: 169 VSCPYICG-KSLDHGVLIVGYGTGYAPVRLKNKPYWIIKNSWGESWGESGYYKICRGRNV 227

Query: 329 CGIES 333
           CG+ES
Sbjct: 228 CGVES 232


>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
          Length = 517

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 103/317 (32%), Positives = 155/317 (48%), Gaps = 27/317 (8%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 214 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 273

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQ 146
            +E   RT + L    +E      E   K    +  G L P   DWR SK  V   V+ Q
Sbjct: 274 EEEF--RT-IYLNSLLRE------EPGNKMKQAKSVGDLAPPEWDWR-SKGAVTK-VKDQ 322

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 323 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLG 382

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 383 GLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINA 439

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+ + 
Sbjct: 440 FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLH 499

Query: 324 RGANACGIESYAYLASV 340
           RG+ ACG+ + A  A V
Sbjct: 500 RGSGACGVNTMASSAVV 516


>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
          Length = 302

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 150/314 (47%), Gaps = 33/314 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQE--- 90
           FK +++ +NRTY    E + R   F  +     +          YG +  SD + +E   
Sbjct: 5   FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPVESQGRC 149
           I   T LR            +E   K    +  G L P   DWR      +  V+ QG C
Sbjct: 65  IYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKVKDQGMC 110

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LE 208
           GSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G LE
Sbjct: 111 GSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLE 170

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLI 266
           ++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V +N   +
Sbjct: 171 TEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGM 227

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+ + RG+
Sbjct: 228 QFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGS 287

Query: 327 NACGIESYAYLASV 340
            ACG+ + A  A V
Sbjct: 288 GACGVNTMASSAVV 301


>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
          Length = 373

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 41/331 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + F  + +++NR+Y+   E   R + F ++  +             +G S  SD + +E 
Sbjct: 40  EVFTLFQIQYNRSYSSPAEYAHRLDIFARNLAQAQRLQEDDLGTAEFGVSPFSDLTEEEF 99

Query: 92  LQRTGLRLTGKEKERLEADRERV-KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            Q  G R       R  A    V +K  +E+ +  +P++ DW Q    V++ V++Q  C 
Sbjct: 100 GQLYGHR-------RAAAGAPHVGRKVESEKWEKTVPQTCDW-QKAAGVISSVKNQEMCN 151

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQYGLES 209
            CWA A    +E+  A+       +S  QL++CD     C GG + D     +   GL S
Sbjct: 152 CCWAMAAAGNIEALWAITYHQSVEVSIQQLLDCDRCGNGCKGGFVWDAFLTVLNNSGLAS 211

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIE 267
           + DYP+R       RC  +K K   ++QD       +  +  +L   GPI V +N +L++
Sbjct: 212 EKDYPFRGDAK-PHRCQAKKPKV-AWIQDFIRLPEDEQKIAEYLATHGPITVTINMKLLQ 269

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI------------------LTWIVRNS 309
            Y    I+     C+P  LDH+V +VG+G    +                    WI++NS
Sbjct: 270 QYQKGVIKATPTTCDPQHLDHSVLLVGFGGGKSVEGRRPGAVSSQSRPRRSSSYWILKNS 329

Query: 310 WGDIGPDHGYFQIERGANACGIESYAYLASV 340
           WG    + GYF++ RG+N CGI  YA  A V
Sbjct: 330 WGAKWGEEGYFRLHRGSNTCGITKYALTALV 360


>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
          Length = 236

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/216 (39%), Positives = 112/216 (51%), Gaps = 12/216 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP S DWRQ  V  +  V+ QG CGSCWAFA T  +E Q     K L  LS+ QL++CD 
Sbjct: 21  LPGSFDWRQHGV--VTEVKDQGMCGSCWAFAVTGNIEGQWYKKTKKLVSLSEQQLLDCDK 78

Query: 186 GNLNCNGGNIDVAFE-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            +  CNGG  + A+E  VK  GL S+ DYPY   +     C  +      ++ D+ VT  
Sbjct: 79  KDEACNGGFPEWAYESIVKMGGLMSEKDYPYEAHKET---CNLKPNNISAYINDS-VTLS 134

Query: 245 VDH---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
            D       L ++GPI V +N   ++ Y G         C+   LDHAV +VGYG  +  
Sbjct: 135 KDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCSEQGLDHAVLLVGYGVTSFW 194

Query: 302 LT--WIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
               WIV+NSWG    + GYF+I RG   CGI + A
Sbjct: 195 QRPYWIVKNSWGRSWGEKGYFRIYRGDGTCGINADA 230


>gi|380026170|ref|XP_003696831.1| PREDICTED: cathepsin O-like [Apis florea]
          Length = 368

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/324 (30%), Positives = 159/324 (49%), Gaps = 35/324 (10%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDD-NEIKTRFEYFKQD-----------GKETDEYYGTSG 82
           D+ + +  F+ Y+V++N++Y +D +E + RF+ F++              +   YYG + 
Sbjct: 43  DNNEDIKLFQNYVVRYNKSYKNDPSEYEERFKRFQRSLQHIERMNGLRSSQESAYYGLTE 102

Query: 83  SSDRSPQEILQRTGLR-LTGKEKERLEADRERVKKFLNERKKGPL--PKSLDWRQSKVKV 139
            SD S  E L  T L  L  + ++   A   R  +   +R K  +  P   DWR   V  
Sbjct: 103 FSDMSEDEFLLHTLLPDLPIRGEKHKNAPYHRKHQVSTDRMKRSISIPSRFDWRDKGV-- 160

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNID-- 196
           + PV SQG CG+CWAF+T  ++ES  A+   TL+ LS  ++++C  + N  C GG+I   
Sbjct: 161 ITPVRSQGSCGACWAFSTIEVIESMFAIKNGTLHSLSVQEMIDCAKNSNFGCEGGDICSL 220

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVF---VQDTWVTSGVDH----MM 249
           +++  V +  +  ++ YP      +T  C   K   K F   +QD    S VD     ++
Sbjct: 221 LSWLLVSKVQILQESIYPLV---GMTGTCKLGKMTDKAFGIKIQDFTCDSFVDAEDELLI 277

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPH--KLDHAVAIVGYGEKNGILTWIVR 307
            L   GP+   +N    ++Y G  I+   + C+     L+HAV I+GY +   +  +I++
Sbjct: 278 ALATHGPVAAAVNALSWQNYLGGVIQ---YHCDGSFDNLNHAVQIIGYDKSVAVPHYIIK 334

Query: 308 NSWGDIGPDHGYFQIERGANACGI 331
           NSWG    D GY  I  G N CGI
Sbjct: 335 NSWGSNFGDKGYMYIGIGNNLCGI 358


>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
          Length = 330

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 126/226 (55%), Gaps = 16/226 (7%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +PKSLDWR+     + PV++QG+CGSCWAF+    LE Q+      L  LS+  LV+C
Sbjct: 112 GDIPKSLDWREHGY--VTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENIT-FRCTYEKEKAKVFVQDT 239
              +GNL CNGG ++ AF+YVK+  GL++   Y Y  ++ +  +   Y       FV+  
Sbjct: 170 SWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVK-- 227

Query: 240 WVTSGVDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            V    D +M  + S GP+ V ++  H+    Y G      D  C+  ++DHAV +VGYG
Sbjct: 228 -VPLSEDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPD--CSSTEMDHAVLVVGYG 284

Query: 297 EK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           E+ +G   W+V+NSWG+     GY ++ +   N CGI +YA   +V
Sbjct: 285 EESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPTV 330


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 152/308 (49%), Gaps = 32/308 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           ++ ++VK  + Y    E + RF+ FK + +  DE+         G +G +D + +E    
Sbjct: 52  YEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRST 111

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G ++ RL    +R    + E     LP S+DWR  K   +  V+ QG CGSCWA
Sbjct: 112 YLGARGGMKRNRLRKTSDRYAPRVGES----LPDSVDWR--KEGAVAEVKDQGSCGSCWA 165

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++++ D
Sbjct: 166 FSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEED 225

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIE 267
           YPY  ++    RC   ++ AKV   D +    V+    L   + + P+ V +    R  +
Sbjct: 226 YPYLARDG---RCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQ 282

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y           C   +LDH VA VGYG +NG   WIVRNSWG    ++GY ++ R  N
Sbjct: 283 FYASGIFSGR---CGT-QLDHGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSIN 338

Query: 328 A----CGI 331
           +    CGI
Sbjct: 339 SPTGICGI 346


>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
          Length = 484

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 152/320 (47%), Gaps = 33/320 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 181 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 240

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I   T LR            +E   K    +  G L P   DWR      +  V
Sbjct: 241 EEEFRTIYLNTLLR------------KEPGNKMKQAKSVGDLAPPEWDWRSKGA--VTKV 286

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  ++ Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 287 KDQGMCGSCWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 346

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 347 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVA 403

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 404 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 463

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 464 YLHRGSGACGVNTMASSAVV 483


>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
          Length = 1785

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 151/314 (48%), Gaps = 27/314 (8%)

Query: 43   FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
            F+ + +   R Y    E + R+  F+ +  + D+          YG +  +D +  E   
Sbjct: 1478 FEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYRA 1537

Query: 94   RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             TGL +  +    +   R  +     ER    LP S DWR      +  V++QG CGSCW
Sbjct: 1538 HTGLIVPKQHSNHI---RNPIATVSTERTS--LPTSFDWRDHGA--VTGVKNQGNCGSCW 1590

Query: 154  AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
            AF+    +E    +  K L   S+ +L++CD  +  CNGG +D AF+ +++  GLE + +
Sbjct: 1591 AFSAIGNIEGLHQIKTKKLEAYSEQELIDCDTVDNGCNGGYMDDAFKAIEKLGGLELEDE 1650

Query: 213  YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYD 270
            YPY+ K   T  C + K  + V V+        +  +  +L+++GPI + LN   ++ Y 
Sbjct: 1651 YPYQAKAQKT--CHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNANAMQFYR 1708

Query: 271  GNPIRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQIER 324
            G         C+  ++DH V IVGYG K     N  L  W ++NSWG    + GY++I R
Sbjct: 1709 GGISHPWHLLCSHKQIDHGVLIVGYGVKEYPLFNKTLPYWTIKNSWGPKWGEQGYYRIYR 1768

Query: 325  GANACGIESYAYLA 338
            G N+CG+   A  A
Sbjct: 1769 GDNSCGVSEMASSA 1782


>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
          Length = 382

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 91/352 (25%), Positives = 158/352 (44%), Gaps = 50/352 (14%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +D     ++  + F+ + +++NR+Y +  E   R + F Q+  +             +G 
Sbjct: 29  QDPGPQPLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E +Q  G ++ G   E L   R+   +   E +    P++ DWR  KV  +
Sbjct: 89  TQFSDLTEEEFVQLYGSQVAG---EALGVSRKVGSEEWGESE----PRTCDWR--KVGPI 139

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKS--------QLVECDHGNLNCNG 192
           + V  Q  C  CWA A    +E+  A+  +    +S          +L++CD     C G
Sbjct: 140 SLVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQRMAGGRGWELLDCDRCGNGCRG 199

Query: 193 GNI-DVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM-- 249
           G + D     +   GL S+ DYP+ +    T RC  +K K   ++QD  +    +  M  
Sbjct: 200 GFVWDAFLTVLNNSGLASEKDYPF-DGSGKTHRCLAKKYKKVAWIQDFIILQACEQSMAR 258

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE------------ 297
           HL   GPI V +N  L++ Y    I+     C+P ++DH+V +VG+G+            
Sbjct: 259 HLATEGPITVTINMTLLQQYQKGVIKATPTTCDPTQVDHSVLLVGFGKTKSGEGRQGKAA 318

Query: 298 --------KNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                   +  +  W ++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 319 SFGSYARPRRSMAYWTLKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVE 370


>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
          Length = 373

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 155/328 (47%), Gaps = 43/328 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y    E   R   FK + +    +        +G +  SD +  E  
Sbjct: 54  DHFSLFKRKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSE-F 112

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           ++  L + G  K   +A++  +    N      LP+  DWR      + PV++QG CGSC
Sbjct: 113 RKKHLGVRGGFKLPKDANKAPILPTEN------LPEDFDWRDRGA--VTPVKNQGSCGSC 164

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+ T  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 165 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTL 224

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           K  GL  + DYPY  K+  T  C  +K K    V +  V S +D      +L+++GP+ V
Sbjct: 225 KTGGLMREEDYPYTGKDGPT--CKLDKSKIVASVSNFSVIS-IDEDQIAANLVKNGPLAV 281

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGD 312
            +N   +++Y G       + C   +L+H V +VGYG              WI++NSWG+
Sbjct: 282 AINAAYMQTYIGG--VSCPYIC-ARRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGE 338

Query: 313 IGPDHGYFQIERGANACGIESYAYLASV 340
              ++G+++I +G N CG++S     S 
Sbjct: 339 SWGENGFYKICKGRNICGVDSLVSTVSA 366


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 159/318 (50%), Gaps = 40/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           +++++VK ++ Y    E +TRF  FK +    D +          G +  +D +  E   
Sbjct: 60  YESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRS 119

Query: 94  RTGLRLTGK--EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
              L L+GK  ++ER   D  R  +F+ E     LP+S+DWR      + PV+ QG+CGS
Sbjct: 120 ---LYLSGKMMKRERKNEDGFRSDRFVFEDGD-HLPESVDWRDRGA--VAPVKDQGQCGS 173

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+T   +E    ++   L  LS+ +LV+CD+G N  CNGG +D AFE+ VK  G+++
Sbjct: 174 CWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDT 233

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH-----MMHLLQSGPIGVYL--N 262
           + DYPY+  + +   C   ++ AKV   + +    V H     +   +   P+ V +   
Sbjct: 234 EDDYPYKGVDGL---CDQNRKNAKVVTINGY--EDVPHNDEKSLKKAVAHQPVSVAIEAG 288

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + Y+          C   +LDH V  VGYG +NG   WIVRNSWG    + GY ++
Sbjct: 289 GRAFQLYESGVFTGQ---CGT-ELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRL 344

Query: 323 ERGANA-----CGIESYA 335
           ER   +     CGI   A
Sbjct: 345 ERNVASTSTGKCGIAMQA 362


>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 373

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 155/320 (48%), Gaps = 43/320 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  K+ +TY    E   RF  FK + +        +    +G +  SD +P+E  ++
Sbjct: 55  FSLFKSKYEKTYATQEEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRK 114

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GL+  G    RL  D +             LP   DWR+     + PV++QG CGSCW
Sbjct: 115 FLGLKRRGF---RLPTDTQTAPIL----PTSDLPTEFDWREQGA--VTPVKNQGMCGSCW 165

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VK 203
           +F+    LE    L  K L  LS+ QLV+CDH          +  C+GG ++ AFEY +K
Sbjct: 166 SFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK 225

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             GL  + DYPY  ++N    C ++K K    V +  V S  +  +  +L++ GP+ + +
Sbjct: 226 AGGLMKEEDYPYTGRDNTA--CKFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAI 283

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  + DH V +VG+G              WI++NSWG + 
Sbjct: 284 NAMWMQTYIGG--VSCPYVCSKSQ-DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMW 340

Query: 315 PDHGYFQIERGA-NACGIES 333
            +HGY++I RG  N CG+++
Sbjct: 341 GEHGYYKICRGPHNMCGMDT 360


>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
          Length = 548

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 155/321 (48%), Gaps = 33/321 (10%)

Query: 37  IKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRS 87
           +K    FK +++ +NRTY    E + R   F  +     +          YG +  SD +
Sbjct: 245 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 304

Query: 88  PQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPL-PKSLDWRQSKVKVLNPV 143
            +E   I     LR            +E   K    +  G L P   DWR SK  V   V
Sbjct: 305 EEEFRTIYLNPLLR------------KEPGNKMKQAKSVGDLAPPEWDWR-SKGAVTK-V 350

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K
Sbjct: 351 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIK 410

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY Y+        C +  EKAKV++ D+ V S  +  +   L + GPI V 
Sbjct: 411 NLGGLETEDDYSYQGHMQ---SCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISVA 467

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +N   ++ Y     R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 468 INAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYY 527

Query: 321 QIERGANACGIESYAYLASVK 341
            +  G+ ACG+ + A L+ V+
Sbjct: 528 YLHCGSEACGVNTMASLSVVE 548


>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
 gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
          Length = 329

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 100/326 (30%), Positives = 157/326 (48%), Gaps = 38/326 (11%)

Query: 34  YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSD 85
           YD       F  +++K+N+ Y  D E   ++E F+        ++ K T+  Y  +  SD
Sbjct: 19  YDLNNSQALFDDFVIKYNKVYATDEERAAKYEIFRNNLVVINEKNSKTTNALYDINRLSD 78

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            +  E+L+ TG  +    K+ L   +E     + +     LP S DWR +    + PV++
Sbjct: 79  LNKNELLRSTGFSV--NLKKNLNPSKECEYVLVADAPSRSLPASFDWRANNA--VTPVKN 134

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV--- 202
           Q  CGSCWAF+T A +ES  A+       L++  L+ CD+ N NCNGG +  A E +   
Sbjct: 135 QLDCGSCWAFSTIANIESLYAIKYGVEVDLAEQYLLNCDYTNNNCNGGLMHWALENILIN 194

Query: 203 KQYGLESQADYPYR------NKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQS 254
              G+  +   PY       +KE   F  T  K    V           +H +   L+++
Sbjct: 195 DNGGVVEERHAPYVGEVTACDKEEYLFTITNCKRFNLV----------NEHTLQQLLIEN 244

Query: 255 GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK-NGILTWIVRNSWGDI 313
           GPI V ++   I  Y       +D   + + L+HAV +VGYG   NGI  W+ +NSWGD 
Sbjct: 245 GPISVAIDVFDILDYKQGI---SDNCRSDNGLNHAVLLVGYGVSINGIPYWVFKNSWGDD 301

Query: 314 GPDHGYFQIERGANACGIESYAYLAS 339
             + G+F++ R  N+CG+ + AY AS
Sbjct: 302 WGEQGFFRVRRDINSCGMMN-AYAAS 326


>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
          Length = 326

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 124/225 (55%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN+ C GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNMGCMGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+  +++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSR--TCSSLRVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            ++G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQSGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
          Length = 323

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/340 (31%), Positives = 153/340 (45%), Gaps = 55/340 (16%)

Query: 37  IKQVDAFKTYIVKWN----------------RTYTDDNEIKTRFEYF------------K 68
           +K + AF  ++V  N                +TY    E K RF  F            K
Sbjct: 1   MKLIIAFAAFVVAINAASDQELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNAK 60

Query: 69  QDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPK 128
            +  E+  Y   +  SD + +E   R  L    + +  LE D E     +     G  P+
Sbjct: 61  YESGESTYYLAINQFSDITDEEF--RAMLMKNVESRPSLE-DME-----IANLTVGAAPE 112

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHG 186
           S+DWR     +  P+ +Q  CGSCWAF+  A +E Q A+   +  PLS  QLV+C  + G
Sbjct: 113 SIDWRTEGAVL--PIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGG 170

Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
           N  CNGG ++ AF+Y+K  GLES A YPY   ++     + + +K+   V+ T       
Sbjct: 171 NSGCNGGLMNGAFDYIKANGLESDAKYPYTGTDD-----SCKADKSSSLVKLTGYKKVAS 225

Query: 247 HMMHLLQS----GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
               L ++    GPI V +   L  SY G     N+  C    LDH V  VGYG  NG  
Sbjct: 226 SEASLKEAVGTVGPISVAVYADLWRSYGGGIF--NNILCLGFGLDHGVTAVGYGTDNGKK 283

Query: 303 TWIVRNSWGDIGPDHGYFQIERGA-NACGI---ESYAYLA 338
            W V+NSWG+   + GY ++ R   + CGI    SY  LA
Sbjct: 284 YWPVKNSWGESWGEEGYIRMARDTLHNCGINQQASYPILA 323


>gi|41055337|ref|NP_956720.1| cathepsin S, a [Danio rerio]
 gi|32451845|gb|AAH54668.1| Cathepsin S, a [Danio rerio]
          Length = 239

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/242 (33%), Positives = 122/242 (50%), Gaps = 15/242 (6%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           LE D   V    +      LP+ ++W +  +  ++PV++QG CGSCWAF+    LE+Q+ 
Sbjct: 5   LEEDFPDVNATFSPPSLQTLPQRVNWTEHGM--VSPVQNQGPCGSCWAFSAVGSLEAQMK 62

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITF 223
                L PLS   L++C    GN  C GG +  AF YV Q  G++S   YPY +KE +  
Sbjct: 63  RRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKEGV-- 120

Query: 224 RCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-----GPIGVYLNHRLIESYDGNPIRRND 278
            C Y       +     +     H    LQS     GP+ V +N +L+  +       ND
Sbjct: 121 -CRYSVSGRAGYCTGFRIVP--RHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYND 177

Query: 279 WACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLA 338
             C+   ++HAV +VGYG +NG   W+V+NSWG    ++GY ++ R  N CGI S+    
Sbjct: 178 PKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYP 237

Query: 339 SV 340
           ++
Sbjct: 238 TI 239


>gi|328789602|ref|XP_623690.2| PREDICTED: cathepsin O-like [Apis mellifera]
          Length = 368

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 161/324 (49%), Gaps = 35/324 (10%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDD-NEIKTRFEYFKQD-----------GKETDEYYGTSG 82
           D+ + +  F+ Y++++N++Y ++ +E + RF+ F++              +   YYG + 
Sbjct: 43  DNNEDIKLFQNYVIRYNKSYRNNPSEYEERFKRFQRSLQHIERMNGLRSSQESAYYGLTE 102

Query: 83  SSDRSPQEILQRTGLR-LTGKEKERLEADRERVKKFLNERKKGPL--PKSLDWRQSKVKV 139
            SD S  E L  T L  L  + ++ + A   R  +   +R K  +  P   DWR   V  
Sbjct: 103 FSDMSENEFLLHTLLPDLPIRGEKHMNASYHRKHQISIDRMKRSISIPLRFDWRDKGV-- 160

Query: 140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNI--D 196
           + PV SQG CG+CWAF+T  ++ES  A+   TL+ LS  ++++C  + N  C GG+I   
Sbjct: 161 ITPVRSQGSCGACWAFSTIEVIESMFAIKNGTLHSLSVQEMIDCAKNSNFGCEGGDICSL 220

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVF---VQDTWVTSGVDH----MM 249
           +++  + +  +  ++ YP      +T  C   K   K F   +QD    S VD     ++
Sbjct: 221 LSWLLISKVQILQESIYPLV---GMTGTCKLGKMTDKTFNIKIQDFTCDSFVDAEDELLI 277

Query: 250 HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNP--HKLDHAVAIVGYGEKNGILTWIVR 307
            L   GP+   +N    ++Y G  I+   + C+   + L+HAV I+GY +   +  +I++
Sbjct: 278 ALATHGPVAAAVNALSWQNYLGGVIQ---YHCDGSFNNLNHAVQIIGYDKSVAVPHYIIK 334

Query: 308 NSWGDIGPDHGYFQIERGANACGI 331
           NSWG    D GY  I  G N CGI
Sbjct: 335 NSWGSNFGDKGYMYIGIGNNLCGI 358


>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 361

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 152/319 (47%), Gaps = 33/319 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 54  RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L         L+   +  +          LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRTKLGAAQNCSATLKGSHKVTE--------AALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  K+     C +  E   V V ++  +T G +    H + L++   I   +
Sbjct: 223 LDTEKAYPYTGKDET---CKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 280 IHSF-RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFK 338

Query: 322 IERGANACGIESYAYLASV 340
           +E G N CG   Y Y+  +
Sbjct: 339 MEMGKNMCG--KYCYMCII 355


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 157/311 (50%), Gaps = 35/311 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI-L 92
           ++ ++VK+ + Y    E + RFE FK + K  D++          G +  +D S +E   
Sbjct: 49  YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
              G R+ GK   RL    +  +    +     LP+S+DWR+     + PV+ QG+CGSC
Sbjct: 109 AYLGTRMDGKR--RLLGGPKSARYLFKDGDD--LPESVDWREKGA--VAPVKDQGQCGSC 162

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G++++
Sbjct: 163 WAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTE 222

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRL 265
            DYPY+  +++   C   ++ A+V   D +     +    +   + + P+ V +    R 
Sbjct: 223 EDYPYKAVDSM---CDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRA 279

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + Y          +C   +LDH V  VGYG +NG+  W+VRNSWG    ++GY ++ER 
Sbjct: 280 FQLYQSGVFTG---SCG-TQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERN 335

Query: 326 ANA-----CGI 331
             +     CGI
Sbjct: 336 VASTETGKCGI 346


>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
          Length = 338

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/310 (30%), Positives = 157/310 (50%), Gaps = 25/310 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE--------TDEYYGTSGSSDRSPQEILQR 94
           F+ +I  +N+ Y D++E + RF+ F  + K+        ++  YG +  SD S +E ++ 
Sbjct: 41  FEQFIKDYNKEY-DESEKEERFKIFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKY 99

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                TG ++E   ++ +  K  L E      P   DWR  K  V++ +++Q  CGSCWA
Sbjct: 100 ----YTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWR--KKGVVSSIKNQKHCGSCWA 153

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYP 214
           F+  A +ES  A+    L  +S+ QL++CD  +  C+GG    A  Y    G  S   YP
Sbjct: 154 FSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGLPWDALRYFVANGAMSLKSYP 213

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIESYDG 271
           Y  KE    +C Y+  K ++ ++   + S +       HL   GP+ + ++   I+ Y G
Sbjct: 214 YVAKEG---KCRYDSSKVEIRLKGYKIFSKISEDQIKEHLYNIGPLSIAIDVSPIKPYVG 270

Query: 272 NPIRRNDWACNP-HKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
             +      C+   +++HAV +VGYG++  +  WIV+NSWG    ++GYF++ERG N   
Sbjct: 271 GIVMEE---CHEVCQVNHAVLLVGYGKEYSVEYWIVKNSWGPNWGENGYFRMERGVNCLL 327

Query: 331 IESYAYLASV 340
           + S     +V
Sbjct: 328 LTSTGITTAV 337


>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
          Length = 326

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V   + V 
Sbjct: 166 PWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVTGYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+P  L+HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSQ--TCSPLGLNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIASLASLPMV 322


>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
          Length = 330

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 121/225 (53%), Gaps = 16/225 (7%)

Query: 125 PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD 184
           PLP ++DWR  K  ++ PV +QG CGSCWAF++   LE Q+     TL  LS   LV+C 
Sbjct: 113 PLPVNVDWR--KEGLVGPVRNQGLCGSCWAFSSLGALEGQLKKRTGTLVSLSPQNLVDCS 170

Query: 185 --HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKE-KAKVFVQDTW 240
              GNL C GG I  A+ YV +  G++S++ YPY +K     +C Y  + +A    + + 
Sbjct: 171 TQDGNLGCRGGYITKAYSYVIRNGGVDSESFYPYEHKNG---KCRYSVQGRAGYCSKFSI 227

Query: 241 VTSGVDHMMH--LLQSGPIGVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           +  G + M+   L   GPI V +N  L     Y G     N  +CNP  ++HAV +VGYG
Sbjct: 228 LPEGDEKMLQKVLASVGPISVAVNAMLESFHMYSGG--LYNVPSCNPKLINHAVLLVGYG 285

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
              G   W+V+NSWG    + GY ++ R   N CGI S+    +V
Sbjct: 286 TDAGQDYWLVKNSWGTAWGEGGYIRLARNKNNLCGIASFPVYPTV 330


>gi|377823949|gb|AFB77219.1| cathepsin L1 [Fasciola gigantica]
          Length = 326

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNYGCMGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+P  ++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYRGGIYQSQ--TCSPLGVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 154/316 (48%), Gaps = 46/316 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           ++T++VK  + Y    E + RF  FK + +  DE         R+ + +  + GL     
Sbjct: 43  YETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDE---------RNSENLSFKLGLNRFAD 93

Query: 99  LTGKEKERL------------EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           LT +E   +             + R +  ++   R    LP+S+DWR  K   +  ++ Q
Sbjct: 94  LTNEEYRSVYLGTRPRSVAVARSGRSKSDRYA-FRAGDTLPESVDWR--KKGAVAGIKDQ 150

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQ 204
           G CGSCWAF+  A +E    ++   L  LS+ +LVECD   N  C+GG +D AFE++ K 
Sbjct: 151 GSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDTSYNDGCDGGLMDYAFEFIIKN 210

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV---DHMMHLLQSGPIGVYL 261
            G++S  DYPY  ++    RC   ++ AKV   D +  S V     +   + + P+ V +
Sbjct: 211 EGIDSDEDYPYTGRDG---RCDTNRKNAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAI 267

Query: 262 --NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
               R  + YD          C    LDH VA+VGYG ++G+  WIVRNSWGD   + GY
Sbjct: 268 EGGGRDFQLYDSGVFTGK---CGT-ALDHGVAVVGYGTEDGLDYWIVRNSWGDTWGEGGY 323

Query: 320 FQIERG----ANACGI 331
            +++R     +  CGI
Sbjct: 324 IRMQRNTKLPSGICGI 339


>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
          Length = 459

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 153/320 (47%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 155 SVKMASIFKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDL 214

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT + L    KE       R K   +       P   DWR      +  V+ Q
Sbjct: 215 TEEEF--RT-IYLNPLLKEEPGVKMRRAKSVGDSA-----PPEWDWRSKGA--VTEVKDQ 264

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L +  L  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 265 GMCGSCWAFSVTGNVEGQWFLNRGALLSLSEQELLDCDKVDKACMGGLPSNAYSAIKTLG 324

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y         C++  EKAKV++ D+   +  +  +   L + GPI V +N 
Sbjct: 325 GLETEDDYSYHGHLQA---CSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPISVAINA 381

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV +VGYG ++ +  W ++NSWG    + GY+
Sbjct: 382 FGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSAVPFWAIKNSWGTDWGEEGYY 438

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 439 YLYRGSGACGVNTMASSAVV 458


>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
 gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
          Length = 1810

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 149/311 (47%), Gaps = 36/311 (11%)

Query: 52   RTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLTGK 102
            R Y    E + RF  F+ +  + ++          YG +  +D +  E    TGL +   
Sbjct: 1509 RQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVV--P 1566

Query: 103  EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
            + +R      RV    +    G LP+S DWR      +  V++QG CGSCWAF+    +E
Sbjct: 1567 KHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGA--VTEVKNQGSCGSCWAFSAVGNVE 1624

Query: 163  SQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
                +  K L   S+ +L++CD  +  C GG +D AF+ ++Q  GLE + DYPY  K   
Sbjct: 1625 GLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQK 1684

Query: 222  TFRCTYEKEKAKVFVQ--------DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNP 273
            +  C + +  + V V+        +T++        +L+++GPI + LN   ++ Y G  
Sbjct: 1685 S--CHFNRSLSHVQVKGAVDMPKNETYIAK------YLIKNGPIAIGLNANAMQFYRGGI 1736

Query: 274  IRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQIERGAN 327
                   CN   +DH V IVGYG K     N  L  WI++NSWG    + GY++I RG N
Sbjct: 1737 SHPWHPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDN 1796

Query: 328  ACGIESYAYLA 338
            +CG+   A  A
Sbjct: 1797 SCGVSEMASSA 1807


>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
 gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
          Length = 356

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 102/313 (32%), Positives = 147/313 (46%), Gaps = 39/313 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRF-------EYFKQDGKETDEY-YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y    E+K RF       E  K   K+   Y  G +  +D + +E    
Sbjct: 57  FARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEF--- 113

Query: 95  TGLRLTGKEKERLEADR--ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
                    K RL A +      K  ++     LP+S DWR  K  +++PV+ QG CGSC
Sbjct: 114 --------RKHRLGAAQNCSATTKGSHKLTDTALPESKDWR--KDGIVSPVKDQGHCGSC 163

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLES 209
           W F+TT  LE+  A        LS+ QLV+C  G  N  CNGG    AFEY+K   GL++
Sbjct: 164 WTFSTTGALEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYIKYNGGLDT 223

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGV-DHMMHLLQ-SGPIGVYL----N 262
           +  YPY   +     C +  E   V V D+  +T G  D + H +    P+ V       
Sbjct: 224 EEAYPYTGVDG---SCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVSG 280

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            RL   Y       N     P  ++HAV  VGYG ++GI  W+++NSWG    D+GYF++
Sbjct: 281 FRL---YSKGVYTSNSCGSTPMDVNHAVLAVGYGVEDGIPYWLIKNSWGGNWGDNGYFKM 337

Query: 323 ERGANACGIESYA 335
           E G N CG+ + A
Sbjct: 338 EMGKNMCGVATCA 350


>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
          Length = 242

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/218 (33%), Positives = 115/218 (52%), Gaps = 8/218 (3%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DW    V  + PV++QG CGSCWAF+ T  +ES  A+    L  LS+ +L++CD 
Sbjct: 29  LPNKFDWNTKGV--VTPVKNQGSCGSCWAFSVTGNIESLWAIKTGNLISLSEQELIDCDV 86

Query: 186 GNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            +  CNGG    AF  +K+ G LE +  YPY+ K      C   + +  V + D      
Sbjct: 87  IDNGCNGGLPINAFREIKRMGGLEPEDQYPYKAKNG---TCHLVRAQIAVTIDDAIEIPR 143

Query: 245 VDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
            + +M   + Q GP+ V ++  L+  Y    +  +   C P K++H V I GYG +NG+ 
Sbjct: 144 NETVMKAWIAQRGPLSVGIDAELLAYYKSGILHPSKSRCPPSKINHGVLITGYGIENGLP 203

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            W ++NSWG+   ++GYF++ RG + CG+      A +
Sbjct: 204 YWTIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAII 241


>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
          Length = 377

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 147/315 (46%), Gaps = 26/315 (8%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTS-GSSDRSPQEILQRTGLRL 99
           +AF T++ K+ +TY    E   R   F Q+ K   E+   + G +     +    T    
Sbjct: 63  EAFMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFADWTAEEF 122

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
              +K        +     +E      P ++DWR   V  +  +++QG CGSCW F+T  
Sbjct: 123 ASYQKLHSRPKPSQAGA-THEVSDKAAPTAVDWRTEGV--VADIKNQGSCGSCWTFSTVV 179

Query: 160 ILESQVALLKKTLYPLSKSQLVEC---------DHGNLNCNGGNIDVAFEYV---KQYGL 207
            +E   A     L  LS+  LV+C         D   + C+GG +D AF+Y+   +  G+
Sbjct: 180 SIEGAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGI 239

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQ---DTWVTSGVDHMMHLLQSGPIGVYLN-H 263
           +++A Y Y  K+     C ++K      +    D  V   V     L  +GP+ + L+  
Sbjct: 240 DTEASYGYTGKDGT---CAFDKANVGATISNWTDVAVGDEVALADALANAGPVSIALDAS 296

Query: 264 RLIESYDGNPIR-RNDWAC--NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +  + Y G  ++ R+   C  +P   DH VAIVGYG  +G+  W +RNSWG    + GY 
Sbjct: 297 KQWQLYSGGILKPRSILGCSSDPTHADHGVAIVGYGTDDGVDYWWIRNSWGTTWGESGYM 356

Query: 321 QIERGANACGIESYA 335
           ++ERG NACG+ ++A
Sbjct: 357 RLERGVNACGVANFA 371


>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
          Length = 326

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 124/225 (55%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  ++ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTELKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN+ C+GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNMGCSGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+  +++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSR--TCSSLRVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 102/306 (33%), Positives = 155/306 (50%), Gaps = 38/306 (12%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +  F++++ K ++ Y   +E   RFE F  + K  D+        + G +  +D + +
Sbjct: 44  KVIHLFESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHE 103

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +  L L G+  ER +   E +++F + R    LPKS+DWR  K   + PV++QG+C
Sbjct: 104 EFKNKF-LGLKGELPERKD---ESIEEF-SYRDFVDLPKSVDWR--KKGAVAPVKNQGQC 156

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLE 208
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF YV + GL 
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLH 216

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV--------DHMMHLLQSGPIGVY 260
            + +YPY   E     C  +K+     V +T   SG         D  +  L + PI V 
Sbjct: 217 KEEEYPYIMSEGT---CDEKKD-----VSETVTISGYHDVPRNNEDSFLKALANQPISVA 268

Query: 261 L--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           +  + R  + Y G      D  C   +LDH VA VGYG   G+   IVRNSWG    + G
Sbjct: 269 IEASGRDFQFYSGGVF---DGHCGT-ELDHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKG 324

Query: 319 YFQIER 324
           Y +++R
Sbjct: 325 YIRMKR 330


>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
          Length = 332

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 28  SVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 87

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE  R  +  + +            P   DWR  K   +  V++Q
Sbjct: 88  TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--------PPEWDWR--KKGAVTEVKNQ 137

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 138 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLG 197

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 198 GLETEDDYGYQGHVQT---CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINA 254

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 255 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYY 311

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 312 YLYRGSGACGVNTMASSAVV 331


>gi|37911662|gb|AAR05023.1| cathepsin L-like protein [Tenebrio molitor]
          Length = 336

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 155/323 (47%), Gaps = 41/323 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY------------YGTSGSSDRSPQE 90
           FKT    + R+Y +  E   R + F++  +  +E+             G +  +D +P+E
Sbjct: 30  FKT---TYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEE 86

Query: 91  ILQRT-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           +   T GL +     +     + R    LN   +   P S DWR   +  ++PV++QG C
Sbjct: 87  MKAYTHGLIMPADLHKNGIPIKTREDLGLNASVR--YPASFDWRDQGM--VSPVKNQGSC 142

Query: 150 GSCWAFATTAILESQVALLKKTLY--PLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG- 206
           GSCWAF++T  +ESQ+ +     Y   +S+ QLV+C    L C+GG ++ AF YV Q G 
Sbjct: 143 GSCWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQNGG 202

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM---HLLQSGPIGVYLNH 263
           ++S+  YPY   +     C Y+  +    +      SG D  M    +   GP+ V  + 
Sbjct: 203 IDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDA 259

Query: 264 R-LIESYDG----NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
                SY G    NP       C  +K  HAV IVGYG +NG   W+V+NSWGD     G
Sbjct: 260 DDPFGSYSGGVYYNP------TCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDG 313

Query: 319 YFQIERGA-NACGIESYAYLASV 340
           YF+I R A N CGI   A + ++
Sbjct: 314 YFKIARNANNHCGIAGVASVPTL 336


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 147/310 (47%), Gaps = 39/310 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           ++ ++VK  +      E   RFE FK + +  DE+ G         + +  R GL     
Sbjct: 42  YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNG---------KNLSYRLGLTKFAD 92

Query: 99  LTGKE------KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           LT  E        RL+    +       R    +P+S+DWR  K   +  V+ QG CGSC
Sbjct: 93  LTNDEYRSMYLGSRLKRKATKTSLRYEARVGDAIPESVDWR--KEGAVAEVKDQGSCGSC 150

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G++++
Sbjct: 151 WAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTE 210

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRL 265
            DYPY+  +    RC   ++ AKV   D++     +  + +   L   PI V +    R 
Sbjct: 211 EDYPYKGVDG---RCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRA 267

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER- 324
            + YD       D  C    LDH V  VGYG +NG   WIV+NSWG    + GY ++ER 
Sbjct: 268 FQLYDSGIF---DGICGT-DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERN 323

Query: 325 ---GANACGI 331
               A  CGI
Sbjct: 324 IASSAGKCGI 333


>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
          Length = 345

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 82/218 (37%), Positives = 124/218 (56%), Gaps = 15/218 (6%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G  P+ ++WR++    + PV++QG+CGSCWAF++T  LE QV    + L  LS+  L++C
Sbjct: 124 GDTPEFIEWRENGF--VTPVKNQGQCGSCWAFSSTGALEGQVFKRTRRLISLSEQNLMDC 181

Query: 184 D---HGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEK--EKAKVFVQ 237
               +GN  CNGG +  AF+YV+  G L+++A YPYR   N  F+C +    E  +V V 
Sbjct: 182 AGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYPYRQGTN--FQCQFSNSFEARRVSVN 239

Query: 238 D-TWVTSGVDHMMH--LLQSGPIGVYLNHRL-IESYDGNPIRRNDWACNPHKLDHAVAIV 293
             T V    + ++   +   GPI + +N       +  N I   +  C+P  L+HAV +V
Sbjct: 240 GHTRVPPRNERVLQDAVANVGPISIAINASPQTFMFYKNGIY-GEPNCDPRGLNHAVLLV 298

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
           GYGE+ G+  WIV+NSWG    + GY +I R  N CG+
Sbjct: 299 GYGEERGVPYWIVKNSWGPGWGEGGYIKILRNRNVCGM 336


>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
 gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
          Length = 1834

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 149/311 (47%), Gaps = 36/311 (11%)

Query: 52   RTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLRLTGK 102
            R Y    E + RF  F+ +  + ++          YG +  +D +  E    TGL +   
Sbjct: 1533 RQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVV--P 1590

Query: 103  EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
            + +R      RV    +    G LP+S DWR      +  V++QG CGSCWAF+    +E
Sbjct: 1591 KHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGA--VTEVKNQGSCGSCWAFSAVGNVE 1648

Query: 163  SQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
                +  K L   S+ +L++CD  +  C GG +D AF+ ++Q  GLE + DYPY  K   
Sbjct: 1649 GLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQK 1708

Query: 222  TFRCTYEKEKAKVFVQ--------DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNP 273
            +  C + +  + V V+        +T++        +L+++GPI + LN   ++ Y G  
Sbjct: 1709 S--CHFNRSLSHVQVKGAVDMPKNETYIAK------YLIKNGPIAIGLNANAMQFYRGGI 1760

Query: 274  IRRNDWACNPHKLDHAVAIVGYGEK-----NGILT-WIVRNSWGDIGPDHGYFQIERGAN 327
                   CN   +DH V IVGYG K     N  L  WI++NSWG    + GY++I RG N
Sbjct: 1761 SHPWHPLCNHKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQGYYRIYRGDN 1820

Query: 328  ACGIESYAYLA 338
            +CG+   A  A
Sbjct: 1821 SCGVSEMASSA 1831


>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
          Length = 363

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 147/309 (47%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V+  + Y D  E++ RF  F +  +          S++R  + +  R G+ R   
Sbjct: 63  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVR-------STNR--RGLPYRLGINRFAD 113

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV+ QG CGSCW
Sbjct: 114 MSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPETKDWREDGI--VSPVKDQGHCGSCW 171

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C   + N  C+GG    AFEY+K   GL+++
Sbjct: 172 TFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTE 231

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY     I   C Y+ E   V V D+  +T G +  +         V +  ++I   
Sbjct: 232 EAYPYTGVNGI---CHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGF 288

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +    +P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G 
Sbjct: 289 RMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGK 348

Query: 327 NACGIESYA 335
           N CGI + A
Sbjct: 349 NMCGIATCA 357


>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
          Length = 311

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 125/224 (55%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 93  VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 150

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V   + V 
Sbjct: 151 PWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVTGYYTVH 207

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           SG +  +  L    GP  V ++   +ES D    R   +    C+P +++HAV  VGYG 
Sbjct: 208 SGSEVELKNLVGAEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT 263

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY ++ R   N CGI S A +A V
Sbjct: 264 QDGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIASLASVAMV 307


>gi|26245875|gb|AAN77413.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 287

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/267 (34%), Positives = 132/267 (49%), Gaps = 28/267 (10%)

Query: 79  GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKK----GPLPKSLDWRQ 134
           G +  +D +P+E +            ER    R+   KFL+E+ K    G LP  +DW  
Sbjct: 34  GVNKFADLTPEEFM------------ERFRPLRKTKPKFLSEQAKFNFDGDLPAEVDW-- 79

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
           +K   +  V+SQG CGSCWAF+TT  +ES   +    L  LS+ QLV+C   N  C GG 
Sbjct: 80  TKQGAVTEVKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKNNSGCAGGW 139

Query: 195 IDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG---VDHMMHL 251
           +D+A EY++  G+ S+ DYPY  + N T  C +   KA V ++          +D    +
Sbjct: 140 MDIALEYIEADGIMSEDDYPYEER-NTT--CRFNNSKAAVQIKSYKAIKKNDEIDLQKAV 196

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK--LDHAVAIVGYGEKNGILTWIVRNS 309
              GP+ V +   +        I  ND  C   +  L HAV + GYG ++G   WIV+NS
Sbjct: 197 ALEGPVPVAIEVTIAFQLYARGI-LNDPQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNS 255

Query: 310 WGDIGPDHGYFQIERGA-NACGIESYA 335
           WG      GY ++ R A N CGI + A
Sbjct: 256 WGAEYGMDGYLRMSRNADNQCGIATRA 282


>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
 gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
           Group]
 gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 362

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 147/309 (47%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V+  + Y D  E++ RF  F +  +          S++R  + +  R G+ R   
Sbjct: 62  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVR-------STNR--RGLPYRLGINRFAD 112

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV+ QG CGSCW
Sbjct: 113 MSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPETKDWREDGI--VSPVKDQGHCGSCW 170

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C   + N  C+GG    AFEY+K   GL+++
Sbjct: 171 TFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTE 230

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY     I   C Y+ E   V V D+  +T G +  +         V +  ++I   
Sbjct: 231 EAYPYTGVNGI---CHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGF 287

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +    +P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G 
Sbjct: 288 RMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGK 347

Query: 327 NACGIESYA 335
           N CGI + A
Sbjct: 348 NMCGIATCA 356


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/343 (30%), Positives = 163/343 (47%), Gaps = 36/343 (10%)

Query: 6   CDHQETNTEQVTYNVNTDSAIYVWRDL---AYDSIKQVDA-FKTYIVKWNRTYTDDNEIK 61
           C   + N + V       S    W++L    Y ++++ +    T+   WN+    + +  
Sbjct: 16  CSAMQLNQQHV-------SLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYS 68

Query: 62  TRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNER 121
            + + ++ +  E    YG   S + S      R  +RL     +R           L+  
Sbjct: 69  LKQKSYRLEMNE----YGDLTSEEFSSMMNGYRNDIRL-----KRKSTGGSTYLNLLSFG 119

Query: 122 KKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLV 181
            +  LP  +DWR  K  ++ PV++QG+CGSCW+F+ T  LE Q       L  LS+  L+
Sbjct: 120 SQIQLPTLVDWR--KHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLI 177

Query: 182 ECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENIT-FRCTYEKEKAKVFVQ 237
           +C    GN  CNGG +D AF+Y+K Q G++++A YPY  K++   F  T        FV 
Sbjct: 178 DCSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVD 237

Query: 238 DTWVTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
              + SG + M+    +  GPI V ++  H   + Y          AC+   LDH V +V
Sbjct: 238 ---IKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSET--ACSSTMLDHGVLVV 292

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           GYG +NG   W+V+NSWG+   + GY ++ R A N CGI + A
Sbjct: 293 GYGTENGKDYWLVKNSWGEGWGEAGYIKMSRNADNQCGIATQA 335


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 152/319 (47%), Gaps = 49/319 (15%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQ 93
           +F  +  ++ + Y    E+K RF  F ++         K      G +  +D S +E  Q
Sbjct: 61  SFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNKFADWSWEE-FQ 119

Query: 94  RTGL------RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           R  L        T K   +L AD               LP++ DWR+S +  ++PV+ QG
Sbjct: 120 RHRLGAAQNCSATTKGNHKLTAD--------------VLPETKDWRESGI--VSPVKDQG 163

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  
Sbjct: 164 HCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYIKYN 223

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGV 259
            GL+++  YPY  K+ +   C +  E   V V D+  +T G +    H + L++  P+ V
Sbjct: 224 GGLDTEEAYPYTGKDGV---CKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR--PVSV 278

Query: 260 YLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
                +++    Y             P  ++HAV  VGYG ++G+  W+++NSWG+   D
Sbjct: 279 AF--EVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGYGVEDGVPYWLIKNSWGENWGD 336

Query: 317 HGYFQIERGANACGIESYA 335
           HGYF+I+ G N CGI + A
Sbjct: 337 HGYFKIKMGKNMCGIATCA 355


>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
          Length = 462

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 158 SVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE  R  +  + +            P   DWR  K   +  V++Q
Sbjct: 218 TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--------PPEWDWR--KKGAVTEVKNQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 328 GLETEDDYGYQGHVQT---CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINA 384

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 385 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYY 441

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 442 YLYRGSGACGVNTMASSAVV 461


>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
          Length = 462

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 158 SVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE  R  +  + +            P   DWR  K   +  V++Q
Sbjct: 218 TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--------PPEWDWR--KKGAVTEVKNQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 328 GLETEDDYGYQGHVQT---CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINA 384

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 385 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYY 441

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 442 YLYRGSGACGVNTMASSAVV 461


>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
          Length = 411

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 101/325 (31%), Positives = 153/325 (47%), Gaps = 35/325 (10%)

Query: 27  YVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEY 77
           Y   D AY     +D F  ++  + R Y   +E + RF+ F          Q GK+  ++
Sbjct: 93  YPREDFAY-----IDQFIDFMNVYGRKYHGYHETRERFQNFVNNMKYIKKIQQGKQNVQF 147

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKE-RLEADRERVK-----KFLNERKKGPLPKSLD 131
            G +  +D S +E+   T     G+E    +  DRE        +F      G  P+S D
Sbjct: 148 -GITRFADWSEEEMKSMT----CGEEPNMEMRYDREYYDGSYEDEFTLYDGFGGRPESFD 202

Query: 132 WRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCN 191
           WR   V  +  ++ Q RCGSCWAF    ++ES  A+ K  L  LS+ QLV+CD  +  C+
Sbjct: 203 WRSKNV--VTDIKDQQRCGSCWAFGAVGVVESMNAIAKNPLVSLSEQQLVDCDMNDNGCD 260

Query: 192 GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM-- 249
           GG    A +Y++  G+  +  YPY  KE  +  C       +V+V+        +  M  
Sbjct: 261 GGYRPYALQYIRHNGIVPEELYPYAGKELDS--CKLNTTVQRVYVKTVKYIRRNESAMAD 318

Query: 250 HLLQSGPIGVYLN-HRLIESYDGNPIRRNDWAC--NPHKLDHAVAIVGYGEKNGILTWIV 306
            +   GP+ V +N  + +  Y       +   C  NP    HA+A+VGYG +NG   WI+
Sbjct: 319 FVFYKGPLSVGINVTKDLFHYQSGVFTPSKEDCEQNPQGT-HALAVVGYGSQNGEDYWII 377

Query: 307 RNSWGDIGPDHGYFQIERGANACGI 331
           +NSWG      G+F  +RGAN+CGI
Sbjct: 378 KNSWGKRWGMDGFFLYKRGANSCGI 402


>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
 gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
          Length = 360

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 146/318 (45%), Gaps = 47/318 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------DGKETDEYYGTSGSSDRSPQEI--- 91
           F  + V++ ++Y    E+  RF  F +        + K      G +  +D S +E    
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRAT 118

Query: 92  ----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
                Q     LTG  + R  A                LP++ DWR+  +  ++PV++QG
Sbjct: 119 RLGAAQNCSATLTGNHRMRAAAV--------------ALPETKDWREDGI--VSPVKNQG 162

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  
Sbjct: 163 HCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYN 222

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH----LLQSGPIG- 258
            GL+++  YPY+    I   C ++ E   V V D+  +T G +  +     L++   +  
Sbjct: 223 GGLDTEESYPYQGVNGI---CKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAF 279

Query: 259 -VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V    RL   Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D 
Sbjct: 280 EVITGFRL---YKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDE 336

Query: 318 GYFQIERGANACGIESYA 335
           GYF++E G N CG+ + A
Sbjct: 337 GYFKMEMGKNMCGVATCA 354


>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
 gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
 gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
 gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
 gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
 gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
 gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
          Length = 462

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 158 SVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE  R  +  + +            P   DWR  K   +  V++Q
Sbjct: 218 TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--------PPEWDWR--KKGAVTEVKNQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 328 GLETEDDYGYQGHVQT---CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINA 384

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 385 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYY 441

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 442 YLYRGSGACGVNTMASSAVV 461


>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
 gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
 gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
 gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
          Length = 328

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 81/235 (34%), Positives = 128/235 (54%), Gaps = 22/235 (9%)

Query: 115 KKFLNERKKGPLPKS-------LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVAL 167
           K  +NE+ + P  KS       +DWR    K +  V+ QG+CGSCW+F+TT  +E Q+A+
Sbjct: 97  KPKMNEKLRIPFVKSGKPAAAEVDWRS---KAVTEVKDQGQCGSCWSFSTTGAVEGQLAI 153

Query: 168 LKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRC 225
             K L  LS+  LV+C   +GN  CNGG +D AF+Y+   G+ S++ YPY   +     C
Sbjct: 154 SGKGLTSLSEQNLVDCSSQYGNAGCNGGWMDSAFDYIHDNGIMSESAYPYTAMDG---NC 210

Query: 226 TYEKEKAKVFVQDTW-VTSGVDHMMH--LLQSGPIGVYLNH-RLIESYDGNPIRRNDWAC 281
            ++  ++   +Q  + + SG +  +   +  +GP+ V L+    ++ Y G  +   D  C
Sbjct: 211 RFDASQSVTSLQGYYDIPSGDESALQDAVANNGPVAVALDATEELQLYSGGVLY--DTTC 268

Query: 282 NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           +   L+H V +VGYG + G   WIV+NSWG    + GY++  R   N CGI + A
Sbjct: 269 SAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQGYWRQARNRNNNCGIATAA 323


>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
          Length = 367

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/323 (30%), Positives = 154/323 (47%), Gaps = 52/323 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y    E   R + FK + +    +        +G +  SD +P E  +R
Sbjct: 50  FSLFKSKFGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSE-FRR 108

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K K +L   +  +           LP+  DWR+     +  V++QG CGSCW+
Sbjct: 109 TYLGLH-KPKPKLSTTKAPI------LPTSDLPEDFDWREKGA--VTGVKNQGSCGSCWS 159

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  C GG +  AFEY +K 
Sbjct: 160 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKA 219

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  +     +C ++K K    V +  V  G+D      +L++ GP+ V +
Sbjct: 220 GGLQREKDYPYTGRNG---QCHFDKSKIAASVTNYSVV-GLDEDQIAANLVKHGPLAVGI 275

Query: 262 NHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           N   +++Y G    P+      C  H+ DH V +VGYG              WI++NSWG
Sbjct: 276 NSAWMQTYIGGVSCPL-----VCFKHQ-DHGVLLVGYGSAGFAPIRLKAKPYWIIKNSWG 329

Query: 312 DIGPDHGYFQIERGA-NACGIES 333
           +   +HGY++I RG  N CG+++
Sbjct: 330 EHWGEHGYYKICRGQHNICGVDA 352


>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 361

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 152/320 (47%), Gaps = 47/320 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ + Y    E   R + FK +      +        +G +  SD +P E  +R
Sbjct: 45  FSLFKAKFGKIYASQEEHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSE-FRR 103

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           T L L  K +  L A++  +    +      LP   DWR+     +  V++QG CGSCW+
Sbjct: 104 TYLGLN-KPRPNLNAEKAPILPTKD------LPSDFDWREKGA--VTDVKNQGSCGSCWS 154

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    L    L  LS+ QLV+CDH          +  CNGG +  AFEY +K 
Sbjct: 155 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKA 214

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL 261
            GL+ + DYPY  +     +C ++K +    V +  V  G+D      +LL+ GP+ V +
Sbjct: 215 GGLQLEKDYPYTGRNG---KCHFDKSRIAASVSNFSVV-GLDEDQIAANLLKHGPLAVGI 270

Query: 262 NHRLIESYDGNPIRRNDWACNPHKL-DHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           N   +++Y    +R         K  DH V +VGYG +            WI++NSWG  
Sbjct: 271 NAAWMQTY----VRGVSCPLICFKRQDHGVLLVGYGSEGFAPIRLKNKPYWIIKNSWGKT 326

Query: 314 GPDHGYFQIERGANACGIES 333
             +HGY++I RG + CG+++
Sbjct: 327 WGEHGYYKICRGHHICGVDA 346


>gi|115533516|ref|NP_001041281.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
 gi|85539716|emb|CAJ58500.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
          Length = 348

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 92/275 (33%), Positives = 140/275 (50%), Gaps = 35/275 (12%)

Query: 78  YGTSGSSDRSPQE----ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR 133
           YG +  SD + +E    +L ++  +   KE E +E   E +     E    P P   DWR
Sbjct: 81  YGHNDMSDWTDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGE-SSSPFPDFFDWR 139

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
              V  + PV++QG+CGSCWAFA+TA +E+  A+       LS+  L++CD  +  C+GG
Sbjct: 140 DKNV--ITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGG 197

Query: 194 NIDVAFEYVKQYGLESQADYPY-RNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMH 250
           + D AF Y+ + GL +  D PY  +++N    C          V D W T+ +   + +H
Sbjct: 198 DEDKAFRYIHRNGLANAVDLPYVAHRQN---GCA---------VNDHWNTTRIKAAYFLH 245

Query: 251 ---------LLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYG-EK 298
                    L+  GP+ + +   + + +Y G     +++AC    +  HA+ I GYG  K
Sbjct: 246 HDEDSIINWLVNFGPVNIGMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSK 305

Query: 299 NGILTWIVRNSWGDI-GPDHGYFQIERGANACGIE 332
            G   WIV+NSWG+  G +HGY    RG NACGIE
Sbjct: 306 TGEKYWIVKNSWGNTWGVEHGYIYFARGINACGIE 340


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 100/309 (32%), Positives = 150/309 (48%), Gaps = 26/309 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +  F++++VK ++ Y   +E   RFE F  + K  DE        + G +  +D + +
Sbjct: 44  KVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHE 103

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +      G + E  E   E  K+F   R    LPKS+DWR  K   + PV++QG+C
Sbjct: 104 EFKHK----FLGFKGELAERKDESSKEF-GYRDFVDLPKSVDWR--KKGAVAPVKNQGQC 156

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLE 208
           G+CWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF YV + GL 
Sbjct: 157 GNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSGLH 216

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLI 266
            + +YPY   E          EK  +        +     +  L + PI V +  + R  
Sbjct: 217 KEEEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDF 276

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y G      D  C   +LDH VA VGYG   G+   IVRNSWG    + GY +++RG+
Sbjct: 277 QFYSGGVF---DGHCGT-ELDHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGS 332

Query: 327 ----NACGI 331
                 CG+
Sbjct: 333 GKPHGMCGL 341


>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
          Length = 417

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 154/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 113 SVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 172

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE  R  +  + +            P   DWR  K   +  V++Q
Sbjct: 173 TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--------PPEWDWR--KKGAVTEVKNQ 222

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 223 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLG 282

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 283 GLETEDDYGYQGHVQT---CNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINA 339

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 340 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYY 396

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 397 YLYRGSGACGVNTMASSAVV 416


>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
          Length = 367

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 147/309 (47%), Gaps = 30/309 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V+  + Y D  E++ RF  F +  +          S++R  + +  R G+ R   
Sbjct: 67  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVR-------STNR--RGLPYRLGINRFAD 117

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV+ QG CGSCW
Sbjct: 118 MSWEEFQASRLGAAQNCSATLAGNHRMRDAAALPETKDWREDGI--VSPVKDQGHCGSCW 175

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C   + N  C+GG    AFEY+K   GL+++
Sbjct: 176 TFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTE 235

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY     I   C Y+ E   V V D+  +T G +  +         V +  ++I   
Sbjct: 236 EAYPYTGVNGI---CHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGF 292

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +    +P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G 
Sbjct: 293 RMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGK 352

Query: 327 NACGIESYA 335
           N CGI + A
Sbjct: 353 NMCGIATCA 361


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 96/308 (31%), Positives = 154/308 (50%), Gaps = 32/308 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           +++++VK  ++Y    E + RF+ FK + +  DE+   S    R+ +  L R       +
Sbjct: 46  YESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES----RTYKVGLNRFADLTNDE 101

Query: 103 EKERLEADRERVKKFLNERKKG---------PLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
            +      R   ++ L+ +K+           LP S+DWR+    V   V+ QG CGSCW
Sbjct: 102 YRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVV--GVKDQGSCGSCW 159

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQA 211
           AF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G++++ 
Sbjct: 160 AFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEE 219

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLI 266
           DYPY  ++    RC   ++ AKV   D +    V++   L   + + P+ V +  +    
Sbjct: 220 DYPYNARDG---RCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAF 276

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           + Y+      N   C    LDH V  VGYG +N +  WIV+NSWG    + GY ++ER  
Sbjct: 277 QFYESGVFTGN---CGT-ALDHGVTAVGYGTENSVDYWIVKNSWGSSWGESGYIRMERNT 332

Query: 327 NA---CGI 331
            A   CGI
Sbjct: 333 GATGKCGI 340


>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
 gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
          Length = 227

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 75/215 (34%), Positives = 118/215 (54%), Gaps = 10/215 (4%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPKS DWR+     + PV++QG CGSCW F++T  +E    L  + L  L + QLV+CD 
Sbjct: 9   LPKSFDWREHGA--MTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQLVDCDR 66

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYR--NKENITF---RCTYEKEKAKVFVQD-T 239
            +  C GG++  A+EY+K  GLE++ DYPY+  N +   F   RC +   K    + + +
Sbjct: 67  MDGGCKGGDMLNAYEYIKAKGLEAEEDYPYQEENYKEYMFPHHRCHFRPSKVAATIANYS 126

Query: 240 WVTSGVDHM-MHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            V+   D +  +L+++GP+ + LN   I  Y G  +           ++HAV +VGYG  
Sbjct: 127 TVSEDEDQIAANLVKNGPLSIALNANYIMDYMGG-VACPRICPGGDNMNHAVLLVGYGMD 185

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
                WI++NSW +   + GYF++ RG   CG+ +
Sbjct: 186 GDKPYWILKNSWSENYGEDGYFRLCRGFGVCGMNT 220


>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
 gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 370

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 157/321 (48%), Gaps = 37/321 (11%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDR 86
           + A+K + +++ R Y   +E   RF  F              Q+GK T +  G +  +D+
Sbjct: 57  IAAWKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKM-GVNEFTDK 115

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           +  E+ +  G ++T        A R +   F+       LP  +DWR+     +  V++Q
Sbjct: 116 TDYELKKLRGYKVTSG------AIRHKGSTFIRSEHT-KLPSKVDWRREGA--VTDVKNQ 166

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK- 203
           G+CGSCWAF+TT  +E Q       L  LS+ QLV+C   +GN  C+GG ++ AFEYV+ 
Sbjct: 167 GQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRD 226

Query: 204 QYGLESQADYPYRNKENI-TFRCTYEKEKAKVFVQDTW---VTSGVDH--MMHLLQSGPI 257
             G++S+  YPY + +     RC +    + +  Q T    +  G +   M  +   GP+
Sbjct: 227 NEGIDSEISYPYVSGDGTENNRCLFNA--SNILAQVTGYVNIHEGDERALMDAVATKGPV 284

Query: 258 GVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
            V +N  L     Y        D       LDH V +VGYGE+NG   W+++NSWG+   
Sbjct: 285 SVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEWG 344

Query: 316 DHGYFQIERGA-NACGIESYA 335
           + GY +I +G+ N CG+ S A
Sbjct: 345 EKGYIKISKGSHNMCGVASAA 365


>gi|115533514|ref|NP_001041280.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
 gi|3878958|emb|CAA89070.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
          Length = 402

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 92/275 (33%), Positives = 140/275 (50%), Gaps = 35/275 (12%)

Query: 78  YGTSGSSDRSPQE----ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR 133
           YG +  SD + +E    +L ++  +   KE E +E   E +     E    P P   DWR
Sbjct: 135 YGHNDMSDWTDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGE-SSSPFPDFFDWR 193

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGG 193
              V  + PV++QG+CGSCWAFA+TA +E+  A+       LS+  L++CD  +  C+GG
Sbjct: 194 DKNV--ITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGG 251

Query: 194 NIDVAFEYVKQYGLESQADYPY-RNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMH 250
           + D AF Y+ + GL +  D PY  +++N    C          V D W T+ +   + +H
Sbjct: 252 DEDKAFRYIHRNGLANAVDLPYVAHRQN---GCA---------VNDHWNTTRIKAAYFLH 299

Query: 251 ---------LLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLD-HAVAIVGYG-EK 298
                    L+  GP+ + +   + + +Y G     +++AC    +  HA+ I GYG  K
Sbjct: 300 HDEDSIINWLVNFGPVNIGMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSK 359

Query: 299 NGILTWIVRNSWGDI-GPDHGYFQIERGANACGIE 332
            G   WIV+NSWG+  G +HGY    RG NACGIE
Sbjct: 360 TGEKYWIVKNSWGNTWGVEHGYIYFARGINACGIE 394


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 96/257 (37%), Positives = 130/257 (50%), Gaps = 33/257 (12%)

Query: 108 EADRERVKKFLNER-KKG-----PL----PKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           E  R+ +  F N++ KKG     PL    PKS+DWR+     + PV++QG+CGSCWAF+ 
Sbjct: 86  EEFRQVMNGFQNQKHKKGKMFRDPLLLQYPKSVDWREKGY--VTPVKNQGQCGSCWAFSA 143

Query: 158 TAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q+      L  LS+  LV+C H  GN  CNGG +D AF+YVK   GL+S+  YP
Sbjct: 144 TGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQGCNGGLMDYAFQYVKDNSGLDSEESYP 203

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIES 268
           Y   E +   C Y+ E +     DT       H   LL++    GPI   ++  H   + 
Sbjct: 204 Y---EGMDGTCKYKPECS--VANDTGFVDIPGHEKALLRAVATVGPISAAIDAGHMSFQF 258

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIER 324
           Y        D  C+   LDH + +VGYG      N    W+V+NSWG    D GY +I R
Sbjct: 259 YKSGIYYDPD--CSSKDLDHGILVVGYGFEGTNSNATKYWLVKNSWGTTWGDEGYVKIIR 316

Query: 325 GA-NACGIESYAYLASV 340
              N CGI + A   +V
Sbjct: 317 DKDNHCGIATAASYPTV 333


>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 347

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 151/318 (47%), Gaps = 27/318 (8%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR 94
           +S+++   F+ + +K  +TY    E   R   +       + YY    +    P    + 
Sbjct: 35  ESVQRAAEFERWTIKHKKTYATAEEYNWRLRVYT-----ANHYYVKRLNEGHGPATEFEL 89

Query: 95  TGLR-LTGKEKERL------EADRERVKKFLNERKKGPL--PKSLDWRQSKVKVLNPVES 145
                LT  E +R+      +  R     F    KK  +  P ++DWR  K  V+ PV  
Sbjct: 90  NQFADLTFAEFKRIYLSSSSQHCRATTGNFQMPVKKNNVEDPVAIDWR--KRNVITPVRD 147

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK 203
           QG CGSCWAF+ T+ L + +AL    L  LSK QL++C     N  C GG    AFEY++
Sbjct: 148 QGSCGSCWAFSATSCLSAHLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYIR 207

Query: 204 -QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGV--DHMMHLLQSGPIGV 259
              G+ES+ DYPY+++E    +C ++       V      T G   D  + L   GP+ +
Sbjct: 208 YNGGIESERDYPYKDREE---KCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSI 264

Query: 260 YLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE-KNGILTWIVRNSWGDIGPDH 317
            ++  +   +Y     +    + NP K++HAV IVGY +  +G   WI +NSWG     +
Sbjct: 265 GIHSTKSFATYKKGIYQGKLCSKNPRKINHAVLIVGYDQTASGEKYWIGKNSWGTNWGMN 324

Query: 318 GYFQIERGANACGIESYA 335
           GYF I RG NACG+ + A
Sbjct: 325 GYFWIRRGHNACGLATCA 342


>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
 gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
 gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 151/318 (47%), Gaps = 41/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F T+  K+ + Y    E   RF  FK +     ++        +G +  SD +P+E  ++
Sbjct: 51  FTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKKHQIMDPTAAHGVTKFSDLTPKEFRRQ 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                  + +   +A++  +         G LP   DWR      +  V+ QG CGSCW+
Sbjct: 111 LLGLKR-RLRLPTDANKAPI------LPTGDLPTDFDWRDHGA--VTSVKDQGSCGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+ T  LE    L    L  LS+ QLV+CDH          +  C+GG ++ AFEY +K 
Sbjct: 162 FSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLMNNAFEYALKA 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GLE + DYPY    N    C +EK K    V +  V S  +  +  +L++ GP+ V +N
Sbjct: 222 GGLEREKDYPYTG--NDRGACKFEKSKVAASVSNFSVVSLDEDQIAANLVKHGPLSVAIN 279

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C+ H+ DH V +VGYG              WI++NSWG+   
Sbjct: 280 AVFMQTYIGG--VSCPYICSKHQ-DHGVLLVGYGAAGYAPIRFKEKPFWIIKNSWGENWG 336

Query: 316 DHGYFQIERGANACGIES 333
           ++GY++I R  N CG++S
Sbjct: 337 ENGYYKICRARNICGVDS 354


>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
          Length = 365

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 148/311 (47%), Gaps = 34/311 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V++ ++Y    E++ RF  F +  +E         S++R  + +  R G+ R + 
Sbjct: 64  FARFAVRYGKSYESAAEVRRRFRIFSESLEEVR-------STNR--KGLSYRLGINRFSD 114

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +         R    LP++ DWR+  +  ++PV+ Q  CGSCW
Sbjct: 115 MSWEEFQATRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGI--VSPVKDQSHCGSCW 172

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C  G  N  C+GG    AFEY+K   G++++
Sbjct: 173 TFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIKYNGGIDTE 232

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-----VTSGVDHMMHLLQSGPIGVYLNH-R 264
             YPY+    +   C Y+ E A V V D+          + + + L++  P+ V      
Sbjct: 233 ESYPYKGVNGV---CHYKAENAVVQVLDSVNITLNAEDELKNAVGLVR--PVSVAFEVIN 287

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
               Y       +     P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E 
Sbjct: 288 GFRQYKSGVYSSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEM 347

Query: 325 GANACGIESYA 335
           G N C + + A
Sbjct: 348 GKNMCAVATCA 358


>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
          Length = 330

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 92/239 (38%), Positives = 127/239 (53%), Gaps = 18/239 (7%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
            +  + ++ K   E   G +PK++DWR  K   + PV++QG CGSCWAF+    LE QV 
Sbjct: 95  FQGQKTKMMKVFPEPFLGDVPKTVDWR--KHGYVTPVKNQGPCGSCWAFSAVGSLEGQVF 152

Query: 167 LLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITF 223
                L PLS+  LV+C   HGN  C+GG  D AF+YVK   GL++   YPY   E +  
Sbjct: 153 RKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPY---EALNG 209

Query: 224 RCTYEKE--KAKVFVQDTWVTSGVDHMMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDW 279
            C Y  +   AKV    +   S    M  +   GPI  G+ + H+  + Y G      D 
Sbjct: 210 TCRYNPKYSAAKVVGFMSIPPSENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPD- 268

Query: 280 ACNPHKLDHAVAIVGYGEK-NGILTWIVRNSWG-DIGPDHGYFQIERG-ANACGIESYA 335
            C+   L+HAV +VGYGE+ +G   W+V+NSWG D G D GY ++ +   N CGI S A
Sbjct: 269 -CSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMD-GYIKMAKDWNNNCGIASDA 325


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 162/333 (48%), Gaps = 42/333 (12%)

Query: 23  DSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSG 82
           D +I  + + + + ++++  +  ++ +  RTY    E + RFE F+ + +  D++   + 
Sbjct: 24  DMSIVSYGERSEEEVRRM--YVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAAD 81

Query: 83  SSDRSPQEILQR-------------TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKS 129
           +   S +  L R              G+R     + RL     R +   NE     LP+S
Sbjct: 82  AGLHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSG---RYQAADNEE----LPES 134

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NL 188
           +DWR+     +  V+ QG CGSCWAF+  A +E    ++   +  LS+ +LV+CD   N 
Sbjct: 135 VDWREKGA--VAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQ 192

Query: 189 NCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            CNGG +D AFE++    G++S+ DYPY+ ++N   RC   K+ AKV   D +    V+ 
Sbjct: 193 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDN---RCDANKKNAKVVTIDGYEDVPVNS 249

Query: 248 MMHL---LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
            + L   + + PI V +    R  + Y           C    LDH V  VGYG +NG  
Sbjct: 250 ELSLKKAVANQPISVAIEAGGRAFQLYKSGIFTGR---CGT-ALDHGVTAVGYGSENGKD 305

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
            WIV+NSWG +  + GY ++ER   A    CGI
Sbjct: 306 YWIVKNSWGTVWGEDGYVRLERNIKATSGKCGI 338


>gi|211909240|gb|ACJ12893.1| cathepsin L1D [Fasciola hepatica]
          Length = 326

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 123/224 (54%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKE--KAKVFVQDTWV 241
             GN  C GG ++ A+EY+KQ+GLE+++ YPYR  E    +C Y ++   AKV    T  
Sbjct: 166 PWGNNGCGGGLMENAYEYLKQFGLETESSYPYRAVEG---QCRYNRQLGVAKVTGYYTLH 222

Query: 242 TSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           +     +  L+ S GP  V ++   +ES D    R   +    C+P  L+HAV  VGYG 
Sbjct: 223 SGNEAGLKSLVGSEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLGLNHAVLAVGYGT 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 279 QGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLPMV 322


>gi|228244|prf||1801240B Cys protease 2
          Length = 323

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 147/300 (49%), Gaps = 20/300 (6%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLE 108
           K+ R Y D  E   R   F+Q+ K  +E+     + + +    + + G     +    ++
Sbjct: 26  KYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMK 85

Query: 109 ADRER----VKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
            +  R    V  F  +++ GP    +DWR      + PV+ QG+CGSCWAF+TT  LE Q
Sbjct: 86  GNIPRRSAPVSVFYPKKETGPQATEVDWRTKGA--VTPVKDQGQCGSCWAFSTTGSLEGQ 143

Query: 165 VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENI 221
             L   +L  L++ QLV+C   +G   CNGG ++ AF+Y+K   G++++A YPY  ++  
Sbjct: 144 HFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASYPYEARDG- 202

Query: 222 TFRCTYEKEK-AKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRR 276
              C ++    A      T + SG +  +   +   GPI V ++  H   + Y       
Sbjct: 203 --SCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYE 260

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
              +C+P  LDHAV  VGYG + G   W+V+NSW     D GY ++ R   N CGI + A
Sbjct: 261 P--SCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVA 318


>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 157/320 (49%), Gaps = 41/320 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +  K+ + Y    E   RF  FK + +    +        +G +  SD +  E  
Sbjct: 45  DHFTLFKKKFGKDYGSIEEHYYRFSVFKANLRRAMRHQKMDPSARHGVTQFSDLTGSE-F 103

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +R  L +TG  K   +A++  +    N      LP+  DWR      + PV++QG CGSC
Sbjct: 104 RRKHLGVTGGFKLPKDANQAPILPTHN------LPEEFDWRDRGA--VTPVKNQGSCGSC 155

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 156 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTL 215

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
           K  GL  + DYPY   +  +  C  ++ K    V +  V S  +  +  +L+++GP+ V 
Sbjct: 216 KTGGLMREEDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLAVA 273

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  +L+H V ++GYG              WI++NSWG+ 
Sbjct: 274 INAAYMQTYIGGV--SCPYICS-RRLNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGES 330

Query: 314 GPDHGYFQIERGANACGIES 333
             ++G+++I +G N CG++S
Sbjct: 331 WGENGFYKICKGRNICGVDS 350


>gi|211909242|gb|ACJ12894.1| cathepsin L1D [Fasciola hepatica]
          Length = 326

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 123/224 (54%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKE--KAKVFVQDTWV 241
             GN  C GG ++ A+EY+KQ+GLE+++ YPYR  E    +C Y ++   AKV    T  
Sbjct: 166 PWGNNGCGGGLMENAYEYLKQFGLETESSYPYRAVEG---QCRYNRQLGVAKVTGYYTLH 222

Query: 242 TSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           +     +  L+ S GP  V ++   +ES D    R   +    C+P  L+HAV  VGYG 
Sbjct: 223 SGNEAGLKSLVGSEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLGLNHAVLAVGYGT 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 279 QGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLPMV 322


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 153/311 (49%), Gaps = 30/311 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRT-----GL 97
           ++ ++    + Y    E + RFE FK + +  DE+   +GS           T      +
Sbjct: 47  YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSM 106

Query: 98  RLTG--KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
            L G  + KER  + +     F   R    LP S+DWR+     ++PV+ QG+CGSCWAF
Sbjct: 107 FLGGNMEMKERSASTKSDRYAF---RAGDKLPGSVDWREKGA--VSPVKDQGQCGSCWAF 161

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADY 213
           +T + +E    ++   L  LS+ +LV+CD   N+ CNGG +D  F+++    G++++ DY
Sbjct: 162 STISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDY 221

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD---HMMHLLQSGPIGVYL--NHRLIES 268
           PYR  +     C   ++ A+V   + +     D    +   + + P+ V +    R  + 
Sbjct: 222 PYRAVDGT---CDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQL 278

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y+      +   C  + LDH V  VGYG +NG+  W VRNSWG    ++GY ++ER  NA
Sbjct: 279 YESGVFTGH---CGTN-LDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNINA 334

Query: 329 ----CGIESYA 335
               CGI S A
Sbjct: 335 TSGKCGIASMA 345


>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
          Length = 326

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 122/225 (54%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN+ C GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNMGCMGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+   ++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSR--TCSSLHVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
 gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
          Length = 358

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 145/303 (47%), Gaps = 17/303 (5%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           +F  ++ +  + Y  ++E+K RF  F ++    D    T+         +     L    
Sbjct: 58  SFSRFVYRHGKRYQSEDEMKMRFAIFSEN---LDFIRSTNRKGLSYTLAVNDFADLTWQE 114

Query: 102 KEKERLEADRE-RVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
            +K RL A +        N +  G  LP + DWR+  V +++PV++QG CGSCW F+TT 
Sbjct: 115 FQKHRLGAAQNCSATTKGNHKLTGVALPDTKDWRE--VGIVSPVKNQGHCGSCWTFSTTG 172

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYR 216
            LE+           LS+ QLV+C     N  C+GG    AFEY+K   GLE++  YPY 
Sbjct: 173 ALEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLETEEAYPYT 232

Query: 217 NKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES---YDGN 272
            ++     C +  E   + V D+  +T G +  +         V +   ++     Y   
Sbjct: 233 GEDGA---CKFSSENVGIQVLDSVNITLGAEDELKEAVGLVRPVSVAFEVVSGFRFYKSG 289

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
               +     P  ++HAV  VGYG ++G+  W+V+NSWG+   DHGYF++E G N CG+ 
Sbjct: 290 VYTSDTCGSTPMDVNHAVLAVGYGVEDGVPYWLVKNSWGENWGDHGYFKMEMGKNMCGVA 349

Query: 333 SYA 335
           + A
Sbjct: 350 TCA 352


>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
 gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
          Length = 373

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 98/331 (29%), Positives = 150/331 (45%), Gaps = 43/331 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F  ++ +  R Y+   E   R   F             +    +G +  SD + +E   R
Sbjct: 49  FAAFVRRHGRRYSGPEEYARRLRVFAANLARAAAHQALDPTARHGVTPFSDLTREEFEAR 108

Query: 95  -TGLRL-TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
            TG+R   G + +RL           ++ +   LP S DWR      +  V+ QG CGSC
Sbjct: 109 LTGVRAGAGGDVQRLVMSGAPAAPPASQEEVSRLPASFDWRDKGA--VTGVKMQGACGSC 166

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYV- 202
           WAF+TT  +E    L    L  LS+ QLV+CDH          N  C GG +  A+ Y+ 
Sbjct: 167 WAFSTTGAVEGANFLATGKLLELSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYLM 226

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGV 259
           K  GL  Q  YPY         C ++  KA V V + T V +G +  +   L++ GP+ V
Sbjct: 227 KSGGLMEQRAYPYTGAPGP---CRFDPAKAAVRVANFTAVPAGDEAQIRAALVRRGPLAV 283

Query: 260 YLNHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNS 309
            LN   +++Y G    P+      C    ++H V +VGYG +            WI++NS
Sbjct: 284 GLNAAFMQTYVGGVSCPL-----LCPRAWVNHGVLLVGYGARGFAALRLGYRPYWIIKNS 338

Query: 310 WGDIGPDHGYFQIERGANACGIESYAYLASV 340
           WG+   + GY+++ RG+N CG++S     +V
Sbjct: 339 WGERWGEQGYYRLCRGSNVCGVDSMVSAVAV 369


>gi|114796866|gb|ABI79445.1| cysteine proteinase 5 [Entamoeba histolytica]
          Length = 289

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 81/219 (36%), Positives = 119/219 (54%), Gaps = 15/219 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP-----LSK 177
           +G +P+S+DWR +K KV   +  Q  CGSC++FA+ A +E ++ +     +      LS+
Sbjct: 76  RGDVPESVDWR-AKGKV-PAIRDQASCGSCYSFASVAAIEGRLLVAGSKKFTVDDLDLSE 133

Query: 178 SQLVECDH--GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
            QLV+C    GN  CNGG++ ++F YVK  G+  + DYPY   E     CTY+K+K  V 
Sbjct: 134 QQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYVAAEET---CTYDKKKVAVK 190

Query: 236 VQ-DTWVTSGVDH-MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
           +     V  G +  +M     GP+G  ++   ++         N   C+  +L+H VA+V
Sbjct: 191 ITGQKLVRPGSEKALMRAAAEGPVGAAIDASGVKFQLYKSGIYNSKECSSTQLNHGVAVV 250

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
           GYG +NG   WIVRNSWG I  D GY  + R   N CGI
Sbjct: 251 GYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQCGI 289


>gi|7271895|gb|AAF44678.1|AF239267_1 cathepsin L, partial [Fasciola gigantica]
          Length = 219

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 123/225 (54%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 1   VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 58

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A+EY+KQ+GLE+++ YPY   E+   +C Y ++     V D + V 
Sbjct: 59  PWGNYGCMGGLMENAYEYLKQFGLETESSYPYTAVED---QCRYNRQLGVAKVTDYYTVH 115

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+  +++HAV  VGYG
Sbjct: 116 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSR--TCSSLRVNHAVLAVGYG 170

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 171 TQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 215


>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 359

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 147/308 (47%), Gaps = 27/308 (8%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           AF  +  ++ ++Y    E+K RF  F    K    +     S      E    T      
Sbjct: 59  AFARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEF-- 116

Query: 102 KEKERLEADRERVKKFLNERK--KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
             K RL A +          K   G LP   DWR+  V ++ PV++QG CGSCW F+TT 
Sbjct: 117 -RKHRLGAAQNCSATLKGNHKLTNGLLPLKKDWRE--VGIVTPVKNQGHCGSCWTFSTTG 173

Query: 160 ILESQ-VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYG-LESQADYPY 215
            LE+  V    K ++ LS+ QLV+C   + N  CNGG    AFEY+K  G L+++  YPY
Sbjct: 174 ALEAAYVQAFGKAIF-LSEQQLVDCARAYNNFGCNGGLPSQAFEYIKANGGLDTEEAYPY 232

Query: 216 RNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH--LLQSGPIGVYL----NHRLIES 268
              + +   C +  E   V V D+  +T G +  +   +    P+ V        RL +S
Sbjct: 233 TGVDGV---CKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVSGFRLYKS 289

Query: 269 YDGNPIRRNDWACN-PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
                +  +D   N P  ++HAV  VGYG +N +  W+++NSWG    D+GYF++E G N
Sbjct: 290 ----GVYTSDTCGNTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWGDNGYFKMEMGKN 345

Query: 328 ACGIESYA 335
            CG+ + A
Sbjct: 346 MCGVATCA 353


>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
          Length = 459

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 155/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F  +     +          YG +  SD 
Sbjct: 155 SVKMASIFKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDL 214

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E      + L    KE         K   +       P   DWR +K  V N V++Q
Sbjct: 215 TEEEF---RAIYLNPLLKENRNKMMHLAKSIGDHA-----PPEWDWR-TKGAVTN-VKNQ 264

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L +  L  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 265 GMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELLDCDKVDKACLGGLPSNAYLAIKNLG 324

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y         C++  +KAKV++ D+   S  +  +   L + GPI V +N 
Sbjct: 325 GLETEDDYSYSGHLQT---CSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPISVAINA 381

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P+R     C+P  +DHAV +VGYG ++GI  W ++NSWG    + GY+
Sbjct: 382 FGMQFYRRGISHPLRP---LCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYY 438

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 439 YLYRGSGACGVNAMASSAVV 458


>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
          Length = 360

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 143/301 (47%), Gaps = 27/301 (8%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLE 108
           ++ + Y    EIK RFE F  + K    +     S      E        LT  E  R  
Sbjct: 67  RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTD-----LTWDEFRR-- 119

Query: 109 ADRERVKKFLNERKKG-------PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
            DR    +  +   KG        LP++  WR++ +  ++PV++QG+CGSCW F+TT  L
Sbjct: 120 -DRLGAAQNCSATTKGNLKVTNVVLPETKGWREAGI--VSPVKNQGKCGSCWTFSTTGAL 176

Query: 162 ESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQADYPYRNK 218
           E+  +        LS+ QLV+C     N  CNGG    AFEY+K  G L+++  YPY  K
Sbjct: 177 EAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGK 236

Query: 219 ENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES---YDGNPI 274
             +   C +  E   V V D+  +T G +  +    +    V +   +I+    Y     
Sbjct: 237 NGL---CKFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVY 293

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
              +    P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF++E G N CGI + 
Sbjct: 294 TSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATC 353

Query: 335 A 335
           A
Sbjct: 354 A 354


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 104/318 (32%), Positives = 164/318 (51%), Gaps = 36/318 (11%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR 94
           D +K++  F++++VK  ++Y   +E   RF+ F+ + K  DE    +   +RS +  L R
Sbjct: 44  DEVKEM--FESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDE---KNSLENRSYKLGLNR 98

Query: 95  TGLRLTGKEKER--LEADRERVKKFLNER--KKGP-----LPKSLDWRQSKVKVLNPVES 145
               +T +E     L A R+  +  +  +  +  P     LP S+DWR+     +  V+ 
Sbjct: 99  FA-DITNEEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGA--VTGVKD 155

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-K 203
           QG CGSCWAF+T A +E    L    L  LS+ +LV+CD   N  CNGG++  AF+++ K
Sbjct: 156 QGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIK 215

Query: 204 QYGLESQADYPYRNKENITFRC-TYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGV 259
             G++S+ DYPY  K+    +C +Y +  AKV   D +    V++   L   + + P+ V
Sbjct: 216 NGGIDSEEDYPYTGKDG---KCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSV 272

Query: 260 YLNHRLIESYDGNPIRRNDW--ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            +       YD        +  +C    LDH VA VGYG +NG+  WIV+NSWGD   + 
Sbjct: 273 AIE---AGGYDFQLYSSGIFTGSCGTD-LDHGVAAVGYGTENGVDYWIVKNSWGDYWGEK 328

Query: 318 GYFQIERGANA----CGI 331
           GY +++R   A    CGI
Sbjct: 329 GYVRMQRNVKAKTGLCGI 346


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 147/307 (47%), Gaps = 30/307 (9%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTGKEK 104
           ++ ++ R Y D NE + R++ FK++ +  + +   SG S         + G+ +      
Sbjct: 42  WMSRFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKS--------YKLGINQFADLTN 93

Query: 105 ERLEADRERVKKFLNERKKGPL--------PKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           E  +  R R K  +   + GP         P S+DWR  K   +  ++ QG+CGSCWAF+
Sbjct: 94  EEFKTSRNRFKGHMCSSQAGPFRYENLTAAPSSMDWR--KKGAVTAIKDQGQCGSCWAFS 151

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQ-YGLESQADY 213
             A +E    L    L  LS+ +LV+CD    +  C GG +D AF++++Q  GL ++A+Y
Sbjct: 152 AVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANY 211

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIE-SYDGN 272
           PY   +            AK+   +    +    +M  +   P+ V ++       +  +
Sbjct: 212 PYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSS 271

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---- 328
            I   D  C   +LDH VA VGYGE NG+  W+V+NSWG    + GY ++++  +A    
Sbjct: 272 GIFTGD--CGT-ELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGL 328

Query: 329 CGIESYA 335
           CGI   A
Sbjct: 329 CGIAMQA 335


>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
 gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
          Length = 360

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 146/318 (45%), Gaps = 47/318 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------DGKETDEYYGTSGSSDRSPQEI--- 91
           F  + V++ ++Y    E+  RF  F +        + K      G +  +D S +E    
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRAT 118

Query: 92  ----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
                Q     LTG  + R  A                LP++ DWR+  +  ++PV++QG
Sbjct: 119 RLGAAQNCSATLTGNHRMRAAAV--------------ALPETKDWREDGI--VSPVKNQG 162

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QL++C     N  CNGG    AFEY+K  
Sbjct: 163 HCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYN 222

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH----LLQSGPIG- 258
            GL+++  YPY+    I   C ++ E   V V D+  +T G +  +     L++   +  
Sbjct: 223 GGLDTEESYPYQGVNGI---CKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAF 279

Query: 259 -VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V    RL   Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D 
Sbjct: 280 EVITGFRL---YKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDE 336

Query: 318 GYFQIERGANACGIESYA 335
           GYF++E G N CG+ + A
Sbjct: 337 GYFKMEMGKNMCGVATCA 354


>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
 gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
          Length = 328

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 132/247 (53%), Gaps = 23/247 (9%)

Query: 96  GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           GL    K+ E+L         F+   K  P    +DWR S V   + V++QG+CGSCW+F
Sbjct: 93  GLATKPKKNEKLRL------PFVQSDK--PAAAEVDWRNSAV---SEVKNQGQCGSCWSF 141

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYGLESQADY 213
           +TT  +E Q+A+  + L  LS+  LV+C   +GN  CNGG +D AF+Y+   G+ S++ Y
Sbjct: 142 STTGAVEGQLAISGRGLTSLSEQNLVDCSSAYGNAGCNGGWMDSAFDYIHDNGIMSESAY 201

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH--LLQSGPIGVYLNHR-LIESY 269
           PY   E     C +   ++   +Q  + + SG ++ +   +  +GPI V L+    ++ Y
Sbjct: 202 PYTASEG---SCRFNPSESVTSLQGYYDLPSGDENALKSAVANNGPIAVALDATDELQFY 258

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANA 328
            G  +   D  C+   L+H V +VGYG + G   WIV+NSWG    + GY++  R   N 
Sbjct: 259 SGGVLY--DTTCSAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQGYWRQARNRNNN 316

Query: 329 CGIESYA 335
           CGI + A
Sbjct: 317 CGIATAA 323


>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
 gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 373

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 155/320 (48%), Gaps = 43/320 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F  +  K+ +TY    E   RF  FK + +        +    +G +  SD +P+E  ++
Sbjct: 55  FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRK 114

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GL+  G    RL  D +             LP   DWR+     + PV++QG CGSCW
Sbjct: 115 FLGLKRRGF---RLPTDTQTAPIL----PTSDLPTEFDWREQGA--VTPVKNQGMCGSCW 165

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VK 203
           +F+    LE    L  K L  LS+ QLV+CDH          +  C+GG ++ AFEY +K
Sbjct: 166 SFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK 225

Query: 204 QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL 261
             GL  + DYPY  +++    C ++K K    V +  V S  +  +  +L+Q GP+ + +
Sbjct: 226 AGGLMKEEDYPYTGRDHTA--CKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAI 283

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIG 314
           N   +++Y G       + C+  + DH V +VG+G              WI++NSWG + 
Sbjct: 284 NAMWMQTYIGG--VSCPYVCSKSQ-DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMW 340

Query: 315 PDHGYFQIERGA-NACGIES 333
            +HGY++I RG  N CG+++
Sbjct: 341 GEHGYYKICRGPHNMCGMDT 360


>gi|7271889|gb|AAF44675.1|AF239264_1 cathepsin L [Fasciola gigantica]
          Length = 326

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 122/225 (54%), Gaps = 20/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNYGCMGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+  +++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFTMYSGGIYQSR--TCSSLRVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
          Length = 358

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 153/308 (49%), Gaps = 29/308 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +  ++ + Y    E+K R+E F ++ K        S +    P  +        + +
Sbjct: 59  FARFAHRYGKKYESVEEMKLRYEIFSENKKLI-----RSTNKKGLPYTLAVNRFADWSWE 113

Query: 103 E--KERLEADR--ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
           E  ++RL A +      K  +E     LP+S +WR+  +  + PV+ QG CGSCW F+TT
Sbjct: 114 EFRRQRLGAAQNCSATTKGSHELTDAVLPESKNWREEGI--VTPVKDQGHCGSCWTFSTT 171

Query: 159 AILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPY 215
             LE+      +    LS+ QLV+C     N  C+GG    AFEY+K   GL+++A YPY
Sbjct: 172 GALEAAYVQAFRKQISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYPY 231

Query: 216 RNKENITFRCTYEKEKAKVFVQDTW-VTSG----VDHMMHLLQSGPIGVYLNHRLIES-- 268
              +     C +  E   V V D+  +T G    + H +  ++  P+ V    ++++S  
Sbjct: 232 VGTDGA---CKFSAENVGVQVLDSVNITLGDEQELKHAVAFVR--PVSVAF--QVVKSFR 284

Query: 269 -YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y       +    +P  ++HAV  VGYGE+ G+  W+++NSWG+   D+GYF++E G N
Sbjct: 285 IYKSGVYTSDTCGSSPMDVNHAVLAVGYGEEGGVPFWLIKNSWGESWGDNGYFKMEFGKN 344

Query: 328 ACGIESYA 335
            CG+ + A
Sbjct: 345 MCGVATCA 352


>gi|377656292|pdb|3QT4|A Chain A, Structure Of Digestive Procathepsin L 3 Of Tenebrio
           Molitor Larval Midgut
          Length = 329

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 124/220 (56%), Gaps = 15/220 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K PL  S+DWR + V   + V+ QG+CGS W+F+TT  +E Q+AL +  L  LS+  L++
Sbjct: 113 KKPLAASVDWRSNAV---SEVKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLID 169

Query: 183 CD--HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C   +GN  C+GG +D AF Y+  YG+ S++ YPY  + +    C ++  ++   +   +
Sbjct: 170 CSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYEAQGDY---CRFDSSQSVTTLSGYY 226

Query: 241 -VTSGVDHMMH--LLQSGPIGVYLNHR-LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            + SG ++ +   + Q+GP+ V ++    ++ Y G      D  CN   L+H V +VGYG
Sbjct: 227 DLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFY--DQTCNQSDLNHGVLVVGYG 284

Query: 297 EKNGILTWIVRNSWGDIGPDHGYF-QIERGANACGIESYA 335
             NG   WI++NSWG    + GY+ Q+    N CGI + A
Sbjct: 285 SDNGQDYWILKNSWGSGWGESGYWRQVRNYGNNCGIATAA 324


>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
 gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
          Length = 367

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 150/318 (47%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ +TY+   E   RF  F+ + +    +        +G +  SD +P E  +R
Sbjct: 52  FGLFKAKFGKTYSTVEEHDYRFSVFEANLRRARRHQLLDPSAVHGVTRFSDLTPDE-FRR 110

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
             L   G +  RL AD ++            LP   DWR      + PV+ QG CGSCW+
Sbjct: 111 DYL---GLKPLRLPADAQKAPIL----PTNDLPTDFDWRDHGA--VTPVKDQGSCGSCWS 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQ 204
           F+    LE    L    L  +S+ QLV+CDH          +  CNGG +  AFEY+ K 
Sbjct: 162 FSAIGALEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKA 221

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+E +  YPY   +  +  C + K +    V +  V S  +  +  +++++GP+ V +N
Sbjct: 222 GGVEREETYPYIGSDRGS--CKFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGIN 279

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C+   LDH V +VGYG              WI++NSWG+   
Sbjct: 280 AVFMQTYMKG--VSCPYICS-RNLDHGVVLVGYGSAGYAPIRFKEKPYWIIKNSWGESWG 336

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG NACG++S
Sbjct: 337 EDGYYKICRGHNACGVDS 354


>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
          Length = 335

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 153/319 (47%), Gaps = 46/319 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEILQR 94
           F T+  K+++TY    E   RF  FK + +        +    +G +  SD +P E  ++
Sbjct: 22  FSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLHAKLDPSAVHGVTKFSDLTPSEFRRQ 81

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +  RL    ++            LP+  DWR      +  V++QG CGSCWA
Sbjct: 82  ----FLGLKPLRLPEHAQKAPILPTHD----LPEDFDWRDKGA--VTHVKNQGSCGSCWA 131

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS  QLV+CDH          +  CNGG ++ AFEY+ + 
Sbjct: 132 FSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGGLMNNAFEYILES 191

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G++ + DYPY  ++    R     E     V +  V S  +  +  +L+++GP+ + +N
Sbjct: 192 GGVQREEDYPYTGRD----RGPAIDEANAASVSNFSVVSLDEDQISANLVKNGPLAIGIN 247

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT--------WIVRNSWGDIG 314
              +++Y G       + C  + LDH V +VGYG K G           WI++NSWG+  
Sbjct: 248 AVFMQTYIGG--VSCPYICGKN-LDHGVLLVGYG-KAGYAPIRLKEKPYWIIKNSWGESW 303

Query: 315 PDHGYFQIERGANACGIES 333
            ++GY++I RG N CG++S
Sbjct: 304 GENGYYKICRGRNVCGVDS 322


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 153/318 (48%), Gaps = 33/318 (10%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRL 99
           ++ F+ +  K  + Y    E + RFE FK + K     Y    ++ R   +     GL  
Sbjct: 46  LEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLK-----YILERNAKRKANKWEHHVGLNK 100

Query: 100 TG--KEKERLEADRERVKKFLNE--------RKK---GPLPKSLDWRQSKVKVLNPVESQ 146
                 +E  +A   +VKK +N+        R+K      P SLDWR     V+  V+ Q
Sbjct: 101 FADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRN--YGVVTAVKDQ 158

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQY 205
           G CGSCWAF++T  +E   AL+   L  LS+ +LVECD  N  C GG +D AFE+V    
Sbjct: 159 GSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNG 218

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPIGVYLNH 263
           G++S++DYPY   +     C   KE+ KV   D +  V      ++  +   P+ V ++ 
Sbjct: 219 GIDSESDYPYTGVDGT---CNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQPVSVGIDG 275

Query: 264 RLI--ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             I  + Y G  I     + +P  +DHAV IVGYG ++    WIV+NSWG      GYF 
Sbjct: 276 SAIDFQLYTGG-IYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGIDGYFY 334

Query: 322 IERGAN----ACGIESYA 335
           ++R  +     C + + A
Sbjct: 335 LKRDTDLPYGVCAVNAMA 352


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 119/228 (52%), Gaps = 23/228 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR      + P++ QG+CGSCWAF+ T  LE Q       L  LS+  LV+C  
Sbjct: 124 LPKNVDWRTKGA--VTPIKDQGQCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSR 181

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
             GN  CNGG +D AFEYVK+  G++++  YPY  ++    +C Y    A    K FV  
Sbjct: 182 KFGNNGCNGGLMDNAFEYVKENGGIDTEESYPYDAEDE---KCHYNPRAAGAEDKGFVD- 237

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             V  G +H +   +   GP+ V ++  H   + Y        +  C+P  LDH V +VG
Sbjct: 238 --VREGSEHALKKAVATVGPVSVAIDASHESFQFYSHGVYIEPE--CSPEMLDHGVLVVG 293

Query: 295 YG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           YG + +G   W+V+NSWG    D GY ++ R   N CGI S A    V
Sbjct: 294 YGIDDDGTDYWLVKNSWGTTWGDQGYVKMARNRDNQCGIASSASFPLV 341


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 104/341 (30%), Positives = 163/341 (47%), Gaps = 38/341 (11%)

Query: 16  VTYNVNTDSAIYVWRDLAYDSI-KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET 74
           +TY +  D +I  +      S+ K ++ F++++ K ++TY    E   RFE F  + K  
Sbjct: 19  ITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHI 78

Query: 75  DE--------YYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGP 125
           DE        + G +  +D S +E   +  GLR+        E  R+R  +  +      
Sbjct: 79  DETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRV--------EFPRKRSSRGFSYGDVED 130

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S+DWR      + PV++QG CGSCWAF+T A +E    ++   L  LS+ +L++CD 
Sbjct: 131 LPESVDWRTKGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR 188

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  C GG +D AF+Y+    GL  + DYPY  +E    RC  EKE+ +V     +   
Sbjct: 189 SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEG---RCIREKEQFEVVTISGYEDV 245

Query: 244 GVDHMMHLLQS---GPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
             +    LL++    P+ V +  + R  + Y G         C   ++DH V  VGYG  
Sbjct: 246 PANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGR---CGT-QMDHGVTAVGYGSS 301

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
            G    IV+NSWG    ++GY +++R        CGI   A
Sbjct: 302 EGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMA 342


>gi|403367386|gb|EJY83513.1| Cathepsin L [Oxytricha trifallax]
          Length = 339

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 119/216 (55%), Gaps = 17/216 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR      + PV+ QG+CGSCWAF+TT  LE + A+   TL   S+ QLV+CD+
Sbjct: 124 VPDSIDWRTKGA--VTPVKDQGQCGSCWAFSTTGSLEGRDAIATGTLQSYSEQQLVDCDY 181

Query: 186 ---GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEK--AKVFVQDTW 240
              GN  CNGG++ +A +Y  +  LE ++DYPY+    I  +C+Y+ +K  +K       
Sbjct: 182 STDGNQGCNGGDMGLAMDYSAKNPLELESDYPYK---AIDGKCSYKADKGHSKNKGHTNV 238

Query: 241 VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
             + +  +   +  GP+ V +  +  + + Y+G  +       N   LDH V  VGYG +
Sbjct: 239 KQNSLPDLKAAIAQGPVSVAIEADTMVFQFYNGGILNSKSCGTN---LDHGVLAVGYGSE 295

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIER--GANACGIE 332
           N    +IV+NSWG    + GY +I +  GA  CGI+
Sbjct: 296 NNKPYYIVKNSWGPSWGEQGYLRIAQVDGAGICGIQ 331


>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
          Length = 266

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 116/224 (51%), Gaps = 14/224 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR   V  + PV+ QG CGSCWAF+ T  +E   A+    L  LS+ +LV+CD 
Sbjct: 47  LPDEFDWRNHSV--VTPVKDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCDK 104

Query: 186 GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            +  CNGG  + A++ +    GLE+++DYPY   EN   +C +     +V V      S 
Sbjct: 105 LDSGCNGGLPENAYKAIHDIGGLETESDYPYNGHEN---KCKFNSNITRVQVTGGVEIST 161

Query: 245 VDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-----E 297
            +  M   L+Q+GPI + +N   ++ Y G         C P  +DH V IVGYG     +
Sbjct: 162 NETEMAQWLIQNGPISIGINANAMQYYRGGVSHPWKVLCRPGGIDHGVLIVGYGVSQYPK 221

Query: 298 KNGILT-WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            N  L  WIV+NSWG    + GY+++ RG   CG+      A++
Sbjct: 222 FNKTLPYWIVKNSWGTRWGEQGYYRVFRGDGTCGLNQMCTSATL 265


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 102/317 (32%), Positives = 151/317 (47%), Gaps = 31/317 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F+ +I  + + Y    E   RFE FK + K  DE        + G +  +D S +
Sbjct: 46  KLIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHE 105

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  +       G + + +  D ER       R    +PKS+DWR  K   +  V++QG C
Sbjct: 106 EFKKM----YLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR--KKGAVAEVKNQGSC 159

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AFEY VK  GL
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGL 219

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQD---TWVTSGVDHMMHLLQSGPIGVYLNH- 263
             + DYPY  +E     C  +K++++    D      T+    ++  L   P+ V ++  
Sbjct: 220 RKEEDYPYSMEEGT---CEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDAS 276

Query: 264 -RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + Y G  +   D  C    LDH VA VGYG   G    IV+NSWG    + GY ++
Sbjct: 277 GREFQFYSG--VSVFDGRCGV-DLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRL 333

Query: 323 ERGANA----CGIESYA 335
           +R        CGI   A
Sbjct: 334 KRNTGKPEGLCGINKMA 350


>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
 gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 323

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 147/300 (49%), Gaps = 20/300 (6%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLE 108
           K+ R Y D  E   R   F+Q+ K  +E+     + + +    + + G     +    ++
Sbjct: 26  KYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMK 85

Query: 109 ADRER----VKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
            +  R    V  F  +++ GP    +DWR      + PV+ QG+CGSCWAF+TT  LE Q
Sbjct: 86  GNIPRRSAPVSVFYPKKETGPQATEVDWRTKGA--VTPVKDQGQCGSCWAFSTTGSLEGQ 143

Query: 165 VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENI 221
             L   +L  L++ QLV+C   +G   CNGG ++ AF+Y+K   G++++A YPY  ++  
Sbjct: 144 HFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDG- 202

Query: 222 TFRCTYEKEK-AKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRR 276
              C ++    A      T + SG +  +   +   GPI V ++  H   + Y       
Sbjct: 203 --SCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYE 260

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
              +C+P  LDHAV  VGYG + G   W+V+NSW     D GY ++ R   N CGI + A
Sbjct: 261 P--SCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVA 318


>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
 gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
 gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
 gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
 gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
          Length = 361

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 155/320 (48%), Gaps = 41/320 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEIL 92
           D F  +  K+ + Y    E   RF  FK +          +    +G +  SD +  E  
Sbjct: 46  DHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSE-F 104

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +R  L + G  K   +A++  +    N      LP+  DWR      + PV++QG CGSC
Sbjct: 105 RRKHLGVKGGFKLPKDANQAPILPTQN------LPEEFDWRDRGA--VTPVKNQGSCGSC 156

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+TT  LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +
Sbjct: 157 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTL 216

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
           K  GL  + DYPY   +  +  C  ++ K    V +  V S  +  +  +L+++GP+ V 
Sbjct: 217 KTGGLMREKDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVA 274

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  +L+H V +VGYG              WI++NSWG+ 
Sbjct: 275 INAAYMQTYIGGV--SCPYICS-RRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGES 331

Query: 314 GPDHGYFQIERGANACGIES 333
             ++G+++I +G N CG++S
Sbjct: 332 WGENGFYKICKGRNICGVDS 351


>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
          Length = 324

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 126/234 (53%), Gaps = 13/234 (5%)

Query: 110 DRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           D  +  +  +E  + P   L  S+DWR S    ++P+++QG+CGSCW+F+ T  LESQ  
Sbjct: 91  DPPKNNRGASEPFRAPNVGLAASVDWRTSGC--VSPIKNQGQCGSCWSFSATGALESQTC 148

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENIT- 222
           L +  L  LS+ QLV+C   +GN  CNGG  D AF+YV+   G++S++ YPY+ +     
Sbjct: 149 LRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQYVQANGGIDSESYYPYQARVGTCH 208

Query: 223 FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACN 282
           +   Y       +   T V S      ++   GP+ + ++    +SY       ND +C+
Sbjct: 209 YNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDASGWQSYQSGVF--NDPSCS 266

Query: 283 PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               DHAV +VGYG  NG   W+V+NSWG    + GY  + R A N CGI ++A
Sbjct: 267 -QTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMARNANNQCGIANHA 319


>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
 gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
          Length = 353

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 154/327 (47%), Gaps = 36/327 (11%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDG--------KETDEYYGTSGSSDRSPQEIL 92
           D F  +  K+ R Y    E + R + F+++         +E +  YG +  SD +  E  
Sbjct: 35  DHFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIREGNNNYGITKFSDLTSDE-F 93

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           ++  L      KE  +  R    K ++     P P   DWR      +  V+ QG+CGSC
Sbjct: 94  RKFYLMEKKTPKEIQKMMRMDSNKMVSNSYAKPAPDHYDWRNHGA--ITGVKDQGQCGSC 151

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN----------CNGGNIDVAFEYV 202
           WAF+    +E   A+  K L   S+ QLV+CD+  +           CNGG    A++Y+
Sbjct: 152 WAFSAIGSIEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYL 211

Query: 203 -KQYGLESQADYPYRNKENITFRCTYEKEK--AKVFVQDTWVTSGVDHMMHLLQSGPIGV 259
            K  G+ ++ DYPY  +    ++C  +     AK+       T+  +    L ++GPI V
Sbjct: 212 MKAGGVVTEKDYPYYAER---YKCEVKPANFVAKLSNWTMLSTNETEMANWLAENGPIAV 268

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWG-DI 313
            LN   +++Y+ N I    W C+P +LDH V IVGYG +          WIV+NSWG D 
Sbjct: 269 ALNADFLQNYN-NGIADPAW-CDPTQLDHGVLIVGYGLETFWFGKPQPYWIVKNSWGYDF 326

Query: 314 GPDHGYFQIERGANACGIESYAYLASV 340
           G D GYF+I +G   CGI +    A V
Sbjct: 327 GED-GYFRIVKGVGRCGINTVPSAAFV 352


>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
          Length = 427

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 145/305 (47%), Gaps = 26/305 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK---------QDGKETDEYYGTSGSSDRSPQEILQ 93
           F+ +  K+ ++Y+ D     R+  FK         Q  ++    YG +  SD S +E   
Sbjct: 127 FEEFQRKFRKSYSSDT--AKRYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEE--- 181

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
               R +    +R ++   +++  +       LP S DWR +    +  V+ QG CGSCW
Sbjct: 182 ---FRHSLANMKRRKSKGSQMETAIFPTTIQSLPPSFDWRANGA--VTEVKDQGMCGSCW 236

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQAD 212
           AFATT  +E Q       L  LS+ QL++CD  +  CNGG  + A+ E VK  GL S+ D
Sbjct: 237 AFATTGNIEGQWFRKTNKLISLSEQQLLDCDTKDEACNGGLPEWAYDEIVKMGGLMSEKD 296

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESYD 270
           YPY   +  +  C   +     ++  +      +  +   L+Q+GPI V +N   ++ Y 
Sbjct: 297 YPYEAMKEQS--CHLRRPNISAYINGSATLPSDEAKLAAWLVQNGPISVGVNANFLQFYL 354

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT--WIVRNSWGDIGPDHGYFQIERGANA 328
           G         C+   LDHAV +VGYG    +    WIV+NSWG    + GYF++ RG   
Sbjct: 355 GGISHPPHMLCSEAGLDHAVLLVGYGVSTFLRRPYWIVKNSWGGGWGEKGYFRMYRGDGT 414

Query: 329 CGIES 333
           CGI +
Sbjct: 415 CGINA 419


>gi|291224892|ref|XP_002732436.1| PREDICTED: cathepsin H-like [Saccoglossus kowalevskii]
          Length = 302

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 115/219 (52%), Gaps = 13/219 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PK+ DW +   KV+  V+SQG CGSCW+F+TT  LES  A+ K TL  LS+ QL++C  
Sbjct: 81  IPKAFDWTKYSRKVVTDVKSQGSCGSCWSFSTTGALESATAIAKSTLISLSEQQLIDCAQ 140

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  CNGG    AFEY+    GL +  DY Y+ K+    +C Y+  KA  FV     +
Sbjct: 141 AFNNHGCNGGLPAQAFEYIHYNDGLMADIDYQYKAKDG---KCKYDPSKAAAFVSKIVNI 197

Query: 242 TSGVDH--MMHLLQSGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE- 297
           T G +   +  + + GP+ + Y        Y            +P  ++HAV   G+ E 
Sbjct: 198 TKGDEDGILNAVYKHGPVSIAYDVASDFHLYHSGVYSSTVCKIDPEHVNHAVLATGFNET 257

Query: 298 KNGILTWIVRNSWG-DIGPDHGYFQIERGANACGIESYA 335
             G+  W+V+NSWG D G D GYF IER  N CG+   A
Sbjct: 258 AEGLKYWMVKNSWGPDWGLD-GYFWIERNKNMCGLADCA 295


>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
 gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
 gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
 gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
          Length = 462

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 155/320 (48%), Gaps = 31/320 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 158 SVKMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E        L  KE       +  + K +N+      P   DWR  K   +  V+ Q
Sbjct: 218 TEEEFHTIYLNPLLQKE----SGGKMSLAKSINDLA----PPEWDWR--KKGAVTEVKDQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYTAIKNLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  + AKV++ D+   S  ++ +   L Q GPI V +N 
Sbjct: 328 GLETEDDYGYQGHVQA---CNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINA 384

Query: 264 RLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
             ++ Y     +P R     C+P  +DHAV +VGYG ++ I  W ++NSWG    + GY+
Sbjct: 385 FGMQFYRHGIAHPFRP---LCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYY 441

Query: 321 QIERGANACGIESYAYLASV 340
            + RG+ ACG+ + A  A V
Sbjct: 442 YLYRGSGACGVNTMASSAVV 461


>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
          Length = 324

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 119/215 (55%), Gaps = 10/215 (4%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           L  S+DWR S    ++P+++QG+CGSCW+F+ T  LESQ  L +  L  LS+ QLV+C  
Sbjct: 110 LAASVDWRTSGC--VSPIKNQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSG 167

Query: 184 DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENIT-FRCTYEKEKAKVFVQDTWV 241
            +GN  CNGG  D AF+Y++   G++S++ YPY+ +     +   Y       +   T V
Sbjct: 168 SYGNYGCNGGWPDQAFQYIQANGGIDSESYYPYQARVGTCHYNSAYSAATCSGYQDVTPV 227

Query: 242 TSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
            S      ++   GP+ + ++    +SY       ND +C+    DHAV +VGYG  NG 
Sbjct: 228 GSESALQYYVANVGPLSIAIDASGWQSYQSGVF--NDPSCS-QTADHAVLLVGYGTYNGQ 284

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             W+V+NSWG    + GY  + R A N CGI ++A
Sbjct: 285 DYWLVKNSWGTWWGEQGYIMMTRNANNQCGIANHA 319


>gi|42564149|gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/301 (33%), Positives = 156/301 (51%), Gaps = 37/301 (12%)

Query: 52  RTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDRSPQEILQRTGLR 98
           +TY    E +TRF  F+             ++GK T  Y   +  +D +  E  ++ GL+
Sbjct: 32  KTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVT-YYMAVTQFADMTRDEFRKKLGLQ 90

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                +  L A  +   + L       LP+ +DW + K  VL PV++QG C SCWAF+TT
Sbjct: 91  --NNRRPNLNATLQVFPEDLE------LPEQIDWTE-KGAVL-PVKNQGNCRSCWAFSTT 140

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCN-GGNIDVAFEYVKQYGLESQADYPY 215
             LE Q A+  K   PLS+ QL++C   +GN +C+ GG +  AF+Y+   G+E+++ YPY
Sbjct: 141 GSLEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPY 200

Query: 216 RNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNP 273
              E +T  C Y+ +K  V ++    + +  D +   + + GPI V ++   +  Y G  
Sbjct: 201 V--EQMT-ECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGV 257

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIE 332
           +    +      +DHAV +VGYGE NG   W V+NSWG    + GYF+IER A N C I 
Sbjct: 258 LGDQCY----FGMDHAVLVVGYGEANGKKFWKVKNSWGATWGEDGYFRIERDADNLCDIA 313

Query: 333 S 333
           S
Sbjct: 314 S 314


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/310 (31%), Positives = 158/310 (50%), Gaps = 35/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           F++++V   ++Y    E + RF+ FK + +  DE           G +  +D + +E   
Sbjct: 45  FESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRS 104

Query: 94  R-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           + TG++ +   ++++ A   R      E     LP+S+DWR+S    +  V+ QG CGSC
Sbjct: 105 KYTGIK-SKDLRKKVSAKSGRYATLSGES----LPESVDWRESGA--VATVKDQGSCGSC 157

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T + +E    +    L  LS+ +LV+CD   N  CNGG +D AFE++    G+++ 
Sbjct: 158 WAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTD 217

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG---PIGVYL--NHRL 265
            DYPY  ++    +C   ++ AKV   D++        + L ++    PI V +  + R 
Sbjct: 218 VDYPYTGRDG---KCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRD 274

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + YD          C    LDH V +VGYG +NG   WIVRNSWG    ++GY ++ERG
Sbjct: 275 FQFYDSGIFTGK---CGI-ALDHGVVVVGYGTENGKDYWIVRNSWGADWGENGYLRMERG 330

Query: 326 ANA----CGI 331
            ++    CGI
Sbjct: 331 ISSKTGICGI 340


>gi|403376395|gb|EJY88173.1| Cysteine protease-5 [Oxytricha trifallax]
          Length = 401

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 157/310 (50%), Gaps = 36/310 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYF-------KQDGKETDEYY--GTSGSSDRSPQEIL 92
           AF  ++ ++ +TY   N + +RF+ F       K   +  +++Y  G +  SD + +E L
Sbjct: 71  AFIQFVAEYGKTYATKNHLNSRFDIFAKNFEMIKSHNENEEKHYEMGINKFSDMTHEEFL 130

Query: 93  Q---RTGLRLTGKEKERLEA---DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           +   + G+ +  +EK RLEA   +R    + +        P+ +DWR++  KV  P + Q
Sbjct: 131 EHYHKQGVLIPSEEK-RLEAHHANRHPSLQAMASDDNQAAPEKVDWREAG-KVSVPGD-Q 187

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYP--LSKSQLVECDHGNLNCNGGNIDVAFEYVKQ 204
             CGSCWAF T   LES  A+ K    P   S   L++CD GN  C GG +  A+E+ K 
Sbjct: 188 SSCGSCWAFTTATTLESLHAI-KNDTKPERFSVQYLIDCDEGNFGCGGGWMLDAYEFTKT 246

Query: 205 YGLESQADYP--YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQSGPIGVY 260
            GL  + DYP  Y   +N    C   K+K + +  D      +D+  +  L+   P+GV 
Sbjct: 247 KGLLKEEDYPRKYTMSKN---SCVDVKDKQRFYNHDQKEEDNIDNDRLRKLVSIRPVGVA 303

Query: 261 L--NHRLIESYDGNPIRRNDWACNPHK--LDHAVAIVGYGE----KNGILTWIVRNSWGD 312
           +  N R + SY    +R  D  C+  K  ++HAV IVGYG+    K+ +  W+V+NSWG 
Sbjct: 304 MHSNPRCLMSYKNGILREEDCKCSDEKNQVNHAVTIVGYGKVDNSKDCVGYWLVKNSWGP 363

Query: 313 IGPDHGYFQI 322
              D G+F++
Sbjct: 364 RWGDQGFFKL 373


>gi|318844127|ref|NP_001187181.1| cathspsin H precursor [Ictalurus punctatus]
 gi|196475594|gb|ACG76366.1| cathspsin H [Ictalurus punctatus]
          Length = 326

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 146/304 (48%), Gaps = 19/304 (6%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
            FKT++ + N+ Y  + E   R + F ++ K+ D +   +  + +    + Q + +    
Sbjct: 27  VFKTWMSEHNKQYGLE-EYYPRLQIFTENKKKIDTH---NAGNHKFRMGLNQFSDMTFAE 82

Query: 102 KEKERL--EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
            +K  L  E       K  + R  G  P S+DWR+ K   +  V++QG CGSCW F+TT 
Sbjct: 83  FKKFYLLKEPQECNATKGNHVRGVGLYPDSIDWRK-KGNYVTEVKNQGACGSCWTFSTTG 141

Query: 160 ILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYR 216
            LES  A+    L  L++ QLV+C     N  CNGG    AFEY+    GL ++ DYPY 
Sbjct: 142 CLESVTAIATGKLPLLAEQQLVDCAGAFNNHGCNGGLPSQAFEYIMYNKGLMTEDDYPYV 201

Query: 217 NKENITFRCTYEKEKAKVFVQDTWVTSGVDHM-----MHLLQSGPIGVYLNHRLIESYDG 271
            ++     C ++ + A  FV+D    +  D M     +  L    I   +    +   DG
Sbjct: 202 GRDG---PCKFDPKLAAAFVKDVVNITKYDEMGIVDAVARLNPVSIAFEVLPEFMHYKDG 258

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
                N+       ++HAV  VGY E+NG   WIV+NSWG      GYF IERG N CG+
Sbjct: 259 -VYTSNECHNTTETVNHAVLAVGYAEENGTPYWIVKNSWGPQWGIDGYFYIERGQNMCGL 317

Query: 332 ESYA 335
            + A
Sbjct: 318 AACA 321


>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/333 (30%), Positives = 164/333 (49%), Gaps = 38/333 (11%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE-------- 76
           A++    LA +++   D +  +     +TY    E +TRF  F+ + ++ +E        
Sbjct: 5   AVFATVLLAVNALTDKDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKG 64

Query: 77  ----YYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
               + G +  +D +  E   +  LR   K K  +EA      + L       +P S+DW
Sbjct: 65  EESYFLGVTPFADLTHDEFKDK--LRRQIKTKPNVEATLAVFPEGLE------VPDSIDW 116

Query: 133 RQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC 190
            Q K  VL+ V+ QG CGSCWAF+ T  LE Q A++     PLS+ QL++C   +GN +C
Sbjct: 117 TQ-KGAVLD-VKYQGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDC 174

Query: 191 -NGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM 249
            +GG +  AF+YV   G+E+ + YPY+    I   C Y+ +K  + ++     S  +  +
Sbjct: 175 EHGGLMSFAFDYVLDKGIEADSSYPYK---GIDTPCQYDAKKTVLKIKGYRNVSISEEEL 231

Query: 250 H--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT---- 303
              +   GP+ V ++   I+ Y G  +   D     H L+H V  VGYGE++ +      
Sbjct: 232 KKAVGTVGPVSVAIDADPIQLYSGGIL---DGLFCTHNLNHGVLAVGYGEEDHLFGKKKF 288

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           W V+NSWG    + GYF+I+R A N CGI   A
Sbjct: 289 WKVKNSWGKDWGEQGYFRIKRDANNLCGIADKA 321


>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
           partial [Trypanosoma vivax Y486]
          Length = 323

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/300 (28%), Positives = 138/300 (46%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                    +   EA R RV+  + +   G  P ++DWR+     + PV+ QGRCGSCW+
Sbjct: 94  YH-----NGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGRCGSCWS 145

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQA 211
           F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + ++ 
Sbjct: 146 FSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEK 205

Query: 212 DYPYRNKENITFRCT-YEKEKAKVFVQDTWVTSGVDHMM-HLLQSGPIGVYLNHRLIESY 269
            YPY +++     C  Y  E          +    D +  +L  +GP+ V ++     SY
Sbjct: 206 SYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSY 265

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
            G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE+G N C
Sbjct: 266 SGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIEKGTNQC 321


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 94/307 (30%), Positives = 150/307 (48%), Gaps = 32/307 (10%)

Query: 44  KTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKE 103
           + ++VK+ R Y D++E + RFE F+ +  E  E +   G+    P ++       LT   
Sbjct: 39  EMWMVKYGRVYKDNSEKERRFEIFRNN-VEFIESFNKPGNR---PYKLDINEFADLT--- 91

Query: 104 KERLEADRERVKKF----LNERKK------GPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
            E  +A R   K+     L+E+          +P S+DWRQ     + P++ QG+CG CW
Sbjct: 92  NEEFKASRNGYKRSSNVGLSEKSSFRYGNVTAVPTSMDWRQKGA--VTPIKDQGQCGCCW 149

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQ 210
           AF+  A +E    L    L  LS+ +LV+CD    +  C GG +D AFE++KQ G L ++
Sbjct: 150 AFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTE 209

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHR--LIES 268
           A+YPY+  +          + AK+   +    +  D ++  + S P+ V ++      + 
Sbjct: 210 ANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQF 269

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G     +   C   +LDH V  VGYG  +G   W+V+NSWG    + GY ++ER   A
Sbjct: 270 YSGGVFTGD---CGT-ELDHGVTAVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERDIEA 325

Query: 329 ----CGI 331
               CGI
Sbjct: 326 KEGLCGI 332


>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
          Length = 477

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/323 (30%), Positives = 149/323 (46%), Gaps = 37/323 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E   R   F  +     +          YG +  SD 
Sbjct: 173 SVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDL 232

Query: 87  SPQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           + +E   I     L+       RL      V            P   DWR      +  V
Sbjct: 233 TEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVP-----------PPQWDWRNKGA--VTDV 279

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  ++
Sbjct: 280 KDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIR 339

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY YR        C++  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 340 TLGGLETEDDYSYRGHLQT---CSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVA 396

Query: 261 LNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
           +N   ++ Y     +P+R     C+P  +DHAV +VGYG ++    W ++NSWG    + 
Sbjct: 397 INAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTNWGEE 453

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+ ACG+   A  A +
Sbjct: 454 GYYYLHRGSGACGVNIMASSAVI 476


>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
          Length = 460

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/323 (30%), Positives = 149/323 (46%), Gaps = 37/323 (11%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           S+K    FK ++  +NRTY    E   R   F  +     +          YG +  SD 
Sbjct: 156 SVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDL 215

Query: 87  SPQE---ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           + +E   I     L+       RL      V            P   DWR      +  V
Sbjct: 216 TEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVP-----------PPQWDWRNKGA--VTDV 262

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
           + QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  +  C GG    A+  ++
Sbjct: 263 KDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIR 322

Query: 204 QYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVY 260
             G LE++ DY YR        C++  EKAKV++ D+   S  +  +   L + GPI V 
Sbjct: 323 TLGGLETEDDYSYRGHLQT---CSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPISVA 379

Query: 261 LNHRLIESYD---GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
           +N   ++ Y     +P+R     C+P  +DHAV +VGYG ++    W ++NSWG    + 
Sbjct: 380 INAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTNWGEE 436

Query: 318 GYFQIERGANACGIESYAYLASV 340
           GY+ + RG+ ACG+   A  A +
Sbjct: 437 GYYYLHRGSGACGVNIMASSAVI 459


>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
          Length = 330

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 128/236 (54%), Gaps = 22/236 (9%)

Query: 117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLS 176
              E   G +PKS+DWR      + PV+ QG CGSCWAF+    LE Q+      L PLS
Sbjct: 105 IFQEPLLGDVPKSVDWRDHGY--VTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLS 162

Query: 177 KSQLVEC--DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAK 233
           +  L++C   +GN+ CNGG +++AF+YVK+  GL+++  Y Y   E     C Y+ + + 
Sbjct: 163 EQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAY---EAWDGPCRYDPKYSA 219

Query: 234 V----FVQDTWVTSGVDHMMHLLQS-GP--IGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
           V    FV+   V    D +M+ + S GP  +G+  +H     Y G      D  C+   L
Sbjct: 220 VNITGFVK---VPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPD--CSSTNL 274

Query: 287 DHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           DHAV +VGYGE+ +G   W+V+NSWG+     GY ++ +   N CGI +YA   +V
Sbjct: 275 DHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 330


>gi|308322047|gb|ADO28161.1| cathepsin H [Ictalurus furcatus]
          Length = 326

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 146/304 (48%), Gaps = 19/304 (6%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
            FKT++ + N+ Y  + E   R + F ++ K+ D +   +  + +    + Q + +    
Sbjct: 27  VFKTWMSEHNKQYGLE-EYYQRLQIFTENKKKIDTH---NAGNHKFRMGLNQFSDMTFAE 82

Query: 102 KEKERL--EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
            +K  L  E       K  + R  G  P S+DWR+ K   +  V++QG CGSCW F+TT 
Sbjct: 83  FKKFYLLKEPQECNATKGNHVRGVGLYPDSIDWRK-KGNYVTEVKNQGACGSCWTFSTTG 141

Query: 160 ILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYR 216
            LES  A+    L  L++ QLV+C     N  CNGG    AFEY+    GL ++ DYPY 
Sbjct: 142 CLESVTAIATGKLPLLAEQQLVDCAGAFNNHGCNGGLPSQAFEYIMYNKGLMTEDDYPYV 201

Query: 217 NKENITFRCTYEKEKAKVFVQDTWVTSGVDHM-----MHLLQSGPIGVYLNHRLIESYDG 271
            ++     C ++ + A  FV+D    +  D M     +  L    I   +    +   DG
Sbjct: 202 GRDG---PCKFDPKLAAAFVKDVVNITKYDEMGIVDAVARLNPVSIAFEVLPEFMHYKDG 258

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
                N+       ++HAV  VGY E+NG   WIV+NSWG      GYF IERG N CG+
Sbjct: 259 -VYTSNECHNTTETVNHAVLAVGYAEENGTPYWIVKNSWGPQWGIDGYFYIERGQNMCGL 317

Query: 332 ESYA 335
            + A
Sbjct: 318 AACA 321


>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/317 (29%), Positives = 150/317 (47%), Gaps = 30/317 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPE 95

Query: 90  EILQRTGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E       R T     E   A  +R +K +     G  P+++DWR  K   + PV+ QG+
Sbjct: 96  E------FRATYHNGAEYYAAALKRPRKVVT-VSTGKAPEAVDWR--KKGAVTPVKDQGQ 146

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQY 205
           CGSCWAF+    +E Q  +    L  LS+  LV CD     C GG +D AF ++    + 
Sbjct: 147 CGSCWAFSAIGNIEGQWKVAGHELTSLSEQTLVSCDPTEYACEGGFMDNAFRWIISSNKG 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            + ++  YPY +       C    +     + D       ++ +   L ++GP+ V ++ 
Sbjct: 207 KVFTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDENAIAEWLAKNGPVSVIVDA 266

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
              +SY G  +     +C    L+HAV +VGY + +    WI++NSW +   + GY +IE
Sbjct: 267 TSFQSYTGGVLT----SCLSKILNHAVLLVGYDDTSKPPYWIIKNSWSEKWGEKGYIRIE 322

Query: 324 RGANACGIESYAYLASV 340
           +G N C ++ YA  A V
Sbjct: 323 KGTNQCLVQEYASSALV 339


>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 162/329 (49%), Gaps = 38/329 (11%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ------------DGK 72
           AI+    +A  +    D +  +     +TY +  E KTRF  F++            D  
Sbjct: 5   AIFATVLIAVTASTNEDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKG 64

Query: 73  ETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
           E     G +  +D + +E   +  L+   K K RL A      + L       +P S+DW
Sbjct: 65  EETYLLGVTRFADLTHEEF--KDILKGQIKNKPRLNATPTVFPEDLE------VPDSIDW 116

Query: 133 RQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC 190
            + K  VL  V+ Q  CGSCWAF+ T  LE Q A+L      LS+ QL++C   +GN NC
Sbjct: 117 TE-KGAVLE-VKDQNPCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNC 174

Query: 191 N-GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM 248
             GG++  AFEYV+ YG++S+  YPY  K+     C Y+  K  + ++    VT+  + +
Sbjct: 175 KEGGDMSAAFEYVRDYGIQSEKSYPYIRKQT---ECQYDASKTILKIKGYKNVTTSEEGL 231

Query: 249 MHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG----ILT 303
              + + GPI + +N   ++ Y    I      C+ H LDH V +VGYG+ +        
Sbjct: 232 RKAVGAIGPISIAMNSDPLQLYYSGIISGK--GCS-HDLDHGVLVVGYGKASQWSGETKF 288

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGI 331
           W V+NSWG I  ++GYF+I+R A N CGI
Sbjct: 289 WRVKNSWGKIWGENGYFRIKRDANNLCGI 317


>gi|42564153|gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/301 (34%), Positives = 156/301 (51%), Gaps = 37/301 (12%)

Query: 52  RTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDRSPQEILQRTGLR 98
           +TY    E +TRF  F+             ++GK T  Y   +  +D +  E  ++ GL+
Sbjct: 32  KTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVT-YYMAVTQFADMTRDEFRKKLGLQ 90

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                +  L A      + L       LP+ +DW + K  VL PV++QG C SCWAF+TT
Sbjct: 91  --NNRRPNLNATLRVFPEDLE------LPEQIDWTE-KGAVL-PVKNQGNCRSCWAFSTT 140

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCN-GGNIDVAFEYVKQYGLESQADYPY 215
             LE Q A+  K   PLS+ QL++C   +GN +C+ GG +  AF+Y+   G+E+++ YPY
Sbjct: 141 GSLEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPY 200

Query: 216 RNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNP 273
              E +T  C Y+ +K  V ++    + +  D +   + + GPI V ++   +  Y G  
Sbjct: 201 V--EQMT-ECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGV 257

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIE 332
           +   D  C    +DHAV +VGYGE NG   W V+NSWG    + GYF+IER A N C I 
Sbjct: 258 L---DDQCY-FGMDHAVLVVGYGEANGKKFWKVKNSWGTTWGEDGYFRIERDADNLCDIA 313

Query: 333 S 333
           S
Sbjct: 314 S 314


>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 380

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 143/301 (47%), Gaps = 27/301 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F  +  K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+E    
Sbjct: 41  FAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEE---- 96

Query: 95  TGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              R T     E   A  +R +K +N    G  P+++DWR  K   + PV+ QG+CGSCW
Sbjct: 97  --FRATYHNGAEYYAAALKRPRKVVN-VSTGKAPEAVDWR--KKGAVTPVKDQGQCGSCW 151

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQ 210
           AF+    +E Q  +    L  LS+  LV CD  +  C GG +D AF+++    +  + ++
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGCEGGLMDDAFKWIVSSNKGNVFTE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIES 268
             YPY +       C    +     ++D       ++ +   L ++GP+ + ++    +S
Sbjct: 212 QSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVAIAVDATSFQS 271

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +     +C    LDH V +VGY + +    WI++NSW     + GY +IE+G N 
Sbjct: 272 YTGGVLT----SCISEHLDHGVLLVGYDDTSKPPYWIIKNSWSKGWGEEGYIRIEKGTNQ 327

Query: 329 C 329
           C
Sbjct: 328 C 328


>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
          Length = 330

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 128/236 (54%), Gaps = 22/236 (9%)

Query: 117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLS 176
              E   G +PKS+DWR      + PV+ QG CGSCWAF+    LE Q+      L PLS
Sbjct: 105 IFQEPLLGDVPKSVDWRDHGY--VTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLS 162

Query: 177 KSQLVEC--DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAK 233
           +  L++C   +GN+ CNGG +++AF+YVK+  GL+++  Y Y   E     C Y+ + + 
Sbjct: 163 EQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAY---EAWDGPCRYDPKYSA 219

Query: 234 V----FVQDTWVTSGVDHMMHLLQS-GP--IGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
           V    FV+   V    D +M+ + S GP  +G+  +H     Y G      D  C+   L
Sbjct: 220 VNITGFVK---VPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPD--CSSTNL 274

Query: 287 DHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           DHAV +VGYGE+ +G   W+V+NSWG+     GY ++ +   N CGI +YA   +V
Sbjct: 275 DHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 330


>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
 gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
          Length = 496

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 160/331 (48%), Gaps = 46/331 (13%)

Query: 32  LAYDSIKQVD--------------AFKTYIVKWNRTYTDDNEIKTRFEYFK--------- 68
           LAY+S+K +                FK ++  + + Y  + E+  R++ FK         
Sbjct: 171 LAYNSVKLLKFIRSQSEEERTLWMQFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEML 230

Query: 69  QDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPK 128
           Q  ++    YG +  +D +P+E  +     L+ + K      R+++ +      KG +  
Sbjct: 231 QKNEQGTAVYGVTFFADLTPEEFRK---FYLSPQWK------RDQLPQRKASIPKGKIED 281

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNL 188
             DWR+     +  V++QG CGSCWAFAT A +E   A+ K  L  LS+ +LV+CD  + 
Sbjct: 282 RWDWREHNA--VTEVKNQGMCGSCWAFATIANVEGVWAVKKGELVSLSEQELVDCDTLDQ 339

Query: 189 NCNGGNIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
            C+GG    A+ E ++  GL ++ +Y Y   +     C ++ + AKV++ D+ V+   D 
Sbjct: 340 GCSGGYPSNAYKEIIRLGGLTTETNYSYDGNQGT---CRFKTQNAKVYINDS-VSLPEDE 395

Query: 248 M---MHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNG 300
                ++ ++GP+ V +N   +  Y         + C+P  LDH VAIVGY      K  
Sbjct: 396 TEIAAYIRENGPVAVGINAFAMMFYRHGIAHPWRFLCSPDALDHGVAIVGYDVEKQSKKP 455

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
              WI++NSWG    + GY+ + RGA  CG+
Sbjct: 456 KPYWIIKNSWGTHWGEGGYYMLYRGAGVCGV 486


>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
 gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
 gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/220 (36%), Positives = 118/220 (53%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP++ DWR+  +  ++PV++QG CGSCW F+TT  LE+           LS+ QLV+C  
Sbjct: 140 LPETKDWREEGI--VSPVKNQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAR 197

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWV 241
              N  CNGG    AFEY+K   GL+++  YPY  K++    C +  E   V  V+   +
Sbjct: 198 AFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKDDA---CKFSSENVGVRVVESVNI 254

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +    P+ V      + RL   Y       +     P  ++HAV  VGY
Sbjct: 255 TLGAEDELKHAVAFVRPVSVAFEVVGSFRL---YKEGVYTTSTCGSTPMDVNHAVLAVGY 311

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           G +NGI  W+++NSWG+   D+GYF++E G N CGI + A
Sbjct: 312 GVENGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGIATCA 351


>gi|340375899|ref|XP_003386471.1| PREDICTED: probable cysteine proteinase A494-like [Amphimedon
           queenslandica]
          Length = 373

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/252 (34%), Positives = 132/252 (52%), Gaps = 53/252 (21%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--- 183
           P + DWR   V  +  V++QG  G+CWAF+T   +E Q AL    L  LS  QLV+C   
Sbjct: 120 PNTFDWRDKHV--VTSVKNQGSAGTCWAFSTVGNVEGQWALGGHNLTSLSTEQLVDCDDT 177

Query: 184 -DHGNLNCN----GGNIDVAFEYVK-QYGLESQADYPYRNKE------------------ 219
            DH NL+ +    GG   +A+EY+K + G+E + DYPY + +                  
Sbjct: 178 YDHNNLHMDCGVFGGWPYLAYEYIKNEGGIEREEDYPYCSGQGTCFPCVPSGWNKTRCGP 237

Query: 220 -----NITFRCTYEKEKAKVFVQ----DTWVT---SGVDHMMHLLQSGPIGVYLNHRLIE 267
                N TF CT++ +K+K FVQ     +W+      V+    L++ GP+ V +N  L++
Sbjct: 238 PPLYCNDTFSCTHKLDKSK-FVQGLSIKSWIAIQKDEVEMQAALIKQGPLSVLINALLLQ 296

Query: 268 SYDG---NPIRRNDWACNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYF 320
            Y     +PI +    CNP +LDHAV +VGYG + G+L     W+++NSWG      GYF
Sbjct: 297 FYRSGVWDPILK----CNPQELDHAVLLVGYGTEKGLLEDKPYWLIKNSWGIKWGMDGYF 352

Query: 321 QIERGANACGIE 332
           ++ RG   CG++
Sbjct: 353 KMIRGKGKCGVD 364


>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
          Length = 311

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 123/221 (55%), Gaps = 12/221 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    +KT    S+ QLV+C  
Sbjct: 93  VPDKIDWRESGY--VTGVKDQGNCGSCWAFSTTGTMEGQYMKNEKTSISFSEQQLVDCSG 150

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A+EY+K++GLE+++ YPYR  E    +C Y ++     V   + V 
Sbjct: 151 PWGNNGCSGGLMENAYEYLKRFGLETESSYPYRAVEG---QCRYNEQLGVAKVTGYYTVH 207

Query: 243 SGVD-HMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
           SG +  + +L+ S GP  + +          + I ++   C P  L+HAV  VGYG ++G
Sbjct: 208 SGSEVELKNLVGSEGPAAIAVEAESDFMMYRSGIYQSQ-TCLPFALNHAVLAVGYGTQDG 266

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
              WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 267 TDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLPMV 307


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 86/253 (33%), Positives = 127/253 (50%), Gaps = 21/253 (8%)

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           +   TG R++G  K        +   FL     G LPK++DWR      + PV+ QG+CG
Sbjct: 89  VAMMTGFRVSGTSKA------AKGSTFLPPNNVGELPKTVDWRTKGY--VTPVKDQGQCG 140

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-VKQYGLES 209
           SCWAF+TT  +E Q       L  LS+  LV+C   +  C+GG +D AF+Y +   G+++
Sbjct: 141 SCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGRDAGCDGGFMDRAFQYIIDAGGIDT 200

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLN--HR 264
           +A YPY+  +    +C ++K      V   T VTSG +  +   +   GPI V ++  H 
Sbjct: 201 EASYPYKAVDG---KCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHM 257

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y       N+  C+   LDH V  VGYG   +G   WIV+NSW +    +GY  + 
Sbjct: 258 SFQHYKSGVY--NEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMS 315

Query: 324 RGA-NACGIESYA 335
           R   N CGI + A
Sbjct: 316 RNKDNQCGIATNA 328


>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
          Length = 442

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 142/308 (46%), Gaps = 25/308 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F+ +     R Y   +E + RFE F  + K+  E         +G +  +D S +E   R
Sbjct: 25  FRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
               R       R     +  K F  E     + + +DWR      + PV++QG CGSCW
Sbjct: 85  HNAARHYAAVMAR---PPKNTKTFTEEEINAAVGQKVDWRLKGA--VTPVKNQGSCGSCW 139

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQ 210
           +F+TT  +E Q A+    L  LS+ +LV CD  +  C+GG +D AF ++       + ++
Sbjct: 140 SFSTTGNIEGQHAIATGQLVSLSEQELVSCDTVDDGCSGGLMDNAFGWLLSAHNGQITTE 199

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWV----TSGVDHMMHLLQSGPIGVYLNHRLI 266
           A YPY +   I   CT+      V    T       +  D    + + GP+ + ++    
Sbjct: 200 ASYPYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSIGVDASSW 259

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           +SY G  +      C+  ++DH V IVG+ +      WI++NSW  +  + GY ++ +G+
Sbjct: 260 QSYIGGILSH----CSDVQIDHGVLIVGFDDTASTPYWIIKNSWSSMWGEQGYIRVAKGS 315

Query: 327 NACGIESY 334
           N CG+ S+
Sbjct: 316 NQCGLTSF 323


>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
          Length = 295

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 84/241 (34%), Positives = 125/241 (51%), Gaps = 23/241 (9%)

Query: 110 DRERVKKFLNERKKGP-----LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
           +R +V+  L+     P     +P  +DWR  K   + PV++QG+CGSCWAF+    LE Q
Sbjct: 58  NRTKVRDHLHSHYISPAIPVSVPAEVDWR--KKGYVTPVKNQGQCGSCWAFSAIGALEGQ 115

Query: 165 VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
                  L  LS+  LV+C   +GN  CNGG +D AF+Y+K   G +++A YPY   E +
Sbjct: 116 HFRKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPY---EAV 172

Query: 222 TFRCTYEKE----KAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIR 275
              C +++E      + +    W    V     +   GP+ V ++  H    SY G    
Sbjct: 173 DGMCRFKRECVGATCRGYTDLPWGNE-VKMKEAVALVGPVSVAIDASHSSFMSYKGGVYV 231

Query: 276 RNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESY 334
             +  C+P++LDH V +VGYG + G+  W+V+NSWG    D GY ++ R   N CGI S 
Sbjct: 232 EKE--CSPYQLDHGVLVVGYGTEQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASM 289

Query: 335 A 335
           A
Sbjct: 290 A 290


>gi|268562090|ref|XP_002638496.1| Hypothetical protein CBG12926 [Caenorhabditis briggsae]
          Length = 382

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 100/338 (29%), Positives = 155/338 (45%), Gaps = 42/338 (12%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQD----------GKET--DEYYGTSGSSDRSPQ 89
           AFK + +K+NR Y D +E + RF  F +            KE+  D  +G +  +D +  
Sbjct: 41  AFKEFKIKYNRKYKDASETQMRFNQFVKSYNKVNDLNAKAKESGYDTKFGINKFADLTEG 100

Query: 90  EILQR--------TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV---K 138
           E   R        TG+ +   EK      +  V K  ++R+    P   D R++ V    
Sbjct: 101 EFSGRLSHVVPNNTGVPVLDLEKPFFR--QAVVNKTRHKRRSTKYPDYFDLRKTLVNGES 158

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-HGNLNCNGGNIDV 197
           ++ P++ QG C  CW FA   ++ES  AL       LS  +L +C  +G   C GG++  
Sbjct: 159 IIGPIKDQGNCACCWGFAIAGLVESVNALHSNRFRSLSDQELCDCGTNGTPGCKGGSLQN 218

Query: 198 AFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD------HMMHL 251
             +YV +YGL +  DYPY +    T +    +E  +V  + T+  + V+       +M +
Sbjct: 219 GVDYVNRYGLSADDDYPYDDTRAFTSKRCRVRETRRVVKERTFTYAAVNARKAEQQIMEV 278

Query: 252 LQ--SGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG-----ILT 303
           L   + P+ VY       +SY+   I  +D  C   K  HA  IVGY   +         
Sbjct: 279 LTKWNVPVAVYFKVGDRFKSYEQGVIVEDD--CRGAKDWHAGLIVGYDSISNSRGREYPY 336

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           WIV+NSWG+   + GYF++ RG N C IES  Y   +K
Sbjct: 337 WIVKNSWGNWAEEDGYFRVIRGENWCSIESNGYAGDMK 374


>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
 gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
 gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 91/310 (29%), Positives = 149/310 (48%), Gaps = 27/310 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRS--PQEILQRT 95
           +   +F ++  ++ ++Y   +EIK RFE F ++ K          S++R   P  +    
Sbjct: 58  RHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIR-------STNRKGLPYTLAVNQ 110

Query: 96  GLRLTGKE--KERLEADRERVKKFLNERKKGP--LPKSLDWRQSKVKVLNPVESQGRCGS 151
               T +E  + RL A +          K     LP++ DWR+  +  ++P++ QG CGS
Sbjct: 111 FADWTWEEFRRHRLGAAQNCSATLKGNHKLTDVILPETKDWREDGI--VSPIKDQGHCGS 168

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLE 208
           CW F+TT  LE+  A        LS+ QLV+C     N  C+GG    AFEY+K   GL+
Sbjct: 169 CWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYIKYNGGLD 228

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNH 263
           ++  YPY   +     C +  E   V V D+  +T G +    H +  ++   +   + H
Sbjct: 229 TEEAYPYTGLDGT---CKFSSENIGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFEVVH 285

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
                Y             P  ++HAV  VGYG ++G+  W+++NSWG+   D+GYF++E
Sbjct: 286 DF-RFYKKGVYTSGTCGSTPMDVNHAVLAVGYGVEDGVAYWLIKNSWGENWGDNGYFKME 344

Query: 324 RGANACGIES 333
            G N CG+ +
Sbjct: 345 LGKNMCGVAT 354


>gi|294890024|ref|XP_002773045.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239877748|gb|EER04861.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 329

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 124/247 (50%), Gaps = 23/247 (9%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           L+    R  +F+ E     LP S+DWR   V  L PV++QG CGS WAF+TT  L +Q A
Sbjct: 92  LKMSTRRDDEFVVEADTTQLPTSVDWRNKSV--LTPVKNQGSCGSSWAFSTTGALGAQYA 149

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFR 224
           +    L  LS+ +LV+C   +GN  C GG +  A+EY+ Q GL+ ++ YPY+  +   FR
Sbjct: 150 IATGKLLSLSEQELVDCSLKYGNDGCIGGYMGAAYEYINQAGLDQESTYPYKGWDEPCFR 209

Query: 225 CTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRR-------N 277
            + EK+   + V+    T     +M  L   P+ V +       Y  +P  R       +
Sbjct: 210 SS-EKKADGIPVRFVLNTKTEQSLMKALADAPVSVGM-------YASDPNFRFYRSGVYS 261

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---CGIESY 334
              CN  + DHAV  VGYG   G   +I++NSWG      GYF ++RG      C I  Y
Sbjct: 262 STTCN-GETDHAVVAVGYGADKGSDYFILKNSWGSKWGIGGYFFLKRGVGGHGECNILEY 320

Query: 335 AYLASVK 341
             + ++K
Sbjct: 321 MLVPTLK 327


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 83/218 (38%), Positives = 118/218 (54%), Gaps = 15/218 (6%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P SLDWR  K  V+  V+ QG CGSCW+F+TT  +E   A++   L  LS+ +LV+CD  
Sbjct: 142 PSSLDWR--KKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT 199

Query: 187 NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTS 243
           N  C GG +D AFE+V    G++++A+YPY   +     C   KE+ KV   D +  V  
Sbjct: 200 NYGCEGGYMDYAFEWVINNGGIDTEANYPYTGVDGT---CNTTKEEIKVVSIDGYTDVDE 256

Query: 244 GVDHMMHLLQSGPIGVYLNHRLI--ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
               ++      PI V ++   +  + Y G  I   D + +P+ +DHAV IVGYG +NG 
Sbjct: 257 TDSALLCATVQQPISVGMDGSALDFQLYTGG-IYDGDCSDDPNDIDHAVLIVGYGSENGE 315

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGAN----ACGIESYA 335
             WIV+NSWG      GYF I+R  +     C I + A
Sbjct: 316 DYWIVKNSWGTEWGMEGYFYIKRNTDLPYGVCAINAEA 353


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 145/314 (46%), Gaps = 26/314 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F+ +I  + + Y    E   RFE FK + K  DE        + G +  +D S +
Sbjct: 46  KLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHE 105

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  +       G + + +  D ER       R    +PKS+DWR  K   +  V++QG C
Sbjct: 106 EFKKM----YLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR--KKGAVAEVKNQGSC 159

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AFEY VK  GL
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGL 219

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNH--RL 265
             + DYPY  +E        E E   +       T+    ++  L   P+ V ++   R 
Sbjct: 220 RKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGRE 279

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + Y G      D  C    LDH VA VGYG   G    IV+NSWG    + GY +++R 
Sbjct: 280 FQFYSGGVF---DGRCGV-DLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRN 335

Query: 326 ANA----CGIESYA 335
                  CGI   A
Sbjct: 336 TGKPEGLCGINKMA 349


>gi|123498602|ref|XP_001327438.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|452292|emb|CAA54435.1| cysteine proteinase, putative [Trichomonas vaginalis]
 gi|121910367|gb|EAY15215.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 309

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 117/224 (52%), Gaps = 17/224 (7%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P S+DWR+  V  +NP++ QG+CGSCW F+T   +ESQ A+    LY LS+  LV+C   
Sbjct: 91  PASIDWREKGV--VNPIKDQGQCGSCWTFSTIQAMESQWAVKHTKLYSLSEQNLVDCVTT 148

Query: 187 NLNCNGGNIDVAFEYVKQY---GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
              CNGG +++A++YVK Y      ++ADYPY+    I   C +   K        ++T 
Sbjct: 149 CYGCNGGLMELAYDYVKTYQKGKFMTEADYPYK---AIDQSCKFNAAKVAEPTVTGYITV 205

Query: 244 G----VDHMMHLLQSGP--IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                 D M  + Q GP  I +  +H   + Y       +  +C+P  LDHAV  VGYG 
Sbjct: 206 TEGDEKDLMNKVAQYGPAAIAIDASHYSFQLYSSGIYDES--SCSPEGLDHAVGCVGYGS 263

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQ-IERGANACGIESYAYLASV 340
           +     WIVRNSWG    + GY + I+   N CG  S A + +V
Sbjct: 264 EGSKNYWIVRNSWGVSWGEKGYIRMIKDKNNQCGEASAACIPTV 307


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 152/315 (48%), Gaps = 36/315 (11%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTGKEK 104
           ++ ++ R Y DD E +TR+  FK++    D +   +G S         + G+ +      
Sbjct: 42  WMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKS--------YKLGVNQFADLSN 93

Query: 105 ERLEADRERVKKFLNERKKGP--------LPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           E  +A R R K  +   + GP        +P ++DWR  K   + PV+ QG+CG CWAF+
Sbjct: 94  EEFKASRNRFKGHMCSPQAGPFRYENVSAVPATMDWR--KKGAVTPVKDQGQCGCCWAFS 151

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQ-YGLESQADY 213
             A +E    L    L  LS+ ++V+CD    +  CNGG +D AF++++Q  GL ++A+Y
Sbjct: 152 AVAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 211

Query: 214 PYRNKENITFRCTYEKE---KAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIE-SY 269
           PY   +     C  +KE    AK+   +    +    +M  +   P+ V ++    E  +
Sbjct: 212 PYTGTDGT---CNTQKEATHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQF 268

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA- 328
             + I     +C   +LDH V  VGYG  +G   W+V+NSWG    + GY ++++  +A 
Sbjct: 269 YSSGIFTG--SCGT-QLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKDISAK 325

Query: 329 ---CGIESYAYLASV 340
              CGI   A   S 
Sbjct: 326 EGLCGIAMQASYPSA 340


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/221 (36%), Positives = 120/221 (54%), Gaps = 21/221 (9%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  LP+++DWRQ     +N +++QG CGSCWAF+T A++E    ++   L  LS+ +LV+
Sbjct: 1   KEALPETVDWRQKGA--VNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVD 58

Query: 183 CDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           CD   N  CNGG +D AF+++ K  GL ++ DYPYR  +    +C    + +KV   D +
Sbjct: 59  CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDG---KCNSLLKNSKVVTIDGY 115

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
               T+    +   +   P+ V ++   R+ + Y           C   K+DHAV  VGY
Sbjct: 116 EDVPTNDETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGE---CGT-KMDHAVVAVGY 171

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERG-----ANACGI 331
           G +NG+  WIVRNSWG    + GY +IER      +  CGI
Sbjct: 172 GSENGVDYWIVRNSWGQKWGEDGYIRIERNLASSKSGKCGI 212


>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/333 (30%), Positives = 164/333 (49%), Gaps = 38/333 (11%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE-------- 76
           A++    LA +++   D +  +     +TY    E +TRF  F+ + ++ +E        
Sbjct: 5   AVFASVLLAVNALTDKDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKG 64

Query: 77  ----YYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
               + G +  +D +  E   +  LR   K K  +EA      + L       +P S+DW
Sbjct: 65  EESYFLGVTPFADLTHDEF--KDELRRQIKTKPNVEATLAVFPEGLE------VPDSIDW 116

Query: 133 RQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC 190
            Q K  VL+ V+ QG CGSCWAF+ T  LE Q A++     PLS+ QL++C   +GN +C
Sbjct: 117 TQ-KGAVLD-VKYQGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDC 174

Query: 191 -NGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM 249
            +GG +  AF+YV   G+E+ + YPY+    I   C Y+ +K  + ++     S  +  +
Sbjct: 175 EHGGLMSFAFDYVLDKGIEADSSYPYK---GIDTPCQYDAKKTVLKIKGYKNVSNSEEEL 231

Query: 250 H--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT---- 303
              +   GP+ V ++   I+ Y G  +   D     H L+H V  VGYGE++ +      
Sbjct: 232 KKAVGTVGPVSVAIDADPIQLYFGGIL---DGLFCTHNLNHGVLAVGYGEEDHLFGKKKF 288

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           W V+NSWG    + GYF+I+R A N CGI   A
Sbjct: 289 WKVKNSWGKDWGEQGYFRIKRDANNLCGIADKA 321


>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
          Length = 342

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/266 (36%), Positives = 132/266 (49%), Gaps = 34/266 (12%)

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           SD +P E  QR GLR    E+          KKF+   +K  +P+ +DWR      + PV
Sbjct: 92  SDLTPSEYRQRLGLRPALGERTG--------KKFVYNGEK--VPEHVDWRDKGY--VTPV 139

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEY 201
           ++QG CGSCWAF++T  LE Q   L   L  LS+  LV+C   +GN  CNGG +D AF Y
Sbjct: 140 KNQGACGSCWAFSSTGSLEGQHFRLTGQLVSLSEQNLVDCTKKYGNAGCNGGWMDNAFNY 199

Query: 202 VK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHM----MHLLQS-- 254
           VK   G++++A YPY   ++    C Y+             T  VD      + L Q+  
Sbjct: 200 VKANNGIDTEAFYPYEGHDDW---CGYDGSPGHKGAN---CTGHVDVQQGDELALKQAVA 253

Query: 255 --GP--IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
             GP  +G+   HR  + Y       ++ AC+    DHAV +VGYG + G   W+V+NSW
Sbjct: 254 TVGPVSVGIDATHRSFQLYKSGIY--DEVACSNSSTDHAVLVVGYGSQGGHDYWLVKNSW 311

Query: 311 GDIGPDHGYFQIERG-ANACGIESYA 335
           G      GY  + R   N C I SYA
Sbjct: 312 GTSWGMDGYIMMSRNKGNQCAIASYA 337


>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
          Length = 364

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 151/320 (47%), Gaps = 46/320 (14%)

Query: 39  QVDA---FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRT 95
           Q+DA   F ++  ++ RTY      +      + D   T   +G +  SD +P E   R 
Sbjct: 51  QLDAEAHFASFERRFGRTYPGPRRAR------RLDPTAT---HGVTKFSDLTPGEFRDRF 101

Query: 96  GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
            L L     E L          L       LP   DWR+     + PV+ QG CGSCW+F
Sbjct: 102 -LGLRRPSLEGLVGGEPHEAPILPTDG---LPDDFDWREHGA--VGPVKDQGSCGSCWSF 155

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYV-KQY 205
           +T+  LE    L    L  LS+ Q+V+CDH          +  CNGG +  AF Y+ K  
Sbjct: 156 STSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKSG 215

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNH 263
           GL+S+ DYPY  +EN    C ++K K    V++  V S  +  +  +L++ GP+ + +N 
Sbjct: 216 GLQSEKDYPYAGRENT---CKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINA 272

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPD 316
             +++Y G       + C  H LDH V +VGYG              WI++NSWG+   +
Sbjct: 273 AYMQTYIGG--VSCPFICGRH-LDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGE 329

Query: 317 HGYFQIERGA---NACGIES 333
            GY++I RG    N CG++S
Sbjct: 330 KGYYKICRGPHDKNKCGVDS 349


>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
          Length = 338

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 86/236 (36%), Positives = 128/236 (54%), Gaps = 22/236 (9%)

Query: 117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLS 176
              E   G +PKS+DWR      + PV+ QG CGSCWAF+    LE Q+      L PLS
Sbjct: 113 IFQEPLLGDVPKSVDWRDHGY--VTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLS 170

Query: 177 KSQLVEC--DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAK 233
           +  L++C   +GN+ CNGG +++AF+YVK+  GL+++  Y Y   +     C Y+ + + 
Sbjct: 171 EQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDG---PCRYDPKYSA 227

Query: 234 V----FVQDTWVTSGVDHMMHLLQS-GP--IGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
           V    FV+   V    D +M+ + S GP  +G+  +H     Y G      D  C+   L
Sbjct: 228 VNITGFVK---VPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPD--CSSTNL 282

Query: 287 DHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           DHAV +VGYGE+ +G   W+V+NSWG+     GY ++ +   N CGI +YA   +V
Sbjct: 283 DHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 338


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 151/314 (48%), Gaps = 40/314 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR-------- 94
           ++ ++ K  R      E + RFE FK + +  D +   + S  RS +  L R        
Sbjct: 50  YEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEE 109

Query: 95  -----TGLR-LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
                 G R  + + + RL +DR R            LP+S+DWR      +  V+ QG 
Sbjct: 110 YRTVYLGTRPASHRRRARLGSDRYRYNAGEE------LPESVDWRDKGA--VTTVKDQGS 161

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +LV+CD+G N  CNGG +D AFE++    G
Sbjct: 162 CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGG 221

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL-- 261
           ++++ DYPY+ ++    +C   ++ AKV   D +    V+    +   + + P+ V +  
Sbjct: 222 IDTEEDYPYKARDG---KCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEA 278

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             R  + Y           C    LDH V  VGYG +NG   WIVRNSWG    + GY +
Sbjct: 279 GGREFQLYHSGIFTGR---CGT-DLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIR 334

Query: 322 IERGANA----CGI 331
           +ER  NA    CGI
Sbjct: 335 MERNVNASTGKCGI 348


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 156/323 (48%), Gaps = 23/323 (7%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSS 84
           A+ +   L   ++   + ++ +  K+ +TY    E   R + + Q+    +E+     S 
Sbjct: 11  AVLLLIGLVSAAVNDAEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSF 70

Query: 85  DRSPQEILQRTGLRLT------GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
                E    T    +      GK + R   +   + ++      G +P S+DWR   + 
Sbjct: 71  QLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTG----GAIPDSVDWRTKGL- 125

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            + PV++Q +CGSCWAF+TT  LE   A     L  LS+  LV+CD  +  C GG +  A
Sbjct: 126 -VTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDHGCQGGLMTTA 184

Query: 199 FEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD--TWVTSGVDHMMH-LLQS 254
           F+Y+++  G++++  YPY+ K     RC ++K+     V+   + +T+  + +   + + 
Sbjct: 185 FKYIEENKGIDTEESYPYKAKNG---RCEFKKDDIGATVERHVSILTTDCEALKKAVAEI 241

Query: 255 GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
           GPI V ++  H   + Y           C+  KLDH V +VGYG+++G   W+V+NSWG 
Sbjct: 242 GPISVAMDASHSSFQLYKSGIYDPK--ICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGK 299

Query: 313 IGPDHGYFQIERGANACGIESYA 335
                GYF+I    N CGI + A
Sbjct: 300 NWGMEGYFKIASKKNLCGICTSA 322


>gi|27681979|ref|XP_225125.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
 gi|109505372|ref|XP_001065135.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
          Length = 331

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/224 (39%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVEC 183
           +PK+LDWR  K   + PV  QG CG+CW FA    +E Q  L KKT  L PLS   LV+C
Sbjct: 112 IPKTLDWR--KDGYVTPVRRQGACGACWGFAVAGSIEGQ--LFKKTGKLSPLSVQNLVDC 167

Query: 184 DH--GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
               G + CNGG I  AF+YVK   GLE++A YPY  KE     C Y  EK+ V V    
Sbjct: 168 SRSFGTMGCNGGRIYNAFQYVKNNGGLEAEATYPYEAKEG---NCRYRPEKSVVKVTRFL 224

Query: 241 VTSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           V    +  +   L+  GPI V ++  H   + Y G      +  C     +H++ +VG+G
Sbjct: 225 VVPRNEEALINALVNIGPIAVGIDAQHESFKKYAGGIYHEPN--CKRDSPNHSMLLVGFG 282

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGANA-CGIESYA 335
               E  G   W+V+NS+G+   + GY +I RG N  CGI SYA
Sbjct: 283 YEGQESEGRKYWLVKNSYGEQWGEKGYMKIPRGQNNYCGIASYA 326


>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
          Length = 349

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 128/236 (54%), Gaps = 22/236 (9%)

Query: 117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLS 176
              E   G +PKS+DWR      + PV+ QG CGSCWAF+    LE Q+      L PLS
Sbjct: 124 IFQEPLLGDVPKSVDWRDHGY--VTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLS 181

Query: 177 KSQLVEC--DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAK 233
           +  L++C   +GN+ CNGG +++AF+YVK+  GL+++  Y Y   E     C Y+ + + 
Sbjct: 182 EQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAY---EAWDGPCRYDPKYSA 238

Query: 234 V----FVQDTWVTSGVDHMMHLLQS-GP--IGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
           V    FV+   V    D +M+ + S GP  +G+  +H     Y G      D  C+   L
Sbjct: 239 VNITGFVK---VPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPD--CSSTNL 293

Query: 287 DHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           DHAV +VGYGE+ +G   W+V+NSWG+     GY ++ +   N CGI +YA   +V
Sbjct: 294 DHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 349


>gi|358334194|dbj|GAA34712.2| cathepsin L [Clonorchis sinensis]
          Length = 401

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 111/195 (56%), Gaps = 14/195 (7%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P S+DWR +    + PV+ QG+CGSCWAF+ T  +E Q  +  K L  LS+ QLV+C   
Sbjct: 166 PASIDWRSTGA--VTPVKDQGQCGSCWAFSATGAIEGQHFMATKQLVSLSEQQLVDCSSH 223

Query: 185 HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENIT--FRCTYEKEKAKVFVQDTWV 241
            GN  C+GG +D AF+YVK  +G+ ++  YPY + E  T   RC +  +     V    V
Sbjct: 224 FGNFGCSGGWMDNAFKYVKHTHGITTETKYPYISGETGTPNPRCEFHGQAIAATVTGI-V 282

Query: 242 TSGVDHMMHLLQS----GPIGVYLNHRLIESYDG-NPIRRNDWACNPHKLDHAVAIVGYG 296
                +   L Q+    GPI V + H  +ES+ G      +D  C+  +LDHAV +VGYG
Sbjct: 283 DLPRSNEFALKQAVGLHGPISVAI-HASLESFMGYKSGVYSDEECSSDQLDHAVLVVGYG 341

Query: 297 EKNGILTWIVRNSWG 311
           E+NGI  W+++NSWG
Sbjct: 342 EENGIPYWLIKNSWG 356


>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 326

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/220 (38%), Positives = 114/220 (51%), Gaps = 20/220 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--D 184
           P S+DWR      +  V+ QG CGSCWAF++T  LE Q       L PLS+ QLV+C  D
Sbjct: 111 PASIDWRTQGY--VTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGD 168

Query: 185 HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
           +GN+ C GG +D AF Y+K  G ES+  YPY   ++    C Y  + +KV   DT  T  
Sbjct: 169 YGNMGCGGGWMDQAFSYIKDKGEESEDGYPYTGTDDT---CVY--DASKVVATDTGYTDI 223

Query: 245 VDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
            +   + LQ      GPI V ++  H   + Y+       +  C+   LDHAV  VGYG 
Sbjct: 224 PEMDENALQQAVATVGPISVAIDATHSSFQFYESGVYDEPE--CSQTNLDHAVLAVGYGT 281

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            + G+  WIV+NSW       GY ++ R   N CGI S A
Sbjct: 282 SEEGLDYWIVKNSWSTGWGMQGYIEMSRNKDNQCGIASKA 321


>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 357

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 147/308 (47%), Gaps = 31/308 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD-------GKETDEY-YGTSGSSDRSPQ 89
           + V +F  +  ++ + Y +  E+K RF  FK++        K+   Y  G +  +D + Q
Sbjct: 54  RHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQ 113

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E  QRT L         L+   +  +          LP++ DWR+  +  ++PV+ QG C
Sbjct: 114 E-FQRTKLGAAQNCSATLKGSHKVTE--------AALPETKDWREDGI--VSPVKDQGGC 162

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG- 206
           GSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  G 
Sbjct: 163 GSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGG 222

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYL 261
           L+++  YPY  K+     C +  E   V V ++  +T G +    H + L++   I   +
Sbjct: 223 LDTEKAYPYTGKDET---CKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEV 279

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            H     Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D GYF+
Sbjct: 280 IHSF-RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFK 338

Query: 322 IERGANAC 329
           +E G N C
Sbjct: 339 MEMGKNMC 346


>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
 gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
          Length = 333

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 115/217 (52%), Gaps = 13/217 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP ++DWR  K   + PV+ QG CGSCWAF+TT  LE Q       L  LS+  L++C  
Sbjct: 118 LPDAVDWR--KYGFVTPVKDQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCSP 175

Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
           GN  C  G ++ AF Y++   G++++  YPY   +N   +C + ++         +V   
Sbjct: 176 GNNGCKNGAVEYAFRYIQSNKGIDTEISYPYEAAQN---QCRFRRDTIGA-TSTGFVKLN 231

Query: 245 VDHMMHLLQS----GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKN 299
               M L Q+    GPI V +N  L      +    ND +CNP+KL HAV +VGYG +  
Sbjct: 232 PGDEMELAQAVATVGPISVLINSSLDSFKFYHDGVYNDPSCNPNKLTHAVLVVGYGTDDR 291

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G   W+V+NSW     + GY +I+R A N CGI S A
Sbjct: 292 GGDFWLVKNSWSTHWGEQGYVKIKRNANNLCGIASNA 328


>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 144/311 (46%), Gaps = 28/311 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ  +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPE 95

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E        L G +     A  ER +K +N    G  P ++DWR  K   + PV+ QG C
Sbjct: 96  EF---RATYLNGAK--YYAAALERPRKVVN-VSTGKAPPAVDWR--KKGAVTPVKDQGSC 147

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYG 206
           GSCWAFA T  +E Q  +    L  LS+  LV CD    NC GG  D AF+++    +  
Sbjct: 148 GSCWAFAATGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNCRGGFADRAFKWIVSSNKGN 207

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR 264
           + ++  YPY + +     C    +     +         ++ +   L ++GP+ + ++  
Sbjct: 208 VFTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAS 267

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
               Y G  +     +C+   L H V +VGY + +    WI++NSW     + GY +IE+
Sbjct: 268 TFLDYKGGVLT----SCSSEGLSHDVLLVGYNDTSKPPYWIIKNSWDKEWGEEGYIRIEK 323

Query: 325 GANACGIESYA 335
           G N C ++ YA
Sbjct: 324 GTNLCLMKEYA 334


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 159/314 (50%), Gaps = 42/314 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           +++++++  ++Y    E   RF+ FK + +  DE       S         + GL     
Sbjct: 49  YESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQS--------YKLGLTKFAD 100

Query: 99  LTGKEKERL------EADRERVKKFLNER---KKG-PLPKSLDWRQSKVKVLNPVESQGR 148
           LT +E   +        DR+++ K  ++R   K G  LP+S+DWR+  V V   V+ QG 
Sbjct: 101 LTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPKVGDSLPESIDWREKGVLV--GVKDQGS 158

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYG 206
           CGSCWAF+  A +ES  A++   L  LS+ +LV+CD   N  C+GG +D AFE+V K  G
Sbjct: 159 CGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGG 218

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL-- 261
           ++++ DYPY+ +  +   C   ++ AKV   D++    V++   +   +   P+ + L  
Sbjct: 219 IDTEEDYPYKERNGV---CDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEA 275

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             R  + Y           C    +DH V I GYG +NG+  WIVRNSWG    ++GY +
Sbjct: 276 GGRDFQHYKSGIFTGK---CGT-AVDHGVVIAGYGTENGMDYWIVRNSWGANWGENGYLR 331

Query: 322 IER----GANACGI 331
           ++R     +  CG+
Sbjct: 332 VQRNVASSSGLCGL 345


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 158/321 (49%), Gaps = 37/321 (11%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDR 86
           + A+K + +++ R Y   +E   RF  F              Q+GK T +  G +  +D+
Sbjct: 57  IAAWKFFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKM-GVNEFTDK 115

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           +  E+ +  G ++T        A R +   F+   +   LP  +DWR+     +  V++Q
Sbjct: 116 TDYELKKLRGYKVTSG------AIRHKGSTFI-RSEHTKLPSKVDWRREGA--VTDVKNQ 166

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK- 203
           G+CGSCWAF+TT  +E Q       L  LS+ QLV+C   +GN  C+GG ++ AFEYV+ 
Sbjct: 167 GQCGSCWAFSTTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRD 226

Query: 204 QYGLESQADYPYRNKENI-TFRCTYEKEKAKVFVQDTW---VTSGVDH--MMHLLQSGPI 257
             G++S+  YPY + +     RC +    + +  Q T    +  G +   M  +   GP+
Sbjct: 227 NEGIDSEISYPYVSGDGTENNRCLFNA--SNILAQVTGYVNIHEGDERALMDAVATKGPV 284

Query: 258 GVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP 315
            V +N  L     Y        D       LDH V +VGYGE+NG   W+++NSWG+   
Sbjct: 285 SVAINAGLPSFSMYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEWG 344

Query: 316 DHGYFQIERGA-NACGIESYA 335
           + GY +I +G+ N CG+ S A
Sbjct: 345 EKGYIKISKGSHNMCGVASAA 365


>gi|197359120|gb|ACH69776.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 261

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/232 (34%), Positives = 125/232 (53%), Gaps = 9/232 (3%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           ++ ++ + K  +      P P   DWR+  V  + PV+SQ  CGSCWAFA    +E+  A
Sbjct: 25  IQHEKPKRKHQIKFDSASPYPPHFDWREKGV--VTPVKSQFNCGSCWAFAAIGTVETSYA 82

Query: 167 LLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPY-RNKENITFRC 225
           +    L  LS+ +L++CD  N  CNGG+ D AF ++ ++GL  + DYPY   ++N     
Sbjct: 83  IAHGELRNLSEQELLDCDLANNACNGGDDDKAFRFIHEHGLMREEDYPYVAQRQNSCLLN 142

Query: 226 TYEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPIGVYLNHRL-IESYDGNPIRRNDWACNP 283
            Y     K+ +   ++ S  + M+  L+  GPI V +N    ++ Y G     + W C  
Sbjct: 143 EYSGPTTKLDLA-YFIASDENAMLEWLVNFGPINVGINVPPDMKLYKGGVYTPSPWDCKN 201

Query: 284 HKLD-HAVAIVGYGE-KNGILTWIVRNSWGD-IGPDHGYFQIERGANACGIE 332
           + L  HA+ I+GYG  ++G   WIV+NSWG   G + GY  + RG N+CGIE
Sbjct: 202 NILGTHALNIMGYGTWEDGQKYWIVKNSWGPKYGIEDGYVYMARGENSCGIE 253


>gi|391341656|ref|XP_003745143.1| PREDICTED: uncharacterized protein LOC100900885 [Metaseiulus
           occidentalis]
          Length = 1356

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/259 (35%), Positives = 132/259 (50%), Gaps = 27/259 (10%)

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           L+R   R  G++    E   +++           LP  +DWR      + PV++QG CGS
Sbjct: 337 LRRASRRFFGRDFSPEECRNDQI-----------LPDHVDWRLEGA--VTPVKNQGTCGS 383

Query: 152 CWAFATTAILESQVAL--LKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYGL 207
           CW+FA  A LESQ  L   K+ L   S+ QLV+C  D  N  C+GG+I+ AF YVK+YGL
Sbjct: 384 CWSFAVIAHLESQYFLNNGKENLTRFSEQQLVDCSWDFSNTGCSGGSIESAFSYVKEYGL 443

Query: 208 ESQADY-PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHR 264
            +   Y PYR +E    R T    +  +   + +   G    +  ++   GPI V ++  
Sbjct: 444 FTDEQYGPYREEEG-KCRDTVTGTEPTISTLEGFNAIGGKECLRNYIALKGPIAVAIDAS 502

Query: 265 LIE-SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
                Y  + + +N  AC    L+HAV  +GYGE NG   W+++NSWGDI    G+  I 
Sbjct: 503 SPSFVYYSHGVYKNP-ACG-RDLNHAVLAIGYGELNGEPYWLIKNSWGDIWGSEGFMLIS 560

Query: 324 RGANACGIE---SYAYLAS 339
           +  N CGIE   SYA L S
Sbjct: 561 QENNTCGIEDELSYADLGS 579



 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 75/213 (35%), Positives = 110/213 (51%), Gaps = 9/213 (4%)

Query: 126  LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
            +P  +DWR      + PV+ Q  CGSCW+F T   +E Q  L    L   ++ QLV+C  
Sbjct: 1141 VPDYVDWRLEGA--VTPVKDQAICGSCWSFGTVGHIEGQYFLKHGELVRFAEQQLVDCSW 1198

Query: 185  -HGNLNCNGGNIDVAFEYVKQYGLESQADY-PYRNKENITFRCTYEKEKAKVFVQDTWVT 242
              GN  C+GG   VA++Y+K+YGL S A Y PYR  +        E  K    +Q  +  
Sbjct: 1199 TSGNDACDGGLDYVAYDYIKKYGLSSDAQYGPYRGIDGKCKDVEIEN-KPITTIQRYYNI 1257

Query: 243  SGVDHMMHLLQ-SGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
            SGV+++   +   GPI V ++  R   S+  + +   D  C+  +LDHAV  VGYG  +G
Sbjct: 1258 SGVENLRKAIAFVGPISVAIDASRPSLSFYAHGVYE-DPDCSSTELDHAVLAVGYGVLHG 1316

Query: 301  ILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
               W+++NSW     + GY  I +  N CG+ S
Sbjct: 1317 KPYWLIKNSWSTYWGNDGYILISQKDNMCGVAS 1349


>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
          Length = 374

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/253 (33%), Positives = 125/253 (49%), Gaps = 16/253 (6%)

Query: 98  RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           +L G  +   ++ R     FL     G LP+S+DWR      +  V++QG CGSCWAF+ 
Sbjct: 128 KLNGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGW--VTEVKNQGMCGSCWAFSA 185

Query: 158 TAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q    K  L  LS+  L++C   +GN+ CNGG +D AF+Y+K   G++ +  YP
Sbjct: 186 TGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKGIDKETAYP 245

Query: 215 YRNKENITFRCTYEKEKAKVF---VQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESY 269
           Y+ K     +C +++           D       D  M +   GP+ V ++  HR  + Y
Sbjct: 246 YKAKTGK--KCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQGPVSVAIDAGHRSFQLY 303

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-N 327
                   +  C+P  LDH V +VGYG +      WIV+NSWG    + GY ++ R   N
Sbjct: 304 TNGVYFEKE--CDPENLDHGVLVVGYGTDPTQGDYWIVKNSWGTRWGEQGYIRMARNRNN 361

Query: 328 ACGIESYAYLASV 340
            CGI S+A    V
Sbjct: 362 NCGIASHASFPLV 374


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/309 (31%), Positives = 154/309 (49%), Gaps = 31/309 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           +K+++++  + Y    E + RFE FK + +  DE+          G +  +D + QE   
Sbjct: 45  YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           +  L      + RL   +    ++ + R    LP S+DWR      ++PV+ QG CGSCW
Sbjct: 105 KF-LGTRTDPRRRLMKSKIPSSRYAH-RAGDNLPDSVDWRDHGA--VSPVKDQGSCGSCW 160

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQA 211
           AF+T A +E    ++   L  LS+ +LV+CD   +  CNGG +D AF+++    G++++ 
Sbjct: 161 AFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTEK 220

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPIGVYL--NHRLIE 267
           DYPY    N   +C   K+ AKV   D +  V +  + +   +   P+ + +    R  +
Sbjct: 221 DYPYLGFNN---QCDPTKKNAKVVSIDGYEDVPNNENALKKAVAHQPVSIAIEAGGRAFQ 277

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
            Y+       +  C    LDH V  VGYG + NG   WIVRNSWG    ++GY ++ER  
Sbjct: 278 LYESGVF---NGECGL-ALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERNI 333

Query: 327 NA----CGI 331
           NA    CGI
Sbjct: 334 NANTGKCGI 342


>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
          Length = 360

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 145/318 (45%), Gaps = 47/318 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------DGKETDEYYGTSGSSDRSPQEI--- 91
           F  + V++ ++Y    E+  RF  F +        + K      G +  +D S +E    
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRAT 118

Query: 92  ----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
                Q     LTG  + R  A                LP++ DWR+  +  ++PV++QG
Sbjct: 119 RLGAAQNCSATLTGNHRMRAAAV--------------ALPETKDWREDGI--VSPVKNQG 162

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QL++C     N  CNGG    AFEY+K  
Sbjct: 163 HCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYIKYN 222

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH----LLQSGPIG- 258
            GL+++  YPY+    I   C ++ E     V D+  +T G +  +     L++   +  
Sbjct: 223 GGLDTEESYPYQGVNGI---CKFKNENVGFKVLDSVNITLGAEDELKDAVGLVRPVSVAF 279

Query: 259 -VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V    RL   Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D 
Sbjct: 280 EVITGFRL---YKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDE 336

Query: 318 GYFQIERGANACGIESYA 335
           GYF++E G N CG+ + A
Sbjct: 337 GYFKMEMGKNMCGVATCA 354


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 153/318 (48%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ K  + Y    E   RFE FK + K  DE        + G +  +D S Q
Sbjct: 42  KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQ 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   + PV++QG 
Sbjct: 102 EFKNKYLGLKVDYSRR------RESPEEF--TYKDVELPKSVDWR--KKGAVAPVKNQGS 151

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + V+  G
Sbjct: 152 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGG 211

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E     C   KE+ +V     +     +    ++  L + P+ V +  
Sbjct: 212 LHKEEDYPYIMEEGT---CEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEA 268

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+   IV+NSWG    + GY +
Sbjct: 269 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIR 324

Query: 322 IERGA----NACGIESYA 335
           + R        CGI   A
Sbjct: 325 MRRNIGKPEGICGIYKMA 342


>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
          Length = 263

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 120/228 (52%), Gaps = 12/228 (5%)

Query: 110 DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           +RER   +   ++   +   +DW  S    +  V++QG+CGSCW+F+TT  LE    +  
Sbjct: 39  ERERNYDYTLAKQVDAVASDVDWVASGA--VTGVKNQGQCGSCWSFSTTGALEGAFEIAG 96

Query: 170 KTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYE 228
            TL  LS+  LV+CD  +  CNGG +D AF++++   G+ S+ADY Y   +     C   
Sbjct: 97  NTLTSLSEQNLVDCDTTDSGCNGGLMDNAFKWIQSNGGICSEADYAYTAAKG---TCKTT 153

Query: 229 KEKAKVFVQDTWVTSG-VDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHK 285
            +K       T V SG  D +   +  GP+ + +  +  + +SY    +  +  AC  + 
Sbjct: 154 CDKVATLSGHTDVPSGDEDALKTAVAIGPVSIAIEADKSVFQSYSSGILDSS--ACGTN- 210

Query: 286 LDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           LDH V +VGYG  +G   W V+NSWG    + GY +I RG+N CGI S
Sbjct: 211 LDHGVLVVGYGTDDGSEYWKVKNSWGTTWGESGYVRIARGSNICGIAS 258


>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
          Length = 462

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 157/318 (49%), Gaps = 25/318 (7%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDR 86
           ++K    FK ++  +NRTY    E + R   F ++     +          YG +  SD 
Sbjct: 158 AMKIASLFKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDL 217

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQ 146
           + +E   RT + L    +E   +   R  K +++      P   DWR  K   +  V++Q
Sbjct: 218 TEEEF--RT-IYLNPLLREH-PSKTMRQAKIVHDSA----PPEWDWR--KKGAVTEVKNQ 267

Query: 147 GRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG 206
           G CGSCWAF+ T  +E Q  L K TL  LS+ +L++CD  +  C GG    A+  +K  G
Sbjct: 268 GMCGSCWAFSVTGNVEGQWFLKKGTLLSLSEQELLDCDKVDKACMGGLPINAYSAIKSLG 327

Query: 207 -LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            LE++ DY Y+        C +  +KAKV++ D+   S  +  +   L   GPI + +N 
Sbjct: 328 GLETEDDYSYQGHMEA---CNFSAKKAKVYINDSVELSKNEQYLAAWLAVKGPISIAINA 384

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             ++ Y           C+P  +DHA+ IVGYG+++G+  W ++NSWG    + GY+ + 
Sbjct: 385 FGMQFYRHGIAHPLQPLCSPWFIDHAMLIVGYGKRSGVPFWAIKNSWGTDWGEEGYYYLH 444

Query: 324 RGANACGIESYAYLASVK 341
           RG+ +CG+   A  A V+
Sbjct: 445 RGSRSCGVNVMASSAVVE 462


>gi|1460063|emb|CAA60672.1| cysteine protein [Entamoeba dispar]
          Length = 307

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 145/298 (48%), Gaps = 27/298 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--SDRSPQEILQRTGLRL 99
           AFK +    N+ + +  E   RF  F  + K  +    T  +  +D + +E +Q T L +
Sbjct: 16  AFKQWAAAHNKVFANRAEYLYRFAVFLDNKKFVEANANTELNVFADMTHEEFIQ-THLGM 74

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
           T +  E   + +  +K           P+S+DWR     ++NP + QG+CGSCW F TTA
Sbjct: 75  TYEIPETTPSVKAAIK---------AAPESVDWRS----IMNPAKDQGQCGSCWTFCTTA 121

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNK 218
           +LE +V      LY  S+ QLV+CD  +  C GG+   + +++++  GL  + DYPY+  
Sbjct: 122 VLEGRVNKDLGKLYSFSEQQLVDCDSSDNGCEGGHPSNSLKFIQENNGLGLETDYPYK-- 179

Query: 219 ENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR--LIESYDGNPI 274
             +   C   K  A V      VT G +  +   + ++GP+ V ++      + Y    I
Sbjct: 180 -AVAGTCKKVKNVATV-TGSKRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 237

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
             +D  C    ++H V  VGYG  +    WI+RNSWG    D GYF + R + N CGI
Sbjct: 238 -YSDAKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTAWGDAGYFLLARDSNNMCGI 294


>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
          Length = 326

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 121/227 (53%), Gaps = 18/227 (7%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    +KT    S+ QLV+
Sbjct: 105 KRAVPDRIDWRESGY--VTEVKDQGGCGSCWAFSTTGAMEGQYMKNEKTSISFSEQQLVD 162

Query: 183 CD--HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C    GN  CNGG ++ A+EY+K++GLE+++ YPYR  E    +C Y ++     V   +
Sbjct: 163 CSGPFGNYGCNGGLMENAYEYLKRFGLETESSYPYRAVEG---QCRYNEQLGVAKVTGYY 219

Query: 241 VTSGVDHMMHLLQSG---PIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVG 294
                D +      G   P  V L+   +ES D    R   +    C+P +L+H V  VG
Sbjct: 220 TVHSGDEVELQNLVGCRRPAAVALD---VES-DFMMYRSGIYQSQTCSPDRLNHGVLAVG 275

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           YG ++G   WIV+NSWG    + GY ++ R   N CGI S A +  V
Sbjct: 276 YGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRGNMCGIASLASVPMV 322


>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
          Length = 348

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 94/281 (33%), Positives = 139/281 (49%), Gaps = 40/281 (14%)

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGP---LPKSLDWRQ 134
           +G +  SD +P E   R    L G  +  LE     V    +E    P   LP   DWR+
Sbjct: 68  HGVTKFSDLTPGEFRDR----LLGLRRPSLEG---LVGGEPHEAPILPTDGLPDDFDWRE 120

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--------- 185
                + PV+ QG CGSCW+F+T+  LE    L    L  LS+ Q+V+CDH         
Sbjct: 121 HGA--VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRA 178

Query: 186 GNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            +  CNGG +  AF Y+ K  GL+S+ DYPY  +EN    C ++K K    V++  V S 
Sbjct: 179 CDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRENT---CKFDKSKIVAQVKNFSVISV 235

Query: 245 VDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
            +  +  +L++ GP+ + +N   +++Y G       + C  H LDH V +VGYG      
Sbjct: 236 NEDQIAANLVKHGPLAIAINAAYMQTYIGG--VSCPFICGRH-LDHGVLLVGYGSAGYAP 292

Query: 303 T-------WIVRNSWGDIGPDHGYFQIERGA---NACGIES 333
                   WI++NSWG+   + GY++I RG    N CG++S
Sbjct: 293 IRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDS 333


>gi|167394751|ref|XP_001741082.1| cysteine proteinase ACP1 precursor [Entamoeba dispar SAW760]
 gi|165894470|gb|EDR22453.1| cysteine proteinase ACP1 precursor, putative [Entamoeba dispar
           SAW760]
          Length = 308

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 145/298 (48%), Gaps = 27/298 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--SDRSPQEILQRTGLRL 99
           AFK +    N+ + +  E   RF  F  + K  +    T  +  +D + +E +Q T L +
Sbjct: 17  AFKQWAAAHNKVFANRAEYLYRFAVFLDNKKFVEANANTELNVFADMTHEEFIQ-THLGM 75

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
           T +  E   + +  +K           P+S+DWR     ++NP + QG+CGSCW F TTA
Sbjct: 76  TYEIPETTPSVKAAIK---------AAPESVDWRS----IMNPAKDQGQCGSCWTFCTTA 122

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNK 218
           +LE +V      LY  S+ QLV+CD  +  C GG+   + +++++  GL  + DYPY+  
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDSSDNGCEGGHPSNSLKFIQENNGLGLETDYPYK-- 180

Query: 219 ENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR--LIESYDGNPI 274
             +   C   K  A V      VT G +  +   + ++GP+ V ++      + Y    I
Sbjct: 181 -AVAGTCKKVKNVATV-TGSKRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 238

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
             +D  C    ++H V  VGYG  +    WI+RNSWG    D GYF + R + N CGI
Sbjct: 239 -YSDAKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTAWGDAGYFLLARDSNNMCGI 295


>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 117/227 (51%), Gaps = 14/227 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP S+DWR  K  VLNPV+ QG CGSCWAF+    LE + A+    L  LS+ QLV+C  
Sbjct: 112 LPTSVDWR--KKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLVDCAG 169

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
            +GN  CNGG +D AFEY+K  G++ ++ YPY   +  T + T E +   + V +     
Sbjct: 170 AYGNEGCNGGLMDKAFEYIKATGVDKESTYPYVGSDE-TCQATVENKTDGLPVGEVTGNQ 228

Query: 244 GVDH----MMHLLQSGP--IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            +      +M  + + P  I +Y N +  + Y        +       +DH V  VGYG 
Sbjct: 229 MLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGT 288

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGANA---CGIESYAYLASVK 341
           +NG   +I+RNSWG      GY  ++RG  +   C I  Y  + ++K
Sbjct: 289 ENGQDYFIIRNSWGRSWGQDGYVYLKRGVGSFGQCNIYKYMCVPTLK 335


>gi|116488416|gb|AAB41670.2| secreted cathepsin L 1 [Fasciola hepatica]
          Length = 326

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 122/224 (54%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSR 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V   + V 
Sbjct: 166 PWGNNGCGGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           SG +  +  L    GP  V ++   +ES D    R   +    C+P +++HAV  VGYG 
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 279 QGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 447

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 142/312 (45%), Gaps = 27/312 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 26  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 85

Query: 95  TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                     ER  EA R RV+  + +   G  P ++DWR+     + PV+ QG CGSCW
Sbjct: 86  Y------HNGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGSCGSCW 136

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQ 210
           +F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + ++
Sbjct: 137 SFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTE 196

Query: 211 ADYPYRNKENITFRCT-YEKEKAKVFVQDTWVTSGVDHMM-HLLQSGPIGVYLNHRLIES 268
             YPY +++     C  Y  E          +    D +  +L  +GP+ V ++     S
Sbjct: 197 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMS 256

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE+G N 
Sbjct: 257 YSGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIEKGTNQ 312

Query: 329 CGIESYAYLASV 340
           C +   A  A V
Sbjct: 313 CLVAQLASSAVV 324


>gi|21617827|sp|P09648.1|CATL1_CHICK RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain
          Length = 218

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 81/219 (36%), Positives = 116/219 (52%), Gaps = 17/219 (7%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P+S+DWR+     + PV+ QG+CGSCWAF+TT  LE Q    K  L  LS+  LV+C   
Sbjct: 2   PRSVDWREKGY--VTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP 59

Query: 185 HGNLNCNGGNIDVAFEYVK-QYGLESQADYPY--RNKENITFRCTYEKEKAKVFVQDTWV 241
            GN  CNGG +D AF+YV+   G++S+  YPY  ++ E+  ++  Y       FV    +
Sbjct: 60  EGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVD---I 116

Query: 242 TSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
             G +   M  +   GP+ V ++  H   + Y        D  C+   LDH V +VGYG 
Sbjct: 117 PQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGF 174

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           + G   WIV+NSWG+   D GY  + +   N CGI + A
Sbjct: 175 EGGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 213


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 87/253 (34%), Positives = 125/253 (49%), Gaps = 21/253 (8%)

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           +   TG R+ G  K        +   FL       LPK++DWR      + PV+ QG+CG
Sbjct: 89  VAMMTGFRVNGTSKA------AKGSTFLPSNNVDKLPKTVDWRTKGY--VTPVKDQGQCG 140

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-VKQYGLES 209
           SCWAF+ T  LE Q       L  LS+  LV+C + N  C+GG +D AF+Y +   G+++
Sbjct: 141 SCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYRNYGCHGGFMDRAFQYIIDAGGIDT 200

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLN--HR 264
           +A Y YR  +     C ++K      V   T VTSG +  +   +   GPI V ++  H+
Sbjct: 201 EATYSYRAVDG---NCHFKKANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHK 257

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y       N+  C+  +L HAV +VGYG   +G   WIV+NSW      +GY  + 
Sbjct: 258 FFKFYKSGVY--NEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMS 315

Query: 324 RGA-NACGIESYA 335
           R   N CGI S A
Sbjct: 316 RNKDNQCGIASEA 328


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 88/307 (28%), Positives = 152/307 (49%), Gaps = 30/307 (9%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTGKEK 104
           ++ ++ R Y+D  E + R++ FK++ +  + +   + +S++S      + G+ +      
Sbjct: 42  WMTRFKRVYSDAKEKEIRYKIFKENVQRIESF---NKASEKS-----YKLGINQFADLTN 93

Query: 105 ERLEADRERVKKFLNERKKGP--------LPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           E  +  R R K  +   + GP        +P S+DWR  K   +  ++ QG+CGSCWAF+
Sbjct: 94  EEFKTSRNRFKGHMCSSQAGPFRYENITAVPSSMDWR--KEGAVTAIKDQGQCGSCWAFS 151

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQ-YGLESQADY 213
             A +E    L    L  LS+ +LV+CD    +  C GG +D AF++++Q  GL ++A+Y
Sbjct: 152 AVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANY 211

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIE-SYDGN 272
           PY   +            AK+   +    +    +M  +   P+ V ++    E  +  +
Sbjct: 212 PYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFEFQFYSS 271

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---- 328
            I   D  C   +LDH VA VGYGE NG+  W+V+NSWG    + GY ++++  +A    
Sbjct: 272 GIFTGD--CGT-ELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGL 328

Query: 329 CGIESYA 335
           CGI   A
Sbjct: 329 CGIAMQA 335


>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 117/227 (51%), Gaps = 14/227 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP S+DWR  K  VLNPV+ QG CGSCWAF+    LE + A+    L  LS+ QLV+C  
Sbjct: 112 LPTSVDWR--KKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLVDCAG 169

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
            +GN  CNGG +D AFEY+K  G++ ++ YPY   +  T + T E +   + V +     
Sbjct: 170 AYGNEGCNGGLMDKAFEYIKATGVDKESTYPYVGSDE-TCQATVENKTDGLPVGEVTGNQ 228

Query: 244 GVDH----MMHLLQSGP--IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            +      +M  + + P  I +Y N +  + Y        +       +DH V  VGYG 
Sbjct: 229 MLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGT 288

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGANA---CGIESYAYLASVK 341
           +NG   +I+RNSWG      GY  ++RG  +   C I  Y  + ++K
Sbjct: 289 ENGQDYFIIRNSWGRSWGQDGYVYLKRGVGSFGQCNIYKYMCVPTLK 335


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 21/218 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P+++DWRQ     +NP++ QG CGSCWAF+TTA +E    ++   L  LS+ +LV+CD 
Sbjct: 145 VPETVDWRQKGA--VNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK 202

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
             N  CNGG +D AF+++ K  GL ++ DYPYR       +C    + ++V   D +   
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRG---FGGKCNSFLKNSRVVSIDGYEDV 259

Query: 241 VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            T     +   +   P+ V +    R+ + Y          +C  + LDHAV  VGYG +
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTG---SCGTN-LDHAVVAVGYGSE 315

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANA-----CGI 331
           NG+  WIVRNSWG    + GY ++ER   A     CGI
Sbjct: 316 NGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGI 353


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 152/318 (47%), Gaps = 33/318 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ K  + Y    E   RFE FK + K  DE        + G +  +D S Q
Sbjct: 42  KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQ 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   + PV++QG 
Sbjct: 102 EFKNKYLGLKVDYSRR------RESPEEF--TYKDVELPKSVDWR--KKGAVAPVKNQGS 151

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN-CNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD    N CNGG +D AF + V+  G
Sbjct: 152 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGG 211

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-----QSGPIGVYL 261
           L  + DYPY  +E     C   KE+ +V     +     ++   LL     QS  + +  
Sbjct: 212 LHKEEDYPYIMEEGT---CEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEA 268

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+   IV+NSWG    + GY +
Sbjct: 269 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIR 324

Query: 322 IERGANACGIESYAYLAS 339
           +       G   Y  +AS
Sbjct: 325 MRGTLETRGNLRYLQMAS 342


>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
          Length = 364

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 151/318 (47%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD------GKETD--EYYGTSGSSDRSPQEILQR 94
           F  +  K+++TY    E   RF  FK +       +E D    +G +  SD +P E   +
Sbjct: 49  FSAFKTKFSKTYATKEEHDYRFGVFKSNLLRAKSHQELDPSAIHGVTKFSDLTPSEFRSQ 108

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G +   L +D         +     LPK  DWR      +  V++QG  GSCW+
Sbjct: 109 ----FLGLKPLSLPSDAHNAPILPTDN----LPKDFDWRDHGA--VTNVKNQGTGGSCWS 158

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---GNLN------CNGGNIDVAFEYVKQY 205
           F+TT  LE    L    L  LS+ QLV+CDH    +LN      CNGG +  AF Y K+ 
Sbjct: 159 FSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDACDSGCNGGLMTTAFGYTKKA 218

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            GL  + DY Y  ++     C ++K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 219 GGLVREEDYLYTGRDRGP--CKFDKSKIAASVSNFSVVSLDEDQIAANLVKNGPLSVGIN 276

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y G       + C  H LDH V +VGYG              WI++NSWG+   
Sbjct: 277 AVYMQTYIGG--VSCPFICGKH-LDHGVLLVGYGAGGYAPIRFKEKPYWIIKNSWGENWG 333

Query: 316 DHGYFQIERGANACGIES 333
           ++GY++I RG N CG++S
Sbjct: 334 ENGYYKICRGPNMCGVDS 351


>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 152/304 (50%), Gaps = 21/304 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGK--------ETDEYYGTSGSSDRSPQEIL 92
           + F+ +I K+N++Y  D E   ++E FK + K          D  +  +  SD +  ++L
Sbjct: 34  NIFEDFIKKYNKSYATDQERAIKYENFKNNLKMINDKNNGSKDAVFDINAFSDLNKNDLL 93

Query: 93  QRT-GLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           +RT G R+  K+      D  +E   + +    +  LP+S DWR      + PV++Q  C
Sbjct: 94  RRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHG--VTPVKNQLEC 151

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQYGLE 208
           GSCWAF+  A +ES   +       LS+  L+ CD  N  C GG +  A E + +Q G+ 
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQGGIV 211

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-QSGPIGVYLNHRLIE 267
           S+ D PY   + +   C  ++    +     +V    + +  LL  +GPI + ++  +I+
Sbjct: 212 SEKDEPYYGLDAV---CKPKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVD--IID 266

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
             D       D   N + L+HAV +VGYG  N I  WI++NSWG+   + GY +++R  N
Sbjct: 267 VIDYKE-GITDICENMNGLNHAVLLVGYGVHNNIPYWIMKNSWGEEWGEKGYLRVQRNIN 325

Query: 328 ACGI 331
           +CG+
Sbjct: 326 SCGL 329


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 152/318 (47%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K V+ F+++I    + Y    E   RFE FK++ K  D+        + G +  +D S +
Sbjct: 42  KLVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHE 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL          E  R++  +  + R    LPKS+DWR  K   + PV++QG 
Sbjct: 102 EFKSKFLGLYP--------EFPRKKSSEDFSYRDVVDLPKSIDWR--KKGAVTPVKNQGS 151

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ QL++CD   N  CNGG +D AFE+ V   G
Sbjct: 152 CGSCWAFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGG 211

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYLNH 263
           L  + DYPY  +E     C  ++E+ +V     +     +    ++  L   P+ V ++ 
Sbjct: 212 LHKEEDYPYLMEEGT---CDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDA 268

Query: 264 --RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             R  + Y G         C    LDH VA VGYG  +GI   IV+NSWG    + GY +
Sbjct: 269 SGRDFQFYSGGVFSG---PCGT-DLDHGVAAVGYGSSSGIDYIIVKNSWGPKWGERGYLR 324

Query: 322 IERGA----NACGIESYA 335
           ++R        CGI   A
Sbjct: 325 MKRNTGKPEGLCGINKMA 342


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 156/311 (50%), Gaps = 35/311 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           ++ ++VK  R Y    E + RFE FK + K  DE+          G +  +D S  E   
Sbjct: 25  YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84

Query: 94  -RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
              G R+ GK   RL    +  +    E     LP+++DWR+     + PV+ QG+CGSC
Sbjct: 85  VYLGTRMDGKG--RLLGGPKSERYLFKEGDD--LPETVDWREKGA--VAPVKDQGQCGSC 138

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   NL CNGG +D AF+++ +  G++++
Sbjct: 139 WAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTE 198

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRL 265
            DYPY+  +++   C   ++ A+V   D +     +    +   + + P+ V +    R 
Sbjct: 199 EDYPYKAIDSM---CDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRG 255

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + Y          +C   +LDH V  VGYG ++G+  WIVRNSWG    ++GY ++ER 
Sbjct: 256 FQLYQSGVFTG---SCGT-QLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERD 311

Query: 326 ANA-----CGI 331
             +     CGI
Sbjct: 312 VASTETGKCGI 322


>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 91/330 (27%), Positives = 156/330 (47%), Gaps = 55/330 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEI--- 91
           FK+++ ++ +TY+   E   R   F ++  +  E+        +G +  SD + +E    
Sbjct: 74  FKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEEEFEAT 133

Query: 92  ---LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
              L+         +  + + D    +  ++      LP+S DWR+     +  V++QGR
Sbjct: 134 YMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSD---LPESFDWREKGA--VTEVKTQGR 188

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAF 199
           CGSCWAF+TT  +E    +    L  LS+ QLV+CDH          +  C+GG +  AF
Sbjct: 189 CGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAF 248

Query: 200 EY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ--------DTWVTSGVDHMMH 250
            Y ++  G+E +  YPY  K      C +  EK  V V+        ++ + + V H   
Sbjct: 249 NYLIEAGGIEEEVTYPYTGKRG---ECKFNPEKVAVKVRNFAKIPEDESQIAANVVH--- 302

Query: 251 LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------ 303
              +GP+ + LN   +++Y G         C+  +++H V +VGYG +   IL       
Sbjct: 303 ---NGPLAIGLNAVFMQTYIGGV--SCPLICDKKRINHGVLLVGYGSRGFSILRLGYKPY 357

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIES 333
           WI++NSWG    +HGY+++ RG N CG+ +
Sbjct: 358 WIIKNSWGKRWGEHGYYRLCRGHNMCGMST 387


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 81/240 (33%), Positives = 117/240 (48%), Gaps = 15/240 (6%)

Query: 106 RLEADRERV---KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
           +++ +R R      F+     G LP S+DWR + +  + PV+ QG+CGSCW+F+TT  +E
Sbjct: 95  KVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGI--VTPVKDQGQCGSCWSFSTTGSVE 152

Query: 163 SQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKE 219
            Q A     L  LS+  LV+C    GN  CNGG +D AF+Y+    G++++A YPY  K+
Sbjct: 153 GQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYTAKD 212

Query: 220 NITFRCTYEKEKAKVFV---QDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRR 276
                C +        +   QD    S  D    +   GP+ V ++              
Sbjct: 213 GT---CKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVY 269

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           N+  C+   LDH V   GYG  NG   W+V+NSWG      GY  + R A N CGI + A
Sbjct: 270 NEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATSA 329


>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
          Length = 362

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 147/311 (47%), Gaps = 34/311 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V+  + Y D  E++ RF  F +  +          S++R  + +  R G+ R   
Sbjct: 62  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVR-------STNR--RGLPYRLGINRFAD 112

Query: 102 KEKERLEADRERVKKFLNERKKG--------PLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A R    +  +    G         LP++ DWR+  +  ++PV+ QG CGSCW
Sbjct: 113 MSWEEFQASRLGAAQNCSATLAGNHRMRDAPALPETKDWREDGI--VSPVKDQGHCGSCW 170

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE++          LS+ QL +C   + N  C+GG    AFEY+K   GL+++
Sbjct: 171 PFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYIKYNGGLDTE 230

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV-----DHMMHLLQSGPIGVYLNH-R 264
             YPY     I   C Y+ E A V V D+   + V      + + L++  P+ V      
Sbjct: 231 EAYPYTGVNGI---CHYKPENAGVKVLDSVNITLVAEDELKNAVGLVR--PVSVAFQVIN 285

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
               Y       +    +P  ++HAV  VGYG +NG+  W+++NSWG    D+GYF +E 
Sbjct: 286 GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFTMEM 345

Query: 325 GANACGIESYA 335
           G N CGI + A
Sbjct: 346 GKNMCGIATCA 356


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 21/218 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P+++DWRQ     +NP++ QG CGSCWAF+TTA +E    ++   L  LS+ +LV+CD 
Sbjct: 145 VPETVDWRQKGA--VNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK 202

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
             N  CNGG +D AF+++ K  GL ++ DYPYR       +C    + ++V   D +   
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRG---FGGKCNSFLKNSRVVSIDGYEDV 259

Query: 241 VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            T     +   +   P+ V +    R+ + Y          +C  + LDHAV  VGYG +
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTG---SCGTN-LDHAVVAVGYGSE 315

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANA-----CGI 331
           NG+  WIVRNSWG    + GY ++ER   A     CGI
Sbjct: 316 NGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGI 353


>gi|67475048|ref|XP_653254.1| cysteine protease [Entamoeba histolytica HM-1:IMSS]
 gi|2507251|sp|P36184.2|ACP1_ENTHI RecName: Full=Cysteine proteinase ACP1; Flags: Precursor
 gi|1460065|emb|CAA60673.1| cysteine proteinase [Entamoeba histolytica]
 gi|56470190|gb|EAL47868.1| cysteine protease, putative [Entamoeba histolytica HM-1:IMSS]
 gi|449707486|gb|EMD47138.1| cysteine protease, putative [Entamoeba histolytica KU27]
          Length = 308

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 145/298 (48%), Gaps = 27/298 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--SDRSPQEILQRTGLRL 99
           AFK +    N+ + +  E   RF  F  + K  +    T  +  +D + +E +Q T L +
Sbjct: 17  AFKQWAATHNKVFANRAEYLYRFAVFLDNKKFVEANANTELNVFADMTHEEFIQ-THLGM 75

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
           T +  E     +  VK           P+S+DWR     ++NP + QG+CGSCW F TTA
Sbjct: 76  TYEVPETTSNVKAAVK---------AAPESVDWRS----IMNPAKDQGQCGSCWTFCTTA 122

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNK 218
           +LE +V      LY  S+ QLV+CD  +  C GG+   + +++++  GL  ++DYPY+  
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDASDNGCEGGHPSNSLKFIQENNGLGLESDYPYK-- 180

Query: 219 ENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR--LIESYDGNPI 274
             +   C   K  A V      VT G +  +   + ++GP+ V ++      + Y    I
Sbjct: 181 -AVAGTCKKVKNVATV-TGSRRVTDGSETGLQTIIAENGPVAVGMDASRPSFQLYKKGTI 238

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
             +D  C    ++H V  VGYG  +    WI+RNSWG    D GYF + R + N CGI
Sbjct: 239 -YSDTKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWGDAGYFLLARDSNNMCGI 295


>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/316 (28%), Positives = 149/316 (47%), Gaps = 28/316 (8%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ  +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPE 95

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E        L G +     A  +R +K +N    G  P ++DWR  K   + PV+ QG+C
Sbjct: 96  EF---RATYLNGAK--YYAAALKRPRKVVN-VSTGKAPPAIDWR--KKGAVTPVKDQGKC 147

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYG 206
           GSCWAF+    +E Q  +    L  LS+  LV CD+ +  C GG +D A +++    +  
Sbjct: 148 GSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDNMDYGCRGGFLDRALKWIVSSNKGN 207

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR 264
           + ++  YPY + +     C    +     +         ++ +   L ++GPI + ++  
Sbjct: 208 VFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDAS 267

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
               Y G  +     +C+   L+H V +VGY + +    WI++NSWG    + GY ++E+
Sbjct: 268 SFLDYTGGVLT----SCSSDALNHGVLLVGYDDSSKPPYWIIKNSWGKKWGEEGYIRVEK 323

Query: 325 GANACGIESYAYLASV 340
           G N C ++ YA  A V
Sbjct: 324 GTNQCLMKEYARSAVV 339


>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 323

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/213 (34%), Positives = 113/213 (53%), Gaps = 15/213 (7%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD---H 185
           ++DWR  K   + PV++QG CGSCWAF+    +E Q      TL  LS  +LV+C    +
Sbjct: 112 AVDWR--KEGAVTPVKNQGHCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEYY 169

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  CNGG +  AF++V+  G++++  YPY+ K +I   C    E          + +  
Sbjct: 170 GNEGCNGGLMGQAFDFVEDEGIQTEESYPYKAKRSI---CQMNGEYVTKVKTYHLLLNEQ 226

Query: 246 DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK----LDHAVAIVGYGEKNGI 301
           +    +   GP+ V ++   +  YD   +   D  C   K    L+H V +VGYG +NG+
Sbjct: 227 EIARAVSAKGPVAVAIDASQLSFYDQGIV---DEKCKCSKKREDLNHGVLVVGYGSENGV 283

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
             WIV+NSWG    + GYF++++   ACGI +Y
Sbjct: 284 DYWIVKNSWGADWGEKGYFRLKKDVKACGIGNY 316


>gi|42564157|gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 322

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 103/301 (34%), Positives = 155/301 (51%), Gaps = 37/301 (12%)

Query: 52  RTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDRSPQEILQRTGLR 98
           +TY    E +TRF  F+             ++GK T  Y   +  +D +  E  ++ GL+
Sbjct: 32  KTYKSLLEERTRFGIFQNNLRTIEKHNAEYEEGKVT-YYMAVTQFADMTRDEFRKKLGLQ 90

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                +  L A      + L       LP+ +DW + K  VL P ++QG C SCWAF+TT
Sbjct: 91  --NNRRPNLNATLRVFPEDLE------LPEQIDWTE-KGAVL-PAKNQGNCRSCWAFSTT 140

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCN-GGNIDVAFEYVKQYGLESQADYPY 215
             LE Q A+  K   PLS+ QL++C   +GN +C+ GG +  AF+Y+   G+E+++ YPY
Sbjct: 141 GSLEGQNAIHNKVKTPLSEQQLLDCSASYGNGDCDDGGLMTEAFDYIIDNGIEAESSYPY 200

Query: 216 RNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNP 273
              E +T  C Y+ +K  V ++    + +  D +   + + GPI V ++   +  Y G  
Sbjct: 201 V--EQMT-ECQYDAKKTIVQIKGYKKLLADEDELKKAVGTVGPISVGMSSENLHMYGGGV 257

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIE 332
           +   D  C    +DHAV +VGYGE NG   W V+NSWG    + GYF+IER A N C I 
Sbjct: 258 L---DDQCY-FGMDHAVLVVGYGEANGKKFWKVKNSWGTTWGEDGYFRIERDANNLCDIA 313

Query: 333 S 333
           S
Sbjct: 314 S 314


>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
          Length = 491

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 149/324 (45%), Gaps = 33/324 (10%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + F  + +++NR+Y+   E   R + F  +  +             +G +  SD + +E 
Sbjct: 166 EVFALFQIQYNRSYSSPAEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTDEEF 225

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q         E  R+      V+K  + ++  P+P + DWR  K ++++P+ +Q  C  
Sbjct: 226 SQVYKQPKVPGEVPRM------VRKVRSLKQGKPVPPTCDWR--KARIISPIRNQKNCSC 277

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAFEYVKQYGLESQ 210
           CWA A    +E+Q  +       +S  +L++C      C GG + D     +   GL S+
Sbjct: 278 CWAMAAADNIEAQWGIRYNQSVKVSVQELLDCGRCGDGCKGGWVWDAFITVLNNSGLASE 337

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYPY++  +   RC  ++ K   ++QD  +    + ++  +L   GPI V +N + ++ 
Sbjct: 338 KDYPYQSNVDPQ-RCRVKRNKV-AWIQDFIMLQDNEQIIAQYLASHGPITVTINMKPLKQ 395

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----------WIVRNSWGDIGPDH 317
           Y           C+P  +DH+V +VG+G    +             WI++NSWG    + 
Sbjct: 396 YRKGVFEATPATCDPWLVDHSVLLVGFGSSKSVKGMRAGTASSKPYWILKNSWGAKWGEK 455

Query: 318 GYFQIERGANACGIESYAYLASVK 341
           GYF++ RG+N CGI  Y   A V+
Sbjct: 456 GYFRLHRGSNTCGIAKYPLTARVE 479


>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
 gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
          Length = 384

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 121/230 (52%), Gaps = 30/230 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR+     + PV+ QG CGSCW+F+T+  LE    L    L  LS+ Q+V+CDH
Sbjct: 148 LPDDFDWREHGA--VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDH 205

Query: 186 ---------GNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
                     +  CNGG +  AF Y+ K  GL+S+ DYPY  +EN    C ++K K    
Sbjct: 206 ECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRENT---CKFDKSKIVAQ 262

Query: 236 VQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
           V++  V S  +  +  +L++ GP+ + +N   +++Y G       + C  H LDH V +V
Sbjct: 263 VKNFSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGG--VSCPFICGRH-LDHGVLLV 319

Query: 294 GYGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGA---NACGIES 333
           GYG              WI++NSWG+   + GY++I RG    N CG++S
Sbjct: 320 GYGSAGYAPIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDS 369


>gi|31558997|gb|AAP49831.1| cathepsin L [Fasciola hepatica]
          Length = 326

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 123/227 (54%), Gaps = 18/227 (7%)

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
           E     +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ Q
Sbjct: 102 ETNNRAVPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQ 159

Query: 180 LVECDH--GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           LV+C    GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V 
Sbjct: 160 LVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVT 216

Query: 238 DTW-VTSGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVA 291
             + V SG +  +  L    GP  V ++   +ES D    R   +    C+P +++HAV 
Sbjct: 217 GYYTVPSGSEVELKNLVGAEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLRVNHAVL 272

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYL 337
            VGYG + G   WIV+NSWG    + GY ++ R   N CGI S A L
Sbjct: 273 AVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASL 319


>gi|195488703|ref|XP_002092426.1| GE11675 [Drosophila yakuba]
 gi|194178527|gb|EDW92138.1| GE11675 [Drosophila yakuba]
          Length = 384

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 89/259 (34%), Positives = 134/259 (51%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A  + V+  L E+   P+P + DWR+     + P
Sbjct: 129 ADLTHSEFLSQLTGLKRSPEAKARAAASLKEVQ--LPEK---PIPDAFDWREHGG--VTP 181

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 182 VKFQGTCGSCWAFATTGAIEGHTFRKTGSLPILSEQNLVDCGPVADFGLNGCDGGFQEAA 241

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+     YPY + ++    C Y+  K+   +Q        D   M  ++ +
Sbjct: 242 FCFIDEVQKGVSQAGAYPYIDSKDT---CKYDGSKSGASLQGFAAIPPKDEEQMKKVVAT 298

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GPI   +N    +++Y G     ND  CN  + +H++ +VGYG +NG   WIV+NSW D
Sbjct: 299 LGPIACSVNGLETLKNYAGG--IYNDDECNQGEPNHSILVVGYGSENGQDYWIVKNSWDD 356

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 357 TWGEQGYFRLPRGQNYCFI 375


>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 389

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 141/312 (45%), Gaps = 27/312 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 95  TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                     ER  EA R RV+  + +   G  P ++DWR+     + PV+ QG CGSCW
Sbjct: 94  Y------HNGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGTCGSCW 144

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQ 210
           +F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + + 
Sbjct: 145 SFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTG 204

Query: 211 ADYPYRNKENITFRCT-YEKEKAKVFVQDTWVTSGVDHMM-HLLQSGPIGVYLNHRLIES 268
             YPY +++     C  Y  E          +    D +  +L  +GP+ V ++     S
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMS 264

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE+G N 
Sbjct: 265 YSGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIEKGTNQ 320

Query: 329 CGIESYAYLASV 340
           C +   A  A V
Sbjct: 321 CLVAQLASSAVV 332


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 149/314 (47%), Gaps = 40/314 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR-------- 94
           ++ ++ K  R Y    E + RFE FK +    D +   + +  RS +  L R        
Sbjct: 50  YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNEE 109

Query: 95  -----TGLRLTG-KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
                 G R  G + + R+ +DR R     +      LP+S+DWR      +  V+ QG 
Sbjct: 110 YRAVYLGTRPAGHRRRARVGSDRYRYNAGED------LPESVDWRAKGA--VAAVKDQGS 161

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +LV+CD+G N  CNGG +D  FE++    G
Sbjct: 162 CGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFIINNGG 221

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL-- 261
           ++++ DYPY  ++    +C   ++ AKV   D +    V+    +   + + P+ V +  
Sbjct: 222 IDTEEDYPYTARDG---KCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEA 278

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             R  + Y           C    LDH V  VGYG +NG   WIVRNSWG    + GY +
Sbjct: 279 GGREFQLYHSGIFTGR---CGT-DLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIR 334

Query: 322 IERGANA----CGI 331
           +ER  N     CGI
Sbjct: 335 MERNVNTSTGKCGI 348


>gi|344258279|gb|EGW14383.1| Cathepsin L1 [Cricetulus griseus]
          Length = 295

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 119/230 (51%), Gaps = 12/230 (5%)

Query: 118 LNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSK 177
             E + G +PKS+DWR  K   + PV+ QG CG+CWAF+    L  Q+      L PLS+
Sbjct: 71  FQEPRLGDVPKSVDWR--KHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPLSE 128

Query: 178 SQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKV 234
             LV+C   HGN+ C+GG +  AF+YV    GL++   YPY ++ N T R   E   A V
Sbjct: 129 QNLVDCSWSHGNIGCHGGLMQNAFQYVMDNGGLDTSESYPYESR-NTTCRYNPENSAANV 187

Query: 235 FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAI 292
                   +    M  +   GPI   ++  H   + Y G      +  C+   LDHAV +
Sbjct: 188 TGFVKIPANEYSLMKAVAIVGPISAAIDTKHHSFQFYRGGMYYEPE--CSSSNLDHAVLV 245

Query: 293 VGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           VGYGE+ +G   W+V+NSWG     +GY ++ R   N CGI +YA   +V
Sbjct: 246 VGYGEESDGRKYWLVKNSWGTYWGMNGYIKMARDRNNNCGIATYAMYPTV 295


>gi|374414520|pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From
           Tenebrio Molitor Larval Midgut
 gi|374414521|pdb|3QJ3|B Chain B, Structure Of Digestive Procathepsin L2 Proteinase From
           Tenebrio Molitor Larval Midgut
          Length = 331

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 92/270 (34%), Positives = 134/270 (49%), Gaps = 26/270 (9%)

Query: 84  SDRSPQEILQRT-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +P+E+   T GL +     +     + R    LN   +   P S DWR   +  ++P
Sbjct: 75  TDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVR--YPASFDWRDQGM--VSP 130

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLY--PLSKSQLVECDHGNLNCNGGNIDVAFE 200
           V++QG CGS WAF++T  +ESQ+ +     Y   +S+ QLV+C    L C+GG ++ AF 
Sbjct: 131 VKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFT 190

Query: 201 YVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM---HLLQSGP 256
           YV Q G ++S+  YPY   +     C Y+  +    +      SG D  M    +   GP
Sbjct: 191 YVAQNGGIDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGP 247

Query: 257 IGVYLNHR-LIESYDG----NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
           + V  +      SY G    NP       C  +K  HAV IVGYG +NG   W+V+NSWG
Sbjct: 248 VAVAFDADDPFGSYSGGVYYNPT------CETNKFTHAVLIVGYGNENGQDYWLVKNSWG 301

Query: 312 DIGPDHGYFQIERGA-NACGIESYAYLASV 340
           D     GYF+I R A N CGI   A + ++
Sbjct: 302 DGWGLDGYFKIARNANNHCGIAGVASVPTL 331


>gi|407036622|gb|EKE38272.1| cysteine protease, putative [Entamoeba nuttalli P19]
          Length = 308

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 145/298 (48%), Gaps = 27/298 (9%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--SDRSPQEILQRTGLRL 99
           AFK +    N+ + +  E   RF  F  + K  +    T  +  +D + +E +Q T L +
Sbjct: 17  AFKQWAATHNKVFANRAEYLYRFAVFLDNKKFVEANANTELNVFADMTHEEFIQ-THLGM 75

Query: 100 TGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
           T +  E     +  VK           P+S+DWR     ++NP + QG+CGSCW F TTA
Sbjct: 76  TYEIPETTSNVKAAVK---------AAPESVDWRS----IMNPAKDQGQCGSCWTFCTTA 122

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNK 218
           +LE +V      LY  S+ QLV+CD  +  C GG+   + +++++  GL  ++DYPY+  
Sbjct: 123 VLEGRVNKDLGKLYSFSEQQLVDCDTSDNGCEGGHPTNSLKFIQENNGLGLESDYPYK-- 180

Query: 219 ENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL--QSGPIGVYLNHR--LIESYDGNPI 274
             +   C   K  A V      VT G +  +  +  ++GP+ V ++      + Y    I
Sbjct: 181 -AVAGTCKKVKNVATV-TGSKRVTDGSETGLQTIIAENGPVAVGMDASRPTFQLYKKGTI 238

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
             +D  C    ++H V  VGYG  +    WI+RNSWG    D GYF + R + N CGI
Sbjct: 239 -YSDARCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWGDAGYFLLARDSNNMCGI 295


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 115/218 (52%), Gaps = 21/218 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P+++DWR      +NP++ QG CGSCWAF+T A +E    ++   L  LS+ +LV+CD+
Sbjct: 145 VPETVDWRLKGA--VNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDN 202

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  CNGG +D AF+++ K  GL+++ DYPYR       +C    + AKV   D +   
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRG---FGGKCNSFLKNAKVVSIDGYEDV 259

Query: 244 GVDHMMHL-----LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
                  L     LQ   + +    R+ + Y       N   C  + LDHAV  VGYG +
Sbjct: 260 PTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGN---CGTN-LDHAVVAVGYGSE 315

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERG-----ANACGI 331
           NG+  WIVRNSWG    + GY ++ER      +  CGI
Sbjct: 316 NGVDYWIVRNSWGPRWGEEGYIRMERNLASSKSGKCGI 353


>gi|410914437|ref|XP_003970694.1| PREDICTED: cathepsin O-like [Takifugu rubripes]
          Length = 328

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 141/294 (47%), Gaps = 11/294 (3%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGT-SGSSDRSPQEILQRTGLRLTG 101
           F+ +  ++ R Y  ++    R  +F Q+      Y  + S +S  +   I Q + L    
Sbjct: 32  FEWFRERFGRNYEVNSPQFDRRLFFFQESTTRHAYLNSFSAASQSAKYGINQFSDLSQRE 91

Query: 102 KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
            +   L A  +R   F  ++ +G LP   DWR   +  + PV++Q  CGSCWAF+    +
Sbjct: 92  FQDLYLRASADRAPAFSGQKAEG-LPAKFDWRDHAI--VAPVQNQQACGSCWAFSVVGAV 148

Query: 162 ESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ--YGLESQADYPYRNKE 219
           +S  A+    L  LS  Q+++C   N  CNGG    A +++ Q    L  Q++YPY+ + 
Sbjct: 149 QSVHAIGGSQLVELSVQQVLDCSFQNKGCNGGTPVAALKWLTQTRVKLVPQSEYPYKAQT 208

Query: 220 NIT--FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
            +   F  ++     K F    +       M HL++ GP+ V ++    + Y G  I+ +
Sbjct: 209 RMCHFFSGSHGGVGVKNFTALDFSGQEEAMMGHLVKHGPLSVVVDALSWQDYLGGIIQYH 268

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
              C+  + +HAV +VGY     I  WIV+NSWG    D GY  ++ G+N CGI
Sbjct: 269 ---CSSKRSNHAVLVVGYDTTGDIPYWIVQNSWGTTWGDKGYVYMKVGSNICGI 319


>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
 gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
          Length = 371

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 147/318 (46%), Gaps = 42/318 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ +TY    E   RF  FK + +    +        +G +  SD +P+E  Q 
Sbjct: 56  FAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKRHQLLDPSAEHGVTQFSDLTPREFRQN 115

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                 G ++ +L AD ++      +     LP   DWR      +  V+ QG CGSCW+
Sbjct: 116 ----YLGLKRLQLPADAQKAPILPTKD----LPTDFDWRDHGA--VTAVKDQGYCGSCWS 165

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVEC---------DHGNLNCNGGNIDVAFEYV-KQ 204
           F+T   LE    L    L  LS  QL++C         D  +  CNGG ++ AFEY+ K 
Sbjct: 166 FSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFEYILKA 225

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN 262
            G+  + DYPY   +     C + K K    V +  V S  +  +  +L+++GP+ V +N
Sbjct: 226 GGVAQEEDYPYTGTDRGL--CRFNKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVGIN 283

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGP 315
              +++Y         + C+   LDH V +VGYG              WI++NSWG+   
Sbjct: 284 AVFMQTYKSG--VSCPYICSS-TLDHGVLLVGYGSAGYSPIRFKEKPYWIIKNSWGESWG 340

Query: 316 DHGYFQIERGANACGIES 333
           + GY++I RG N CG++S
Sbjct: 341 EQGYYKICRGHNICGVDS 358


>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
          Length = 379

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 87/244 (35%), Positives = 128/244 (52%), Gaps = 22/244 (9%)

Query: 106 RLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILES 163
           RL  D  R     FL     G LP+S+DWR      +  V++QG CGSCWAF++T  LE+
Sbjct: 139 RLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGW--VTEVKNQGMCGSCWAFSSTGALEA 196

Query: 164 QVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKEN 220
           Q A     L  LS+  L++C   +GN+ CNGG +D AF+Y+K   G++ + DYPY+ K  
Sbjct: 197 QHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYKAKTG 256

Query: 221 ITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNP 273
              +C +++    V   DT    +  G +  + +  +  GP  V ++  HR  + Y    
Sbjct: 257 K--KCLFKRN--DVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 312

Query: 274 IRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGI 331
               +  C+P  LDH V +VGYG +      WIV+NSWG    + GY ++ R   N CGI
Sbjct: 313 YFEKE--CSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMARNRKNNCGI 370

Query: 332 ESYA 335
            S+A
Sbjct: 371 ASHA 374


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 119/220 (54%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S+DWR+     + PV++QG+CGSCWAF+TT  LE Q  L    L  LS+  LV+C  
Sbjct: 117 LPQSMDWREKGA--VTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSE 174

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
             GN  C GG +D AF+Y+K   G++++  YPY  ++     C ++K+        FV D
Sbjct: 175 TFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDG---ECRFKKQNVGATDTGFV-D 230

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               S  D    +   GP+ V ++  H   + Y        +  C+  +LDH V +VGYG
Sbjct: 231 IEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETE--CSSEQLDHGVLVVGYG 288

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            ++G   W+V+NSW +   D+GY ++ R   N CGI S A
Sbjct: 289 VEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAA 328


>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
          Length = 336

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 149/308 (48%), Gaps = 23/308 (7%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F+ +I ++N+ Y D  E + RF+ F  + K  ++    S ++     +        L+ +
Sbjct: 41  FENFIREYNKKY-DSKEKEERFKIFVNNLKRINDLNHKSTNAVHGINKFTD-----LSKE 94

Query: 103 EKERLEADRERVKKFLNERKKGP--------LPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           E ++     +  K FL++  K P         P + DWR   V  +  V++QG CGSCWA
Sbjct: 95  EFKKFYTGFKPDKSFLDDNIKKPSQLSFNITAPPAFDWRDKGV--VTRVKNQGTCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYP 214
           F+T   +ES  A+    L  LS+ QLV+CD  +  C+ G  D A +Y+  +G  S+  YP
Sbjct: 153 FSTIGNVESVNAIKHGNLVELSEQQLVDCDSKDEACDSGLPDNAQQYLVSHGAISEQSYP 212

Query: 215 YRNKENITFRCTYEKEKAKVFVQ--DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGN 272
           Y+        CTY+  +  V +   +  V S       L  + P+ + +   ++ +Y   
Sbjct: 213 YK---GYAANCTYDSSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYTKG 269

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
            I  N+       L+HAV +VGYG + G   WI++NSWG    + GYF+I+RG N   I 
Sbjct: 270 -ILVNECE-QSQDLNHAVLLVGYGNEGGTNFWILKNSWGTNWGEGGYFRIKRGVNCLMIT 327

Query: 333 SYAYLASV 340
            Y  L+ +
Sbjct: 328 DYGVLSGI 335


>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
          Length = 335

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 88/248 (35%), Positives = 125/248 (50%), Gaps = 22/248 (8%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           L+  + R  +   E     +P S+DWRQ     + PV++QG+CGSCWAF+    LE Q+ 
Sbjct: 96  LKQKQHRNGRLFREPLFAEIPSSVDWRQKGY--VTPVKNQGQCGSCWAFSANGALEGQMF 153

Query: 167 LLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITF 223
                L  LS+  LV+C H  GN  CNGG +D AF+YVK   GL+S+  YPY  +E+ T 
Sbjct: 154 RKTGKLVSLSEQNLVDCSHSQGNQGCNGGLMDNAFQYVKDNKGLDSEESYPYLGRESNT- 212

Query: 224 RCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRN 277
            C Y  E +     DT       H   L+++    GPI V ++  H   + Y        
Sbjct: 213 -CNYRPEYSA--ANDTGFVDIPQHERGLMKAVATVGPISVAIDAGHSSFQFYSEGIYYEP 269

Query: 278 DWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIE 332
           +  C+   LDH V +VGYG    + +    WIV+NSWG      GY ++ R  +N CGI 
Sbjct: 270 N--CSSKDLDHGVLVVGYGSEGAQSDSNKFWIVKNSWGTGWGMSGYVKMARDQSNHCGIA 327

Query: 333 SYAYLASV 340
           + A   +V
Sbjct: 328 TAASYPTV 335


>gi|313235127|emb|CBY24999.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 77/219 (35%), Positives = 114/219 (52%), Gaps = 14/219 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S DWR +   V+ PV+ QG+CGSCWAF+T A LESQ AL    L  LS+ QLV+C  
Sbjct: 108 MPASADWRTANPPVVTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSM 167

Query: 186 --GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
             GN  C+GG +   F Y+    G++++A YPY  ++    +C +        +   + +
Sbjct: 168 NWGNYGCSGGLMTQGFTYIHDNNGVDTEASYPYTAQDG---KCVFNPANVGTSLTSCYNI 224

Query: 242 TSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            SG +  +   +   GP+ V ++  H   + Y        +  C+   LDH V  VGYG 
Sbjct: 225 ASGDEAALANAVQMVGPMSVAIDASHMSFQLYTSGVYYEPN--CSSQFLDHGVTAVGYGS 282

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
            NG   +IV+NSW     D+GY  + R  +N CGI + A
Sbjct: 283 SNGNDFFIVKNSWAATWGDNGYIMMSRNKSNNCGIATSA 321


>gi|74765984|sp|Q24940.1|CATLL_FASHE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
 gi|497700|gb|AAA29136.1| cathepsin [Fasciola hepatica]
          Length = 326

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 122/223 (54%), Gaps = 16/223 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V   + V 
Sbjct: 166 PWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVTGYYTVH 222

Query: 243 SG----VDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           SG    + +++   +   + V +    +    G  I ++   C+P +++HAV  VGYG +
Sbjct: 223 SGSEVELKNLVGARRPAAVAVDVESDFMMYRSG--IYQSQ-TCSPLRVNHAVLAVGYGTQ 279

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 280 GGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIASLASLPMV 322


>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 282

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 85/222 (38%), Positives = 119/222 (53%), Gaps = 21/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P ++DWR      + PV++QG CGSCWAF+ T  LE Q       L  LS+  LV+C  
Sbjct: 65  VPDAVDWRDEGY--VTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSA 122

Query: 184 DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
           D GN  CNGG +D AFEYVKQ +G++++  YPY+ K+    +C +  +KA V   DT   
Sbjct: 123 DFGNNGCNGGLMDFAFEYVKQNHGIDTEESYPYKAKQK---KCHF--QKANVGADDTGFV 177

Query: 243 SGVDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
              +     L++     GP+ V ++  HR    Y           C+P +LDH V +VGY
Sbjct: 178 DLPEADEEQLKAAVASQGPVSVAIDAGHRSFRLYKTGVYYEKH--CSPEQLDHGVLVVGY 235

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G +      WIV+NSWG+   + GY +I R   N CGI S A
Sbjct: 236 GTDPEHGDYWIVKNSWGEEWGEKGYVRIARNRNNHCGIASKA 277


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 117/221 (52%), Gaps = 18/221 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP  +DWR+     + PV++QG+CGSCWAF++T  LE Q       L PLS+  LV+C  
Sbjct: 114 LPTHVDWREDGA--VTPVKNQGQCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSR 171

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK---AKVFVQDT 239
            +GN  C GG +D AF Y++   G++++  YPY   E +  RC Y+  K   + +   D 
Sbjct: 172 KYGNNGCEGGLMDFAFTYIRDNKGIDTEGSYPY---EGVGGRCHYDPSKKGSSDIGFVDV 228

Query: 240 WVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
              S  + +  +   GP+ V ++  H   + Y       +   C+P  LDH V +VGYG 
Sbjct: 229 KKGSEEELLKAVASVGPVSVAIDASHMSFQFYSHGVYFESK--CSPENLDHGVLVVGYGT 286

Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
            E +G   W+V+NSW +   D GY ++ R   N CGI S A
Sbjct: 287 DENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCGIASSA 327


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 149/316 (47%), Gaps = 37/316 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           +  ++ K  + Y    E + RFE FK + K  DE+     S +RS +  L R    LT +
Sbjct: 47  YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEH----NSENRSYKVGLNRFA-DLTNE 101

Query: 103 EKER--LEADRERVKKFLNERKKG---------PLPKSLDWRQSKVKVLNPVESQGRCGS 151
           E     L    +  ++F+  +             LP+S+DWR+S    + P++ QG CGS
Sbjct: 102 EYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGA--VAPIKDQGSCGS 159

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLES 209
           CWAF+T A +E    +    +  LS+ +LV+CD   +  CNGG +D AFE++    G+++
Sbjct: 160 CWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINNGGIDT 219

Query: 210 QADYPYRNKENITFRCTYEKEKAKVF-VQDTWVTSGVDHMM--HLLQSGPIGVYL--NHR 264
           + DYPYR    +   C  E++  KV  + D       D M     +   P+ V +  + R
Sbjct: 220 EEDYPYRG---VDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGR 276

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y           C    LDH V +VGYG  NG   WIVRNSWG    ++GY ++ER
Sbjct: 277 AFQLYLSGVFTGE---CG-RALDHGVVVVGYGTDNGADHWIVRNSWGTSWGENGYIRMER 332

Query: 325 G-----ANACGIESYA 335
                    CGI   A
Sbjct: 333 NVVDNFGGKCGIAMQA 348


>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
 gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
          Length = 360

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 146/318 (45%), Gaps = 47/318 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQ--------DGKETDEYYGTSGSSDRSPQEI--- 91
           F  + V++ ++Y    E+  RF  F +        + K      G +  +D S +E    
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFADMSWEEFRAT 118

Query: 92  ----LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
                Q     LTG  + R  A                LP++ DWR+  +  ++PV++QG
Sbjct: 119 RLGAAQNCSATLTGNHRMRAAAV--------------ALPETKDWREDGI--VSPVKNQG 162

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-Q 204
            CGSCW F+TT  LE+           LS+ QLV+C     N  CNGG    AFEY+K  
Sbjct: 163 HCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGLAFNNFGCNGGLPSQAFEYIKYN 222

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMH----LLQSGPIG- 258
            GL+++  YPY+    I+    ++ E   V V D+  +T G +  +     L++   +  
Sbjct: 223 GGLDTEESYPYQGVNGIS---KFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAF 279

Query: 259 -VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V    RL   Y       +     P  ++HAV  VGYG ++G+  W+++NSWG    D 
Sbjct: 280 EVITGFRL---YKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDE 336

Query: 318 GYFQIERGANACGIESYA 335
           GYF++E G N CG+ + A
Sbjct: 337 GYFKMEMGKNMCGVATCA 354


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 120/225 (53%), Gaps = 28/225 (12%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +PK++DWR+     + PV++QG+CGSCWAF++T  LE QV      L  +S+  LV+C  
Sbjct: 108 VPKTVDWREKGY--VTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSR 165

Query: 184 DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
           D GN+ C+GG +D AF Y+K+  G++S+  YPY   E +   C Y+K  +          
Sbjct: 166 DEGNMGCSGGLMDNAFTYIKKNMGIDSEKSYPY---EAVDGECRYKKSDSVT------TD 216

Query: 243 SGVDHMMH---------LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVA 291
           SG   + H         +   GP+ V ++  H   + Y        +  C+  +LDH V 
Sbjct: 217 SGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEAN--CSSTQLDHGVL 274

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           +VGYG +NG   W+V+NSWG    + GY ++ R   N CGI S A
Sbjct: 275 VVGYGVENGQDYWLVKNSWGASWGEAGYIKLARNHGNQCGIASQA 319


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 117/218 (53%), Gaps = 14/218 (6%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P SLDWR  K   +  V++QG CGSCWAF++T  +E   A+    L  LS+ +LV+CD  
Sbjct: 147 PASLDWR--KRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT 204

Query: 187 NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTS 243
           N  C+GG +D AFE+V    G++S+A+YPY  + +    C   KE+ KV   D +  V +
Sbjct: 205 NEGCDGGYMDYAFEWVINNGGIDSEANYPYTGQADSV--CNTTKEEIKVVSIDGYEDVAT 262

Query: 244 GVDHMMHLLQSGPIGVYLNHRLI--ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
               ++      P+ V ++   +  + Y G  I   D + NP  +DHAV +VGYG++ G 
Sbjct: 263 SESALLCAAVQQPVSVGIDGSSLDFQLYAGG-IYDGDCSGNPDDIDHAVLVVGYGQQGGT 321

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
             WIV+NSWG      GY  I R        C I++ A
Sbjct: 322 DYWIVKNSWGTDWGMQGYIYIRRNTGLPYGVCAIDAMA 359


>gi|222641669|gb|EEE69801.1| hypothetical protein OsJ_29533 [Oryza sativa Japonica Group]
          Length = 314

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 129/269 (47%), Gaps = 30/269 (11%)

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNE------------------RKKGPLPKSLDWR 133
           L+ T +   G+ ++ L   R  V+ F                     R    LP++ DWR
Sbjct: 45  LESTVIAALGRTRDALRFARFAVRSFRRAGSGAAQNCSATLAGNHRMRDAAALPETKDWR 104

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCN 191
           +  +  ++PV+ QG CGSCW F+TT  LE+           LS+ QLV+C   + N  C+
Sbjct: 105 EDGI--VSPVKDQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCS 162

Query: 192 GGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMM 249
           GG    AFEY+K   GL+++  YPY     I   C Y+ E   V V D+  +T G +  +
Sbjct: 163 GGLPSQAFEYIKYNGGLDTEEAYPYTGVNGI---CHYKPENVGVKVLDSVNITLGAEDEL 219

Query: 250 HLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIV 306
                    V +  ++I     Y       +    +P  ++HAV  VGYG +NG+  W++
Sbjct: 220 KNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLI 279

Query: 307 RNSWGDIGPDHGYFQIERGANACGIESYA 335
           +NSWG    D+GYF++E G N CGI + A
Sbjct: 280 KNSWGADWGDNGYFKMEMGKNMCGIATCA 308


>gi|410910990|ref|XP_003968973.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
          Length = 329

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 78/221 (35%), Positives = 121/221 (54%), Gaps = 12/221 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P+S+DWR  K  +++PV++QG CGSCWAF++   LE Q+      L PLS   L++C   
Sbjct: 114 PQSVDWR--KHGLVSPVQNQGYCGSCWAFSSLGALEGQMKRKTGFLVPLSPQNLLDCSTS 171

Query: 185 HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYE-KEKAKVFVQDTWVT 242
            GNL C GG I  ++ Y+ +  G++S++ YPY +++    +C Y  K KA    +   + 
Sbjct: 172 DGNLGCRGGYISKSYSYIIRNGGVDSESFYPYEHQKG---KCRYSVKGKAGYCSRFHILP 228

Query: 243 SGVDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
            G +  +   + + GP+ V +N  L   +       N   CNP  ++HAV +VGYG   G
Sbjct: 229 QGDEETLKATVARVGPVAVAVNAMLASFHLYRGGLYNVPNCNPKFINHAVLVVGYGSSEG 288

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
              W+V+NSWG    + GY ++ R   N CGI S+A   S+
Sbjct: 289 QDFWLVKNSWGSAWGEEGYIRLARNKKNLCGIASFAVYPSL 329


>gi|294883340|ref|XP_002770717.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239874002|gb|EER02722.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 333

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 80/231 (34%), Positives = 113/231 (48%), Gaps = 12/231 (5%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           LE    R  KF+ E     LP S+DWR   V  L+PV++QG CGSCWAF+    LE+Q A
Sbjct: 92  LEMSTRRDDKFVVEADTTQLPTSVDWRNKSV--LSPVKNQGSCGSCWAFSAAGALEAQYA 149

Query: 167 LLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFR 224
           +    L PLS  +LV+C   +GN  C GG +  A++Y+K  GL+ ++ YPY+      FR
Sbjct: 150 IATGKLRPLSVQELVDCSSSYGNKGCLGGLMTNAYKYIKSAGLDQESTYPYKGWNKHCFR 209

Query: 225 CTYEKE---KAKVFVQDTWVTSGVDHMMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDW 279
            + +K     A        +      +M  L + P+   +Y   R    Y          
Sbjct: 210 SSEKKADGIPAGEVTGSHMLAQTEQSLMKALAAAPVSLAMYARDRNFRFYRSGVYSST-- 267

Query: 280 ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
            CN  ++DH V  VGYG   G   +I++NSWG      GYF ++RG    G
Sbjct: 268 TCNG-EIDHGVVAVGYGADKGSDYFILKNSWGSSWGIGGYFYLKRGVGGFG 317


>gi|385298943|gb|AFI60244.1| cysteine protease/senescence-enhanced 1, partial [Panicum virgatum]
          Length = 282

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 115/222 (51%), Gaps = 12/222 (5%)

Query: 121 RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL 180
           R    LP++ DWR+  +  ++PV++QG CGSCW F+TT  LE+           LS+ QL
Sbjct: 60  RAAAALPETKDWREDGI--VSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPVSLSEQQL 117

Query: 181 VECD--HGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           V+C   + N  CNGG    AFEY+K  G L+++  YPY+    +   C ++     V V 
Sbjct: 118 VDCAGAYNNFGCNGGLPSQAFEYIKHNGGLDTEESYPYKGVNGL---CQFKASNVGVKVL 174

Query: 238 DTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIV 293
           D+  +T G ++ +         V +   +I     Y       +     P  ++HAV  V
Sbjct: 175 DSVNITLGAENELKDAVGLVRPVSVAFEVINGFRLYKSGVYTSDHCGTTPMDVNHAVLAV 234

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           GYG +NG+  W+++NSWG    D GYF++E G N CG+ + A
Sbjct: 235 GYGVENGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCA 276


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 97/306 (31%), Positives = 150/306 (49%), Gaps = 29/306 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           ++ ++VK  ++Y    E + RFE FK + +  +E+         G +  +D + +E   R
Sbjct: 54  YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSR 113

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               L  +++ R      RV    + R    LP+S+DWR+    V  PV+ QG CGSCWA
Sbjct: 114 Y---LGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVV--PVKDQGNCGSCWA 168

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+T A +E    +    L  LS+ +LV+CD   N  CNGG +D AFE++    G++S+ D
Sbjct: 169 FSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIE 267
           YPYR  +     C   ++ A+V   D +     +    L   + + P+ V +    R  +
Sbjct: 229 YPYRAADTT---CDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQ 285

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y           C   +LDH V  VGYG +N +  WIVRNSWG    + GY ++ER  N
Sbjct: 286 LYQSGVFTGQ---CG-TQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLER--N 339

Query: 328 ACGIES 333
             G E+
Sbjct: 340 LAGTET 345


>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
          Length = 376

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 155/344 (45%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFANNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +   RE   + L E     +P + DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSM--GREIRSEELEES----VPFTCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRINFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG---EKNGILT----------- 303
            V +N +L++ Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKLLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGNVKSEEGIWAETVLSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 118/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q A     L  LS+ QLV+C
Sbjct: 114 GTLPKSVDWRNSAM--VSEVKDQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDC 171

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        +   
Sbjct: 172 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLIGYK 229

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GPI V ++  H   + Y       ++  C+  +LDH V +VGY
Sbjct: 230 DVKSGNEHALKRAVATVGPISVAIDAGHESFQFYSSGVY--DEPQCSSEQLDHGVLVVGY 287

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 288 GAMNDNSHQAFWIVKNSWGPNWGDQGYIMMSRNKDNQCGIATSA 331


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 21/218 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P+++DWRQ     +NP++ QG CGSCWAF+TTA +E    ++   L  LS+ +LV+CD 
Sbjct: 145 VPETVDWRQKGA--VNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK 202

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
             N  CNGG +D AF+++ K  GL ++ DYPYR       +C    + ++V   D +   
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRG---FGGKCNSFLKNSRVVSIDGYEDV 259

Query: 241 VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            T     +   +   P+ V +    R+ + Y          +C  + LDHAV  VGYG +
Sbjct: 260 PTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTG---SCGTN-LDHAVVAVGYGSE 315

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANA-----CGI 331
           NG+  WIVRNSWG    + GY ++ER   A     CGI
Sbjct: 316 NGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGI 353


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 126/246 (51%), Gaps = 25/246 (10%)

Query: 106 RLEADRERVKKFLNERKKG----PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
           R+ A R R        + G     LP+S+DWR+     + PV++QG+CGSCWAF+  + +
Sbjct: 175 RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGA--VAPVKNQGQCGSCWAFSAVSSV 232

Query: 162 ESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNK 218
           ES   ++   +  LS+ +LVEC  D GN  CNGG +D AF++ +K  G++++ DYPY+  
Sbjct: 233 ESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYK-- 290

Query: 219 ENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNP 273
             +  +C   +E AKV   D +     +    +   +   P+ V +    R  + Y    
Sbjct: 291 -AVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAGV 349

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----C 329
                  C  + LDH V  VGYG +NG   WIVRNSWG    + GY ++ER  NA    C
Sbjct: 350 F---TGTCTTN-LDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 405

Query: 330 GIESYA 335
           GI   A
Sbjct: 406 GIAMMA 411


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 115/220 (52%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPK++DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  L++C  
Sbjct: 96  LPKTVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSG 153

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
             GN  C GG +D AF+Y+K   G++++  YPY   E +   C ++KE        FV D
Sbjct: 154 SFGNEGCGGGLMDNAFKYIKANDGIDTEESYPY---EAMDGDCRFKKEDVGATDTGFV-D 209

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               S  D    +   GPI V ++  H   + Y        +  C+  +LDH V  VGYG
Sbjct: 210 IQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEGVYDEPN--CSSEELDHGVLAVGYG 267

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            KNG   W+V+NSW +   D+GY  + R   N CGI S A
Sbjct: 268 VKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIASSA 307


>gi|167427531|gb|ABZ80402.1| cathepsin L6, partial [Fasciola hepatica]
          Length = 306

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 122/222 (54%), Gaps = 12/222 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWRQ     +  V+ QG CGSCWAF+TT  +E Q     +T    S+ QLV+C  
Sbjct: 88  VPASIDWRQ--YGYVTEVKDQGGCGSCWAFSTTGAIEGQYVKKFQTRVSFSEQQLVDCST 145

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT-WVT 242
             GN  C GG +  A+EY+K+ GLE ++ YPY+  E    +C Y+ + A   V ++  V 
Sbjct: 146 IPGNHGCRGGGMRRAYEYLKKNGLEPESSYPYKAVEG---QCQYKSDLALAKVTNSQLVR 202

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
           SG +  +  L    GP  V ++ +   S   + I ++   C+  +++HAV  VGYG + G
Sbjct: 203 SGNETQLKNLIGAEGPASVAVDVKPDFSMYRSGIYQSQ-TCSSRRMNHAVLAVGYGTEGG 261

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASVK 341
           +  WIV+NSWG    + GY ++ R   N CGI S   L +V+
Sbjct: 262 MDYWIVKNSWGPRWGEAGYIRMARNRNNMCGIASAGSLPTVE 303


>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
          Length = 376

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 154/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P S DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFSCDWRKV-ASAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + ++ Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
 gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 153/304 (50%), Gaps = 21/304 (6%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEIL 92
           + F+ +I K+N++Y  D E   ++E FK + K  ++         +  +  SD +  ++L
Sbjct: 34  NIFEDFIKKYNKSYATDQERAIKYENFKNNLKMINDKNNGSKYAVFDINAFSDLNKNDLL 93

Query: 93  QRT-GLRLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           +RT G R+  K+      D  +E   + +    +  LP+S DWR      + PV++Q  C
Sbjct: 94  RRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHG--VTPVKNQLEC 151

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQYGLE 208
           GSCWAF+  A +ES   +       LS+  L+ CD  N  C GG +  A E + +Q G+ 
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQGGIV 211

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-QSGPIGVYLNHRLIE 267
           S+ D PY   + +   C  ++    +     +V    + +  LL  +GPI + ++  +I+
Sbjct: 212 SEKDEPYYGLDAV---CKPKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVD--IID 266

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
             D       D   N + L+HAV +VGYG  N I  WI++NSWG+   + GY +++R  N
Sbjct: 267 VIDYKE-GITDICENMNGLNHAVLLVGYGVHNNIPYWIMKNSWGEEWGEKGYLRVQRNIN 325

Query: 328 ACGI 331
           +CG+
Sbjct: 326 SCGL 329


>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
 gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
 gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 88/306 (28%), Positives = 145/306 (47%), Gaps = 19/306 (6%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL 97
           + V +F  +  ++ + Y    E+K RF  FK++    D    T+         + Q   L
Sbjct: 54  RHVLSFSRFTHRYGKKYQSVEEMKLRFSVFKEN---LDLIRSTNKKGLSYKLSLNQFADL 110

Query: 98  RLTGKEKERLEADRERVKKFLNERK--KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
                ++ +L A +          K  +  +P + DWR+  +  ++PV+ QG CGSCW F
Sbjct: 111 TWQEFQRYKLGAAQNCSATLKGSHKITEATVPDTKDWREDGI--VSPVKEQGHCGSCWTF 168

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           +TT  LE+           LS+ QLV+C     N  C+GG    AFEY+K   GL+++  
Sbjct: 169 STTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEA 228

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNHRLIE 267
           YPY  K+     C +  +   V V+D+  +T G +    H + L++   +   + H    
Sbjct: 229 YPYTGKDG---GCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEF-R 284

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y       N     P  ++HAV  VGYG ++ +  W+++NSWG    D+GYF++E G N
Sbjct: 285 FYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKN 344

Query: 328 ACGIES 333
            CG+ +
Sbjct: 345 MCGVAT 350


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 161/341 (47%), Gaps = 38/341 (11%)

Query: 16  VTYNVNTDSAIYVWRDLAYDSI-KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKET 74
           +TY    D +I  +      S+ K ++ F++++ K ++ Y    E   RFE F  + K  
Sbjct: 19  ITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHI 78

Query: 75  DE--------YYGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGP 125
           DE        + G +  +D S +E   +  GLR+        E  R+R  +  +      
Sbjct: 79  DETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRV--------EFPRKRSSRGFSYGDVED 130

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S+DWR      + PV++QG CGSCWAF+T A +E    ++   L  LS+ +L++CD 
Sbjct: 131 LPESVDWRTKGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR 188

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  C GG +D AF+Y+    GL  + DYPY  +E    RC  EKE+ +V     +   
Sbjct: 189 SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEG---RCIREKEQFEVVTISGYEDV 245

Query: 244 GVDHMMHLLQS---GPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
             +    LL++    P+ V +  + R  + Y G         C   ++DH V  VGYG  
Sbjct: 246 PANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGR---CGT-QMDHGVTAVGYGSS 301

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
            G    IV+NSWG    ++GY +++R        CGI   A
Sbjct: 302 EGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMA 342


>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
          Length = 338

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 152/318 (47%), Gaps = 35/318 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQ- 93
           F  +++K+++ Y  + E   +FE FK        ++ K+ +  +  +  +DRS  E+L+ 
Sbjct: 34  FDLFMIKYHKVYRSELERAAKFEVFKRNLATLNDKNDKDENATFDINAYTDRSRNELLRT 93

Query: 94  RTGLRLTGKEKERLEADRER--VKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           +TG +            ++   + + +       LP+S DWR   V  + PV+ Q  CGS
Sbjct: 94  QTGFQSNFARNASPFTQKKGMCITRVVAGTPPCLLPESFDWRDKNV--VTPVKDQLECGS 151

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQ 210
           CWAF   A  ESQ A+        S+  L++CD  N  C+GG +  AF E ++  G+  +
Sbjct: 152 CWAFTAIANFESQYAIKHGKHVDFSEQHLLDCDQLNYGCDGGLMHWAFEEIIRMGGVVLE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--------LLQSGPIGVYLN 262
            DYPY   E+           A      T ++  V + +         L+ +GPI V L+
Sbjct: 212 YDYPYTGVESFC---------ANNVNMYTTISGCVQYDLRDEEKLRELLVTNGPIAVALD 262

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
              I  Y    +    +    + L+HAV +VGYG    I  W+++NSWG    + GYF+I
Sbjct: 263 IVDIVDYKSGVV---SFCGTNNGLNHAVLLVGYGVDKTIEYWLLKNSWGTDWGEEGYFRI 319

Query: 323 ERGANACGIESYAYLASV 340
           +R  N+CGI + +Y ASV
Sbjct: 320 KRNRNSCGILN-SYAASV 336


>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
 gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
          Length = 317

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 124/221 (56%), Gaps = 19/221 (8%)

Query: 127 PKSLDWRQ----SKVK--VLNPVESQ--GRCGSCWAFATTAILESQVALLKKTLYPLSKS 178
           P   DWR     +KVK  V + ++ +  G+CGS WAF+T A +ES  A+    L  LS+ 
Sbjct: 53  PNKFDWRNYNVVTKVKRQVWHKMQKKFLGKCGSSWAFSTIANIESAWAIKFGDLISLSEQ 112

Query: 179 QLVECDHGNLNCNGGNIDVAF-EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           Q+++CD  N  C GG    A+ E ++  G+++++DYPY     +   C   KEK KV++ 
Sbjct: 113 QIIDCDKINRGCRGGQPLKAYHEIIRMSGVQAESDYPY---TGLHGSCKLNKEKIKVYIN 169

Query: 238 DTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           DT +    +  +  +L + GP+ V +N  ++  Y    I+    +CNP+ L+H   I+GY
Sbjct: 170 DTVLLHKNETTIANYLYEHGPVAVRMNADILMLYRKGIIKPTKSSCNPNFLNHGATIIGY 229

Query: 296 GEKNGI-----LTWIVRNSWGDIGPDHGYFQIERGANACGI 331
           G+++ +       WI++NSWG    ++GYF++ RG  ACG+
Sbjct: 230 GKESWLHWWSNPYWIIKNSWGVDWGENGYFRLYRGNEACGV 270


>gi|121531598|gb|ABM55484.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 105/329 (31%), Positives = 162/329 (49%), Gaps = 38/329 (11%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ------------DGK 72
           AI+    +A  +    D +  +     +TY +  E KTRF  F++            D  
Sbjct: 5   AIFATVLIAVTASTNEDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKG 64

Query: 73  ETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
           E     G +  +D + +E   +  L+   K K RL A      + L       +P S+DW
Sbjct: 65  EETYLLGVTRFADLTHEEF--KDILKGQIKNKPRLNATPTVFPEDLE------VPDSIDW 116

Query: 133 RQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC 190
            + K  VL  V+ Q  CGSCWAF+ T  L+ Q A+L      LS+ QL++C   +GN NC
Sbjct: 117 TE-KGAVLE-VKDQNPCGSCWAFSATGALKGQNAILNNVKISLSEQQLLDCSAAYGNGNC 174

Query: 191 N-GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM 248
             GG++  AF+YV+ YG++S+  YPY  K+     C Y+  K  + ++    VT+  + +
Sbjct: 175 KEGGDMSAAFDYVRDYGIQSEKSYPYIRKQT---ECQYDASKTILKIKGYKNVTTSEEGL 231

Query: 249 MHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG----ILT 303
              + + GPI + +N   ++ Y    I      C+ H LDH V +VGYG+ +        
Sbjct: 232 RKAVGTIGPISIAMNSDPLQLYYSGTISGK--GCS-HDLDHGVLVVGYGKASQWSGETKF 288

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGI 331
           W V+NSWG I  ++GYF+I+R A N CGI
Sbjct: 289 WRVKNSWGKIWGENGYFRIKRDANNLCGI 317


>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
          Length = 313

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 152/312 (48%), Gaps = 41/312 (13%)

Query: 49  KWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           K+ + Y    E   RF  FK +          +    +G +  SD +  E  +R  L + 
Sbjct: 6   KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSE-FRRKHLGVK 64

Query: 101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
           G  K   +A++  +    N      LP+  DWR      + PV++QG CGSCW+F+TT  
Sbjct: 65  GGFKLPKDANQAPILPTQN------LPEEFDWRDRGA--VTPVKNQGSCGSCWSFSTTGA 116

Query: 161 LESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLESQ 210
           LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFEY +K  GL  +
Sbjct: 117 LEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMRE 176

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DYPY   +  +  C  ++ K    V +  V S  +  +  +L+++GP+ V +N   +++
Sbjct: 177 KDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQT 234

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHGYFQ 321
           Y G       + C+  +L+H V +VGYG              WI++NSWG+   ++G+++
Sbjct: 235 YIGG--VSCPYICS-RRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYK 291

Query: 322 IERGANACGIES 333
           I +G N CG++S
Sbjct: 292 ICKGRNICGVDS 303


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 103/336 (30%), Positives = 160/336 (47%), Gaps = 37/336 (11%)

Query: 18  YNVNTDSAIYVWRDLAYDSIKQV-DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE 76
           Y + ++ +I  +    + S +QV + F+ +  +  + Y    E   R E FK        
Sbjct: 25  YGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFK-------- 76

Query: 77  YYGTSGSSDRSPQEILQRTGLRLT------GKEKERLEADRERVKKFLNERKK-GPLPKS 129
                    R+ + I++R  +R +      G  +    ++ E   KF+++ +     P S
Sbjct: 77  ---------RNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISKVESCDDAPYS 127

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN 189
           LDWR  K  V+  V+ QG CGSCW+F++T  +E   A++   L  LS+ +LV+CD  N  
Sbjct: 128 LDWR--KKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDG 185

Query: 190 CNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVD 246
           C GG +D AFE+V    G++++ADYPY     +   C   KE+ KV   D +  VT    
Sbjct: 186 CEGGYMDYAFEWVINNGGIDTEADYPYI---GVGGTCNVTKEETKVVTIDGYTDVTQSDS 242

Query: 247 HMMHLLQSGPIGVYLNHRLIES--YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
            +       PI V ++   ++   Y G  I   D + NP  +DHAV IVGYG       W
Sbjct: 243 ALFCATVKQPISVGIDGSTLDFQLYTGG-IYDGDCSSNPDDIDHAVLIVGYGSDGNQDYW 301

Query: 305 IVRNSWGDIGPDHGYFQIERGANA-CGIESYAYLAS 339
           IV+NSWG      G+  I R  N   G+ +  Y+AS
Sbjct: 302 IVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMAS 337


>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
          Length = 259

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 79/217 (36%), Positives = 111/217 (51%), Gaps = 10/217 (4%)

Query: 128 KSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGN 187
           +  DWR+     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +
Sbjct: 50  EKFDWREHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLD 107

Query: 188 LNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
             C+GG     +  + K  GLE  +DYPY     I   C  +K K   +V  + +    +
Sbjct: 108 DGCDGGYPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYVNGSTILPLSE 164

Query: 247 HMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
            +    L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   W
Sbjct: 165 KVQAQKLRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYW 222

Query: 305 IVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           IV+NSWG+   + GYF+I RG   CGI S    A +K
Sbjct: 223 IVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 259


>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
          Length = 379

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 87/244 (35%), Positives = 128/244 (52%), Gaps = 22/244 (9%)

Query: 106 RLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILES 163
           RL  D  R     FL     G LP+S+DWR      +  V++QG CGSCWAF++T  LE+
Sbjct: 139 RLLGDNLRRNASTFLAPINIGDLPESVDWRDKGW--VTEVKNQGMCGSCWAFSSTGALEA 196

Query: 164 QVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKEN 220
           Q A     L  LS+  L++C   +GN+ CNGG +D AF+Y+K   G++ + DYPY+ K  
Sbjct: 197 QHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYKAKTG 256

Query: 221 ITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNP 273
              +C +++    V   DT    +  G +  + +  +  GP  V ++  HR  + Y    
Sbjct: 257 K--KCLFKRN--DVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 312

Query: 274 IRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGI 331
               +  C+P  LDH V +VGYG +      WIV+NSWG    + GY ++ R   N CGI
Sbjct: 313 YFEKE--CSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMARNRKNNCGI 370

Query: 332 ESYA 335
            S+A
Sbjct: 371 ASHA 374


>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
          Length = 278

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 117/229 (51%), Gaps = 23/229 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPKS+DWR  K   + PV++QG+CGSCWAF+ T  LE Q+      L  LS+  LV+C  
Sbjct: 59  LPKSVDWR--KKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGQLVSLSEQNLVDCSQ 116

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
             GN  CNGG +D AFEYVK+  GLES+  YPY  K+     C Y+ E +      FV  
Sbjct: 117 PQGNQGCNGGLMDFAFEYVKENKGLESEKSYPYEGKDG---SCRYKPELSAANDTGFVDI 173

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-- 296
                 +  M  + + GPI V ++  L+           D  C+   L+H V +VGYG  
Sbjct: 174 PQREKAL--MKAVAEKGPISVAVDAGLMSFQFYKDGIYFDPECSSKDLNHGVLVVGYGYE 231

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
               EKN    W+V+NSWG      GY +I R   N CGI + A   S 
Sbjct: 232 EVDTEKNEY--WLVKNSWGPEWGAEGYIKIARNRNNHCGIATAASYPST 278


>gi|41152540|gb|AAR99519.1| cathepsin L protein [Fasciola hepatica]
          Length = 239

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 124/224 (55%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 21  VPDKIDWRESGY--VTGVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 78

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y ++     V   + V 
Sbjct: 79  PWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTGYYTVH 135

Query: 243 SGVD-HMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           SG +  + +L+ S GP  + ++   +ES D    R   +    C P  L+HAV  VGYG 
Sbjct: 136 SGSEVELKNLVGSEGPAAIAVD---VES-DFMMYRSGIYQSQTCLPFALNHAVLAVGYGT 191

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 192 QGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASLASLPMV 235


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 126/246 (51%), Gaps = 25/246 (10%)

Query: 106 RLEADRERVKKFLNERKKG----PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
           R+ A R R        + G     LP+S+DWR+     + PV++QG+CGSCWAF+  + +
Sbjct: 118 RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGA--VAPVKNQGQCGSCWAFSAVSSV 175

Query: 162 ESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNK 218
           ES   ++   +  LS+ +LVEC  D GN  CNGG +D AF++ +K  G++++ DYPY+  
Sbjct: 176 ESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYK-- 233

Query: 219 ENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNP 273
             +  +C   +E AKV   D +     +    +   +   P+ V +    R  + Y    
Sbjct: 234 -AVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAGV 292

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----C 329
                  C  + LDH V  VGYG +NG   WIVRNSWG    + GY ++ER  NA    C
Sbjct: 293 FTGT---CTTN-LDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 348

Query: 330 GIESYA 335
           GI   A
Sbjct: 349 GIAMMA 354


>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
          Length = 369

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 88/255 (34%), Positives = 131/255 (51%), Gaps = 21/255 (8%)

Query: 98  RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           +L G  +   +  R     FL     G +P+S+DWR  +   +  V++QG+CGSCWAF+ 
Sbjct: 124 KLNGYRRALGDNLRRNASTFLAPMNIGDIPESVDWRDKQW--VTEVKNQGQCGSCWAFSA 181

Query: 158 TAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q A     L  LS+  LV+C   +GN+ CNGG +D AF+Y+K   G++ +  YP
Sbjct: 182 TGALEGQHARKTGQLVSLSEQNLVDCTKKYGNMGCNGGLMDNAFQYIKDNEGIDKEMTYP 241

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQS--GPIGVYLN--HRLIE 267
           Y+ K     RC +++    V   DT    V  G +  + L  +  GP+ V ++  HR  +
Sbjct: 242 YKAKAG---RCHFKRN--DVGATDTGFFDVAEGDEDKLKLAVATQGPVSVAIDAGHRSFQ 296

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
            Y        +  CNP +LDH V +VGYG +      WIV+NSW     + GY ++    
Sbjct: 297 LYKHGVYFEEE--CNPEELDHGVLVVGYGTDPEHGDYWIVKNSWSTHWGEQGYIRMAPNR 354

Query: 327 -NACGIESYAYLASV 340
            N CGI S+A   +V
Sbjct: 355 NNNCGIPSHASYPTV 369


>gi|294874404|ref|XP_002766939.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239868314|gb|EEQ99656.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 339

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 145/311 (46%), Gaps = 15/311 (4%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           AF  +  K+ + Y    E   R   F+ +    ++    + S      E    T      
Sbjct: 27  AFTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVNAQNLSYTLGVNEYADLTHEEFVA 86

Query: 102 KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
           ++   L+ D  R  KF  E     LP  +DWR +   VL P+++QG CGSCWAF+TT  L
Sbjct: 87  QKVGILKMDARRDVKFDVEANATELPTDVDWRHATGDVLTPIKNQGACGSCWAFSTTGTL 146

Query: 162 ESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN--IDVAFEYVKQYGLESQADYPYRNKE 219
           ES  A+    L  LS  QLV+C  G          +  AF+YVK  G++ ++ YPY   +
Sbjct: 147 ESLYAIGTGQLRSLSAQQLVDCSRGYGTGGCAGGWMYQAFDYVKDKGIDLESTYPYEGSD 206

Query: 220 NITFRCTYEKE----KAKVFVQDTWVTSGVDHMMHLLQSGPIGV--YLNHRLIESYDGNP 273
           N T + + EK     KA V    + +      +M  +   P+ V  Y +    + Y G  
Sbjct: 207 N-TCQNSLEKRSDGIKAGVVTGWSQLERTEQALMTKIVKSPVSVALYASDHDFQFYSGGV 265

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---CG 330
              ++  CN H++DHAV ++GYG   G   +I RNSWG      GYF ++RG ++   C 
Sbjct: 266 YSSDN--CN-HQIDHAVVMIGYGSVFGRDYFIGRNSWGTSWGIAGYFYLKRGVSSYGQCN 322

Query: 331 IESYAYLASVK 341
           +  Y Y+ ++K
Sbjct: 323 VLEYMYVPTIK 333


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 126/246 (51%), Gaps = 25/246 (10%)

Query: 106 RLEADRERVKKFLNERKKG----PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
           R+ A R R        + G     LP+S+DWR+     + PV++QG+CGSCWAF+  + +
Sbjct: 115 RIPAARRRGTAVGERYRHGGGAEELPESVDWREKGA--VAPVKNQGQCGSCWAFSAVSSV 172

Query: 162 ESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNK 218
           ES   ++   +  LS+ +LVEC  D GN  CNGG +D AF++ +K  G++++ DYPY+  
Sbjct: 173 ESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYK-- 230

Query: 219 ENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNP 273
             +  +C   +E AKV   D +     +    +   +   P+ V +    R  + Y    
Sbjct: 231 -AVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAGV 289

Query: 274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----C 329
                  C  + LDH V  VGYG +NG   WIVRNSWG    + GY ++ER  NA    C
Sbjct: 290 F---SGTCTTN-LDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 345

Query: 330 GIESYA 335
           GI   A
Sbjct: 346 GIAMMA 351


>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
 gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
 gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
          Length = 376

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 154/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P S DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFSCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + ++ Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 120/220 (54%), Gaps = 20/220 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P S+DWR      + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 109 PDSVDWRNEGY--VTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTA 166

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
           +GN  CNGG +D AF Y+K+  G++S+A YPY  K+    +C + K    V   DT    
Sbjct: 167 YGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDG---KCAFTK--PNVAATDTGFVD 221

Query: 241 VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           + SG ++ +   +   GPI V ++  H   + Y       N+  C+  +LDH V +VGYG
Sbjct: 222 IPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKGVY--NERKCSSTELDHGVLVVGYG 279

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            ++G   W+V+NSW     D GY ++ R A N CGI + A
Sbjct: 280 TESGKDYWLVKNSWNTSWGDKGYIKMSRNAKNQCGIATNA 319


>gi|313213098|emb|CBY36961.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 125/237 (52%), Gaps = 17/237 (7%)

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           K++L +  + P+P S DWR  K   ++PV+ QG+CGSCW F+TT  +E+  A+     + 
Sbjct: 94  KEYLAKGVEQPMPTSWDWR--KDNKVSPVKDQGQCGSCWTFSTTGNVEAGEAIHLNEYHT 151

Query: 175 LSKSQLVECDHG--NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEK 231
           LS+ QLV+C     N  CNGG    AFEY+    G+ ++ADYPY  K+     C ++++K
Sbjct: 152 LSEQQLVDCAGAFNNHGCNGGLPSQAFEYIAAAPGIMTEADYPYTAKDG---NCVFDQKK 208

Query: 232 AKVFVQDTW-VTSG--VDHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHK 285
           A V V  +  +T G  V+    ++   PI +     +++    Y        D   +P  
Sbjct: 209 AAVHVYGSVNITRGDEVEMAEAMVMYQPISIAF--EVVDDFMHYKSGTYSSKDCKGSPTD 266

Query: 286 LDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           ++HAV  VG+G +  G   W V+NSW     + GYF I+RG N CG+      A +K
Sbjct: 267 VNHAVLAVGFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQRGVNMCGLSQCTSFALIK 323


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 145/310 (46%), Gaps = 39/310 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           ++ ++VK  +      E   RFE FK + +  DE+ G         + +  R GL     
Sbjct: 42  YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNG---------KNLSYRLGLTKFAD 92

Query: 99  LTGKE------KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           LT  E        RL+    +       R    +P+S+DWR  K   +  V+ QG CGSC
Sbjct: 93  LTNDEYRSMYLGSRLKRKATKSSLRYEVRVGDAIPESVDWR--KEGAVAEVKDQGSCGSC 150

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++++
Sbjct: 151 WAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 210

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRL 265
            DYPY+  +    RC   ++ AKV   D +     +  + +   L   PI V +    R 
Sbjct: 211 EDYPYKGVDG---RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRA 267

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER- 324
            + YD       D  C    LDH V  VGYG +NG   WIV+NSWG    + GY ++ER 
Sbjct: 268 FQLYDSGIF---DGICGT-DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERN 323

Query: 325 ---GANACGI 331
               A  CGI
Sbjct: 324 IASSAGKCGI 333


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 145/310 (46%), Gaps = 39/310 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           ++ ++VK  +      E   RFE FK + +  DE+ G         + +  R GL     
Sbjct: 48  YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNG---------KNLSYRLGLTKFAD 98

Query: 99  LTGKE------KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           LT  E        RL+    +       R    +P+S+DWR  K   +  V+ QG CGSC
Sbjct: 99  LTNDEYRSMYLGSRLKRKATKSSLRYEVRVGDAIPESVDWR--KEGAVAEVKDQGSCGSC 156

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++++
Sbjct: 157 WAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 216

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRL 265
            DYPY+  +    RC   ++ AKV   D +     +  + +   L   PI V +    R 
Sbjct: 217 EDYPYKGVDG---RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRA 273

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER- 324
            + YD       D  C    LDH V  VGYG +NG   WIV+NSWG    + GY ++ER 
Sbjct: 274 FQLYDSGIF---DGICGT-DLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERN 329

Query: 325 ---GANACGI 331
               A  CGI
Sbjct: 330 IASSAGKCGI 339


>gi|313229615|emb|CBY18430.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 125/237 (52%), Gaps = 17/237 (7%)

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           K++L +  + P+P S DWR  K   ++PV+ QG+CGSCW F+TT  +E+  A+     + 
Sbjct: 94  KEYLAKGVEQPMPTSWDWR--KDNKVSPVKDQGQCGSCWTFSTTGNVEAGEAIHLNEYHT 151

Query: 175 LSKSQLVECDHG--NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEK 231
           LS+ QLV+C     N  CNGG    AFEY+    G+ ++ADYPY  K+     C ++++K
Sbjct: 152 LSEQQLVDCAGAFNNHGCNGGLPSQAFEYIAAAPGIMTEADYPYTAKDG---NCVFDQKK 208

Query: 232 AKVFVQDTW-VTSG--VDHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHK 285
           A V V  +  +T G  V+    ++   PI +     +++    Y        D   +P  
Sbjct: 209 AAVHVYGSVNITRGDEVEMAEAMVMYQPISIAF--EVVDDFMHYKSGTYSSKDCKGSPTD 266

Query: 286 LDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           ++HAV  VG+G +  G   W V+NSW     + GYF I+RG N CG+      A +K
Sbjct: 267 VNHAVLAVGFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQRGVNMCGLSQCTSFALIK 323


>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
 gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 356

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 142/303 (46%), Gaps = 17/303 (5%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           +F  + ++  + Y    EIK RFE F  + K    +     S      E    T L    
Sbjct: 56  SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEF---TDLTWDE 112

Query: 102 KEKERLEADR--ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
             K +L A +      K   +     LP++ DWR  K  +++PV++QG+CGSCW F+TT 
Sbjct: 113 FRKHKLGASQNCSATTKGNLKLTNVVLPETKDWR--KDGIVSPVKAQGKCGSCWTFSTTG 170

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYR 216
            LE+  A        LS+ QLV+C     N  CNGG    AFEY+K   GL+++  YPY 
Sbjct: 171 ALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYT 230

Query: 217 NKENITFRCTYEKEKAKV-FVQDTWVTSGVDHMMH--LLQSGPIGVYLNH-RLIESYDGN 272
            K  I   C + +    V  +    +T G ++ +   +    P+ V     +  + Y   
Sbjct: 231 GKNGI---CKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSG 287

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
                +    P  ++HAV  VGYG +NG   W+++NSWG    + GYF++E G N CG+ 
Sbjct: 288 VYASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVA 347

Query: 333 SYA 335
           + A
Sbjct: 348 TCA 350


>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/320 (29%), Positives = 154/320 (48%), Gaps = 41/320 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEIL 92
           D F  +  K+ + Y    E   RF  FK +          +    +G +  SD +  E  
Sbjct: 46  DHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSE-F 104

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           +R  L + G  K   +A++  +    N      LP+  DWR      + PV++QG CGSC
Sbjct: 105 RRKHLGVKGGFKLPKDANQAPILPTQN------LPEEFDWRDRGA--VTPVKNQGSCGSC 156

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-V 202
           W+F+TT  LE    L    L  LS+ QLV+CDH          +  CNG  ++ AFEY +
Sbjct: 157 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTL 216

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVY 260
           K  GL  + DYPY   +  +  C  ++ K    V +  V S  +  +  +L+++GP+ V 
Sbjct: 217 KTGGLMREKDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVA 274

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDI 313
           +N   +++Y G       + C+  +L+H V +VGYG              WI++NSWG+ 
Sbjct: 275 INAAYMQTYIGGV--SCPYICS-RRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGES 331

Query: 314 GPDHGYFQIERGANACGIES 333
             ++G+++I +G N CG++S
Sbjct: 332 WGENGFYKICKGRNICGVDS 351


>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
          Length = 358

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 151/305 (49%), Gaps = 21/305 (6%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           +F  +  ++ + Y    EIK RF+ F  D  E    +   G S +    + + + L    
Sbjct: 58  SFARFARRYGKRYDSVEEIKQRFDIF-LDNLEMINSHNDKGLSYK--LGVNEFSDLTWDE 114

Query: 102 KEKERLEADR--ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTA 159
             ++RL A +      K   + +   LP++ DWR++ +  ++PV++QG+CGSCW F+TT 
Sbjct: 115 FRRDRLGAAQNCSATTKGNLKLRDAVLPETKDWREAGI--VSPVKNQGKCGSCWTFSTTG 172

Query: 160 ILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQYG-LESQADYPYR 216
            LE+           LS+ QLV+C     N  CNGG    AFEY+K  G LE++  YPY 
Sbjct: 173 ALEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYT 232

Query: 217 NKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNH-RLIESYD 270
            K  +   C +  +   V V D+  +T G +    + + L++  P+ V     +  + Y 
Sbjct: 233 GKNGL---CKFSSQNVGVKVTDSVNITLGAEDELKYAVALVR--PVSVAFEVVKGFKQYK 287

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACG 330
                  +    P  ++HAV  VGYG + G+  W+++NSWG    D+ YF++E G + CG
Sbjct: 288 SGVYTSTECGTTPMDVNHAVLAVGYGVEYGVPFWLIKNSWGADWGDNAYFKMEMGNDMCG 347

Query: 331 IESYA 335
           I + A
Sbjct: 348 IATCA 352


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 123/223 (55%), Gaps = 23/223 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPK +DWR  K+  + PV+ QG+CGSCW+F+TT  LE Q     K L  LS+  L++C  
Sbjct: 120 LPKQIDWR--KLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSE 177

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  CNGG +D AF Y+K   G++++  YPY+ ++    +C Y+        + FV  
Sbjct: 178 KYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDE---KCHYKPRNKGATDRGFVD- 233

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             + SG +  +   +   GPI V ++  H   + Y        +  C+  +LDH V +VG
Sbjct: 234 --IESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPE--CSSEQLDHGVLVVG 289

Query: 295 YG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           YG +++G   W+V+NSWGD   D GY ++ R   N CGI + A
Sbjct: 290 YGTDEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQA 332


>gi|41152538|gb|AAR99518.1| cathepsin L protein [Fasciola hepatica]
          Length = 326

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 18/230 (7%)

Query: 120 ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQ 179
           E     +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ Q
Sbjct: 102 ETNNRAVPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQ 159

Query: 180 LVECDH--GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           LV+C    GN  C+GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y ++     V 
Sbjct: 160 LVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNEQLGVAKVT 216

Query: 238 DTW-VTSGVD-HMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVA 291
             + V SG +  + +L+ S GP  V ++   +ES D    R   +    C+P  ++HAV 
Sbjct: 217 GYYTVHSGSEVELKNLVGSEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLSVNHAVL 272

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            VGYG + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 273 AVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASLASLPMV 322


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 124/246 (50%), Gaps = 20/246 (8%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           R+         F+     G LP ++DWR      + P+++QG+CGSCW+F+ T  LE Q 
Sbjct: 94  RMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGY--VTPIKNQGQCGSCWSFSATGSLEGQT 151

Query: 166 ALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENIT 222
                 L  LS+  LV+C    GN  C GG +D AF Y+K   G++++A YPY+ ++   
Sbjct: 152 FKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKARDG-- 209

Query: 223 FRCTYEKEKAKVFVQDT-WVTSGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIR 275
            +C  E + A V   DT +V         L Q+    GPI V ++  H   + Y      
Sbjct: 210 -KC--EFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVY- 265

Query: 276 RNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESY 334
            +DW C+  KLDH V  VGYG ++    W+V+NSWG+     GY Q+ R   N CGI + 
Sbjct: 266 -HDWFCSQTKLDHGVLAVGYGTEDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIATS 324

Query: 335 AYLASV 340
           A   +V
Sbjct: 325 ASYPTV 330


>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
          Length = 381

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 150/330 (45%), Gaps = 44/330 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFK--------QDGKETDEYYGTSGSSDRSPQEILQR 94
           F  ++ +  R Y+   E   R   F             +    +G +  SD + +E   R
Sbjct: 60  FAAFVRRHGRRYSGPKEYARRLRVFAANLARAAAHQALDPTARHGVTPFSDLTREEFEAR 119

Query: 95  -TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
            TGLR  G + +RL +          E +   LP S DWR      +  V++QG CGSCW
Sbjct: 120 LTGLR-AGGDVQRLMSGVPAAPPASKE-EVARLPASFDWRDKGA--VTGVKTQGACGSCW 175

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYVKQ 204
           AF+TT  +E    L    L  LS+ QLV+CDH          N  C GG +  A+ Y+ +
Sbjct: 176 AFSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYSYLME 235

Query: 205 Y-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVY 260
             GL  Q+ YPY         C ++  +  V V + T V +G +  +   L++ GP+ V 
Sbjct: 236 SGGLMEQSAYPYTGAAG---PCRFDPTQVAVRVANFTAVPAGDEAQIRAALVRRGPLAVG 292

Query: 261 LNHRLIESYDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSW 310
           LN   +++Y G    P+      C    ++H V +VGYG +            WI++NSW
Sbjct: 293 LNAAFMQTYVGGVSCPL-----ICPRAWVNHGVLLVGYGARGFAALRLGYRPYWIIKNSW 347

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASV 340
           G    + GY+++ RG+N CG++S     +V
Sbjct: 348 GKQWGEQGYYRLCRGSNVCGVDSMVSAVAV 377


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/288 (33%), Positives = 141/288 (48%), Gaps = 35/288 (12%)

Query: 63  RFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERV 114
           RFE FK + +  DE+         G +  +D + +E      + L  K  +R+    +R 
Sbjct: 74  RFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRS---MYLGAKPTKRVLKTSDRY 130

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           +     R    LP S+DWR  K   +  V+ QG CGSCWAF+T   +E    ++   L  
Sbjct: 131 QA----RVGDALPDSVDWR--KEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLIS 184

Query: 175 LSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKA 232
           LS+ +LV+CD   N  CNGG +D AFE++ K  G++++ADYPY+  +    RC   ++ A
Sbjct: 185 LSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADG---RCDQNRKNA 241

Query: 233 KVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLD 287
           KV   D++     +    L   L   PI V +    R  + Y        D  C   +LD
Sbjct: 242 KVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF---DGLCGT-ELD 297

Query: 288 HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
           H V  VGYG +NG   WIVRNSWG+   + GY ++ R   A    CGI
Sbjct: 298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGI 345


>gi|46251290|gb|AAS84611.1| cathepsin L-like cysteine proteinase I variant form precursor
           [Heterodera glycines]
          Length = 374

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/253 (33%), Positives = 124/253 (49%), Gaps = 16/253 (6%)

Query: 98  RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           +L G  +   ++ R     FL     G LP+S+DWR      +  V++QG CGSCWAF+ 
Sbjct: 128 KLNGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGW--VTEVKNQGMCGSCWAFSA 185

Query: 158 TAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q    K  L  LS+  L++C   +GN+ CNGG +D AF+Y+K   G++ +  YP
Sbjct: 186 TGALEGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKGIDKETAYP 245

Query: 215 YRNKENITFRCTYEKEKAKVF---VQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESY 269
           Y+ K     +C +++           D       D  M +   GP+ V ++  HR  + Y
Sbjct: 246 YKAKTGK--KCLFKRNDVGATDSGYNDIAEGDEEDLRMAVATQGPVSVAIDAGHRSFQLY 303

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-N 327
                   +  C+P  LDH V + GYG +      WIV+NSWG    + GY ++ R   N
Sbjct: 304 TNGVYFEKE--CDPQNLDHGVLVEGYGTDPTQGDYWIVKNSWGTRWGEQGYIRMARNRNN 361

Query: 328 ACGIESYAYLASV 340
            CGI S+A    V
Sbjct: 362 NCGIASHASFPLV 374


>gi|403223173|dbj|BAM41304.1| cysteine protease precursor TacP [Theileria orientalis strain
           Shintoku]
          Length = 463

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/342 (30%), Positives = 160/342 (46%), Gaps = 44/342 (12%)

Query: 24  SAIYVWRDLAYDSIKQVDA---FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYG- 79
           S++Y  R+++YD +K+ +A   F+ +   +N+ +  D+E + RF  F+ +  ET  + G 
Sbjct: 123 SSLYERREISYDHVKEFEALRSFEKFKADYNKVHATDDERRERFLVFRNNYLETLTHKGH 182

Query: 80  ------TSGSSDRSPQEILQRTGLRLTGKEK------ERLEADRERVKKFLNERK----- 122
                  +  SD + +E+ +        KE       ERL + R     FL +       
Sbjct: 183 ETFTKSVNFFSDLTEEELNRLFPKIEVPKESSPSEHLERLMSSRSTDPNFLAKLALAKGF 242

Query: 123 -------KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPL 175
                   G   +S+DWR  K   +  V+ QG CGSCWAFA+   +ES   +    +  L
Sbjct: 243 QSPVKSLDGISGESIDWR--KANGVTKVKDQGMCGSCWAFASVGSVESLYKIHTDKVLDL 300

Query: 176 SKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
           S+ +LV C+  +  C GG  D A EYVK  G+ S AD PY   +      T++    KVF
Sbjct: 301 SEQELVNCETKSHGCEGGFGDTALEYVKNKGISSSADVPYHAMDQTCDIKTHD----KVF 356

Query: 236 VQDTWVTSGVDHMMHLLQSGPIGVYLNHRL-IESYDGNPIRRNDWACNPHKLDHAVAIV- 293
           +    VT G D M   L   P  VY+     +  Y        + AC   +L+HAV +V 
Sbjct: 357 INSFMVTKGKDVMNKSLVLSPTVVYIAASSELMMYKAGVF---NGAC-AKELNHAVLLVG 412

Query: 294 -GYGEKNGILTWIVRNSWGDIGPDHGYFQIER---GANACGI 331
            GY +  G   W+++NSWG    + GY ++ER   G + CG+
Sbjct: 413 EGYDDIVGKRYWVIKNSWGPHWGEDGYVRLERTDKGTDKCGV 454


>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
          Length = 823

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 108/202 (53%), Gaps = 18/202 (8%)

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV 202
           ++G+CGSCWAF+TT  LE Q       L  LS+ QLV+C    GN  CNGG +D+AFEY+
Sbjct: 624 AKGQCGSCWAFSTTGSLEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFEYI 683

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-----GP 256
           K   G+E + DYPY  K+    RC +++  +KV   DT          + L+      GP
Sbjct: 684 KAAPGIEGEMDYPYLAKDG---RCMFDQ--SKVVATDTGYVDIPSMDENALKEAVATIGP 738

Query: 257 IGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIG 314
           I V ++  H   + Y       N+  C+  +LDH V  VGYG ++G   W+V+NSWGD  
Sbjct: 739 ISVAIDAGHPSFQMYKSGVY--NEPGCSSERLDHGVLAVGYGTEDGQDYWLVKNSWGDSW 796

Query: 315 PDHGYFQIERGA-NACGIESYA 335
              GY  + R   N CGI + A
Sbjct: 797 GQAGYIMMSRNMNNQCGIATQA 818


>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
 gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
 gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
          Length = 376

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 154/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P S DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFSCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCGRCGDGCHGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + ++ Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 116/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPK++DWR  K   + PV++QG+CGSCWAF+TT  LE Q       +  LS+  LV+C  
Sbjct: 142 LPKTVDWR--KKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSG 199

Query: 185 -HGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  C GG +D AF+Y+K  G ++++  YPY   + I   C +EK  + V   DT   
Sbjct: 200 KFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGI---CHFEK--SDVGATDTGFV 254

Query: 243 SGVDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
              +    LL+      GP+ V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 255 DIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPE--CSSESLDHGVLVVGY 312

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G K+G   W+V+NSWG    D GY  + R   N CGI S A
Sbjct: 313 GTKDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSA 353


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 85/240 (35%), Positives = 122/240 (50%), Gaps = 20/240 (8%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
           +   R + +   +   K  LP ++DWR      +  V+ QG+CGSCWAF+TT  LE Q  
Sbjct: 98  MRGQRTQDRHTFSFNSKIALPDTVDWRDKGY--VTDVKDQGQCGSCWAFSTTGALEGQHF 155

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITF 223
                L  LS+  LV+C    GN+ CNGG +D AFEY+K+  G++++  YPY   +N   
Sbjct: 156 KQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDN--- 212

Query: 224 RCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRR 276
           +C ++   A V   DT  T         LQ      GPI V ++  H   + Y       
Sbjct: 213 QCRFKA--ANVGATDTGFTDITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVY-- 268

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           N+  C+  +LDH V  VGYG  +G   W+V+NSWG+   D GY ++ R   N CGI + A
Sbjct: 269 NEPFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAA 328


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 147/321 (45%), Gaps = 31/321 (9%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEI 91
           L  +  +  DAFKT   K+N+ Y    E   RF  F Q+    + +   +     +   +
Sbjct: 22  LTVNKGRLFDAFKT---KFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHT-HTV 77

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKK----GPLPKSLDWRQSKVKVLNPVESQG 147
                  LT +E  +L       +    ER++    GP   S+DWRQ     + P+++QG
Sbjct: 78  DVNQFADLTNEEYRQLYLRPYPTELLGRERQEVWLDGPNAGSVDWRQKGA--VTPIKNQG 135

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEY-VKQ 204
           +CGSCW+F+TT  +E   A+    L  LS+ QLV+C    GN  CNGG +D AF+Y +  
Sbjct: 136 QCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISN 195

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--N 262
            GL+++ DYPY  ++ +  +    K    +        +  D +   ++ GP+ V +  +
Sbjct: 196 GGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEAD 255

Query: 263 HRLIESYD----GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
            +  + Y       P   N        LDH V +VGY        WIV+NSWG    D G
Sbjct: 256 QQSFQMYSSGVFSGPCGTN--------LDHGVLVVGYTSD----YWIVKNSWGASWGDQG 303

Query: 319 YFQIERGANACGIESYAYLAS 339
           Y  ++RG ++ GI   A   S
Sbjct: 304 YIMMKRGVSSAGICGIAMQPS 324


>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 111/212 (52%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   D+
Sbjct: 115 AIDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCKGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGEYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI +Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGTY 319


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/248 (34%), Positives = 122/248 (49%), Gaps = 19/248 (7%)

Query: 96  GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           G+R  G    +  A    + + ++      LP S+DWR + +  + PV++QG+CGSCW+F
Sbjct: 84  GVRFNGVNATKSFASSTYLPRMVS------LPDSVDWRTAGI--VTPVKNQGQCGSCWSF 135

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEY-VKQYGLESQAD 212
           +TT  +E Q A    TL  LS+  LV+C    GN  CNGG +D AFEY +K  G++++A 
Sbjct: 136 STTGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEAS 195

Query: 213 YPYRNKENITFRCTYEKEKAKVFV---QDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESY 269
           YPY      T  C +        V   QD    S  D    +   GP+ V ++   I   
Sbjct: 196 YPYTAT---TGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQ 252

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILTWIVRNSWGDIGPDHGYFQIERGA-N 327
                  N+  C+  +LDH V  VGYG    G   W+V+NSWG      GY  + R A N
Sbjct: 253 FYFTGVYNEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADN 312

Query: 328 ACGIESYA 335
            CGI + A
Sbjct: 313 QCGIATSA 320


>gi|344257452|gb|EGW13556.1| Cathepsin L1 [Cricetulus griseus]
          Length = 290

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 83/226 (36%), Positives = 121/226 (53%), Gaps = 16/226 (7%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +PKS+DWR   +  + PV+ QG+C SCWAF+    LE Q+      L  LS+  LV+C
Sbjct: 72  GDVPKSVDWRN--LSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDC 129

Query: 184 --DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-T 239
              +GN+ C GG ++ AF YVK+  GL+++  YPY  +      C Y+ + +   V D  
Sbjct: 130 SWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNG---PCRYDPKNSAANVTDFV 186

Query: 240 WVTSGVDHMMHLLQS-GPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            +    D +M  + + GPI  GV  +H     Y G         C+   LDHAV +VGYG
Sbjct: 187 KIPISEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPH--CSSSNLDHAVLVVGYG 244

Query: 297 EK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           E+ +G   W+V+NSWG     +GY ++ R   N CGI +YA   +V
Sbjct: 245 EESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 290


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 114/221 (51%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKAVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                S VD    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/236 (36%), Positives = 126/236 (53%), Gaps = 21/236 (8%)

Query: 112 ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
           E      +   +  +P SLDWR  KV  +  V++QG+CGSCWAF+TT  LE   AL    
Sbjct: 97  ETSGSVFSSSLRNAMPSSLDWRDKKV--VTDVKNQGKCGSCWAFSTTGSLEGLHALKTGH 154

Query: 172 LYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQYGL-ESQADYPYRNKENITFRCTYE 228
           L  LS+ QL++C   +GN  C+GGN+  AF+Y+K  G  +++  YPY  K      C ++
Sbjct: 155 LVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPYTAKNE---SCRFD 211

Query: 229 KEKAKV----FVQDTWVTSG--VDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACN 282
            +K       +V+   + SG  V  M  L + GPI V ++  L           +D+ C+
Sbjct: 212 PKKVGATDEGYVR---IPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGIYSDYLCS 268

Query: 283 PHKLDHAVAIVGYGE-KNGILTWIVRNSWG-DIGPDHGYFQIER-GANACGIESYA 335
              L+H V ++GYGE  +G   W+V+NSWG D G D GYF + R   N CG+ + A
Sbjct: 269 NTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGID-GYFMLARYVGNMCGVATDA 323


>gi|351701945|gb|EHB04864.1| Cathepsin W [Heterocephalus glaber]
          Length = 373

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/330 (27%), Positives = 153/330 (46%), Gaps = 39/330 (11%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYF------KQDGKETD---EYYGTSGSSDRSPQEI 91
           + FK + +++N++Y++  E   R + F       Q  +E D     +G +  SD + +E 
Sbjct: 40  EVFKLFQIQFNKSYSNPAEHARRLDIFVHNLAMAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            Q  G         R      RV + +   K+  +P S DWR++   +++PV+ QG+C  
Sbjct: 100 GQLYG-------NWRAAKKDLRVGRKVRFEKQELIPPSCDWRKAP-NIISPVKYQGKCNC 151

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQ 210
           CWA A    +E+   +  K    +S  +L++C      C GG +  AF  V  Y GL S+
Sbjct: 152 CWAIAAAGNIEALWNIRFKQSVEVSVQELLDCGRCGDGCLGGYVWDAFITVLNYSGLASE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLNHRLIES 268
            DY +R + NI  RC     K   ++QD  +    +H M  ++   GPI V +N  L++ 
Sbjct: 212 KDYRFRGRANI-HRCLAPFYKKVAWIQDYVMLPRNEHTMARYVATQGPITVLINQMLLQH 270

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYG------------------EKNGILTWIVRNSW 310
           Y    IR     C+P  ++H V +VG+G                   ++    WI++NSW
Sbjct: 271 YRQGIIRATPSTCDPWLVNHYVLLVGFGKEEEKKGSEKDLSQSNHLPRHSTPYWILKNSW 330

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASV 340
           G    + GYF++ +G+N CGI      A +
Sbjct: 331 GAHWGEQGYFRLHQGSNTCGITRSPLTACI 360


>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
          Length = 403

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 154/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 56  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGV 115

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P + DWR+     +
Sbjct: 116 TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFTCDWRKV-AGAI 168

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 169 SPIKDQKNCNCCWAMAAAGNIEALWRINFWDFVDVSVQELLDCSRCGDGCHGGFVWDAFI 228

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 229 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNSEHRIAQYLATYGPI 287

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + ++ Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 288 TVTINMKPLQLYRKGVIKATSTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 347

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 348 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 391


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 152/318 (47%), Gaps = 45/318 (14%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K +D F+++I ++ R Y    E   RFE FK +    D+        + G +  +D S +
Sbjct: 42  KLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHE 101

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +  L L     +R +   E   K +       +PKS+DWR  K   + PV++QG C
Sbjct: 102 EFKNKY-LGLKPDLSKRAQCPEEFTYKDV------AIPKSVDWR--KKGAVTPVKNQGSC 152

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF Y V   GL
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGL 212

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV--------DHMMHLLQSGPIGV 259
             + DYPY  +E     C   KE++     D    SG         + ++  L + P+ +
Sbjct: 213 HKEEDYPYIMEEGT---CDMRKEES-----DAVTISGYHDVPQNSEESLLKALANQPLSI 264

Query: 260 YL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            +  + R  + Y G      D  C   +LDH VA VGYG   G+   IV+NSWG    + 
Sbjct: 265 AIEASGRDFQFYSGGVF---DGHCGT-ELDHGVAAVGYGTSKGLDYIIVKNSWGPKWGEK 320

Query: 318 GYFQIERGAN----ACGI 331
           GY +++R  +     CGI
Sbjct: 321 GYIRMKRKTSKPEGICGI 338


>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
          Length = 376

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 152/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P S DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFSCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + +  Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKPLRLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 336

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 18/308 (5%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL 97
           + V  F  +  K+ + Y    E+K RF  F +  K  + +     S   +  E    T  
Sbjct: 24  RDVLHFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQHSYSLAVNEFADMTFE 83

Query: 98  RLTGKEKERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
               ++   ++ ++       N    G  LPK+ DWR+  +  ++ V++Q  CGSCW F+
Sbjct: 84  EF--RDSRLMKGEQNCSATVGNHVLTGESLPKTKDWREEGI--VSQVKNQASCGSCWTFS 139

Query: 157 TTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQADY 213
           TT  LE+  A     +  LS+ QLV+C  +  N  C GG    AFEY++   G++++  Y
Sbjct: 140 TTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYIRYNGGIDTEDSY 199

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNHRLIES 268
           PY  K++   +C + K      V D   +T G +    H +  ++   +   + H     
Sbjct: 200 PYNAKDS---QCRFHKNTIGAQVWDVVNITEGAETQLKHAIATMRPVSVAFEVVHDF-RL 255

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGE-KNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           Y+G      +    P  ++HAV  VGYGE +NG+  WI++NSWG     +GYF +E G N
Sbjct: 256 YNGGVYTSLNCHTGPQTVNHAVLAVGYGEDENGVPYWIIKNSWGADWGMNGYFNMEMGKN 315

Query: 328 ACGIESYA 335
            CG+ + A
Sbjct: 316 MCGVATCA 323


>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
          Length = 443

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 137/309 (44%), Gaps = 25/309 (8%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +     R Y    E + RFE F  + K+  E         +G +  +D S +E  
Sbjct: 23  DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQ 82

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
            R                 +  K F  E  K    + +DWR      +  V++QG CGSC
Sbjct: 83  TRH--NAARHYAAAKARRAKHTKSFTKEEIKAADGQKIDWRLKGA--VTSVKNQGSCGSC 138

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLES 209
           W+F+TT  +E Q A+    L  LS+ +LV CD  +  CNGG +D AF ++   +   + +
Sbjct: 139 WSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIAT 198

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFV-----QDTWVTSGVDHMMHLLQSGPIGVYLNHR 264
           +A YPY +   I   C+Y  +   V       QD   T   D    +   GP+ + ++  
Sbjct: 199 EASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTE-EDMAAFVFNYGPLSIGVDAS 257

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             +SY G  I      C   ++DH V IVGY +      WI++NSW     + GY ++ +
Sbjct: 258 TWQSYAGGIITY----CPDVQIDHGVLIVGYDDTAPTPYWIIKNSWTANWGEDGYIRVAK 313

Query: 325 GANACGIES 333
           G+N CG+ S
Sbjct: 314 GSNMCGLTS 322


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 154/312 (49%), Gaps = 37/312 (11%)

Query: 43  FKTYIVKWNRTYTDDN---EIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRL 99
           ++ ++VK  + ++++N   E + RF+ FK + +  DE+     S +RS +  L R    L
Sbjct: 51  YEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEH----NSENRSYKVGLNRFA-DL 105

Query: 100 TGKE------KERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           T +E        R  A R R+ +  N    R    LP S+DWR  K   +  V+ QG CG
Sbjct: 106 TNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWR--KEGAVAEVKDQGSCG 163

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLE 208
           SCWAF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AF+++    G++
Sbjct: 164 SCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGID 223

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYL--NH 263
           S+ DYPY  ++     C   ++ AKV   D +    V+    +   + + P+ V +    
Sbjct: 224 SEEDYPYLARDGT---CDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGG 280

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
           R  + Y           C    LDH VA VGYG +NG   WIVRNSWG    + GY ++E
Sbjct: 281 REFQFYQSGIFTGR---CGT-ALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRME 336

Query: 324 R----GANACGI 331
           R        CGI
Sbjct: 337 RNIATATGKCGI 348


>gi|350415610|ref|XP_003490694.1| PREDICTED: cathepsin O-like [Bombus impatiens]
          Length = 355

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 152/318 (47%), Gaps = 37/318 (11%)

Query: 43  FKTYIVKWNRTYTDD-NEIKTRFEYF--------KQDG---KETDEYYGTSGSSDRSPQE 90
           F+ Y++++N++Y +D  E + RF+ F        K +G    +   YYG +  SD S  E
Sbjct: 36  FQNYVMRYNKSYRNDPTEYEERFKRFLKSLRHIEKMNGLRPSQESAYYGLTEFSDMSEDE 95

Query: 91  ILQRTGLR-LTGKEKERLEADRERVKKFLNE----RKKGPLPKSLDWRQSKVKVLNPVES 145
            L  T L  L  + ++ +     R    L      +K   +P   DWR   V  + PV +
Sbjct: 96  FLSLTLLPDLPARGEKHVNESYHRRHHLLQSTNRVKKSVSIPLRFDWRDKGV--ITPVRN 153

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNIDVAFEYV-- 202
           QG CG+CWAF+T  ++ES  A+   TL+ LS  ++++C  + N  C GG+I     ++  
Sbjct: 154 QGSCGACWAFSTVEVVESMYAIKNGTLHMLSVQEMIDCAKNSNFGCEGGDICSLLSWLLA 213

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKE-------KAKVFVQDTWVTSGVDHMMHLLQSG 255
            +  +  ++ YP   K   T  C   K        K + F  D +V +  + ++ +   G
Sbjct: 214 SKVQIFQESTYPLVGK---TSMCKLGKMIDKASGVKIRDFNCDNFVDAEDELLITVATHG 270

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNPH--KLDHAVAIVGYGEKNGILTWIVRNSWGDI 313
           P+   +N    ++Y G  I+   + C+     L+HAV IVGY +   I  +I++NSWG  
Sbjct: 271 PVAAAVNALSWQNYLGGVIQ---YHCDSSFDNLNHAVQIVGYDKSAAIPHYIIKNSWGTN 327

Query: 314 GPDHGYFQIERGANACGI 331
             D GY  I  G N CGI
Sbjct: 328 FGDKGYMYIGIGNNLCGI 345


>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
          Length = 376

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 152/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P S DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFSCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K     RC  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + +  Y    I+     C+P  +DH+V +VG+G    + GI             
Sbjct: 261 TVTINMKPLRLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAERVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|323454466|gb|EGB10336.1| hypothetical protein AURANDRAFT_22962 [Aureococcus anophagefferens]
          Length = 416

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/337 (27%), Positives = 162/337 (48%), Gaps = 39/337 (11%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDR 86
           D+  Q   F  +I +++++Y   +E   RF  F ++    D          +G +  +D+
Sbjct: 81  DTRDQKSLFDQFIDEYSKSYDTTHEYNDRFTIFSKNLNYIDALNTQNPHALFGLNVFADQ 140

Query: 87  SPQEILQR--TGLRLTGKEKERLEADRE----RVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           + +E  +R  T   +T   +    +  +     +     E   G LP   DWR+  +  +
Sbjct: 141 TEEERSKRRMTDPSITNYTRVGWASGSDCAACNLYPAFGEYDMGNLPDDFDWRE--LGAV 198

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE 200
             V++Q  CGSCW+F+T A LE    L    L   +  QLVEC+  NL C+GG    A +
Sbjct: 199 TRVKNQAYCGSCWSFSTAADLEGTHYLATGDLESYAPQQLVECNTMNLGCDGGYPFAAMQ 258

Query: 201 YVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDH----MMHLLQ 253
           Y+  + G+ +    PY+  E +  +     E   V     W  V  G D+     + L++
Sbjct: 259 YLSHFGGMVTWETMPYKKIELLNEKL----EDGDVAHISGWQMVAMGADYESLMRVTLVK 314

Query: 254 SGPIGVYLNHRLIESY----DGNPIRRNDWACNPHKLDHAVAIVGYG----EKNG-ILTW 304
           +GP+ +  N   ++ Y    DG+    + + C+P  LDHAV +VGYG    + NG +  W
Sbjct: 315 NGPLSIAFNANGMDYYVHGVDGD---GDMFTCDPTSLDHAVLVVGYGVQHTDGNGKVPYW 371

Query: 305 IVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           +++NSW D+  + GY+++ RG+NACG+ +    + VK
Sbjct: 372 VIKNSWDDVWGEDGYYRLVRGSNACGVANMVVHSIVK 408


>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
          Length = 232

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 110/215 (51%), Gaps = 10/215 (4%)

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN 189
            DWR+     + PV  QG+CGSCWAF+    +  Q       L  LS+ QLV+CD+ +  
Sbjct: 25  FDWREHGA--VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDG 82

Query: 190 CNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHM 248
           C+GG     +  + K  GLE  +DYPY     I   C  +K K   ++  + +    + +
Sbjct: 83  CDGGYPPQTYTAIQKMGGLELASDYPYTGVGGI---CHMDKSKFVAYINGSTILPLSEKV 139

Query: 249 M--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIV 306
               L   GP+   LN   ++ Y G  I R  W C+P  ++HAV  VGYG +NG   WIV
Sbjct: 140 QAQKLRAIGPLSSALNADTLQLYKGG-IMRPKW-CDPAGVNHAVLTVGYGVQNGKPYWIV 197

Query: 307 RNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           +NSWG+   + GYF+I RG   CGI S    A +K
Sbjct: 198 KNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 232


>gi|294938848|ref|XP_002782226.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239893730|gb|EER14021.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 334

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 152/321 (47%), Gaps = 36/321 (11%)

Query: 42  AFKTYIVKWNRTY-TDDNEIK------TRFEYFKQ-DGKETDEYYGTSGSSDRSPQEILQ 93
           AF  +  K+ + Y + + EIK          Y +Q + K      G +  +D + +E   
Sbjct: 27  AFMGFQHKFGKNYESKEEEIKRNAIFRAHLHYIEQVNAKNLSYKLGVNEHADLTHEEF-- 84

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              L+L    K  ++ D + V K         L  S+DWR   V  L P++ QG CGSCW
Sbjct: 85  -AALKLGTSSKMSMKRDDKLVVK----ADTTQLLTSVDWRSKGV--LTPIKDQGPCGSCW 137

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           AF+ T  LE+Q A+    L  LS+ QL++C   +GN  C+GG ++ A+ Y+K  GL+ ++
Sbjct: 138 AFSATGALEAQYAIATGKLLSLSEQQLIDCSSSYGNEGCSGGLMENAYTYIKSAGLDQES 197

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRL-IESYD 270
            YPY  K N    C    EK    +    VT    HM+   + G +    +  + I  Y 
Sbjct: 198 TYPYIAKNN---ACQVSLEKRSDGIPAGEVTG--FHMLDQTEQGLMKALADAPVSIAMYA 252

Query: 271 GNPIRR-------NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
            +P  R       +   C+   +DH V  VGYG +NG   +++RNSWG      GYF ++
Sbjct: 253 SDPDFRFYQSGVYSSKTCHG-TIDHGVVAVGYGTENGEDYFVIRNSWGSSWGQDGYFYLK 311

Query: 324 RGANA---CGIESYAYLASVK 341
           RG +    C I  Y  +A++K
Sbjct: 312 RGVSGYGECNILEYMCVATLK 332


>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
          Length = 398

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 123/222 (55%), Gaps = 20/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P ++DWR S    +  V+ QG+CGSCWAF+ T  LE Q       L  LS+  LV+C  
Sbjct: 180 IPDTVDWRNSSYVTV--VKDQGQCGSCWAFSATGALEGQHMRKTHQLVSLSEQNLVDCSR 237

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD---T 239
            +GN  CNGG +D AFEY+K  +G++++  YPY+  E    +C + ++   V  +D   T
Sbjct: 238 KYGNNGCNGGLMDNAFEYIKDNHGIDTEESYPYKGVEGK--KCHFRRK--FVGAEDYGYT 293

Query: 240 WVTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G +  + +  +  GPI V ++  H   ++Y       N+  C+P  LDH V +VGY
Sbjct: 294 DLPEGDEEALKVAVATIGPISVAIDAGHISFQNYRKGIYTENE--CSPEDLDHGVLVVGY 351

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G ++N    WIV+NSWG    +HGY ++ R   N CGI S A
Sbjct: 352 GTDENAGDYWIVKNSWGTRWGEHGYIRMARNKRNQCGIASKA 393


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/296 (31%), Positives = 137/296 (46%), Gaps = 21/296 (7%)

Query: 51  NRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEAD 110
           N+ Y+ ++E   R+  +K +     EY   S +           T      K    L   
Sbjct: 35  NKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAKMNGLLLHK 94

Query: 111 RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
            +    FL        P ++DWR      + PV++QG+CGSCWAF++T  LE Q      
Sbjct: 95  HQNGSTFLVPSHTAA-PDAVDWRSEGY--VTPVKNQGQCGSCWAFSSTGALEGQHFKKTG 151

Query: 171 TLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTY 227
            L  LS+  LV+C  D+GN  CNGG +D AF Y+K  G ++++  YPY  ++     C Y
Sbjct: 152 RLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGT---CRY 208

Query: 228 EKEKAKVFVQDTW---VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWA 280
            K  + +   DT    +  G +  +   +   GP+ V ++  H   + Y       ++  
Sbjct: 209 SK--SSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVY--DEPQ 264

Query: 281 CNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           C+P  LDH V +VGYG  NG   W+V+NSWG      GY  + R   N CGI S A
Sbjct: 265 CSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKA 320


>gi|121531602|gb|ABM55486.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 104/329 (31%), Positives = 161/329 (48%), Gaps = 38/329 (11%)

Query: 25  AIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ------------DGK 72
           AI+    +A  +    D +  +     +TY +  E +TRF  F++            D  
Sbjct: 5   AIFATVLIAVTASTNEDQWIAFKQTHGKTYKNLLEERTRFGIFQRNLIKIKEHNARCDKG 64

Query: 73  ETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
           E     G +  +D + +E   +  L+   K K RL A      + L       +P S+DW
Sbjct: 65  EETYLLGVTRFADLTHEEF--KDILKGQIKNKPRLNATPTVFPEDLE------VPDSIDW 116

Query: 133 RQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC 190
            + K  VL  V+ Q  CGSCWAF+ T  LE Q A+L      LS+ QL++C   +GN NC
Sbjct: 117 TE-KGAVLE-VKGQNPCGSCWAFSATGALEGQNAILNNAKISLSEQQLLDCSAAYGNGNC 174

Query: 191 N-GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM 248
             GG++  AFEYV+ YG++S+  YPY  K+     C Y+  K  + ++    VT+  + +
Sbjct: 175 KEGGDMSAAFEYVRDYGIQSEKSYPYIRKQT---ECQYDASKTILKIKGYKNVTTSEEGL 231

Query: 249 MHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG----ILT 303
              + + GP+ + +N   ++ Y           C+ H LDH V +VGYG+ +        
Sbjct: 232 RKAVGTIGPMSIAMNSGPLQLYYSGIFSGK--GCS-HDLDHGVLVVGYGKASQWSGETKF 288

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGI 331
           W V+NSWG I  ++GYF+I+R A N CGI
Sbjct: 289 WRVKNSWGKIWGENGYFRIKRDANNLCGI 317


>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIEIAKDRDNHCGLATAA 328


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/225 (38%), Positives = 120/225 (53%), Gaps = 18/225 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP  +DW Q     +  V++QG+CGSCWAF+TT  LE QV      L  LS+  LV+C  
Sbjct: 114 LPAEVDWTQKGY--VTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCST 171

Query: 185 -HGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKV--FVQDTW 240
             GN  CNGG +D AF Y+K+ G ++++A YPY   +  T R    K  A V  FV    
Sbjct: 172 SEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDG-TCRFLENKVGATVSGFVD--- 227

Query: 241 VTSGVDHMMH--LLQSGPIGVYLNHRLI--ESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           V SG ++ +   +   GPI V ++   I  + Y G     N W C+  +LDH V +VGYG
Sbjct: 228 VKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVY--NPWFCSSTELDHGVLVVGYG 285

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   W+V+NSWG      GY ++ R   N CGI + A   +V
Sbjct: 286 TEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYPTV 330


>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
 gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
          Length = 352

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 142/314 (45%), Gaps = 39/314 (12%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYYGTSGSSDRSPQEILQ 93
           +F  +  K+ + Y    EI+ RF  F ++         K      G +  +D S  E   
Sbjct: 52  SFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTNKKRLSYKLGLNHFADLSWDEF-- 109

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERK--KGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           RT         ++L A +      +   K     LP   DWR  K  +++ V+ Q  CGS
Sbjct: 110 RT---------QKLGAAQNCSATLIGNHKLTDAVLPAEKDWR--KESIVSEVKDQAHCGS 158

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLE 208
           CW F+TT  LE+  A        LS+ QLV+C     N  CNGG    AFEY+K   G+ 
Sbjct: 159 CWTFSTTGALEAAYAQAHGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKYNGGIA 218

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGV-DHMMHLLQ-SGPIGVYL---- 261
            + +YPY  K+     C +  E   V V D+  +T G  D + H +  + P+ V      
Sbjct: 219 LEKEYPYTAKDEA---CKFTAENVAVRVLDSVNITLGAEDELKHAVAFARPVSVAFQVVD 275

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
             RL   Y       +     P  ++HAV  VGYG +N +  WI++NSWG    DHGYF+
Sbjct: 276 GFRL---YKEGVYTSDTCGNTPMDVNHAVLAVGYGVENNVPYWIIKNSWGSTWGDHGYFK 332

Query: 322 IERGANACGIESYA 335
           +E G N CG+ + A
Sbjct: 333 MELGKNMCGVATCA 346


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 85/244 (34%), Positives = 121/244 (49%), Gaps = 23/244 (9%)

Query: 111 RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
           + R  K   E     LPKS+DWR+     + PV++QG+CGSCWAF+    LE Q+ L   
Sbjct: 99  KHRKGKVFQEPLMLQLPKSVDWREKGC--VTPVKNQGQCGSCWAFSACGALEGQMCLKTG 156

Query: 171 TLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTY 227
            L  LS+  LV+C    GN  CNGG +D AF+YV    GL+S+  YPY  K+     C Y
Sbjct: 157 VLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKDGT---CKY 213

Query: 228 EKEKAKV----FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWAC 281
           + E A      +V    +   +  M  +   GPI + ++  H   + Y        +  C
Sbjct: 214 KPEFAAANDTGYVDIPQLEKAL--MKAVATVGPIAIAIDASHPSFQFYSSGIYYEPN--C 269

Query: 282 NPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAY 336
           +  +LDH V +VGYG    + N    WIV+NSWG      G+F I +   N CG+ + A 
Sbjct: 270 SSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKNNHCGVATAAS 329

Query: 337 LASV 340
             +V
Sbjct: 330 YPTV 333


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/260 (33%), Positives = 127/260 (48%), Gaps = 23/260 (8%)

Query: 86  RSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
           ++ + +   TG R+ G  K        +   FL     G LPK++DWR      + PV+ 
Sbjct: 84  KNEEFVAMMTGFRVNGTSKAA------KGSTFLPSNNIGELPKTVDWRTKGY--VTPVKD 135

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEY-V 202
           QG+CGSCWAF+TT  LE Q       L  LS+  LV+C    GN  C+GG +D AF+Y +
Sbjct: 136 QGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYII 195

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGV 259
           K  G++++  YPY+  +     C ++K      V   T VTS  +  +   +   GPI V
Sbjct: 196 KAGGIDTEESYPYKAVDG---ECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISV 252

Query: 260 YLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPD 316
            ++  H   + Y        D  C+   LDH V  VGYG   +G   WIV+NSW +    
Sbjct: 253 AIDASHMSFQLYKSGVYNEPD--CSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGM 310

Query: 317 HGYFQIERGA-NACGIESYA 335
           +GY  + R   N CGI + A
Sbjct: 311 NGYLWMSRNKDNQCGIATQA 330


>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
 gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
 gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
 gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
          Length = 331

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 120/233 (51%), Gaps = 16/233 (6%)

Query: 112 ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
           + + K   E   G +PKS+DWR      + PV+ QG CGSCWAF+    LE Q+      
Sbjct: 101 KMMMKVFQEPLLGDVPKSVDWRDHGY--VTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGK 158

Query: 172 LYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYE 228
           L PLS   LV+C    GN  C+GG  D+AF+YVK   GL++   YPY   E +   C Y 
Sbjct: 159 LVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPY---EALNGTCRYN 215

Query: 229 -KEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPH 284
            K  A        V S  D +M  + + GPI V ++  H+  + Y        D  C+  
Sbjct: 216 PKNSAATVTGFVNVQSSEDALMKAVATVGPISVGIDTKHKSFQFYKEGMYYEPD--CSST 273

Query: 285 KLDHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            LDHAV +VGYGE+ +G   W+V+NSWG     +GY ++ +   N CGI S A
Sbjct: 274 VLDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDA 326


>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
          Length = 330

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 122/232 (52%), Gaps = 16/232 (6%)

Query: 118 LNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSK 177
             E   G +PKS+DWR   +  + PV+ QG+C SCWAF+    LE Q+      L  LS+
Sbjct: 106 FQEPLLGDVPKSVDWRN--LSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSE 163

Query: 178 SQLVEC--DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKV 234
             LV+C   +GN+ C GG ++ AF YVK+  GL+++  YPY  +      C Y+ + +  
Sbjct: 164 QNLVDCSWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNG---PCRYDPKNSAA 220

Query: 235 FVQD-TWVTSGVDHMMHLLQS-GPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAV 290
            V D   +    D +M  + + GPI  GV  +H     Y G         C+   LDHAV
Sbjct: 221 NVTDFVKIPISEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPH--CSSSNLDHAV 278

Query: 291 AIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
            +VGYGE+ +G   W+V+NSWG     +GY ++ R   N CGI +YA   +V
Sbjct: 279 LVVGYGEESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 330


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 115/219 (52%), Gaps = 16/219 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P  +DWR  K   + P+++QGRCGSCWAF+TT  LE Q       L  LS+  L++C  
Sbjct: 109 MPTEVDWR--KEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSA 166

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVF---VQDT 239
             GN  C GG +D AFEY+K   G++++A YPY  +++I   C Y+K           D 
Sbjct: 167 AEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDI---CRYKKTNKGAIDTGYMDI 223

Query: 240 WVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
              S  D    +   GPI V ++  H+    Y        +  C+   LDH V +VGYG 
Sbjct: 224 KQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTGVYHEPE--CSQTVLDHGVLVVGYGT 281

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           +NG   W+V+NSWG     +GY ++ R  +N CGI + A
Sbjct: 282 ENGEDYWLVKNSWGTDWGMNGYIKMSRNRSNNCGIATNA 320


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 95/302 (31%), Positives = 152/302 (50%), Gaps = 30/302 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y    E   RFE FK + K  D+        + G +  +D S Q
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQ 101

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E   +  L L     +R E+  E         +   LPKS+DWR  K   + PV++QG+C
Sbjct: 102 EFKNKY-LGLKVDLSQRRESSEEEFT-----YRDVDLPKSVDWR--KKGAVTPVKNQGQC 153

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + VK  GL
Sbjct: 154 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGL 213

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
             + DYPY  +E+    C  +KE ++V   + +     +    ++  L + P+ V +  +
Sbjct: 214 HKEEDYPYIMEEST---CEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEAS 270

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + Y G      D  C   +LDH V+ VGYG   G+   IV+NSWG    + G+ ++
Sbjct: 271 GRDFQFYSGGVF---DGHCGS-ELDHGVSAVGYGTSKGLDYIIVKNSWGAKWGEKGFIRM 326

Query: 323 ER 324
           +R
Sbjct: 327 KR 328


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/290 (33%), Positives = 139/290 (47%), Gaps = 39/290 (13%)

Query: 63  RFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR----LTGKEKERLEADRERVKKFL 118
           RFE FK + +  DE+         + + +  + GL     LT  E   +    + VK+ L
Sbjct: 74  RFEIFKDNLRYIDEH---------NTKNLSYKLGLTRFADLTNDEYRSMYLGAKPVKRVL 124

Query: 119 N------ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTL 172
                   R    LP S+DWR  K   +  V+ QG CGSCWAF+T   +E    ++   L
Sbjct: 125 KTSDRYEARVGDALPDSVDWR--KEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDL 182

Query: 173 YPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKE 230
             LS+ +LV+CD   N  CNGG +D AFE++ K  G++++ADYPY+  +    RC   ++
Sbjct: 183 ISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADG---RCDQNRK 239

Query: 231 KAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHK 285
            AKV   D++     +    L   L   PI V +    R  + Y        D  C   +
Sbjct: 240 NAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVF---DGICGT-E 295

Query: 286 LDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG----ANACGI 331
           LDH V  VGYG +NG   WIVRNSWG+   + GY ++ R        CGI
Sbjct: 296 LDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIAEPTGKCGI 345


>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
          Length = 336

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/238 (35%), Positives = 126/238 (52%), Gaps = 26/238 (10%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N+  +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NQTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C   HGN  CNGG +D AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACN 282
                AK+  FV D    + +  M  +   GP+ V ++  H+ ++ Y          AC+
Sbjct: 217 PRFNVAKITGFV-DIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--ACS 273

Query: 283 PHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CGI + A
Sbjct: 274 SSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMA 331


>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 142/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++RV+K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRVRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  C GG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|148709373|gb|EDL41319.1| cathepsin 7, isoform CRA_b [Mus musculus]
          Length = 358

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/258 (36%), Positives = 135/258 (52%), Gaps = 25/258 (9%)

Query: 99  LTGKEKERL-EADRERVKKFLNERKKGP-LPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           +TG+E + L E+    ++   + +K+ P +P +LDWR  K   + PV  QG CG+CWAF+
Sbjct: 110 MTGEEMKMLTESSSYPLRNGKHIQKRNPKIPPTLDWR--KEGYVTPVRRQGSCGACWAFS 167

Query: 157 TTAILESQVALLKKT--LYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
            TA +E Q  L KKT  L PLS   L++C   +G   C+GG    AF+YVK   GLE++A
Sbjct: 168 VTACIEGQ--LFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEA 225

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQSGPIGVYLN--HRLIE 267
            YPY  K      C Y  E++ V V   +V    +   +  L+  GPI V ++  H    
Sbjct: 226 TYPYEAKAK---HCRYRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFH 282

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIE 323
           SY G         C    LDH + +VGYG    E      W+++NS G+   ++GY ++ 
Sbjct: 283 SYRGGIYHEPK--CRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLP 340

Query: 324 RGANA-CGIESYAYLASV 340
           RG N  CGI SYA   ++
Sbjct: 341 RGQNNYCGIASYAMYPAL 358


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
          Length = 443

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 137/309 (44%), Gaps = 25/309 (8%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEIL 92
           D F  +     R Y    E + RFE F  + K+  E         +G +  +D S +E  
Sbjct: 23  DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQ 82

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
            R                 +  K F  E  K    + +DWR      +  V++QG CGSC
Sbjct: 83  TRH--NAARHYAAAKARRAKHTKSFTKEEIKAADGQKIDWRLKGA--VTSVKNQGSCGSC 138

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLES 209
           W+F+TT  +E Q A+    L  LS+ +LV CD  +  CNGG +D AF ++   +   + +
Sbjct: 139 WSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIAT 198

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFV-----QDTWVTSGVDHMMHLLQSGPIGVYLNHR 264
           +A YPY +   I   C+Y  +   V       QD   T   D    +   GP+ + ++  
Sbjct: 199 EASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTE-EDMAAFVFNYGPLSIGVDAS 257

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             +SY G  I      C   ++DH V IVGY +      WI++NSW     + GY ++ +
Sbjct: 258 TWQSYAGGIITY----CPDVQIDHGVLIVGYDDTAPTPYWIIKNSWTANWGEDGYIRVAK 313

Query: 325 GANACGIES 333
           G+N CG+ S
Sbjct: 314 GSNMCGLTS 322


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
          Length = 359

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 90/316 (28%), Positives = 148/316 (46%), Gaps = 33/316 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  + V+  ++Y    E++ RF  F +   E         S++R  + +  + G+ R + 
Sbjct: 58  FARFAVRHGKSYGSAAEVQRRFRIFSESLDEVR-------STNR--KGLSYKLGINRFSD 108

Query: 102 KEKERLEADRERVKKFLNE--------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              E  +A +    +  +         R    LP++ DWR++ +  ++PV+ Q  CGSCW
Sbjct: 109 MTWEEFQATKLGAAQTCSATLAGNHLMRDANALPETKDWRETGI--VSPVKDQASCGSCW 166

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
            F+TT  LE+           LS+ QLV+C   + N  CNGG    AFEY+K   G++++
Sbjct: 167 TFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGIDTE 226

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVDHMMHLLQSGPIGVYLNHRLIES- 268
             YPY+    +   C Y  E A V V D+  +T   +  +         V +   +I+  
Sbjct: 227 ESYPYKGVNGV---CKYRPENAAVQVADSVNITLNAEDELKNAVGLVRPVSVAFEVIDGF 283

Query: 269 --YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
             Y       +     P  ++HAV  VGYG +NG+  W+++NSWG    + GYF++E G 
Sbjct: 284 KQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDGYFKMEMGK 343

Query: 327 NACGI---ESYAYLAS 339
           N C +    SY  LA+
Sbjct: 344 NMCAVATCASYPILAA 359


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 153/318 (48%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y +  E   RFE FK + K  DE        + G S  +D S +
Sbjct: 43  KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHR 102

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   + PV++QG 
Sbjct: 103 EFNNKYLGLKVDYSRR------RESPEEFT--YKDVELPKSVDWR--KKGAVAPVKNQGS 152

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + V+  G
Sbjct: 153 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGG 212

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E     C   KE+ +V     +     +    ++  L + P+ V +  
Sbjct: 213 LHKEEDYPYIMEEG---ACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEA 269

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+    V+NSWG    + GY +
Sbjct: 270 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIR 325

Query: 322 IERGA----NACGIESYA 335
           + R        CGI   A
Sbjct: 326 MRRNIGKPEGICGIYKMA 343


>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 143/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  CNGG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|19909509|dbj|BAB86959.1| cathepsin L [Fasciola gigantica]
          Length = 324

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 121/225 (53%), Gaps = 22/225 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRGSGY--VTTVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C+GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNYGCSGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  + ++   +ES    Y G   +     C   +L+HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAIAVD---VESDFMMYSGGIYQSQ--TC--LRLNHAVLAVGYG 275

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
            + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 276 TQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGISSLASLPMV 320


>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/238 (35%), Positives = 122/238 (51%), Gaps = 39/238 (16%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP ++DWR      + PV++Q +CGSCWAF+TT  LE Q  L K TL  LS+ QLV+C  
Sbjct: 108 LPDTVDWRTKGA--VTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSD 165

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
            +GN  C GG +D AF+Y++   G++S+A YPY  K     +C +++             
Sbjct: 166 KYGNHGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNG---KCRFQQSAVAA------TC 216

Query: 243 SGVDHMMH---------LLQSGPIGVYLN--HRLIESYDG---NPIRRNDWACNPHKLDH 288
           +G   + H         +   GPI V ++  H   + Y     +P+      C+  +LDH
Sbjct: 217 TGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAGVYDPLL-----CSSTRLDH 271

Query: 289 AVAIVGYG-EKNGILT-----WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            V  VGYG E +G+       W+V+NSWG      GYF+I R  N CGI + A   +V
Sbjct: 272 GVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASYPTV 329


>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 143/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  CNGG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 143/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  CNGG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|23956098|ref|NP_062412.1| cathepsin 7 precursor [Mus musculus]
 gi|81902493|sp|Q91ZF2.1|CAT7_MOUSE RecName: Full=Cathepsin 7; AltName: Full=Cathepsin 1; Flags:
           Precursor
 gi|16445017|gb|AAK00508.1| cathepsin 1 precursor [Mus musculus]
 gi|40352949|gb|AAH64740.1| Cathepsin 7 [Mus musculus]
 gi|148709372|gb|EDL41318.1| cathepsin 7, isoform CRA_a [Mus musculus]
          Length = 331

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/258 (36%), Positives = 137/258 (53%), Gaps = 25/258 (9%)

Query: 99  LTGKEKERL-EADRERVKKFLNERKKGP-LPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           +TG+E + L E+    ++   + +K+ P +P +LDWR  K   + PV  QG CG+CWAF+
Sbjct: 83  MTGEEMKMLTESSSYPLRNGKHIQKRNPKIPPTLDWR--KEGYVTPVRRQGSCGACWAFS 140

Query: 157 TTAILESQVALLKKT--LYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
            TA +E Q  L KKT  L PLS   L++C   +G   C+GG    AF+YVK   GLE++A
Sbjct: 141 VTACIEGQ--LFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEA 198

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQSGPIGVYLN--HRLIE 267
            YPY  K      C Y  E++ V V   +V    +   +  L+  GPI V ++  H    
Sbjct: 199 TYPYEAKAK---HCRYRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFH 255

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIE 323
           SY G     ++  C    LDH + +VGYG    E      W+++NS G+   ++GY ++ 
Sbjct: 256 SYRGGIY--HEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLP 313

Query: 324 RGANA-CGIESYAYLASV 340
           RG N  CGI SYA   ++
Sbjct: 314 RGQNNYCGIASYAMYPAL 331


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/222 (37%), Positives = 120/222 (54%), Gaps = 20/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P ++DWRQ     + PV+ QG CGSCW+F+ T  LE Q     K L  LS+  LV+C  
Sbjct: 120 IPDTVDWRQEGA--VTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSS 177

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKA--KVFVQDTW 240
             GN  CNGG +D AF Y+K   G++++A YPY   E+  FR + +   A  K FV    
Sbjct: 178 RFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMG-EDEKFRYSAKNRGATDKGFVD--- 233

Query: 241 VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           + SG +  +   +   GPI + ++  H   + Y       +D  C+  +LDH V +VGYG
Sbjct: 234 IPSGDEDKLKAAVATVGPISIAIDASHESFQLYSNGVY--SDPTCSSTELDHGVLVVGYG 291

Query: 297 --EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             EK G+  W+V+NSWGD     GY ++ R   N CG+ + A
Sbjct: 292 TDEKTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQA 333


>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
          Length = 1157

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/279 (30%), Positives = 133/279 (47%), Gaps = 17/279 (6%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEA 109
           W   + +++ IK     F Q  +     YG +  SD + +E  Q T L L      RL+ 
Sbjct: 645 WGMLWGEEDNIKQ--AEFYQTLERGTALYGVTQFSDLTGEE-FQETFLGL------RLDE 695

Query: 110 DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
              + + ++ ++    +P++ DWR      + PV  QG CGSCWAF+    +E Q     
Sbjct: 696 QYSKSQSYVKKKHSVSIPENYDWR--PYGAVGPVLDQGHCGSCWAFSVIGNIEGQWFRKT 753

Query: 170 KTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYE 228
             L  LSK QLV+CD  +  C GG     ++ +++  GLE + DY Y  ++ +   C   
Sbjct: 754 GQLVSLSKQQLVDCDRSSRGCGGGYPPATYDSIRRIGGLEIELDYRYTGRDGV---CHQN 810

Query: 229 KEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
             K   +V  +   +  ++ +   L   GPI + LN RL++ Y    +      C    +
Sbjct: 811 PRKFVAYVNSSVALTKDENTIAEWLSYHGPISMALNARLLQFYVSGIMHPPAAYCPVKDI 870

Query: 287 DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            HAV  VG+G K  +  WIV+NSWG +  + GYF+I RG
Sbjct: 871 SHAVLSVGFGTKGNVPFWIVKNSWGTLWGEEGYFRIYRG 909



 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 147/345 (42%), Gaps = 28/345 (8%)

Query: 6   CDHQETNTEQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRT--YTDDNEIKTR 63
           C+ +E N   ++    T+  I  W        +  +   T + +W  T  +     I T+
Sbjct: 350 CNPEELNHAVLSVGFGTEQGIPYWIIKNSWGEQWGEQHLTKLKEWLNTQPFGHKRLIGTK 409

Query: 64  FEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKK 123
             Y +Q  ++    YG     D    + L  T +    K  +    + E V         
Sbjct: 410 SGYIRQSYEDFKRKYGKQFIGDAEEFKALYLTAMYDHRKLNQSKTTEPETV--------- 460

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G    S DWR      + PV  Q RCG+ WAF+    +E Q  +    L  LS+ QLV+C
Sbjct: 461 GEPQDSFDWR--DYGAVGPVLDQDRCGASWAFSAIGNIEGQYFMRVHRLLSLSEQQLVDC 518

Query: 184 DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT-WV 241
           D  +  C GG    AFE ++Q  GLE +ADYPY   ++    C     +  V +  +  +
Sbjct: 519 DRIDQGCAGGTPYGAFEGIQQLGGLELEADYPYLGHQD---NCQSNPLRFVVSINGSVQL 575

Query: 242 TSGVDHM-MHLLQSGPIGVYLNHRLIESYDGNPIRRNDW-ACNPHKLDHAVAIVGYGEKN 299
               D +  +L   GP+ V +N  L++ Y    I +  W  CNP +++HA   VG+G + 
Sbjct: 576 PKDEDQIAQYLFDHGPLSVGINGALLQYYSSG-IMQPLWDNCNPAEMNHAGLAVGFGFEQ 634

Query: 300 GILTWIVRNSWG-------DIGPDHGYFQIERGANACGIESYAYL 337
            +  W ++NSWG       +I     Y  +ERG    G+  ++ L
Sbjct: 635 DVPYWTIKNSWGMLWGEEDNIKQAEFYQTLERGTALYGVTQFSDL 679



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 90/324 (27%), Positives = 142/324 (43%), Gaps = 75/324 (23%)

Query: 1   MKSSQCDHQETNTEQVTYNVNTDSAIYVW---RDLAYDSIKQVDAFKTYIVKWNRTYTDD 57
           + + QCD +  N   +     TD +   W        D  +Q+D F+           D+
Sbjct: 121 LAAEQCDPEALNHAALAVGFGTDESTPFWIIKNTFGKDWGEQLDEFE-----------DE 169

Query: 58  NEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEA-DRERVKK 116
            E+   +E FK +  +T E             E  +R  L LT K  +  E  DR  V++
Sbjct: 170 REL---YENFKAEYDKTYE----------GRDEEFRR--LYLTYKSPDEHEPIDRIHVQE 214

Query: 117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLS 176
                  G LP   DWR+     + PV +QG+CGSCWA +                    
Sbjct: 215 V------GQLPSYFDWRE--YGAVGPVRNQGQCGSCWAIS-------------------- 246

Query: 177 KSQLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVF 235
            +++V+CDH +  C+GG    A+E V++ G LE    YPY   +       Y +   + F
Sbjct: 247 -AEVVDCDHADHGCSGGFPIHAYECVQRLGGLELAVRYPYVGYQQ------YCQADPRYF 299

Query: 236 VQDTWVTSGV------DHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDH 288
           V   ++   V      + +   L + GP+ V L+ RL++ Y    +  +   CNP +L+H
Sbjct: 300 V--AYINGSVALPKDSEQIAKFLATFGPLSVVLDARLLQYYRSGILNPSVAYCNPEELNH 357

Query: 289 AVAIVGYGEKNGILTWIVRNSWGD 312
           AV  VG+G + GI  WI++NSWG+
Sbjct: 358 AVLSVGFGTEQGIPYWIIKNSWGE 381



 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/142 (34%), Positives = 74/142 (52%), Gaps = 8/142 (5%)

Query: 124  GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
            G +P+  DWR+  +  + P++ QG CGSCWAF+T   +E Q       L  LS+ QL++C
Sbjct: 997  GEIPERFDWRE--LGAVGPIQDQGDCGSCWAFSTIGNIEGQWFKKTGQLLTLSEQQLIDC 1054

Query: 184  DHGNLNCNGG-NIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV- 241
            D  +  C GG   D   + VK  GLE  ADYPY   + +   C  E+ K + +V  + V 
Sbjct: 1055 DSVDDGCGGGYPPDTYGDIVKMGGLELNADYPYIAADGV---CKMERSKFRAYVNKSLVL 1111

Query: 242  -TSGVDHMMHLLQSGPIGVYLN 262
             T      + L ++GP+   +N
Sbjct: 1112 PTKEDQQAVWLSKNGPLSAGIN 1133



 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 74/150 (49%), Gaps = 7/150 (4%)

Query: 179 QLVECDHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQ 237
           QLV+CDH +  C GG    AF  V++ G L+   DYPY         C +  ++A  FV 
Sbjct: 24  QLVDCDHVDRGCEGGFPLDAFMAVQRLGGLQLSIDYPYIASRQA---CQFNPKQAVAFVT 80

Query: 238 DTWVTSGVDHMM--HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                   + ++  +L ++GP+ V LN R ++ Y+   +      C+P  L+HA   VG+
Sbjct: 81  GFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCDPEALNHAALAVGF 140

Query: 296 GEKNGILTWIVRNSWG-DIGPDHGYFQIER 324
           G       WI++N++G D G     F+ ER
Sbjct: 141 GTDESTPFWIIKNTFGKDWGEQLDEFEDER 170


>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 143/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  CNGG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 155/311 (49%), Gaps = 32/311 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           +++++++  ++Y    E   RF+ FK + K  DE           G +  +D + +E   
Sbjct: 49  YESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEY-- 106

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           R+    T    +R +  + +  ++L  +    LP+S+DWR   V V   V+ QG CGSCW
Sbjct: 107 RSIYLGTKSSGDRRKLSKNKSDRYL-PKVGDSLPESVDWRDKGVLV--GVKDQGSCGSCW 163

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQA 211
           AF+  A +ES  A++   L  LS+ +LV+CD   N  C+GG +D AFE+V    G++++ 
Sbjct: 164 AFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEE 223

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-----QSGPIGVYLNHRLI 266
           DYPY+ + ++   C   ++ AKV   D++    V++   L      Q   I +    R +
Sbjct: 224 DYPYKERNDV---CDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDL 280

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER-- 324
           + Y           C    +DH V   GYG +NG+  WIVRNSWG    + GY +++R  
Sbjct: 281 QHYKSGIFTGK---CGT-AVDHGVVAAGYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNV 336

Query: 325 --GANACGIES 333
              +  CG+ +
Sbjct: 337 ASSSGLCGLAT 347


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 124/223 (55%), Gaps = 23/223 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P+S+DWR+  +  + PV++QG CGSCWAF++T  LE Q A     L  LS+  LV+C  
Sbjct: 138 IPESVDWREEGL--VTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCST 195

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  CNGG +D+AFEY+K+ +G++++  YPY  +E    +C +++       K FV  
Sbjct: 196 KYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRET---KCHFKRNTVGADDKGFVD- 251

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             +  G +  +   +   GPI + ++  HR  + Y        D  C+  +LDH V +VG
Sbjct: 252 --LPEGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYF--DEECSSEELDHGVLLVG 307

Query: 295 YG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           YG +      W+V+NSWG    + GY +I R   N CG+ + A
Sbjct: 308 YGTDPEAGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKA 350


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 148/308 (48%), Gaps = 30/308 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           +  ++    RTY    E + RFE F+ + +  D +   + +   S +  L R    LT  
Sbjct: 46  YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFA-DLTND 104

Query: 103 E--------KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           E        + R + +R    ++L    +  LP+S+DWR      +  V+ QG CGSCWA
Sbjct: 105 EYRATYLGVRSRPQRERRLGDRYLAGDNE-DLPESVDWRAKGA--VAEVKDQGSCGSCWA 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+T A +E    ++   +  LS+ +LV+CD   N  CNGG +D AFE++    G++++ D
Sbjct: 162 FSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEED 221

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIE 267
           YPY+  +    RC   ++ AKV   D++     +    +   + + PI V +    R  +
Sbjct: 222 YPYKGTDG---RCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y+          C    LDH V  VGYG +NG   WIV+NSWG    + GY ++ER   
Sbjct: 279 LYNSGIFTGT---CGT-ALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIK 334

Query: 328 A----CGI 331
           A    CGI
Sbjct: 335 ASSGKCGI 342


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 157/320 (49%), Gaps = 38/320 (11%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           + ++ ++ +  +TY    E ++RF  F  + K  DE+   + S +RS +  L +    LT
Sbjct: 34  NTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEH---NLSGNRSYKVGLNQFA-DLT 89

Query: 101 GKEKERL-------------EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
            +E   +             +  R  + +    ++    P  +DWR+     ++PV++QG
Sbjct: 90  NEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGA--VSPVKNQG 147

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQY 205
            CGSCWAF+T A +E    ++   L  LS+ +LV+CD+  N  CNGG++D AF++ V   
Sbjct: 148 GCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQFIVSNG 207

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPI--GVY 260
           G++S++DYPY+    +   C   + KAK+   D +          +M  +   P+  G+ 
Sbjct: 208 GIDSESDYPYKG---VGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVGIE 264

Query: 261 LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
            + R  + Y    +     +C  + LDH V +VGYG +NG   WIVRNSWG    + GY 
Sbjct: 265 ASGRAFQLYTSGVLTG---SCGTN-LDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYI 320

Query: 321 QIERG-----ANACGIESYA 335
           ++ER         CGI   A
Sbjct: 321 RMERNMVDTPVGMCGITLMA 340


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEEALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
          Length = 588

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPKS+DWR  K   + PV++Q +CGSCWAF+ T  LE Q+      L  LS+  LV+C H
Sbjct: 114 LPKSVDWR--KKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG ++ AF+YVK+  GL+S+A YPY  K+     C Y+ E +     DT   
Sbjct: 172 PQGNQGCNGGFMNNAFQYVKENGGLDSEASYPYVAKDG---SCKYKPENS--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               H   L+++    GPI V ++  H   + Y        D  C+   LDH V +VGYG
Sbjct: 227 VIPAHEKELMKAVATVGPISVAVDASHSSFQFYKSGIYFEQD--CSSKNLDHGVLVVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
                 N    W+++NSWG     +GY +I +   N CGI + A
Sbjct: 285 FEGTNSNNNNYWLIKNSWGPEWGSNGYIKIAKDRNNHCGIATAA 328


>gi|313221004|emb|CBY31836.1| unnamed protein product [Oikopleura dioica]
          Length = 323

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 120/229 (52%), Gaps = 17/229 (7%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K P+P S DWR  K   ++PV+ QG+CGSCW F+TT  +E+  A+     + LS+ QLV+
Sbjct: 99  KQPMPTSWDWR--KDNKVSPVKDQGQCGSCWTFSTTGNVEAGEAIHLNEYHTLSEQQLVD 156

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT 239
           C     N  CNGG    AFEY+    G+ ++ADYPY  K+     C ++++KA V V  +
Sbjct: 157 CAGAFNNHGCNGGLPSQAFEYIAAAPGIMTEADYPYTAKDG---NCVFDQKKAAVHVYGS 213

Query: 240 W-VTSG--VDHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIV 293
             +T G  V+    ++   PI +     +++    Y        D   +P  ++HAV  V
Sbjct: 214 VNITRGDEVEMAEAMVMYQPISIAF--EVVDDFMHYKSGTYSSKDCKGSPTDVNHAVLAV 271

Query: 294 GYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
           G+G +  G   W V+NSW     + GYF I+RG N CG+      A +K
Sbjct: 272 GFGTDGAGTDFWTVKNSWSKDWGNQGYFNIQRGVNMCGLSQCTSFALIK 320


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 120/227 (52%), Gaps = 21/227 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +PKS+DWRQ     +  V+ QG CGSCWAF++TA LE Q       L  LS+  LV+C  
Sbjct: 120 IPKSVDWRQHGA--VTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCST 177

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  CNGG +D AF Y+K   G++++  YPY   E I   C +   K+ V   DT   
Sbjct: 178 KYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY---EGIDDSCHF--TKSGVGATDTGFV 232

Query: 241 -VTSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G +   M  +   GP+ V ++  H   + Y       N+  C+   LDH V +VGY
Sbjct: 233 DIPQGDEEALMKAVATMGPVSVAIDASHESFQLYSEGVY--NEPECDAQNLDHGVLVVGY 290

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G +K G+  W+V+NSWG    D GY ++ R   N CGI + +   +V
Sbjct: 291 GTDKTGLDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSYPTV 337


>gi|340710428|ref|XP_003393792.1| PREDICTED: cathepsin O-like [Bombus terrestris]
          Length = 355

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 91/318 (28%), Positives = 152/318 (47%), Gaps = 37/318 (11%)

Query: 43  FKTYIVKWNRTYTDD-NEIKTRFEYFKQDGKETDE-----------YYGTSGSSDRSPQE 90
           F+ Y++++N++Y ++  E + RF+ F++  +  ++           YYG +  SD S  E
Sbjct: 36  FQNYVMRYNKSYRNNPTEYEERFKRFRKSLRHIEKMNGLRPSQESAYYGLTEFSDMSEDE 95

Query: 91  ILQRTGLR-LTGKEKERLEADRERVKKFLNE----RKKGPLPKSLDWRQSKVKVLNPVES 145
            L  T L  L+ + ++       R    L      +K   +P   DWR   V  + PV S
Sbjct: 96  FLSLTLLPDLSARGEKHANESYHRRHHLLQSTNRVKKSVSIPLRFDWRDKGV--ITPVRS 153

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-DHGNLNCNGGNIDVAFEYV-- 202
           QG CG+CWAF+T  ++ES  A+   TLY LS  ++++C  + N  C GG+I     ++  
Sbjct: 154 QGSCGACWAFSTIEVVESMYAIKNGTLYMLSVQEMIDCAKNKNFGCEGGDIYSLLSWLLA 213

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKE-------KAKVFVQDTWVTSGVDHMMHLLQSG 255
            +  +  ++ YP   K   T  C   K        K + F  D +V +  + ++ +   G
Sbjct: 214 SKVQIFQESTYPLVGK---TSMCKLGKMIDNAFGVKIRDFNCDNFVDAEDELLIKVATHG 270

Query: 256 PIGVYLNHRLIESYDGNPIRRNDWACNP--HKLDHAVAIVGYGEKNGILTWIVRNSWGDI 313
           P+   +N    ++Y G  I+   + C+      +HAV I+GY +   I  +I++NSWG  
Sbjct: 271 PVAAVVNALSWQNYLGGVIQ---YHCDSTYDNRNHAVQIIGYDKSAAIPHYIIKNSWGTN 327

Query: 314 GPDHGYFQIERGANACGI 331
             D GY  I  G N CGI
Sbjct: 328 FGDKGYMYIAIGNNLCGI 345


>gi|449471885|ref|XP_004186123.1| PREDICTED: LOW QUALITY PROTEIN: pro-cathepsin H [Taeniopygia
           guttata]
          Length = 334

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/229 (36%), Positives = 121/229 (52%), Gaps = 29/229 (12%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR+ K   + PV+ QG CGSCW F+TT  LES +A+    L  L++ QLV+C  
Sbjct: 109 VPDSIDWRK-KGNFVTPVKIQGACGSCWTFSTTGCLESAIAIATGKLLSLAEQQLVDCAQ 167

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKE------KAKVFV 236
              N  C+GG    AFEY+    GL  +  YPYR K      C ++ +      KA  FV
Sbjct: 168 AFNNHGCSGGLPSQAFEYILYNRGLMGEDSYPYRAKNGT---CRFQPDNDIRVGKAIAFV 224

Query: 237 QDTWVTSGVDH--MMHLL-QSGPIGV-------YLNHRLIESYDGNPIRRNDWACNPHKL 286
           +D    +  D   M+  + +  P+         ++++R  +    NP   +     P K+
Sbjct: 225 KDVINITQYDEDGMVEAVGRHNPVSFAFEVTSDFMHYR--KGVYSNPRCEH----TPDKV 278

Query: 287 DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           +HAV  VGYG+++G   WIV+NSWG +    GYF IERG N CG+ + A
Sbjct: 279 NHAVLAVGYGQEDGTPYWIVKNSWGRLWGMQGYFLIERGKNMCGLAACA 327


>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 330

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 115/224 (51%), Gaps = 16/224 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP S+DWR   V  L+PV+ QG CGSCWAF+    LE+Q A+    L PLS+ QLV+C H
Sbjct: 113 LPTSVDWRNKSV--LSPVKDQGSCGSCWAFSAAGALEAQYAIATGKLRPLSEQQLVDCSH 170

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAK-VFVQDTWVT 242
             G   C GG +  A++Y+K  GL+ ++ YPY+    +   C   ++KA  + V+    T
Sbjct: 171 KYGTNGCFGGFMADAYKYIKSAGLDQESTYPYK---GVNEPCRPREKKADGIPVRFVLDT 227

Query: 243 SGVDHMMHLLQSGPIGV--YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
                +M  L   P+ V  Y +  L   Y           CN  ++DHAV  VGYG   G
Sbjct: 228 KTEQSLMKALADAPVSVAMYASDFLFHLYLSGVYSST--TCN-GEIDHAVVAVGYGADEG 284

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGANA---CGIESYAYLASVK 341
              +I++NSWG      GYF ++RG      C I  Y  + ++K
Sbjct: 285 SDYFILKNSWGSSWGMGGYFFLKRGVGGHGECNILEYMVVPTLK 328


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDYAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 120/244 (49%), Gaps = 23/244 (9%)

Query: 111 RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
           + R  K   E     LPKS+DWR+     + PV++QG+CGSCWAF+    LE Q+ L   
Sbjct: 99  KHRKGKLFQEPLMLQLPKSVDWREKGC--VTPVKNQGQCGSCWAFSACGALEGQMCLKTG 156

Query: 171 TLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTY 227
            L  LS+  LV+C    GN  CNGG +D AF+YV    GL+S+  YPY  K+     C Y
Sbjct: 157 VLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKDGT---CKY 213

Query: 228 EKEKAKV----FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWAC 281
           + E A      +V    +   +  M  +   GPI V ++  H   + Y        +  C
Sbjct: 214 KPEFAAANDTGYVDIPQLEKAL--MKAVATVGPIAVAIDASHPSFQFYSSGIYFEPN--C 269

Query: 282 NPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAY 336
           +   LDH V ++GYG    + N    WIV+NSWG      G+F I +   N CGI + A 
Sbjct: 270 SSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKNNHCGIATAAS 329

Query: 337 LASV 340
             +V
Sbjct: 330 YPTV 333


>gi|8917575|gb|AAF81274.1| EPCS24 [Mus musculus]
          Length = 329

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 135/253 (53%), Gaps = 25/253 (9%)

Query: 99  LTGKEKERL-EADRERVKKFLNERKKGP-LPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
           +TG+E + L E+    ++   + +K+ P +P +LDWR  K   + PV  QG CG+CWAF+
Sbjct: 83  MTGEEMKMLTESSSYPLRNGKHIQKRNPKIPPTLDWR--KEGYVTPVRRQGSCGACWAFS 140

Query: 157 TTAILESQVALLKKT--LYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQA 211
            TA +E Q  L KKT  L PLS   L++C   +G   C+GG    AF+YVK   GLE++A
Sbjct: 141 VTACIEGQ--LFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEA 198

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH--MMHLLQSGPIGVYLN--HRLIE 267
            YPY  K      C Y  E++ V V   +V    +   +  L+  GPI V ++  H    
Sbjct: 199 TYPYEAKAK---HCRYRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFH 255

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIE 323
           SY G     ++  C    LDH + +VGYG    E      W+++NS G+   ++GY ++ 
Sbjct: 256 SYRGGIY--HEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLP 313

Query: 324 RGANA-CGIESYA 335
           RG N  CGI SYA
Sbjct: 314 RGQNNYCGIASYA 326


>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 88  IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 145

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 146 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 200

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 201 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 258

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 259 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 302


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 116/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV+ QG+CGSCWAF+TT  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKAVDWR--KKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKE--DVGATDTGYV 228

Query: 241 -VTSGV--DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + +G   D    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 124/223 (55%), Gaps = 23/223 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P+S+DWR+  +  + PV++QG CGSCWAF++T  LE Q A     L  LS+  LV+C  
Sbjct: 137 IPESVDWREEGL--VTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCST 194

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  CNGG +D+AFEY+K+ +G++++  YPY  +E    +C +++       K FV  
Sbjct: 195 KYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRET---KCHFKRNAVGADDKGFVD- 250

Query: 239 TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             +  G +  +   +   GPI + ++  HR  + Y        D  C+  +LDH V +VG
Sbjct: 251 --LPEGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYF--DEECSSEELDHGVLLVG 306

Query: 295 YG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           YG +      W+V+NSWG    + GY +I R   N CG+ + A
Sbjct: 307 YGTDPEAGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKA 349


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/255 (31%), Positives = 121/255 (47%), Gaps = 18/255 (7%)

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           P+   +  GLR       +  A    + + ++      LP S+DWR     ++ P++ QG
Sbjct: 76  PEFAAKYLGLRFDATNATKSFAASTYLPRMVS------LPDSVDWR--TAGIVTPIKDQG 127

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEY-VKQ 204
           +CGSCW+F+TT  +E Q A     L  LS+  LV+C    GN  CNGG +D AF+Y +  
Sbjct: 128 QCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISN 187

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKVFV---QDTWVTSGVDHMMHLLQSGPIGVYL 261
            G+++++ YPY  ++     C +        V   QD    S  D    +   GPI V +
Sbjct: 188 NGIDTESSYPYTAQDGT---CQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAI 244

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +         +    N+ AC+  +LDH V  VGYG       W+V+NSWG      GY  
Sbjct: 245 DASQPSFQFYSSGVYNEPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIW 304

Query: 322 IERGA-NACGIESYA 335
           + R + N CGI + A
Sbjct: 305 MTRNSNNQCGIATAA 319


>gi|405963298|gb|EKC28885.1| Cathepsin L [Crassostrea gigas]
          Length = 265

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/222 (37%), Positives = 120/222 (54%), Gaps = 14/222 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP ++DW  SK   + PV++QG+CGSCWAF+TT  LE Q       L  LS+  L++C  
Sbjct: 51  LPDTVDW--SKEGYVTPVKNQGQCGSCWAFSTTGGLEGQHYRKTGKLVSLSEQNLLDCSK 108

Query: 186 GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRN-KENITFRCTYEKEKAKVFVQDTWVTS 243
            N+ CNGG    A++Y+K+  G++++  YPY   KE  +FR +        FVQ   VT+
Sbjct: 109 ENMGCNGGLPQKAYKYIKENGGIDTEESYPYLGKKETCSFRPSEVGATCTGFVQ---VTA 165

Query: 244 GVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
           G +  +   +   GPI V ++      + Y G     ++ +CNP   DHAV IVGYG   
Sbjct: 166 GDELALKKAVASVGPITVCIDASQPSFQLYKGG--VYDEQSCNPIVFDHAVLIVGYGVYQ 223

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G   W+V+NSWG      GY  + R   N CGI ++A   +V
Sbjct: 224 GKDYWLVKNSWGTSWGMDGYIMMSRNQNNQCGIANHAVYPTV 265


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/314 (33%), Positives = 154/314 (49%), Gaps = 36/314 (11%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRL 99
           +  ++ ++VK  + Y    E + RF+ FK + +  D++   +   DR+ +  L R    L
Sbjct: 76  MSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDH---NSQEDRTYKLGLNRFA-DL 131

Query: 100 TGKE------KERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           T +E        +++ +R   K   N    R    LP+S+DWR  K   + PV+ QG CG
Sbjct: 132 TNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDWR--KEGAVPPVKDQGGCG 189

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLE 208
           SCWAF+    +E    ++   L  LS+ +LV+CD G N  CNGG +D AFE++    G++
Sbjct: 190 SCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINNGGID 249

Query: 209 SQADYPYRNKENITFRC-TYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL--NH 263
           S+ DYPYR  +    RC TY K    V + D       D +     + + P+ V +    
Sbjct: 250 SEEDYPYRGVDG---RCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGG 306

Query: 264 RLIESY-DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
           R  + Y  G    R   A     LDH V  VGYG  NG   WIVRNSWG    + GY ++
Sbjct: 307 REFQLYVSGVFTGRCGTA-----LDHGVVAVGYGTANGHDYWIVRNSWGPSWGEDGYIRL 361

Query: 323 ERG-ANA----CGI 331
           ER  AN+    CGI
Sbjct: 362 ERNLANSRSGKCGI 375


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 152/312 (48%), Gaps = 28/312 (8%)

Query: 45  TYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTG-------- 96
           T+ ++ N+ Y +D E + R + F  +  +  ++ G       S +  + + G        
Sbjct: 30  TFKMEHNKVYKNDVEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFV 89

Query: 97  LRLTGKEKE---RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             L G  K    +L ++R  +     E     LPK++DWR+     + PV+ QG CGSCW
Sbjct: 90  NTLNGFNKSINTQLRSERLPIAASFIEPANVVLPKTVDWREHGA--VTPVKDQGHCGSCW 147

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
           +F+ T  LE Q       L PLS+  L++C   +GN  CNGG +D AF+Y+K   GL+++
Sbjct: 148 SFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207

Query: 211 ADYPYRNKENITFRCTYEKEKAKVF-VQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRL 265
             YPY  + +   +C Y    +    V    +  G +  +   +   GP+ V ++  H+ 
Sbjct: 208 VTYPYEAEND---KCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQS 264

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIER 324
            + Y        +  C+   LDH V  VGYG ++NG   W+V+NSWG+   D+GY ++ R
Sbjct: 265 FQFYSEGVYYEPE--CSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMAR 322

Query: 325 G-ANACGIESYA 335
              N CGI S A
Sbjct: 323 NKLNHCGIASTA 334


>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
 gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 143/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKVQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  CNGG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSKQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 148/308 (48%), Gaps = 30/308 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           +  ++    RTY    E + RFE F+ + +  D +   + +   S +  L R    LT  
Sbjct: 46  YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFA-DLTND 104

Query: 103 E--------KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           E        + R + +R    ++L    +  LP+S+DWR      +  ++ QG CGSCWA
Sbjct: 105 EYRATYLGVRSRPQRERRLGDRYLAGDNE-DLPESVDWRAKGA--VAEIKDQGSCGSCWA 161

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+T A +E    ++   +  LS+ +LV+CD   N  CNGG +D AFE++    G++++ D
Sbjct: 162 FSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEED 221

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIE 267
           YPY+  +    RC   ++ AKV   D++     +    +   + + PI V +    R  +
Sbjct: 222 YPYKGTDG---RCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y+          C    LDH V  VGYG +NG   WIV+NSWG    + GY ++ER   
Sbjct: 279 LYNSGIFTGT---CGT-ALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERNIK 334

Query: 328 A----CGI 331
           A    CGI
Sbjct: 335 ASSGKCGI 342


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 149/310 (48%), Gaps = 32/310 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++VK  ++Y    E   RFE FK + K  DE+ G + +           T      K
Sbjct: 55  YEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSK 114

Query: 103 -EKERLEADRERVKKFLNE-------RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               +++ +R R+KK           R    LP+S+DWR+    V   V+ Q  CGSCWA
Sbjct: 115 FLGTKIDPNR-RMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVV--GVKDQASCGSCWA 171

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+  A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++S+ D
Sbjct: 172 FSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDD 231

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIE 267
           YPY+    +  RC   ++ AKV   D +        + L   + + PI V +    R  +
Sbjct: 232 YPYKA---VDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQ 288

Query: 268 SYD-GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG- 325
            Y+ G    R   A     LDH VA VGYG +NG   WIVRNSWG    + GY ++ER  
Sbjct: 289 LYEYGVFTGRCGTA-----LDHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNL 343

Query: 326 ----ANACGI 331
               A  CGI
Sbjct: 344 ASSRAGKCGI 353


>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
          Length = 500

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 116/231 (50%), Gaps = 23/231 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+  DWR      + PV+ QG+CGSCW F+TT  +E    +    L  LS+ QL++CD 
Sbjct: 272 LPQYYDWRARGA--VTPVKDQGQCGSCWTFSTTGAIEGANFIKTGKLVSLSEQQLLDCDV 329

Query: 186 G---------NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVF 235
           G         +  CNGG    A EY+ ++ GL+++  YPY+  +  T R    K  A + 
Sbjct: 330 GCAPDIPNACDSGCNGGLPSNAMEYIVEHGGLDTEKSYPYKAYKEDTCRAKEGKLGATI- 388

Query: 236 VQDTWVTSGVDHMMH-LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
              T+V     HM H L++ GP+ + +N   ++SY G       W CN   LDH V IVG
Sbjct: 389 SNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSYVGGVA--CPWLCNKDALDHGVLIVG 446

Query: 295 YGEKNGILT-------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLA 338
           YGE+            W+++NSWG    + GY++I +    CG+ +    A
Sbjct: 447 YGEEGFAPARLHKEPYWVIKNSWGMGWGEEGYYRICKDKGNCGVNNMVVAA 497


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        V   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLVGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V  VGY
Sbjct: 228 DVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLAVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSA 329


>gi|7271893|gb|AAF44677.1|AF239266_1 cathepsin L [Fasciola gigantica]
          Length = 326

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 12/224 (5%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P+S+DWR      +  V+ QG+CGSCWAF+TT  +E Q    ++     S+ QLV+
Sbjct: 105 KPAVPESIDWRD--YYYVTEVKDQGQCGSCWAFSTTGAMEGQFRKNERASASFSEQQLVD 162

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C  + GN  C GG ++ A+EY+K  GLE+ + YPY+  E     C Y+   A   V D +
Sbjct: 163 CTRNFGNHGCGGGYMENAYEYLKHSGLETDSYYPYQAVEG---PCQYDGRLAYAKVTDYY 219

Query: 241 VTSGVDH--MMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                D   + +L+ + GP  V L+         + I  ++  C P +L HAV  VGYG 
Sbjct: 220 TVHSGDEVELKNLVGTEGPAAVALDVDYDFMMYESGIYHSE-TCLPDRLTHAVLAVGYGA 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY +  R   N CGI S A +  V
Sbjct: 279 QDGTDYWIVKNSWGSSWGEKGYIRFARNRGNMCGIASLASVPMV 322


>gi|294879891|ref|XP_002768815.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239871742|gb|EER01533.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 247

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 115/230 (50%), Gaps = 22/230 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP S+DWR   V  L PV++QG CGSCWAF+TT  LE+Q A+    L  LS+ +LV+C H
Sbjct: 24  LPTSVDWRNKSV--LTPVKNQGSCGSCWAFSTTGALEAQYAIATGKLLSLSEQELVDCSH 81

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             GN  C GG +  A+EY+   GL+ ++ YPY+  +     C   ++KA          +
Sbjct: 82  KYGNDGCIGGYMGAAYEYINSAGLDQESTYPYKGWDE---PCRPREKKADGIPAGE--VT 136

Query: 244 GV-------DHMMHLLQSGPIGV--YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
           GV         +M  L   P+ V  Y +      Y    I R    CN  + DHAV  VG
Sbjct: 137 GVHLLAKTEQSLMKALADAPVSVAMYASDPNFRFYRSGVILRVLATCN-GETDHAVVAVG 195

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA---CGIESYAYLASVK 341
           YG   G   +I++NSWG      GYF ++RG      C I  Y  + ++K
Sbjct: 196 YGADKGSDYFILKNSWGSKWGIGGYFFLKRGVGGHGECNILEYMLVPTLK 245


>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
          Length = 376

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 153/344 (44%), Gaps = 40/344 (11%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGT 80
           +DL    ++  +AFK + +++NR+Y    E   R + F  +  +             +G 
Sbjct: 29  QDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV 88

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD + +E  Q  G R        +       ++  +E  +  +P + DWR+     +
Sbjct: 89  TPFSDLTEEEFGQLYGYRRAAGGVPSMG------REIRSEEPEESVPFTCDWRKV-AGAI 141

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI-DVAF 199
           +P++ Q  C  CWA A    +E+   +       +S  +L++C      C+GG + D   
Sbjct: 142 SPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFI 201

Query: 200 EYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPI 257
             +   GL S+ DYP++ K      C  +K +   ++QD  +    +H +  +L   GPI
Sbjct: 202 TVLNNSGLASEKDYPFQGKVR-AHSCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPI 260

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE---KNGILT----------- 303
            V +N + +  Y    I+     C+P  +DH+V +VG+G    + GIL            
Sbjct: 261 TVTINMKPLRLYRKGVIKATPITCDPQLVDHSVLLVGFGSIKSEEGILAETVSSQSQPQP 320

Query: 304 ------WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASVK 341
                 WI++NSWG    + GYF++ RG+N CGI  +   A V+
Sbjct: 321 PHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQ 364


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 120/222 (54%), Gaps = 21/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP+S+DWR+     + PV++QG+CGSCWAF+  + +ES   ++   +  LS+ +LVEC  
Sbjct: 150 LPESVDWREKGA--VAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECST 207

Query: 184 DHGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
           D GN  CNGG +D AF+++ K  G++++ DYPYR    +  +C   ++ A+V   D +  
Sbjct: 208 DGGNSGCNGGLMDAAFDFIIKNGGIDTEDDYPYRA---VDGKCDMNRKNARVVSIDGFED 264

Query: 241 -VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
              +    +   +   P+ V +    R  + Y          +C  + LDH V  VGYG 
Sbjct: 265 VPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKSGVF---SGSCTTN-LDHGVVAVGYGA 320

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
           +NG   WIVRNSWG    + GY ++ER  NA    CGI   A
Sbjct: 321 ENGKDYWIVRNSWGPKWGEAGYIRMERNVNASTGKCGIAMMA 362


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 156/310 (50%), Gaps = 44/310 (14%)

Query: 47  IVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQR-TG 96
           +VK ++ Y      + RFE FK + +  DE+          G +  +D S +E      G
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
            R+  ++++  E+DR   K  + +     LP+S+DWR+     + PV+ QG+CGSCWAF+
Sbjct: 71  GRMV-RDRKGFESDR--FKYGVGDE----LPQSVDWREKGA--VAPVKDQGQCGSCWAFS 121

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLESQADYP 214
           T A +E    +    L  LS+ +LV+CD G N  CNGG +D AFE+ VK  G++++ DYP
Sbjct: 122 TVAAVEGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYP 181

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYLN-----HRLI 266
           Y+    +  +C   ++ AKV   + +     +    +   +   P+ V +       +L 
Sbjct: 182 YK---GVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLY 238

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA 326
           ES   N +   D       LDH V  VGYG ++G   WIVRNSWG    ++GY ++ER  
Sbjct: 239 ESGIFNGLCGTD-------LDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNV 291

Query: 327 NA-----CGI 331
            +     CGI
Sbjct: 292 ASTNTGKCGI 301


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/220 (36%), Positives = 121/220 (55%), Gaps = 20/220 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P ++DWR      + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 109 PDTVDWRNEGY--VTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTA 166

Query: 185 HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
           +GN  CNGG +D AF Y+K+  G++S+A YPY  ++    +C ++K    V   DT    
Sbjct: 167 YGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAEDG---KCVFKK--PSVAATDTGFVD 221

Query: 241 VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           +  G ++ +   +   GPI V ++  H   + Y       N+ +C+  +LDH V +VGYG
Sbjct: 222 LPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVY--NEPSCSSTELDHGVLVVGYG 279

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            ++G   W+V+NSW     D GY ++ R A N CGI + A
Sbjct: 280 TESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKA 319


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 153/303 (50%), Gaps = 31/303 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y    E   RFE FK + K  DE        + G +  +D S Q
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQ 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++   ++     + E   + ++      LPKS+DWR  K   + PV++QG+
Sbjct: 102 EFKNKYLGLKVNLSQRRESSNEEEFTYRDVD------LPKSVDWR--KKGAVTPVKNQGQ 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQY-G 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF ++ Q  G
Sbjct: 154 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGG 213

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E+    C  +KE+ +V   + +     +    ++  L + P+ V +  
Sbjct: 214 LHKEDDYPYIMEEST---CEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEA 270

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH V+ VGYG    +   IV+NSWG    + G+ +
Sbjct: 271 SSRDFQFYSGGVF---DGHCGS-DLDHGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIR 326

Query: 322 IER 324
           ++R
Sbjct: 327 MKR 329


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/220 (37%), Positives = 115/220 (52%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP ++DWR  K   + PV++QG+CGSCWAF+TT  LE Q       L  LS+  LV+C  
Sbjct: 117 LPTTVDWR--KKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSD 174

Query: 184 DHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
           D GN  CNGG +D  F+Y+K  G ++++  +PY  ++     C ++K         FV D
Sbjct: 175 DFGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDG---DCKFKKADVGATDAGFV-D 230

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               S  D    +   GP+ V ++  H   + Y        D  C+  +LDH V  VGYG
Sbjct: 231 IQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPD--CSSSQLDHGVLTVGYG 288

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            KNG   W+V+NSWG    D+GY  + R   N CGI S A
Sbjct: 289 VKNGKKYWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSA 328


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 115/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LPK++DWR      + PV++QG+CGSCWAF+ T  LE Q      ++  LS+  LV+C  
Sbjct: 119 LPKTVDWRTKGA--VTPVKNQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCST 176

Query: 184 DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
           D GN  C GG +D AF+Y++   G++++  YPY   +     C ++K         FV D
Sbjct: 177 DFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNGTDGT---CHFKKSTVGATDSGFV-D 232

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESY-DGNPIRRNDWACNPHKLDHAVAIVGY 295
               S       +   GPI V ++  H   + Y DG     ++  C+   LDH V +VGY
Sbjct: 233 IKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG---VYDEPECDSESLDHGVLVVGY 289

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           G  NG   W+V+NSWG    D GY ++ R   N CGI S A
Sbjct: 290 GTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330


>gi|358255491|dbj|GAA57187.1| cathepsin L [Clonorchis sinensis]
          Length = 368

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 151/314 (48%), Gaps = 35/314 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYF-------------KQDGKETDEYYGTSGSSDRSP 88
           A++ +  +++R Y+D  E   R   F              ++G E+ +  G +  +DR P
Sbjct: 61  AWEQFKHQFDRVYSDAEESSKRLNVFCENFLYVRRHNNAYEEGTESFKL-GINQFADRLP 119

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           +E     G  +         A   R +K        P PKS+DWR  K   +  +  QGR
Sbjct: 120 KERENICGGHIPANLSSHGGA---RFRKI-----AAPPPKSIDWR--KKGAVTSIRKQGR 169

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYG 206
           CGSCWAFA  A +E    +    L  LS  QL++C  ++GN  C GG+   +F+Y+K+ G
Sbjct: 170 CGSCWAFAAAAAVEGHTYIHNNQLETLSTQQLIDCSLEYGNGGCTGGDSVTSFKYLKESG 229

Query: 207 -LESQADYPYRNKENI--TFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGV 259
            LE   DYPY + + I     C ++  K    V    V    D    +LQ+    GP+ +
Sbjct: 230 GLERDRDYPYVSDKTIRPNPECKFDWTKCAAEVTGFVVLPYHDEDA-ILQAVGFYGPVAI 288

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
            ++ RL    D      +D  C  +  DH++ +VGYGE+NG   WI++NSWG+   + GY
Sbjct: 289 SVDSRLQSFKDYKGDIYSDPLCGKNS-DHSMVVVGYGEENGTPYWIIKNSWGEHWGEKGY 347

Query: 320 FQIERGANACGIES 333
            ++ RG N CG+ S
Sbjct: 348 LRLRRGVNMCGVAS 361


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 117/221 (52%), Gaps = 17/221 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKE-NITFRCTYEKEKAKVFVQDTWV 241
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+ +  +R  +       FV     
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANGTGFVDIPQQ 231

Query: 242 TSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG--- 296
              +  M  +   GPI V ++  H  ++ Y        +  C+   LDH V +VGYG   
Sbjct: 232 EKAL--MKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYGYEG 287

Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 140/292 (47%), Gaps = 10/292 (3%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRS-PQEILQRTGLRLTG 101
           F+++ +K +++Y++  E   R   F ++ ++ +E+     +   S  + + Q T L +  
Sbjct: 25  FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTDLTIDE 84

Query: 102 KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
            +       +  +      R    +P +LDWR      +  V+ QG CGSCWAF+     
Sbjct: 85  FKAYLTLHSKPTLNTVPYVRTGLQVPTTLDWRSQGY--VTGVKDQGDCGSCWAFSVVGST 142

Query: 162 ESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKEN 220
           E         L  LS+ QL++C    N  C+GG ++  F YV+Q GL S++ YPY  ++ 
Sbjct: 143 EGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLEETFPYVQQTGLVSESSYPYTGRDG 202

Query: 221 ITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDW 279
               C   +      V    +  G   ++  + S GP+ V ++   I SY       +  
Sbjct: 203 ---NCRISESDVVTKVSKYVLLGGEADLLEAVGSVGPVSVAMDATYIYSYASGVYESS-- 257

Query: 280 ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
            C+ + L+H V +VGYG ++G   W+++NSWG+   + GY ++ RG N CGI
Sbjct: 258 LCSLYSLNHGVLVVGYGTQDGKDYWLIKNSWGNTWGEQGYLKLLRGTNECGI 309


>gi|545734|gb|AAB30089.1| cysteine protease [Fasciola sp.]
 gi|2662308|dbj|BAA23743.1| cathepsin L [Fasciola hepatica]
          Length = 325

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 19/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGSCWAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 108 VPDKIDWRESGY--VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSG 165

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A+EY+KQ+GLE+++ YPY   E    +C Y ++     V D + V 
Sbjct: 166 PWGNYGCMGGLMENAYEYLKQFGLETESSYPYTAVEG---QCRYNRQLGVAKVTDYYTVH 222

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIES----YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           SG +  +  L    GP  V ++   +ES    Y G   +     C+  +++HAV  VGYG
Sbjct: 223 SGSEVELKNLVGAEGPAAVAVD---VESDFMMYSGGIYQSR--TCSSLRVNHAVLAVGYG 277

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
            + G   WIV+NSWG    +     +    N CGI S A L  V
Sbjct: 278 TQGGTDYWIVKNSWGSSWGERYIRMVRNRGNMCGIASLASLPMV 321


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 113/221 (51%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK +DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKVVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                S VD    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/247 (32%), Positives = 127/247 (51%), Gaps = 18/247 (7%)

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK-VLNPVESQGRCGSCWAFAT 157
           + G + ++  + R        E     +P+S+DWR   VK  + PV+ QG+CGSCWAF++
Sbjct: 95  MNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWR---VKGAITPVKDQGQCGSCWAFSS 151

Query: 158 TAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q       L  LS+  L++C   +GN  CNGG +D AF+Y+K   G++++  YP
Sbjct: 152 TGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYP 211

Query: 215 YRNKENITFRCTYE-KEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESY 269
           Y  ++N+   C Y  + +  +      + SG +  +   +   GP+ V ++  H   + Y
Sbjct: 212 YEAEDNV---CRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFY 268

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANA 328
                     +C+   LDH V +VGYG  NG   W+V+NSW +   D GY +I R   N 
Sbjct: 269 SKGVYYEP--SCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNH 326

Query: 329 CGIESYA 335
           CGI + A
Sbjct: 327 CGIATAA 333


>gi|313246319|emb|CBY35240.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 113/219 (51%), Gaps = 14/219 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S DWR +   V+ PV+ QG+CGSCWAF+T A LESQ AL    L  LS+ QLV+C  
Sbjct: 108 MPASADWRTANPPVVTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSM 167

Query: 186 --GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
             GN  C+GG +   F Y+    G++++A YPY  ++    +C +        +   + +
Sbjct: 168 NWGNYGCSGGLMTQGFTYIHDNNGVDTEASYPYTAQDG---KCVFNPANVGTSLTSCYNI 224

Query: 242 TSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            SG +  +   +   GP+ V ++  H   + Y        +  C+   LDH V  VGYG 
Sbjct: 225 ASGDEAALANAVQMVGPMSVAIDASHMSFQLYTSGVYYEPN--CSSQFLDHGVTAVGYGS 282

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            +G   +IV+NSW     D+GY  + R   N CGI + A
Sbjct: 283 SSGNDFFIVKNSWAATWGDNGYIMMSRNKNNNCGIATSA 321


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 116/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPK++DWR  K   + PV++QG+CGSCWAF+TT  LE Q       +  LS+  LV+C  
Sbjct: 121 LPKTVDWR--KKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCST 178

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  C GG +D AF+Y+K   G++++  YPY   +     C +  +K+ V   DT   
Sbjct: 179 AFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGT---CHF--KKSDVGATDTGFV 233

Query: 243 SGVDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
              +   HLL+      GPI V ++  H+  + Y        +  C+   LDH V +VGY
Sbjct: 234 DIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPE--CSSENLDHGVLVVGY 291

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G K+    W+V+NSWG    D GY  + R   N CGI S A
Sbjct: 292 GTKDDQDYWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSA 332


>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
 gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 450

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 142/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  C GG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 451

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 142/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  C GG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
          Length = 343

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 113/218 (51%), Gaps = 18/218 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP   DWR  K  +++ V+ QG CGSCW F+TT  LES  A        LS+ QLV+C  
Sbjct: 126 LPAEKDWR--KEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAG 183

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
            + N  CNGG    AFEY+K   GLE++  YPY  +  +   C +  E   V V  +  +
Sbjct: 184 AYNNFGCNGGLPSQAFEYIKYNGGLETEEVYPYTGQNGL---CKFTSENVAVQVLGSVNI 240

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +  + P+ V      + RL   Y             P  ++HAV  VGY
Sbjct: 241 TLGAEDELKHAVAFARPVSVAFQVVDDFRL---YKKGVYTGTTCGSTPMDVNHAVLAVGY 297

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G ++G+  W+++NSWG    DHGYF++E G N CG+ +
Sbjct: 298 GIEDGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGVAT 335


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 42/313 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKT--RFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR-- 98
           ++ ++VK  +  + ++ ++   RFE FK + +  DE+         + + +  R GL   
Sbjct: 50  YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEH---------NEKNLSYRLGLTRF 100

Query: 99  --LTGKE------KERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRC 149
             LT  E        ++E   ER      E + G  LP+S+DWR  K   +  V+ QG C
Sbjct: 101 ADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWR--KKGAVAEVKDQGGC 158

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGL 207
           GSCWAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G+
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
           ++  DYPY+  +     C   ++ AKV   D++    T   + +   +   PI + +   
Sbjct: 219 DTDKDYPYKGVDGT---CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAG 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + YD       D +C   +LDH V  VGYG +NG   WIVRNSWG    + GY ++
Sbjct: 276 GRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 331

Query: 323 ER----GANACGI 331
            R     +  CGI
Sbjct: 332 ARNIASSSGKCGI 344


>gi|1666270|emb|CAA49713.1| envelope glycoprotein [Autographa californica nucleopolyhedrovirus]
          Length = 208

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 74/208 (35%), Positives = 117/208 (56%), Gaps = 13/208 (6%)

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN 189
           + WR  ++  +  V++QG CG+CWAFAT A LESQ A+    L  LS+ Q+++CD  +  
Sbjct: 1   IHWR--RLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAG 58

Query: 190 CNGGNIDVAFE-YVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVD 246
           CNGG +  AFE  +K  G++ ++DYPY    N    C     K  V V+D   ++T   +
Sbjct: 59  CNGGLLHTAFEAIIKMGGVQLESDYPYEADNN---NCRMNSNKFLVQVKDCYRYITVYEE 115

Query: 247 HMMHLLQ-SGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWI 305
            +  LL+  GPI + ++   I +Y    I+     C    L+HAV +VGYG +N I  W 
Sbjct: 116 KLKDLLRLVGPIPMAIDAADIVNYKQGIIKY----CFNSGLNHAVLLVGYGVENNIPYWT 171

Query: 306 VRNSWGDIGPDHGYFQIERGANACGIES 333
            +N+WG    + G+F++++  NACG+ +
Sbjct: 172 FKNTWGTDWGEDGFFRVQQNINACGMRN 199


>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
          Length = 443

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 137/306 (44%), Gaps = 21/306 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +     R Y   +E + RFE F  + K+       +  +   P E    T      +
Sbjct: 25  FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 84

Query: 103 EKERLEA------DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
                          +  K F  E  K  + + +DWR      + PV++QG CGSCW+F+
Sbjct: 85  HNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGA--VTPVKNQGACGSCWSFS 142

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQADY 213
           TT  +E Q A+    L  +S+ +LV CD  +  CNGG +D AF ++    +  + ++A+Y
Sbjct: 143 TTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANY 202

Query: 214 PYRNKENITFRCTYEKEKAKV-----FVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIES 268
           PY +   I   C+   E   V       QD   T   D    + + GP+ + ++    +S
Sbjct: 203 PYVSGNGIVPACSSSPESKPVGATISAFQDIARTE-EDMAAFVFKHGPLSIGVDASTWQS 261

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +      C   ++DH V IVG+ +      WI++NSW     + GY ++ +G+N 
Sbjct: 262 YAGGIMSY----CPQDQIDHGVLIVGFDDTASTPYWIIKNSWTANWGEEGYIRVAKGSNQ 317

Query: 329 CGIESY 334
           CG+ S+
Sbjct: 318 CGLTSH 323


>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
          Length = 428

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 137/306 (44%), Gaps = 21/306 (6%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           F  +     R Y   +E + RFE F  + K+       +  +   P E    T      +
Sbjct: 10  FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 69

Query: 103 EKERLEA------DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
                          +  K F  E  K  + + +DWR      + PV++QG CGSCW+F+
Sbjct: 70  HNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGA--VTPVKNQGACGSCWSFS 127

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQADY 213
           TT  +E Q A+    L  +S+ +LV CD  +  CNGG +D AF ++    +  + ++A+Y
Sbjct: 128 TTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANY 187

Query: 214 PYRNKENITFRCTYEKEKAKV-----FVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIES 268
           PY +   I   C+   E   V       QD   T   D    + + GP+ + ++    +S
Sbjct: 188 PYVSGNGIVPACSSSPESKPVGATISAFQDIARTE-EDMAAFVFKHGPLSIGVDASTWQS 246

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +      C   ++DH V IVG+ +      WI++NSW     + GY ++ +G+N 
Sbjct: 247 YAGGIMSY----CPQDQIDHGVLIVGFDDTASTPYWIIKNSWTANWGEEGYIRVAKGSNQ 302

Query: 329 CGIESY 334
           CG+ S+
Sbjct: 303 CGLTSH 308


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 113/221 (51%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK +DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKVVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                S VD    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 113/221 (51%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK +DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKVVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                S VD    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
          Length = 450

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 142/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  C GG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDSSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 102/310 (32%), Positives = 149/310 (48%), Gaps = 32/310 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++VK  ++Y    E   RFE FK + K  DE+ G + +           T      K
Sbjct: 55  YEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSK 114

Query: 103 -EKERLEADRERVKKF-------LNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               +++ +R R+KK           R    LP+S+DWR+    V   V+ Q  CGSCWA
Sbjct: 115 FLGTKIDPNR-RMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVV--GVKDQASCGSCWA 171

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+  A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++    G++S+ D
Sbjct: 172 FSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDD 231

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIE 267
           YPY+    +  RC   ++ AKV   D +        + L   + + PI V +    R  +
Sbjct: 232 YPYKA---VDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQ 288

Query: 268 SYD-GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG- 325
            Y+ G    R   A     LDH VA VGYG +NG   WIVRNSWG    + GY ++ER  
Sbjct: 289 LYEYGVFTGRCGTA-----LDHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNL 343

Query: 326 ----ANACGI 331
               A  CGI
Sbjct: 344 ASSRAGKCGI 353


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 152/312 (48%), Gaps = 28/312 (8%)

Query: 45  TYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTG-------- 96
           T+ ++ N+ Y +D E + R + F  +  +  ++ G       S +  + + G        
Sbjct: 30  TFKMEHNKVYKNDIEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFV 89

Query: 97  LRLTGKEKE---RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             L G  K    +L ++R  +     E     LPK++DWR+     + PV+ QG CGSCW
Sbjct: 90  NTLNGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGA--VTPVKDQGHCGSCW 147

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQ 210
           +F+ T  LE Q       L PLS+  L++C   +GN  CNGG +D AF+Y+K   GL+++
Sbjct: 148 SFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207

Query: 211 ADYPYRNKENITFRCTYEKEKAKVF-VQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRL 265
             YPY  + +   +C Y    +    V    +  G +  +   +   GP+ V ++  H+ 
Sbjct: 208 VTYPYEAEND---KCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQS 264

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIER 324
            + Y        +  C+   LDH V  VGYG ++NG   W+V+NSWG+   D+GY ++ R
Sbjct: 265 FQFYSEGVYYEPE--CSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMAR 322

Query: 325 G-ANACGIESYA 335
              N CGI S A
Sbjct: 323 NKLNHCGIASTA 334


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 122/220 (55%), Gaps = 20/220 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P ++DWR      + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 109 PDTVDWRNEGY--VTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTA 166

Query: 185 HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
           +GN  C+GG +D AF Y+K+  G++S+A YPY  ++    +C ++K  + V   DT    
Sbjct: 167 YGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAEDG---KCVFKK--SSVAATDTGFVD 221

Query: 241 VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           +  G ++ +   +   GPI V ++  H   + Y       N+ +C+  +LDH V +VGYG
Sbjct: 222 IPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVY--NEPSCSSTELDHGVLVVGYG 279

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            ++G   W+V+NSW     D GY ++ R A N CGI + A
Sbjct: 280 TESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKA 319


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 149/319 (46%), Gaps = 42/319 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQD----GKETDEY--------YGTSGSSDRSP 88
           +AFK+      +TY  + E   RF+ F ++     K   +Y         G +  +D  P
Sbjct: 28  EAFKS---THKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLP 84

Query: 89  QEILQRT----GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
            E ++      G RL G+    L          LN+     LPK++DWR  K   + PV+
Sbjct: 85  HEFVKMMNGYQGKRLAGRGSTYLPPAN------LNDSS---LPKTVDWR--KKGAVTPVK 133

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV 202
            QG+CGSCWAF++T  LE Q  L    L  LS+  LV+C   +GN  CNGG +D +F Y+
Sbjct: 134 DQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYI 193

Query: 203 KQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQDTWVTSGVDHMMHLLQSGPI 257
           K   G++++  YPY  ++     C Y+KE        FV D    S  D    +   GP+
Sbjct: 194 KANGGIDTEDSYPYEAEDG---DCRYKKEDVGATDTGFV-DIKEGSEKDLQKAVATVGPV 249

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V ++         +    ++  C+   LDH V  VGYG KNG   W+V+NSW +     
Sbjct: 250 SVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQD 309

Query: 318 GYFQIERGA-NACGIESYA 335
           GY  + R   N CGI S A
Sbjct: 310 GYILMSRDKNNQCGIASSA 328


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 150/315 (47%), Gaps = 37/315 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSS---------DRSPQEILQ 93
           F+T+  +  +TY    E   R + F+ +     E+     SS         D +  E  +
Sbjct: 30  FETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHE-FK 88

Query: 94  RTGLRLTGKEKERLEADRE--RVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            + L L+      L  DR   ++  F+ +     +P S+DWR  K   +  V+ QG CG+
Sbjct: 89  ASRLGLSSAASASLNVDRSNRQIPDFVAD-----VPASVDWR--KNGAVTQVKDQGNCGA 141

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLES 209
           CW+F+ T  +E    ++  +L  LS+ +LV+CD   N  C GG +D AF++V   +G+++
Sbjct: 142 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDT 201

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-----QSGPIGVYLNHR 264
           + DYPY+ ++     C  EK K  V   D +V    ++   LL     Q   +G+  + R
Sbjct: 202 EEDYPYQGRDR---SCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSER 258

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y           C+   LDHAV IVGYG +NG+  WIV+NSWG      GY  ++R
Sbjct: 259 AFQLYSKGIFTG---PCST-SLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQR 314

Query: 325 GANA----CGIESYA 335
            + +    CGI   A
Sbjct: 315 NSGSSRGLCGINMLA 329


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 153/315 (48%), Gaps = 36/315 (11%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRL 99
           +  +  ++ K ++TY    E + RFE FK + +  DE+   + S +R+ +  L R     
Sbjct: 45  ISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEH---NNSKNRTYKVGLTRFADLT 101

Query: 100 TGKEKERLEADRERVKKFLNERKKGP-----------LPKSLDWRQSKVKVLNPVESQGR 148
             + + +    +   K+ L  + K P           LP+S+DWRQS    ++ ++ QG 
Sbjct: 102 NEEYRAKFLGTKSDPKRRL-MKSKNPSQRYAFKAGDVLPESIDWRQSGA--VSAIKDQGS 158

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AF++ +   G
Sbjct: 159 CGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGG 218

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS---GPIGVYL-- 261
           +++  DYPY   + +  +C   K K K    D +        M L ++    P+ V +  
Sbjct: 219 IDTDKDYPY---QAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEA 275

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           +   ++ Y           C    LDH V IVGYG ++GI  W+VRNSWG    ++GY +
Sbjct: 276 SGMALQFYQSGVFTGE---CGS-ALDHGVVIVGYGTEDGIDYWLVRNSWGRDWGENGYIK 331

Query: 322 IERGA-----NACGI 331
           ++R         CGI
Sbjct: 332 MQRNVVDTFTGKCGI 346


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        V   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLVGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V  VGY
Sbjct: 228 DVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLAVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSA 329


>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
          Length = 363

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 148/317 (46%), Gaps = 40/317 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK ++  + R+Y+ + E   R   F Q+     E+     ++     +          G 
Sbjct: 54  FKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSLPVSNNAAGG 113

Query: 103 EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
               LE D               LP++ DWR+     +  V+ QGRCGSCWAF+TT  +E
Sbjct: 114 IAPPLEVDG--------------LPENFDWREKGA--VTEVKLQGRCGSCWAFSTTGSIE 157

Query: 163 SQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYVKQY-GLESQAD 212
               L    L  LS  QL++CD+          +  CNGG +  A+ Y+ +  GLE ++ 
Sbjct: 158 GANFLATGKLVSLSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESS 217

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESYD 270
           YPY  +      C ++ EK  V + + T + +  + +  +L+++GP+ + +N   +++Y 
Sbjct: 218 YPYTGERG---ECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYI 274

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSWGDIGPDHGYFQIE 323
           G         C+  +L+H V +VGYG K   IL       WI++NSWG+   + GY+++ 
Sbjct: 275 GG--VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLC 332

Query: 324 RGANACGIESYAYLASV 340
           RG   CGI +    A V
Sbjct: 333 RGHGMCGINTMVSAAMV 349


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 115/220 (52%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP ++DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPSTVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
             GN  C GG +D AF+Y+K   G++++  YPY   E +  +C ++KE        FV D
Sbjct: 174 SFGNNGCEGGLMDNAFKYIKANDGIDAEESYPY---EAMDDKCRFKKEDVGATDTGFV-D 229

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               S  D    +   GPI V ++  H   + Y        +  C+  +LDH V  VGYG
Sbjct: 230 IEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPE--CSSEELDHGVLAVGYG 287

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            K+G   W+V+NSWG    D+GY  + R   N CGI S A
Sbjct: 288 VKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAA 327


>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 87/311 (27%), Positives = 147/311 (47%), Gaps = 25/311 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F  +  K++R+Y D  E   RF  FKQ  +   E         +G +  SD SP+E+   
Sbjct: 41  FAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEL--- 97

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               L G +     A  +R +K +N    G  P ++DWR  K   + PV+ Q +CGSCWA
Sbjct: 98  RATYLNGAK--YYAAALKRPRKVVN-VSTGKAPPAVDWR--KKGAVTPVKDQRKCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQA 211
           F+ T  +E Q  +    L  LS+  LV CD+ +  C GG +D A +++    +  + ++ 
Sbjct: 153 FSATGNIEGQWKVAGHELTSLSEQMLVSCDNMDDGCQGGLMDRALKWIVSSNKGNVFTEE 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESY 269
            YPY + +     C    +     +         ++ +   L ++GP+ + ++      Y
Sbjct: 213 SYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDASSFLDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
            G  +     +C+   L+H V +VGY + +    WI++NSWG    + GY ++E+G N C
Sbjct: 273 KGGVLT----SCSSDALNHDVLLVGYDDTSKPPYWIIKNSWGKKWGEEGYIRVEKGTNQC 328

Query: 330 GIESYAYLASV 340
            ++ YA  A V
Sbjct: 329 LMKEYARSAVV 339


>gi|294897727|ref|XP_002776051.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239882576|gb|EER07867.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 361

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 118/227 (51%), Gaps = 17/227 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           L  S+DWR   V  L P++ QG CGSCWAF++T  LE+Q A+    L  LS+ QLV+C  
Sbjct: 122 LAASVDWRNKSV--LTPIKDQGHCGSCWAFSSTGALEAQYAIATGKLLSLSEQQLVDCSS 179

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---- 239
            +GN  CNGG +  A++Y+K  G++ ++ YPY   +N T + + EK    + V +     
Sbjct: 180 SYGNHGCNGGWMQYAYDYIKSSGIDQESTYPYEASDN-TCQKSLEKLSDGLPVGEVTGYH 238

Query: 240 WVTSGVDHMMHLLQSGPIGV--YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            +      +M  L + P+ V  Y +    + Y       +   CN   LDHAV  VGYG 
Sbjct: 239 MLEQTEQALMTRLVAAPVSVAMYASDPDFQFYKSGVYSSD--TCNG-GLDHAVVAVGYGN 295

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGA---NACGIESYAYLASVK 341
           +NG   +I RNSWG      GYF ++RG      C I  Y  +A +K
Sbjct: 296 ENGEDYFIGRNSWGTSWGQDGYFYLKRGVPGYGECTILEYMCVADLK 342


>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   D+
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCKGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGEYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|194882211|ref|XP_001975206.1| GG20691 [Drosophila erecta]
 gi|190658393|gb|EDV55606.1| GG20691 [Drosophila erecta]
          Length = 378

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/255 (32%), Positives = 130/255 (50%), Gaps = 24/255 (9%)

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           P+ + Q TGL+ + + K R  A  + V        K P+P + DWR+     + PV+ QG
Sbjct: 128 PEFLSQLTGLKRSPEAKARAAASLKEVI-----LPKKPIPDAFDWREHGG--VTPVKFQG 180

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVAFEYVK 203
            CGSCWAFATT  +E        +L  LS+  LV+C    D     C+GG  + AF ++ 
Sbjct: 181 TCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPLEDFSLNGCDGGFQEAAFCFID 240

Query: 204 --QYGLESQADYPYR-NKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS-GPI 257
             Q G+     YPY+ NKE     C Y+ +K+   ++        D   +  ++ + GP+
Sbjct: 241 EVQKGVSQAGAYPYKDNKET----CKYDGKKSGASLKGFAAIPPKDEEQLKKVVATLGPV 296

Query: 258 GVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
              +N    +++Y G     ND  CN  + +H++ +VGYG +NG   WI++NSW D   +
Sbjct: 297 ACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSENGQDYWIIKNSWDDTWGE 354

Query: 317 HGYFQIERGANACGI 331
            GYF++ RG N C I
Sbjct: 355 QGYFRLPRGQNYCFI 369


>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 142/300 (47%), Gaps = 25/300 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD--------EYYGTSGSSDRSPQEILQR 94
           F  +  K+ + Y D  E   RF  F+++ ++            +G +  SD + +E   R
Sbjct: 41  FAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRAR 100

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
                         A ++R++K +N    G  P ++DWR+     + PV+ QG+CGSCWA
Sbjct: 101 YR-----NGASYFAAAQKRLRKTVN-VTTGRAPAAVDWREKGA--VTPVKDQGQCGSCWA 152

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY---GLESQA 211
           F+T   +E Q  +    L  LS+  LV CD  +  C GG +D AF ++       + ++A
Sbjct: 153 FSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 212

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRLIESY 269
            YPY +      +C     +    + D   +    D +  +L ++GP+ + ++      Y
Sbjct: 213 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDY 272

Query: 270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANAC 329
           +G  +     +C   +LDH V +VGY + +    WI++NSW ++  + GY +IE+G N C
Sbjct: 273 NGGILT----SCTSEQLDHGVLLVGYNDNSNPPYWIIKNSWSNMWGEDGYIRIEKGTNQC 328


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 42/313 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKT--RFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR-- 98
           ++ ++VK  +  + ++ ++   RFE FK + +  DE+         + + +  R GL   
Sbjct: 50  YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEH---------NEKNLSYRLGLTRF 100

Query: 99  --LTGKE------KERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRC 149
             LT  E        ++E   ER      E + G  LP+S+DWR  K   +  V+ QG C
Sbjct: 101 ADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWR--KKGAVAEVKDQGGC 158

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGL 207
           GSCWAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G+
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
           ++  DYPY+  +     C   ++ AKV   D++    T   + +   +   PI + +   
Sbjct: 219 DTDKDYPYKGVDGT---CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAG 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + YD       D +C   +LDH V  VGYG +NG   WIVRNSWG    + GY ++
Sbjct: 276 GRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 331

Query: 323 ER----GANACGI 331
            R     +  CGI
Sbjct: 332 ARNIASSSGKCGI 344


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 153/313 (48%), Gaps = 42/313 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKT--RFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR-- 98
           ++ ++VK  +  + ++ ++   RFE FK + +  DE+         + + +  R GL   
Sbjct: 50  YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEH---------NEKNLSYRLGLTRF 100

Query: 99  --LTGKE------KERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRC 149
             LT  E        ++E   ER      E + G  LP+S+DWR  K   +  V+ QG C
Sbjct: 101 ADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWR--KKGAVAEVKDQGGC 158

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGL 207
           GSCWAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G+
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
           ++  DYPY+  +     C   ++ AKV   D++    T   + +   +   PI + +   
Sbjct: 219 DTDKDYPYKGVDGT---CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAG 275

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + YD       D +C   +LDH V  VGYG +NG   WIVRNSWG    + GY ++
Sbjct: 276 GRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 331

Query: 323 ER----GANACGI 331
            R     +  CGI
Sbjct: 332 ARNIASSSGKCGI 344


>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   D+
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCKGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGEYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        V   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLVGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V  VGY
Sbjct: 228 DVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLAVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSA 329


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/305 (30%), Positives = 148/305 (48%), Gaps = 25/305 (8%)

Query: 49  KWNRTYTDD---NEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKE 105
           +W   YT      E + RF  FK++ K  +E        D+  +  L + G  LT  E  
Sbjct: 46  RWRSVYTSARSFGEKQNRFHVFKENVKYINEV----NKMDKPYKLRLNQFG-DLTPSEFA 100

Query: 106 RLEADRERVKKFLNER-----KKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAI 160
           R  A+ + ++   NE      +   +P+S+DWR      + PV++QGRCG CWAF+  A 
Sbjct: 101 RTYANSKIIEGTRNESGGFMYENVEVPRSIDWRVKGA--VTPVKNQGRCGGCWAFSAAAA 158

Query: 161 LESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKE 219
           +E    +    L  LS+ QL++CD  N  C GG +  AFEY+KQ  G+ S+A+YPY+ + 
Sbjct: 159 VEGINQITTGQLISLSEQQLIDCDTQNSGCRGGTMGRAFEYIKQRGGITSEANYPYKAQA 218

Query: 220 NITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
            +      ++    +   D +  +    D ++ +L   P+ V ++     S D     + 
Sbjct: 219 GMCKNNLIQRPTVSI---DGYYNIRRSEDAVLKILAHQPVSVAVDATTWSSLDWMFYFQG 275

Query: 278 DWA--CNPHKLDHAVAIVGYGEKN-GILTWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            +   C   KL+H V  VGYG  N G   WI++NSWG+   + GY ++ RG +  G+   
Sbjct: 276 VFTGPCGT-KLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGVSPYGLCGI 334

Query: 335 AYLAS 339
           A  AS
Sbjct: 335 AMQAS 339


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 147/308 (47%), Gaps = 32/308 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           ++ ++VK  + Y    E + RFE FK + +  DE+         G +  +D + +E    
Sbjct: 42  YEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSM 101

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               L+G  + +L    +R       R    LP S+DWR+    V   V+ QG CGSCWA
Sbjct: 102 YLGALSGIRRNKLRKISDR----YTPRVGDSLPDSVDWRKEGAVV--GVKDQGSCGSCWA 155

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F+  A +E    ++   L  LS+ +LV+CD+  N  CNGG +D  FE++    G++S+ D
Sbjct: 156 FSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEED 215

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIE 267
           YPY  ++    RC   ++ A+V   D++    V++   L   + + P+ V +    R  +
Sbjct: 216 YPYLARDG---RCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQ 272

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-- 325
            Y           C    LDH V  VGYG +NG   WIVRNSWG    + GY ++ R   
Sbjct: 273 LYSSGVFSGR---CGT-ALDHGVVAVGYGTENGQDYWIVRNSWGKSWGESGYLRMARNIR 328

Query: 326 --ANACGI 331
                CGI
Sbjct: 329 KPTGICGI 336


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 80/240 (33%), Positives = 127/240 (52%), Gaps = 17/240 (7%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           +L ++R  +     E     LPK +DWR  K   + PV+ QG CGSCW+F+ T  LE Q 
Sbjct: 102 QLRSERMPIGASFIEPANVALPKKVDWR--KEGAVTPVKDQGHCGSCWSFSATGALEGQH 159

Query: 166 ALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENIT 222
                 L  LS+  L++C   +GN  CNGG +D AF+Y+K   GL+++A YPY  + +  
Sbjct: 160 FRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAEND-- 217

Query: 223 FRCTYEKEKAKVF-VQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRN 277
            +C Y    +    V    + +G + ++   +   GP+ V ++  H+  + Y        
Sbjct: 218 -KCRYNPANSGAIDVGYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEP 276

Query: 278 DWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           +  C+  +LDH V ++GYG  +NG   W+V+NSWG+   ++GY ++ R   N CGI S A
Sbjct: 277 E--CSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSA 334


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/312 (29%), Positives = 157/312 (50%), Gaps = 35/312 (11%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS---------SDRSPQEILQRTG 96
           ++ ++ + Y D  E + RF+ FK + +  + +               +D   +E   +  
Sbjct: 38  WMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF--KAL 95

Query: 97  LRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG-RCGSCWAF 155
           L    K+  R+E   E   ++ N  K   +P ++DWR  K   + P++ QG  CGSCWAF
Sbjct: 96  LNNVQKKASRVETATETSFRYENVTK---IPSTMDWR--KRGAVTPIKDQGYTCGSCWAF 150

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHGNL-NCNGGNIDVAFEYVK-QYGLESQADY 213
           AT A +ES   +    L  LS+ +LV+C  G+   C GG ++ AFE++  + G+ S+A Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210

Query: 214 PYRNKENITFRCTYEKEK---AKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLI--ES 268
           PY+ K+     C  +KE    A++   ++  ++    ++  + + P+ VY++   I  + 
Sbjct: 211 PYKGKDR---SCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKF 267

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGE-KNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
           Y        +  C  H LDHAVA+VGYG+ ++G   W+V+NSW     + GY +I+R   
Sbjct: 268 YSSGIFEARN--CGTH-LDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIR 324

Query: 328 A----CGIESYA 335
           A    CGI S A
Sbjct: 325 AKKGLCGIASNA 336


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 79/220 (35%), Positives = 124/220 (56%), Gaps = 14/220 (6%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LP  +DWRQ     + PV+  G+CGSCWAF++T  L  Q+ L  K L  LS+ QLV+C
Sbjct: 110 GKLPAKVDWRQKGA--VTPVKDPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDC 167

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
             ++GN  C+GG +  AF+Y+K   G++++  YPY  +++   +C Y K K+       +
Sbjct: 168 SGNYGNDGCDGGIMVQAFQYIKGNGGIDTEGSYPYEAEDD---KCRY-KTKSVAGTDKGY 223

Query: 241 V--TSGVDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           V    G ++ +   + + GPI V ++   +     +    ++  C+  +LDH V +VGYG
Sbjct: 224 VDIAQGDENALKEAVAEIGPISVAIDAGNLSFQFYSEGIYDEPFCSNTELDHGVLVVGYG 283

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            +NG   W+V+NSWG    ++GY +I R   N CGI S A
Sbjct: 284 TENGQDYWLVKNSWGPSWGENGYIKIARNHNNHCGIASMA 323


>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
          Length = 334

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 125/248 (50%), Gaps = 22/248 (8%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
            +  ++R  K   E     +P S+DWR      + PV++QG+CGSCWAF+ T  LE Q+ 
Sbjct: 95  FQNQKQRNGKVFREPLFAQIPSSVDWRDKGY--VTPVKNQGQCGSCWAFSATGSLEGQMF 152

Query: 167 LLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITF 223
                L  LS+  LV+C    GN  CNGG +D AF+YVK   GL+++  YPY  +E+ T 
Sbjct: 153 RKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFQYVKDNKGLDTEESYPYLARESNT- 211

Query: 224 RCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRN 277
            C Y  E +     DT           LL++    GPI V ++  H   + Y+       
Sbjct: 212 -CNYRPEYSA--ANDTGFVDIPQREKALLKAVATVGPISVAIDAGHSSFQFYNAGIYYEP 268

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERG-ANACGIE 332
           +  C+   LDH V +VGYG + G       WIV+NSWG     +GY ++ R  +N CGI 
Sbjct: 269 N--CSSKDLDHGVLVVGYGSEGGESKNNKFWIVKNSWGSGWGMNGYVKMARDQSNHCGIA 326

Query: 333 SYAYLASV 340
           + A   +V
Sbjct: 327 TAASYPTV 334


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        V   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLVGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V  VGY
Sbjct: 228 DVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLAVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSA 329


>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   D+
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCKGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGEYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|195584238|ref|XP_002081921.1| GD11280 [Drosophila simulans]
 gi|194193930|gb|EDX07506.1| GD11280 [Drosophila simulans]
          Length = 382

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/259 (32%), Positives = 131/259 (50%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A  + V          P+P++ DWR+     + P
Sbjct: 127 ADLTHSEFLSQLTGLKRSPEAKARAAASLKEVA-----LPAKPIPEAFDWREHGG--VTP 179

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 180 VKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAA 239

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+     YPY + ++    C Y+  K+   +Q        D   +  ++ +
Sbjct: 240 FCFIDEVQKGVSQAGAYPYIDNKDT---CKYDGSKSGATLQGFAAIPPKDEEQLKKVVAT 296

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GP+   +N    +++Y G     ND  CN  + +H++ +VGYG +NG   WIV+NSW D
Sbjct: 297 LGPVACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDD 354

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 355 TWGEQGYFRLPRGQNYCFI 373


>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSLGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|195335257|ref|XP_002034291.1| GM21790 [Drosophila sechellia]
 gi|194126261|gb|EDW48304.1| GM21790 [Drosophila sechellia]
          Length = 382

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/259 (32%), Positives = 132/259 (50%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A  + V     +    P+P++ DWR+     + P
Sbjct: 127 ADLTHSEFLSQLTGLKRSPEAKARAAASLKEV-----DLPAKPIPEAFDWREHGG--VTP 179

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 180 VKFQGVCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAA 239

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+     YPY + ++    C Y+  K+   +Q        D   +  ++ +
Sbjct: 240 FCFIDEVQKGVSQAEAYPYIDNKDT---CKYDGSKSGATLQGFAAIPPKDEEQLKKVVAT 296

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GP+   +N    +++Y G     ND  CN  + +H++ +VGYG +NG   WIV+NSW D
Sbjct: 297 LGPVACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDD 354

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 355 TWGEQGYFRLPRGQNFCFI 373


>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 76/215 (35%), Positives = 114/215 (53%), Gaps = 16/215 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP + DWR+     + PV+ QG CGSCW F+T   LE+   +  +    LS+ QLV+C  
Sbjct: 135 LPANWDWREHNG--VTPVKDQGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAG 192

Query: 185 -HGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV- 241
            + N  CNGG    AF+Y+   G + ++A YPY  K+     CT ++ +  V V    V 
Sbjct: 193 AYDNYGCNGGLPSHAFQYISDNGGIATEAAYPYFAKDR---PCTIQQSQKSVGVVGGSVN 249

Query: 242 -TSGVDHM-MHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
            T   D + + + Q GP+ +   + +I+    Y        D    P  ++HAV  VG+G
Sbjct: 250 LTKSEDELAIAIFQHGPVSIA--YEVIDDFMDYHSGVYTTKDCKNGPDDVNHAVVAVGFG 307

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
            +NG+  W+V+NSW     D+GYF+I+RG N CGI
Sbjct: 308 TENGVDYWLVKNSWSTKWGDNGYFKIQRGVNMCGI 342


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 84/262 (32%), Positives = 127/262 (48%), Gaps = 14/262 (5%)

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           +D + +E     GL+    + +R         +F  E     LP  +DWR+     + PV
Sbjct: 71  TDMTSEEFRNFKGLKFDATKTKR------NGTRFQKELLGEALPTQVDWREKGY--VTPV 122

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEY 201
           ++QG+CGSCWAF+TT  LE Q       L  LS+  LV+C    GN  CNGG +D  F Y
Sbjct: 123 KNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTY 182

Query: 202 VKQYG-LESQADYPYRNKE-NITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGV 259
           ++Q G ++++  YPY  K+ +  F       + K FV D            +   GP+ V
Sbjct: 183 IQQNGGIDTEESYPYTGKDGDCAFNENSVGARVKGFV-DVPQRDEAALQAAVASVGPVSV 241

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
            ++              ++ +C+  +LDH V +VGYG +NG+  W+V+NSWG      GY
Sbjct: 242 AIDASNDSFQYYKEGVYDEPSCSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGY 301

Query: 320 FQIERGA-NACGIESYAYLASV 340
            ++ R   N CGI S A   +V
Sbjct: 302 IKMMRNKENQCGIASMASYPTV 323


>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
 gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
          Length = 362

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 120/221 (54%), Gaps = 19/221 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           L KS+DWR+     +  V+ QG+CGSCW+F+ T  LE Q+A +   L  LS+  LV+C  
Sbjct: 135 LDKSVDWREKGA--VTEVKDQGQCGSCWSFSATGALEGQMAQVFGKLPDLSEQNLVDCSR 192

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--- 239
             GN  CNGG +D AF+YVK Q GL+ +  YPY   +N    C Y+K   +    DT   
Sbjct: 193 PEGNQGCNGGLMDAAFQYVKDQDGLDGEDWYPYEGVDNK--ECRYDKSHREA--DDTGFK 248

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G +  +   L + GP+ V ++  +   + Y        +  C+P  LDH V  VGY
Sbjct: 249 MIPEGNEKALKHALAKVGPVSVAIDASNPSFQFYQSGVYYEPN--CSPENLDHGVLAVGY 306

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           G ++G   ++V+NSW +   D+GY ++ R   N CGI SYA
Sbjct: 307 GTEDGEHYYLVKNSWSEAWGDNGYIKMARNKENHCGIASYA 347


>gi|148709374|gb|EDL41320.1| cathepsin 7, isoform CRA_c [Mus musculus]
          Length = 277

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/257 (36%), Positives = 136/257 (52%), Gaps = 25/257 (9%)

Query: 100 TGKEKERL-EADRERVKKFLNERKKGP-LPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           TG+E + L E+    ++   + +K+ P +P +LDWR  K   + PV  QG CG+CWAF+ 
Sbjct: 30  TGEEMKMLTESSSYPLRNGKHIQKRNPKIPPTLDWR--KEGYVTPVRRQGSCGACWAFSV 87

Query: 158 TAILESQVALLKKT--LYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQAD 212
           TA +E Q  L KKT  L PLS   L++C   +G   C+GG    AF+YVK   GLE++A 
Sbjct: 88  TACIEGQ--LFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEAT 145

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYLN--HRLIES 268
           YPY  K      C Y  E++ V V   +V    +  +   L+  GPI V ++  H    S
Sbjct: 146 YPYEAKAK---HCRYRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFHS 202

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIER 324
           Y G     ++  C    LDH + +VGYG    E      W+++NS G+   ++GY ++ R
Sbjct: 203 YRGGIY--HEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPR 260

Query: 325 GANA-CGIESYAYLASV 340
           G N  CGI SYA   ++
Sbjct: 261 GQNNYCGIASYAMYPAL 277


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/240 (35%), Positives = 123/240 (51%), Gaps = 21/240 (8%)

Query: 110 DRERVKKFLNERKKGP-----LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
           +R  V+  L+     P     +P  +DWR  K   + PV++QG+CGSCWAF+TT  LE Q
Sbjct: 113 NRTEVRDHLHANYISPAIPVSVPAEVDWR--KEGYVTPVKNQGQCGSCWAFSTTGSLEGQ 170

Query: 165 VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
                  L  LS+  LV+C   +GN  CNGG +D AF+Y+K   G +++A YPY   E +
Sbjct: 171 HFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPY---EAV 227

Query: 222 TFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRR 276
              C ++           T +  G +  M   +   GP+ V ++  H   + Y       
Sbjct: 228 DGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIYVE 287

Query: 277 NDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            +  C+P +LDHAV +VGYG + G   W+V+NSWG    D GY ++ R   N CGI S A
Sbjct: 288 QE--CSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQA 345


>gi|313213752|emb|CBY40632.1| unnamed protein product [Oikopleura dioica]
          Length = 440

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 113/219 (51%), Gaps = 14/219 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S DWR +   V+ PV+ QG+CGSCWAF+T A LESQ AL    L  LS+ QLV+C  
Sbjct: 222 MPASADWRTANPPVVTPVKDQGQCGSCWAFSTIASLESQWALAGNALTSLSEQQLVDCSM 281

Query: 186 --GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
             GN  C+GG +   F Y+    G++++A YPY  ++    +C +        +   + +
Sbjct: 282 NWGNYGCSGGLMTQGFTYIHDNNGVDTEASYPYTAQDG---KCVFNPANVGTSLTSCYNI 338

Query: 242 TSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            SG +  +   +   GP+ V ++  H   + Y        +  C+   LDH V  VGYG 
Sbjct: 339 ASGDEAALANAVQMVGPMSVAIDASHMSFQLYTSGVYYEPN--CSSQFLDHGVTAVGYGS 396

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            +G   +IV+NSW     D+GY  + R   N CGI + A
Sbjct: 397 SSGNDFFIVKNSWAATWGDNGYIMMSRNKNNNCGIATSA 435


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 100/318 (31%), Positives = 153/318 (48%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y +  E   RFE FK + K  DE        + G +  +D S +
Sbjct: 43  KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHR 102

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   + PV++QG 
Sbjct: 103 EFNNKYLGLKVDYSRR------RESPEEFT--YKDVELPKSVDWR--KKGAVAPVKNQGS 152

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + V+  G
Sbjct: 153 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGG 212

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E     C   KE+ +V     +     +    ++  L + P+ V +  
Sbjct: 213 LHKEEDYPYIMEEGT---CEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEA 269

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+    V+NSWG    + GY +
Sbjct: 270 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIR 325

Query: 322 IERGA----NACGIESYA 335
           + R        CGI   A
Sbjct: 326 MRRNIGKPEGICGIYKMA 343


>gi|163310848|pdb|2O6X|A Chain A, Crystal Structure Of Procathepsin L1 From Fasciola
           Hepatica
          Length = 310

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 121/224 (54%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P  +DWR+S    +  V+ QG CGS WAF+TT  +E Q    ++T    S+ QLV+C  
Sbjct: 92  VPDKIDWRESGY--VTEVKDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSR 149

Query: 186 --GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             GN  C GG ++ A++Y+KQ+GLE+++ YPY   E    +C Y K+     V   + V 
Sbjct: 150 PWGNNGCGGGLMENAYQYLKQFGLETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVH 206

Query: 243 SGVDHMMHLL--QSGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
           SG +  +  L    GP  V ++   +ES D    R   +    C+P +++HAV  VGYG 
Sbjct: 207 SGSEVELKNLVGAEGPAAVAVD---VES-DFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT 262

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           + G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 263 QGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASLASLPMV 306


>gi|440290792|gb|ELP84121.1| cysteine proteinase ACP1 precursor, putative [Entamoeba invadens
           IP1]
          Length = 306

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 76/226 (33%), Positives = 116/226 (51%), Gaps = 15/226 (6%)

Query: 113 RVKKFLNERKK-GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
           +V + +NE++     P+S+DWR     ++NP + Q +CGSCW F TTA++E +V      
Sbjct: 76  KVPEVINEKRSVKSAPESVDWRS----IMNPAKDQAQCGSCWTFCTTAVMEGRVNKDLGK 131

Query: 172 LYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKE 230
           LY  S+ QL++CD  +  C+GG+ D +F ++K   G+  +A YPY+  +     C    +
Sbjct: 132 LYSYSEQQLIDCDTTDNGCSGGHPDNSFTFIKNNKGITLEASYPYKAADGT---CNTAVK 188

Query: 231 KAKVFVQDTWVTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKL 286
                     VT G +  +  + +  GPI V ++      + Y    I  ND  C    +
Sbjct: 189 NVATVAGHKRVTDGNEAGLQEITATYGPIAVGMDASRASFQLYKKGTI-YNDANCKRIVM 247

Query: 287 DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGI 331
           DH V +VGYG+      WI+RNSWG    D GYF + R   N CGI
Sbjct: 248 DHCVTLVGYGKNTDGEYWIIRNSWGTSWGDEGYFLLARNQNNRCGI 293


>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
          Length = 218

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 82/217 (37%), Positives = 111/217 (51%), Gaps = 12/217 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P S+DWR  K  ++ P++ QG CGSCWAF+ T  LE Q+   K  L  LS+ QLV+C  
Sbjct: 7   VPDSIDWR--KKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKKGKLISLSEQQLVDCST 64

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKEN-ITFRCTYEKEKAKVFVQDTWVT 242
           D GN  CNGG ++ AF Y  Q G ES++DYPY   +    F  +    K   FV+   V 
Sbjct: 65  DMGNEGCNGGYMNDAFRYWMQNGAESESDYPYTAMDGKCKFNSSKVVTKVSKFVK---VP 121

Query: 243 SGVDHMMHL--LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY-GEKN 299
              +  + L   Q GP+ V ++               D  C+   LDHAV +VGY  +  
Sbjct: 122 KKREDQLKLSVAQVGPVSVAIDAASSGFMLYKKGIYQDNTCSQQYLDHAVLVVGYDADMA 181

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           G   WIV+NSWG+     GY  + R   N CGI + A
Sbjct: 182 GQKYWIVKNSWGEDWGQRGYIWMARDKGNMCGIATMA 218


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PK++DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKTVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 DQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEYA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKDLDHGVLVVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAA 328


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/242 (35%), Positives = 123/242 (50%), Gaps = 25/242 (10%)

Query: 110 DRERVKKFLNERKKGP-----LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
           +R +V+  L+     P     LP  +DWR  K   + P++ QG CGSCW+F+TT  LE Q
Sbjct: 114 NRTKVRDHLHSHYISPAIPVSLPAEVDWR--KEGYVTPIKDQGHCGSCWSFSTTGALEGQ 171

Query: 165 VALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENI 221
                  L  LS+  L++C   +GN  CNGG +D AF+Y+K   G +++  YPY   +  
Sbjct: 172 HFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYEAADG- 230

Query: 222 TFRCTYEKEKAKVFVQDTWVTS---GVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPI 274
              C ++KE   V   DT  T    G +  M   +   GP+ V ++  H   + Y     
Sbjct: 231 --PCRFKKEY--VGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVY 286

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
             ++  C+P  LDH V +VGYG + G   W+V+NSWG    D GY ++ R   N CGI S
Sbjct: 287 --DEVECDPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISS 344

Query: 334 YA 335
            A
Sbjct: 345 MA 346


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 95/288 (32%), Positives = 140/288 (48%), Gaps = 35/288 (12%)

Query: 63  RFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERV 114
           RFE FK + +  DE+         G +  +D + +E      + L  K K+R+    +R 
Sbjct: 73  RFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRS---IYLGAKSKKRVLKTSDRY 129

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           +     R    +P S+DWR  K   +  V+ QG CGSCWAF+T   +E    ++   L  
Sbjct: 130 QP----RVGDAIPDSVDWR--KEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLIS 183

Query: 175 LSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKA 232
           LS+ +LV+CD   N  CNGG +D AFE++ K  G++++ DYPY+  +    RC   ++ A
Sbjct: 184 LSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADG---RCDQTRKNA 240

Query: 233 KVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLD 287
           KV   D +     ++   L   L + PI V +    R  + Y        D  C   +LD
Sbjct: 241 KVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVF---DGICGT-ELD 296

Query: 288 HAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG----ANACGI 331
           H V  VGYG +NG   WIVRNSWG    + GY ++ R        CGI
Sbjct: 297 HGVVAVGYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGI 344


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 89/303 (29%), Positives = 135/303 (44%), Gaps = 25/303 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL----- 97
           F  +  K+ + Y   NE   RF  FK +    D  Y T+  +      + + T L     
Sbjct: 27  FNNFKTKYGKVYNGINEDAVRFGIFKAN---VDIIYATNARNLTFALGVNEFTDLTQEEL 83

Query: 98  --RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
               TG +   L +   R+    +E    PL  S+DW    V  + PV++QG+CGSCW+F
Sbjct: 84  AASYTGLKPASLWSGLPRLST--HEYNGAPLASSVDWTTQGV--VTPVKNQGQCGSCWSF 139

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPY 215
           +TT  LE   AL    L  LS+ Q V+CD  +  CNGG +D AF + K+  + ++  YPY
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFVDCDTTDSGCNGGWMDNAFSFAKKNSICTEGSYPY 199

Query: 216 RNKENIT--FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIESYDG 271
              +       C     +  V       T     MM  +   P+ + +  +    + Y  
Sbjct: 200 TATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSS 259

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER---GANA 328
             +     +C   +LDH V  VGYG + G   W V+NSWG    + GY +++R   GA  
Sbjct: 260 GVLTA---SCG-TRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAGE 315

Query: 329 CGI 331
           CG+
Sbjct: 316 CGL 318


>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 112/224 (50%), Gaps = 16/224 (7%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +P + DWR     V++PV++QG+CGSCW F+T   LES   L       LS+ QLV+C
Sbjct: 133 GSIPTNWDWR--TYGVVSPVKNQGKCGSCWTFSTVGALESHFLLKYGQFRNLSEQQLVDC 190

Query: 184 --DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
             ++ N  CNGG    AFEY+K   G+  +  YPY     +T  C  +K    V V+   
Sbjct: 191 AGNYDNHGCNGGLPSHAFEYLKDNGGIAEETSYPYV---AVTNTCALKKGSQSVGVKGGA 247

Query: 241 VTSGV---DHMMHLLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           V   +   D    +   GP+ +          Y             P  ++HAV  VG+G
Sbjct: 248 VNVSLSEDDLKQAIYSHGPVSIAFQVASDFRDYRAGVYTSKVCKNGPQDVNHAVLAVGFG 307

Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE---SYAY 336
            ++N +  WI++NSWG +  D GYF++ERG N CG+    SY Y
Sbjct: 308 TDENKVDYWIIKNSWGAVWGDQGYFKMERGVNMCGVSNCNSYPY 351


>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PKS+DWR+     + PV+++G+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKSVDWREKGC--VTPVKNKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEFA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKNLDHGVLLVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAA 328


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 87/268 (32%), Positives = 133/268 (49%), Gaps = 22/268 (8%)

Query: 77  YYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSK 136
           Y G +  +D   +E     GLR        ++       ++L        P  +DWR  K
Sbjct: 88  YLGINQFADMKNEEFRMYNGLRRDYNYSREVQCSNHLTPEYL------VAPDEVDWR--K 139

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGN 194
              +  V++QG+CGSCW+F+TT  LE Q       L  LS+ QLV+C    GN  CNGG 
Sbjct: 140 KGYVTAVKNQGQCGSCWSFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGL 199

Query: 195 IDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEK-EKAKVFVQDTWVTSG--VDHMMH 250
           +D AFEY +   G+E++ +YPY  ++    RC ++K E A        V SG   D    
Sbjct: 200 MDQAFEYIITNGGIETEEEYPYDARQE---RCHFKKSEVAATASGCVDVKSGDETDLKNS 256

Query: 251 LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRN 308
           + + GP+ + ++  H+  + Y G     ++  C+  +LDH V +VGYG  +G   W+V+N
Sbjct: 257 VAEVGPVSIAIDASHQSFQLYSGGVY--DEPKCSSTELDHGVLVVGYGTDDGQDYWLVKN 314

Query: 309 SWGDIGPDHGYFQIERGA-NACGIESYA 335
           SWG      GY ++ R   N CG+ + A
Sbjct: 315 SWGTTWGLEGYVKMSRNQDNQCGVATQA 342


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 91/297 (30%), Positives = 143/297 (48%), Gaps = 28/297 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           +++++V   + Y    E + RFE FK + +  DE+   S    R+ +  L R       +
Sbjct: 62  YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRES----RTYKVGLTRFADLTNEE 117

Query: 103 EKERLEADRERVKKFLNERKKG--------PLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
            + R    R   K  L+  K G         LP  +DWR  K   +  V+ QG+CGSCWA
Sbjct: 118 YRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWR--KKGAVATVKDQGQCGSCWA 175

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQAD 212
           F++ A +E    ++   L PLS+ +LV+CD   N+ CNGG +D AF+++    G++++ D
Sbjct: 176 FSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTEED 235

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIE 267
           YPY+ ++     C   ++ AKV   D +     +    +   + + P+ V +    R  +
Sbjct: 236 YPYKGRDAA---CDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
            Y           C    LDH V  VGYG  NG   WIVRNSWG    + GY ++ER
Sbjct: 293 LYQSGVFTGR---CGT-DLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLER 345


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 105/316 (33%), Positives = 160/316 (50%), Gaps = 30/316 (9%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYY--GTSGSSDRSPQ 89
           ++ F+ +  +  + Y    E + RF  FK++        GKET   +  G +  +D S +
Sbjct: 40  IEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFADLSNE 99

Query: 90  EILQRTGLRLTGK-EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E  Q    ++     K R++A+ +R ++ L   +    P SLDWR  K  V+  V+ QG 
Sbjct: 100 EFKQLYLSKVKKPINKTRIDAE-DRSRRNL---QSCDAPSSLDWR--KKGVVTAVKDQGD 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV-KQYGL 207
           CGSCW+F+TT  +E   A++   L  LS+ +LV+CD  N  C GG +D AFE+V    G+
Sbjct: 154 CGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGI 213

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPIGVYLNHRL 265
           +++A+YPY   +     C   KE+ KV   D +  V      ++      PI V ++   
Sbjct: 214 DTEANYPYTGVDGT---CNTAKEEIKVVSIDGYKDVDETDSALLCAAAQQPISVGIDGSA 270

Query: 266 I--ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
           I  + Y G  I   D + +P  +DHAV IVGYG +NG   WIV+NSWG      GYF I+
Sbjct: 271 IDFQLYTGG-IYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYIK 329

Query: 324 RGAN----ACGIESYA 335
           R  +     C I + A
Sbjct: 330 RNTDLPYGVCAINAMA 345


>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
          Length = 352

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 89/253 (35%), Positives = 133/253 (52%), Gaps = 26/253 (10%)

Query: 98  RLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           +L G    RL  D  R+   KFL       +P S+DWR+  +  + PV++QG CGSCWAF
Sbjct: 106 KLNGYRHRRLFGDSMRKNGTKFLVPFNV-KVPDSVDWREHNL--VTPVKNQGMCGSCWAF 162

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           + T  LE Q       L  LS+  LV+C   +GN  CNGG +D+AFEY+K  +G++++  
Sbjct: 163 SATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEG 222

Query: 213 YPYRNKENITFRCTYEKE----KAKVFVQDTWVTSGVDHMMHLLQS--GPIGVYLN--HR 264
           YPY  KE    RC ++K     + + FV    +  G +  + +  +  GPI + ++  HR
Sbjct: 223 YPYVGKE---MRCHFKKRDIGAEDRGFVD---LPEGDEDALKVAVATQGPISIAIDAGHR 276

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y        D  C+  +LDH V +VGYG +      WI++NSWG    + GY +I 
Sbjct: 277 SFQLYKKGVYF--DEECSSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIA 334

Query: 324 RGA-NACGIESYA 335
           R   N CG+ + A
Sbjct: 335 RNRNNHCGVATKA 347


>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
          Length = 367

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 140/317 (44%), Gaps = 37/317 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 95  TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                     ER  EA R RV+  + +   G  P ++DWR+     + PV+ QG CGSCW
Sbjct: 94  Y------HNGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGTCGSCW 144

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQ 210
           +F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + ++
Sbjct: 145 SFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTE 204

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD-------HMMHLLQSGPIGVYLNH 263
             YPY +       C     K         +T  VD          +L  +GP+ V ++ 
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHKVGAT-----ITGHVDIPHDEDAIAKYLADNGPVAVAVDA 259

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
               SY G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE
Sbjct: 260 TTFMSYSGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIE 315

Query: 324 RGANACGIESYAYLASV 340
           +G N C +   A  A V
Sbjct: 316 KGTNQCLVAQLASSAVV 332


>gi|321476439|gb|EFX87400.1| hypothetical protein DAPPUDRAFT_312328 [Daphnia pulex]
          Length = 330

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 99/288 (34%), Positives = 136/288 (47%), Gaps = 26/288 (9%)

Query: 61  KTRFEYF----KQDGKETDEYYGT-----SGSSDRSPQEILQRTGLRLTGKEKERLEADR 111
           KTR E F    KQ  K   E  GT     +  SD  P E+    G+  T     +     
Sbjct: 48  KTRKELFRARDKQIKKHNSEKAGTFRKEHNQFSDLWPLELRSYLGVNATAVPSLKFMRSV 107

Query: 112 ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
                 ++ + +   P S D R      L  +++QG+CGSCW+F + A LE       K 
Sbjct: 108 S-----VDLQSRAAPPASFDLRYDSC--LPAIKNQGQCGSCWSFTSIAPLEFSKCKKAKV 160

Query: 172 LYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLES-QADYPYRNKENITFRCTYEKE 230
              LS+  LV+CD  N  CNGG    A+ Y+K+ G  + Q  Y Y  K+N T R T    
Sbjct: 161 TTVLSEQHLVDCDTTNGGCNGGWYVTAWTYLKKAGGSAKQTLYNYTAKKN-TCRFTTAMI 219

Query: 231 KAKV----FVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKL 286
            AKV    +VQ    T+     + L Q GP+ V +   +   Y       +D AC+   +
Sbjct: 220 AAKVSSFGYVQSNNATA---MQLALQQYGPLAVAIT-VVPSFYSYASGVYDDNACDGQAV 275

Query: 287 DHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
           +HAV +VG+G  NG+  WIVRNSWG      GYF ++RG N CGIE+Y
Sbjct: 276 NHAVVLVGWGNLNGVDYWIVRNSWGTNWGLSGYFFMKRGVNKCGIETY 323


>gi|405966500|gb|EKC31778.1| Cathepsin L [Crassostrea gigas]
          Length = 271

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 83/242 (34%), Positives = 123/242 (50%), Gaps = 22/242 (9%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           ++ A+R +   +++    G LP S+DWR  K   +  +++QG CGSCW+F+ T  LE Q 
Sbjct: 35  KMSANRTKGDLYMSPSNIGDLPDSVDWR--KEGYVTDIKNQGHCGSCWSFSATGSLEGQH 92

Query: 166 ALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENIT 222
               K L  LS+  LV+C    GN  C GG +D AF Y++   G++++  YPY  K    
Sbjct: 93  FKASKKLVSLSEQNLVDCSQREGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGF- 151

Query: 223 FRCTYEKEKAKVFVQDTWVTSGVDHMMH------LLQSGPIGVYLN--HRLIESYDGNPI 274
             C ++KE   V   DT     + HM        +   GPI V ++  H+  + Y     
Sbjct: 152 --CHFKKEN--VGATDTGYVD-IPHMQEDKLQEAVATVGPISVAIDAGHKSFQLYREGVY 206

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
             ++ AC+  KLDH V  VGYG ++G   W+V+NSWG      GY  + R   N CGI +
Sbjct: 207 --SEPACSSSKLDHGVLAVGYGTESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIAT 264

Query: 334 YA 335
            A
Sbjct: 265 QA 266


>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 111/212 (52%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   ++
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCRGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGDYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI+ Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIDYY 319


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 21/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LPKS+DWR+     + PV++QG CGSCW+F+TT  LE Q+      L  LS+  L++C  
Sbjct: 114 LPKSVDWREKGA--VTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCST 171

Query: 184 DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  C GG +D AF Y+K+ +G++++  YPY  K+    +C Y KE +    +DT   
Sbjct: 172 SYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQG---KCRYHKEDSA--GRDTGFV 226

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + SG +  +   L   GP+ V ++  H   + Y        D  C+ H LDH V  VGY
Sbjct: 227 DIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPD--CDSHSLDHGVLAVGY 284

Query: 296 GEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G   +G   +I++NSWG+     GY  + R + N CG+ + A
Sbjct: 285 GTTDDGQDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQA 326


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 126/246 (51%), Gaps = 16/246 (6%)

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
           + G + ++  + R        E     +P+S+DWR+     + PV+ QG+CGSCWAF++T
Sbjct: 91  MNGYQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGA--ITPVKDQGQCGSCWAFSST 148

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPY 215
             LE Q       L  LS+  L++C   +GN  CNGG +D AF+Y+K   G++++  YPY
Sbjct: 149 GALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPY 208

Query: 216 RNKENITFRCTYE-KEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYD 270
             ++++   C Y  + +  V      + SG +  +   +   GP+ V ++  H   + Y 
Sbjct: 209 EAEDDV---CRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYS 265

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANAC 329
                    +C+   LDH V +VGYG  NG   W+V+NSW +   D GY ++ R   N C
Sbjct: 266 KGVYYEP--SCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKMARNRKNHC 323

Query: 330 GIESYA 335
           G+ S A
Sbjct: 324 GVASAA 329


>gi|414589597|tpg|DAA40168.1| TPA: hypothetical protein ZEAMMB73_868349 [Zea mays]
          Length = 252

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 112/217 (51%), Gaps = 12/217 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P++ DWR+  +  ++PV++QG CGSCW F+TT  LE+           LS+ QLV+C  
Sbjct: 35  MPETKDWREDGI--VSPVKNQGHCGSCWTFSTTGALEAAYTQATGKAISLSEQQLVDCGF 92

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  C GG    AFEY+K   GL+++  YPY+    I   C ++ E   V V D+  +
Sbjct: 93  AFNNFGCKGGLPSQAFEYIKYNGGLDTEESYPYQGVNGI---CQFKAENVGVKVLDSVNI 149

Query: 242 TSGVDHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           T G +  +         V +   +I     Y       +     P  ++HAV  VGYG +
Sbjct: 150 TLGAEDELKDAVGLVRPVSVAFEVISGFRLYKTGVYTSDHCGTTPMDVNHAVLAVGYGVE 209

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           NG+  W+++NSWG    D GYF++E G N CG+ + A
Sbjct: 210 NGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCA 246


>gi|157819967|ref|NP_001099569.1| cathepsin 7 precursor [Rattus norvegicus]
 gi|374110484|sp|D3ZZ07.1|CAT7_RAT RecName: Full=Cathepsin 7; Flags: Precursor
 gi|149039730|gb|EDL93846.1| cathepsin 7 (predicted) [Rattus norvegicus]
          Length = 331

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 121/224 (54%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVEC 183
           +PK+LDWR +    + PV SQG CG+CWAF+  A +ESQ  L KKT  L PLS   L++C
Sbjct: 112 IPKTLDWRDTGC--VAPVRSQGGCGACWAFSVAASIESQ--LFKKTGKLIPLSVQNLIDC 167

Query: 184 --DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
              +GN +C+GG    AF+YVK   GLE++A YPY  K      C Y  E++ V +   +
Sbjct: 168 TVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKLR---HCRYRPERSVVKIARFF 224

Query: 241 VTSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           V    +   M  L+  GPI V ++  H   + Y G     ++  C    LDH + +VGYG
Sbjct: 225 VVPRNEEALMQALVTYGPIAVAIDGSHASFKRYRGGIY--HEPKCRRDTLDHGLLLVGYG 282

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGANA-CGIESYA 335
               E      W+++NS G+   + GY ++ R  N  CGI SYA
Sbjct: 283 YEGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYA 326


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 145/318 (45%), Gaps = 39/318 (12%)

Query: 46  YIVKWNRTYTDDNEIKTR-------FEYFKQDGKETDE-----YYGTSGSSDRSPQEILQ 93
           + V+ N+ Y D+ E   R        EY +Q   E D        G +  +D   +E ++
Sbjct: 25  FKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVR 84

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                       +++  R +   ++     G LP ++DWR      +  V++QG+CGSCW
Sbjct: 85  VM-------NGYKMQEQRPKAPTYMPPSNVGDLPATVDWRTKGY--VTEVKNQGQCGSCW 135

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQ 210
           AF++T  LE Q       L  LS+  LV+C  + GN+ C GG +D AF Y+K   G++++
Sbjct: 136 AFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTE 195

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVT-----SGVDHMMHLLQSGPIGVYLN--H 263
             YPY   E  + +C +   KA V   DT  T     S  D    +   GPI V ++  H
Sbjct: 196 TSYPY---EAASGKCRF--NKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASH 250

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
              + Y           C+  +LDH V  VGYG  +G   W+V+NSWG      GY  + 
Sbjct: 251 MSFQLYKSGVYHY--IFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGATWGQQGYIMMS 308

Query: 324 RGA-NACGIESYAYLASV 340
           R   N CGI + A   +V
Sbjct: 309 RNRDNNCGIATQASYPTV 326


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 156/311 (50%), Gaps = 37/311 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEI-LQ 93
           +++++VK  +TY    E   RF+ FK + +  DE+         G +  +D + +E  + 
Sbjct: 52  YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMT 111

Query: 94  RTGLRLTGKEKE--RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
            TG++    +K+  ++++DR         R    LP+ +DWR+     +  V+ QG CGS
Sbjct: 112 YTGIKTIDDKKKLSKMKSDRYAY------RSGDSLPEYVDWREQGA--VTDVKDQGSCGS 163

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+TT  +E    ++   L  +S+ +LV CD   N  CNGG +D AFE+ +K  G+++
Sbjct: 164 CWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDT 223

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHR 264
           + DYPY  K+    +C   K+ AKV   D++    V+    L   + + P+ V +    R
Sbjct: 224 EEDYPYTGKDG---KCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGR 280

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y          +C    LDH V   GYG ++G   W+V+NSWG    + GY ++ER
Sbjct: 281 DFQFYTSGIFTG---SCGT-ALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMER 336

Query: 325 G----ANACGI 331
                +  CGI
Sbjct: 337 NIADKSGKCGI 347


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 81/240 (33%), Positives = 127/240 (52%), Gaps = 17/240 (7%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           +L ++R  V     E     LPK +DWR  K   + PV+ QG CGSCW+F+ T  LE Q 
Sbjct: 108 QLRSERLPVGASFIEPANVVLPKKVDWR--KEGAVTPVKDQGHCGSCWSFSATGALEGQH 165

Query: 166 ALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENIT 222
                 L  LS+  L++C   +GN  CNGG +D AF+Y+K   GL+++A YPY  + +  
Sbjct: 166 FRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAEND-- 223

Query: 223 FRCTYEKEKAKVF-VQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRN 277
            +C Y    +    V    + +G + ++   +   GP+ V ++  H+  + Y        
Sbjct: 224 -KCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEP 282

Query: 278 DWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           +  C+  +LDH V ++GYG  +NG   W+V+NSWG+   ++GY ++ R   N CGI S A
Sbjct: 283 E--CSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSA 340


>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 441

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 140/317 (44%), Gaps = 37/317 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 95  TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                     ER  EA R RV+  + +   G  P ++DWR+     + PV+ QG CGSCW
Sbjct: 94  Y------HNGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGSCGSCW 144

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQ 210
           +F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + ++
Sbjct: 145 SFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTE 204

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD-------HMMHLLQSGPIGVYLNH 263
             YPY +       C     K         +T  VD          +L  +GP+ V ++ 
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHKVGAT-----ITGHVDIPHDEDAIAKYLADNGPVAVAVDA 259

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
               SY G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE
Sbjct: 260 TTFMSYSGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIE 315

Query: 324 RGANACGIESYAYLASV 340
           +G N C +   A  A V
Sbjct: 316 KGTNQCLVAQLASSAVV 332


>gi|148575301|gb|ABQ95351.1| secreted cathepsin L2 [Fasciola hepatica]
          Length = 326

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 12/224 (5%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P+S+DWR      +  V+ QG+CGSCWAF+TT  +E Q    ++     S+ QLV+
Sbjct: 105 KLAVPESIDWRD--YYYVTEVKDQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLVD 162

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C  D GN  C GG ++ A+EY+K  GLE+++ YPY+  E     C Y+   A   V   +
Sbjct: 163 CTRDFGNYGCGGGYMENAYEYLKHNGLETESYYPYQAVEG---PCQYDGRLAYAKVTGYY 219

Query: 241 VTSGVDH--MMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                D   + +L+ + GP  V L+         + I ++   C P +L HAV  VGYG 
Sbjct: 220 TVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIYQSQ-TCLPDRLTHAVLAVGYGS 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY +  R   N CGI S A +  V
Sbjct: 279 QDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIASLASVPMV 322


>gi|68137209|gb|AAY85545.1| male accessory gland protein [Drosophila simulans]
          Length = 362

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 85/259 (32%), Positives = 131/259 (50%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A  + V          P+P++ DWR+     + P
Sbjct: 107 ADLTHSEFLSQLTGLKRSPEAKARAAASLKEVA-----LPAKPIPEAFDWREHGG--VTP 159

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 160 VKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAA 219

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+     YPY + ++    C Y+  K+   +Q        D   +  ++ +
Sbjct: 220 FCFIDEVQKGVSQAGAYPYIDNKDT---CKYDGSKSGATLQGFAAIPPKDEEQLKKVVAT 276

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GP+   +N    +++Y G     ND  CN  + +H++ +VGYG +NG   WIV+NSW D
Sbjct: 277 LGPVACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDD 334

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 335 TWGEKGYFRLPRGKNYCFI 353


>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 145/312 (46%), Gaps = 27/312 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F  +  K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+E    
Sbjct: 41  FAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEE---- 96

Query: 95  TGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              R T     E   A  +R +K +N     P P ++DWR  K   + PV+ QG+C S W
Sbjct: 97  --FRATYHNGAEYYAAALKRPRKVVNVSTGRP-PMTVDWR--KKGAVTPVKDQGKCDSSW 151

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQ 210
           AF+ T  +E Q  +    L  LS+  LV CD  +L C  G  D+AF ++    +  + ++
Sbjct: 152 AFSATGNIEGQWKVAGHELTSLSEQMLVSCDTDDLGCRDGFPDIAFNWIVSSNKGNVFTE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIES 268
             YPY +       C    +     ++D    +  + M+   L + GP  + ++    + 
Sbjct: 212 QSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDEDMIAEWLARKGPAAITVDATSFQR 271

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           Y G  +     +C   +++ A  +VGY + +    WI++NSWG    + GY +IE+G N 
Sbjct: 272 YTGGVLT----SCISKEMNSAALLVGYDDTSKPPYWIIKNSWGKGWGEEGYIRIEKGTNQ 327

Query: 329 CGIESYAYLASV 340
           C ++ YA  A V
Sbjct: 328 CLVQEYARSAVV 339


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 126/246 (51%), Gaps = 16/246 (6%)

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
           + G + ++  + R        E     +P+S+DWR+     + PV+ QG+CGSCWAF++T
Sbjct: 95  MNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGA--ITPVKDQGQCGSCWAFSST 152

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPY 215
             LE Q       L  LS+  L++C   +GN  CNGG +D AF+Y+K   G++++  YPY
Sbjct: 153 GALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPY 212

Query: 216 RNKENITFRCTYE-KEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYD 270
             ++++   C Y  + +  V      + SG +  +   +   GP+ V ++  H   + Y 
Sbjct: 213 EAEDDV---CRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYS 269

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANAC 329
                    +C+   LDH V +VGYG  NG   W+V+NSW +   D GY +I R   N C
Sbjct: 270 KGVYYEP--SCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHC 327

Query: 330 GIESYA 335
           G+ + A
Sbjct: 328 GVATAA 333


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 153/303 (50%), Gaps = 31/303 (10%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y    E   RFE FK + K  D+        + G +  +D S Q
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQ 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++   ++     + E   + ++      LPKS+DWR  K   + PV++QG+
Sbjct: 102 EFKNKYLGLKVDLSQRRESSNEEEFTYRDVD------LPKSVDWR--KKGAVTPVKNQGQ 153

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQY-G 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF ++ Q  G
Sbjct: 154 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGG 213

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E+    C  +KE+ +V   + +     +    ++  L + P+ V +  
Sbjct: 214 LHKEEDYPYIMEEST---CEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEA 270

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH V+ VGYG    +   IV+NSWG    + G+ +
Sbjct: 271 SSRDFQFYSGGVF---DGHCGS-DLDHGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIR 326

Query: 322 IER 324
           ++R
Sbjct: 327 MKR 329


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 149/310 (48%), Gaps = 35/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++ K  ++Y    E + RF+ FK + +  DE+     + +R+ +  L R    LT +
Sbjct: 51  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEH----NAENRTYKVGLNRFA-DLTNE 105

Query: 103 E------KERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           E        R  A R    K  +    R    LP+S+DWR+    V   V+ QG CGSCW
Sbjct: 106 EYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVV--EVKDQGSCGSCW 163

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLESQA 211
           AF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE+ +   G++S+ 
Sbjct: 164 AFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEE 223

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLI 266
           DYPY+  +    RC   ++ AKV   D +     +    L   + + P+ V +    R  
Sbjct: 224 DYPYKASDG---RCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREF 280

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER-- 324
           + Y           C    LDH V  VGYG +NG+  WIV+NSWG    + GY ++ER  
Sbjct: 281 QLYQSGIFTGR---CGT-ALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDL 336

Query: 325 ---GANACGI 331
                  CGI
Sbjct: 337 ATSATGKCGI 346


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 115/226 (50%), Gaps = 20/226 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LPK++DWR      + PV++QG+CGSCWAF+ T  LE Q      ++  LS+  LV C  
Sbjct: 119 LPKTVDWRTKGA--VTPVKNQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCST 176

Query: 184 DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
           D GN  C GG +D AF+Y++   G++++  YPY   +     C ++K         FV D
Sbjct: 177 DFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNGTDGT---CHFKKSTVGATDSGFV-D 232

Query: 239 TWVTSGVDHMMHLLQSGPIGVYLN--HRLIESY-DGNPIRRNDWACNPHKLDHAVAIVGY 295
               S       +   GPI V ++  H   + Y DG     ++  C+   LDH V +VGY
Sbjct: 233 IKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG---VYDEPECDSESLDHGVLVVGY 289

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           G  NG   W V+NSWG    D GY ++ R   N CGI S A +  V
Sbjct: 290 GTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSASIPLV 335


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 113/221 (51%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK +DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKVVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY+    +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKA---VDGECRFKKED--VGATDTGYV 228

Query: 241 ---VTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
                S VD    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 76/223 (34%), Positives = 118/223 (52%), Gaps = 20/223 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +P S+DWR   V  +  V+ QG CG+CW+F+ T  +E    ++  +L  LS+ +L+EC
Sbjct: 112 GDIPASIDWRNKGV--VTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIEC 169

Query: 184 DHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV 241
           D   N  C GG +D AF++V   +G++++ DYPYR ++     C  ++ K +V   D +V
Sbjct: 170 DKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGT---CNKDRMKRRVVTIDKYV 226

Query: 242 TSGVDHMMHLLQSGP-----IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
               ++   LLQ+       +G+  + R  + Y           C+   LDHAV IVGYG
Sbjct: 227 DVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTG---PCST-SLDHAVLIVGYG 282

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
            +NG+  WIV+NSWG      GY  ++R +      CGI   A
Sbjct: 283 SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLA 325


>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
 gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
          Length = 372

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 151/319 (47%), Gaps = 37/319 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDRSP 88
           A+K + + + R Y +  E   RF  F              Q+GK T +  G +  +D++ 
Sbjct: 61  AWKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKM-GVNNFTDKTE 119

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
            E+ +  G R            + +   F++  +   LP  +DWR++    + PV++QG+
Sbjct: 120 YELRKLRGYR------SACRIAKPKGSTFISS-EHAKLPDRVDWRRNGA--VTPVKNQGQ 170

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QY 205
           CGSCWAF++T  +E Q       L  LS+ QL++C   +GN  C GG +D+AF+YV+   
Sbjct: 171 CGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNE 230

Query: 206 GLESQADYPY---RNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           G++S+  YPY      EN+  RC +        V         D    M  +   GP+ V
Sbjct: 231 GIDSEISYPYISGDGDENV--RCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGPVSV 288

Query: 260 YLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            +N  L     Y        + A     LDH V +VGYG ++G   W+++NSWG+   D 
Sbjct: 289 AINAGLSSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDWGDK 348

Query: 318 GYFQIERGA-NACGIESYA 335
           GY +I + + N CG+ S A
Sbjct: 349 GYVKILKDSKNMCGVASAA 367


>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
          Length = 247

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PK++DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 27  IPKTVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 84

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 85  DQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEYA--VANDTGFV 139

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L+++    GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 140 DIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKDLDHGVLVVGYG 197

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 198 YEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAA 241


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 21/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LPKS+DWR+     + PV++QG CGSCW+F+TT  LE Q+      L  LS+  L++C  
Sbjct: 119 LPKSVDWREKGA--VTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCST 176

Query: 184 DHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  C GG +D AF Y+K+ +G++++  YPY  K+    +C Y KE +    +DT   
Sbjct: 177 SYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQG---KCRYHKEDSA--GRDTGFV 231

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + SG +  +   L   GP+ V ++  H   + Y        D  C+ H LDH V  VGY
Sbjct: 232 DIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPD--CDSHSLDHGVLAVGY 289

Query: 296 GEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G   +G   +I++NSWG+     GY  + R + N CG+ + A
Sbjct: 290 GTTDDGQDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQA 331


>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
          Length = 372

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 151/319 (47%), Gaps = 37/319 (11%)

Query: 42  AFKTYIVKWNRTYTDDNEIKTRFEYFK-------------QDGKETDEYYGTSGSSDRSP 88
           A+K + + + R Y +  E   RF  F              Q+GK T +  G +  +D++ 
Sbjct: 61  AWKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKM-GVNNFTDKTE 119

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
            E+ +  G R            + +   F++  +   LP  +DWR++    + PV++QG+
Sbjct: 120 YELRKLRGYR------SACRIAKPKGSTFISS-EHAKLPDRVDWRRNGA--VTPVKNQGQ 170

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QY 205
           CGSCWAF++T  +E Q       L  LS+ QL++C   +GN  C GG +D+AF+YV+   
Sbjct: 171 CGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNK 230

Query: 206 GLESQADYPY---RNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGV 259
           G++S+  YPY      EN+  RC +        V         D    M  +   GP+ V
Sbjct: 231 GIDSEISYPYISGDGDENV--RCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGPVSV 288

Query: 260 YLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            +N  L     Y        + A     LDH V +VGYG ++G   W+++NSWG+   D 
Sbjct: 289 AINAGLPSFSMYKSGIYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDWGDK 348

Query: 318 GYFQIERGA-NACGIESYA 335
           GY +I + + N CG+ S A
Sbjct: 349 GYVKILKDSKNMCGVASAA 367


>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
          Length = 350

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 18/218 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR  K  +++ V+ QG CGSCW F+TT  LES  A        LS+ QLV+C  
Sbjct: 133 LPDEKDWR--KEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAG 190

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  C+GG    AFEY+K   GLE++  YPY     +   C +  E   V V  +  +
Sbjct: 191 AFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSNGL---CKFRSEHVAVKVLGSVNI 247

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +  + P+ V      + RL   Y             P  ++HAV  VGY
Sbjct: 248 TLGAEDELKHAIAFARPVSVAFEVVHDFRL---YKSGVYTSTACGSTPMDVNHAVLAVGY 304

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G ++GI  W+++NSWG    DHGYF++E G N CG+ +
Sbjct: 305 GIEDGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGVAT 342


>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 691

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 83/220 (37%), Positives = 112/220 (50%), Gaps = 20/220 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P S+DWR      +  V+ QG CGSCWAF+TT  +E Q       L   S+ QLV+C   
Sbjct: 476 PDSVDWRTKGY--VTEVKDQGACGSCWAFSTTGSMEGQSFKNTGKLVSFSEQQLVDCSGS 533

Query: 185 HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
           +GN+ C GG +D AF Y++ YG+E +ADYPY  K++    C+Y+  KA     +T  T  
Sbjct: 534 YGNMGCGGGLMDQAFAYIEDYGIEPEADYPYTAKDD---PCSYDTSKA--VATNTGYTDI 588

Query: 245 VDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                  LQ      GPI V ++  H     Y       ++ AC+   LDH V  VGYG 
Sbjct: 589 ATMDEKALQQAVATVGPISVAIDASHSSFRLYKSGVY--DEPACSQTMLDHGVLAVGYGT 646

Query: 298 K-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             +G   WIV+NSWG    + GY  + R   N CGI + A
Sbjct: 647 TDDGNDYWIVKNSWGSTWGNQGYIHMSRNNDNQCGIATNA 686


>gi|60679562|gb|AAX34043.1| Sui m 1 allergen [Suidasia medanensis]
          Length = 336

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 119/227 (52%), Gaps = 25/227 (11%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP + DWRQ   +    V +QG+CGSCWAFAT A +E+Q A+ K     LS+ QLV+CDH
Sbjct: 115 LPAAFDWRQ---QWNTAVRNQGQCGSCWAFATAATVEAQYAIRKNVHVTLSEQQLVDCDH 171

Query: 186 GNLN-------CNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYE-KEKAKVFVQ 237
                      C GGN  +A+ YV+Q GL  ++ YPY+ ++      T    ++  V   
Sbjct: 172 RPFQGQYEDHGCQGGNPIIAYAYVQQTGLVEESAYPYQARDGQCQSSTVNGHQRYHVSAG 231

Query: 238 DTWVTSGVDH--MMHLLQSGPIGVYLNHRLIESYDGNPIR--RN----DWACNPHKLDHA 289
                +  D   M  L Q GP+ V     LI + D N  R  RN    +   N  +++HA
Sbjct: 232 RELPFNATDETIMNSLHQIGPMAV-----LIFASD-NEFRFYRNGVIQNLRPNSRQINHA 285

Query: 290 VAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
           V +VG+G ++G   WIV+NSWG    + GYF++ R  N  GI +Y +
Sbjct: 286 VTLVGWGTEDGQDYWIVKNSWGPSWGESGYFRLGRHHNLIGINNYVF 332


>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
          Length = 326

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 88/304 (28%), Positives = 142/304 (46%), Gaps = 37/304 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD----------EYYGTSGSSDRSPQEIL 92
           F+ Y+ ++N+TY D +  + R   FKQ  +  +            YG +  SD  P+E  
Sbjct: 35  FQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGLTKFSDLLPEEFF 94

Query: 93  QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
           Q        ++    E  R       +  K+  +P  +DWR+     +  + +QG CG+C
Sbjct: 95  QTYLQSNLSQKTHSNEPKR-------HHHKRATVPNKVDWREKNA--VTRIYNQGSCGAC 145

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK--QYGLESQ 210
           WA++    +ES  A+       LS  ++++C   N  CNGG+I     ++K   + ++  
Sbjct: 146 WAYSVIETVESMNAIKTNKSEELSVQEIIDCAGNNKGCNGGDICTLLSWIKATNFTIQRH 205

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQ-SGPIGVYLNHRLIESY 269
           ADY          +C   +  A V V+D  +    D M+ LL  +GP+ V +N +  ++Y
Sbjct: 206 ADY--------GGKCG--RGSAGVHVRDFILVGSEDVMLRLLADNGPLAVAINAQTWQNY 255

Query: 270 DGNPIRRNDWAC--NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            G  I   ++ C  +P KL+HAV IVGY     I  +IVRN+WG    D G+  I    N
Sbjct: 256 IGGVI---EYHCDGDPSKLNHAVQIVGYDLTASIPHYIVRNTWGVDFGDGGFLYIAVDKN 312

Query: 328 ACGI 331
            CGI
Sbjct: 313 MCGI 316


>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
          Length = 336

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 83/240 (34%), Positives = 127/240 (52%), Gaps = 30/240 (12%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N+  +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NQTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D+AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSG--VDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWA 280
                AK+  FV    + SG  +  M  +   GP+ V ++  H+ ++ Y          A
Sbjct: 217 PRFNVAKITGFVD---IPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--A 271

Query: 281 CNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           C+  +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CG+ + A
Sbjct: 272 CSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKA 331


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 153/311 (49%), Gaps = 38/311 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQ 93
           +++++VK  ++Y    E + RF+ FK + +  DE+          G +  +D + +E  +
Sbjct: 50  YESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEE-YR 108

Query: 94  RTGLRLTGKEK-ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
            T L    K K  ++++DR         R    LP+S+DWR      + P++ QG CGSC
Sbjct: 109 STYLGAKSKPKLSKVKSDR------YAPRVGDSLPESVDWRAKGA--VAPIKDQGSCGSC 160

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+T   +E    ++   L  LS+ +LV+CD   N  C+GG +D  FE++    G+++ 
Sbjct: 161 WAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTD 220

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPI--GVYLNHRL 265
            DYPY  ++    RC   ++ AKV   D++    V++   +   + S P+  G+    R 
Sbjct: 221 KDYPYLGRDA---RCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRA 277

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER- 324
            + YD          C    LDH V +VGYG + G   WIVRNSWG    + GY ++ER 
Sbjct: 278 FQFYDSGIFTGK---CGT-ALDHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERN 333

Query: 325 ----GANACGI 331
                   CGI
Sbjct: 334 LAGTSVGKCGI 344


>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
          Length = 328

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 114/218 (52%), Gaps = 21/218 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           L  S+DWR      + PV++QG+CGSCWAF++T  LE Q  +    L   S+S+LV+C  
Sbjct: 115 LSDSVDWRSKGA--VTPVKNQGQCGSCWAFSSTGSLEGQYFINNDKLLSFSESELVDCSR 172

Query: 185 -HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
            +GN  C GG +D AF Y + Y  E ++DYPY  K+     C Y ++K    +       
Sbjct: 173 RYGNNGCKGGLMDNAFRYWEVYKEELESDYPYVAKDG---PCRYSQDKGVTTISS---YK 226

Query: 244 GVDHMMHL-LQS-----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V H   + LQ      GPI V ++  H+  + Y       ++  C+  KLDH V +VGY
Sbjct: 227 NVPHFSQISLQDAVRTIGPISVAMDASHKSFQLYHSGVYSESE--CSQTKLDHGVLVVGY 284

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G  +    W+V+NSWG      GYF+I    N CG+E+
Sbjct: 285 GTSSEPF-WLVKNSWGAGWGMDGYFEIAMRNNMCGLET 321


>gi|148283737|gb|ABN50361.2| cathepsin L [Fasciola hepatica]
          Length = 326

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 119/224 (53%), Gaps = 15/224 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P+S+DWR      +  V++QG+CGSCWAF+TT  +E Q    ++     S+ QLV 
Sbjct: 105 KLAVPESIDWRD--YYYVTEVKNQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLVN 162

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C  D GN  C GG ++ A+EY+K  GLE+++ YPY+  E     C Y+   A   V   +
Sbjct: 163 CTRDFGNYGCGGGYVENAYEYLKHNGLETESYYPYQAVEG---PCQYDGRLAYAKVTGYY 219

Query: 241 VTSGVDH--MMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                D   + +L+ + GP  V L+         + I ++   C P +L HAV  VGYG 
Sbjct: 220 TVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIYQSQ-TCLPDRLTHAVLAVGYGS 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY +  R   N CGI S   LASV
Sbjct: 279 QDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIAS---LASV 319


>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
          Length = 358

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 150/322 (46%), Gaps = 50/322 (15%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  ++ + Y  + E   RF  FK +      +        +G +  SD +P E    
Sbjct: 45  FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVTQFSDLTPMEFQHS 104

Query: 95  T-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             GLR  G     L +D +       +     LPK  DWR      + PV++QG CGSCW
Sbjct: 105 VLGLRGVG-----LPSDADSAPILPTDN----LPKDFDWRGHGA--VTPVKNQGSCGSCW 153

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN------------IDVAFEY 201
           +F+ T  LE    L    L  LS+ QLV+CDH    C+               ++ AFEY
Sbjct: 154 SFSATGALEGAHFLSTGELVSLSEQQLVDCDH---QCDPEEAGSCGSGCNGGLMNSAFEY 210

Query: 202 V-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIG 258
           +    G+  + DYPY      T  C ++K K    V +  V S  +  +  +L+++GP+ 
Sbjct: 211 ILNNGGVMREEDYPYSGTNGGT--CKFDKAKIAASVANFSVVSRDEDQIAANLVKNGPLA 268

Query: 259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWG 311
           V +N   +++Y G       + C+  KL+H V +VGYG ++           WI++NSWG
Sbjct: 269 VAINAVYMQTYVGG--VSCPYVCS-KKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWG 325

Query: 312 DIGPDHGYFQIERGANACGIES 333
           +   ++GY++I RG N CG++S
Sbjct: 326 ENWGENGYYKICRGRNICGVDS 347


>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
           tropicalis]
 gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
          Length = 329

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 128/259 (49%), Gaps = 24/259 (9%)

Query: 85  DRSPQEILQRT-GLRLTGKEKER---LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           D + +E++Q+  GL++    +     +     R+ ++++ RKKG               +
Sbjct: 82  DMTSEEVVQKMMGLKVPPNHRPNNTYIPEWNSRIPEYIDYRKKG--------------YV 127

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVECDHGNLNCNGGNIDVA 198
            PV +QG CGSCWAF++   LE Q  L+KKT  L  LS   LV+CD  N  C GG +  A
Sbjct: 128 TPVHNQGICGSCWAFSSVGALEGQ--LMKKTGKLVSLSPQNLVDCDTDNYGCEGGYMTNA 185

Query: 199 FEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPI 257
           F YV+   G++S A+YPY  ++        +K       ++  V S       +   GP+
Sbjct: 186 FGYVRDNGGIDSDAEYPYVGQDEGCHYNPADKAATCKGYKEIPVGSEKALKRAVANVGPV 245

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDH 317
            V ++  L            D +CNP  ++HAV +VGYG + GI  WI++NSWGD     
Sbjct: 246 SVSIDASLPSFQFYKKGVYYDSSCNPDAVNHAVLVVGYGNEKGIKHWIIKNSWGDWWGKK 305

Query: 318 GYFQIERG-ANACGIESYA 335
           GY  + R   NACGI S A
Sbjct: 306 GYVLLARDKKNACGIASLA 324


>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
          Length = 350

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 18/218 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR  K  +++ V+ QG CGSCW F+TT  LES  A        LS+ QLV+C  
Sbjct: 133 LPDEKDWR--KEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAG 190

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  C+GG    AFEY+K   GLE++  YPY     +   C +  E   V V  +  +
Sbjct: 191 AFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSNGL---CKFRSEHVAVKVLGSVNI 247

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +  + P+ V      + RL   Y             P  ++HAV  VGY
Sbjct: 248 TLGAEDELKHAIAFARPVSVAFEVVHDFRL---YKSGVYTSTACGSTPMDVNHAVLAVGY 304

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G ++GI  W+++NSWG    DHGYF++E G N CG+ +
Sbjct: 305 GIEDGIPYWLIKNSWGGDWGDHGYFKMEMGKNMCGVAT 342


>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
          Length = 347

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 89/253 (35%), Positives = 133/253 (52%), Gaps = 26/253 (10%)

Query: 98  RLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           +L G    RL  D  R+   KFL       +P S+DWR+  +  + PV++QG CGSCWAF
Sbjct: 101 KLNGYRHRRLFGDSMRKNGTKFLVPFNV-KVPDSVDWREHNL--VTPVKNQGMCGSCWAF 157

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           + T  LE Q       L  LS+  LV+C   +GN  CNGG +D+AFEY+K  +G++++  
Sbjct: 158 SATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEG 217

Query: 213 YPYRNKENITFRCTYEKE----KAKVFVQDTWVTSGVDHMMHLLQS--GPIGVYLN--HR 264
           YPY  KE    RC ++K     + + FV    +  G +  + +  +  GPI + ++  HR
Sbjct: 218 YPYVGKE---MRCHFKKRDIGAEDRGFVD---LPEGDEDALKVAVATQGPISIAIDAGHR 271

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y        D  C+  +LDH V +VGYG +      WI++NSWG    + GY +I 
Sbjct: 272 SFQLYKKGVYF--DEECSSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIA 329

Query: 324 RGA-NACGIESYA 335
           R   N CG+ + A
Sbjct: 330 RNRNNHCGVATKA 342


>gi|431917800|gb|ELK17041.1| Cathepsin L1 [Pteropus alecto]
          Length = 334

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 86/248 (34%), Positives = 125/248 (50%), Gaps = 22/248 (8%)

Query: 107 LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVA 166
            +  + +  K   E     +P S+DWRQ     + PV++QG+CGSCWAF+ T  LE Q+ 
Sbjct: 95  FQNQKHKKGKVFREPLFAQIPPSVDWRQKGY--VTPVKNQGQCGSCWAFSATGSLEGQMF 152

Query: 167 LLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITF 223
                L  LS+  LV+C    GN  CNGG +D AF+Y+K   GL+S+  YPY  KE+ T 
Sbjct: 153 RKTGKLVSLSEQNLVDCSRSQGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYLAKESDT- 211

Query: 224 RCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRN 277
            C Y+ E +     DT           L+++    GPI V ++  H   + Y+       
Sbjct: 212 -CNYKPEYSA--ANDTGFVDIPQREKSLMKAVATVGPISVAIDAGHSSFQFYNKGIYYEP 268

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIE 332
           D  C+   LDH V ++GYG + G       WIV+NSWG     +GY ++ +   N CGI 
Sbjct: 269 D--CSSKDLDHGVLVIGYGSEGGDPKSNKFWIVKNSWGPEWGMNGYVKMAKDQNNHCGIA 326

Query: 333 SYAYLASV 340
           + A   +V
Sbjct: 327 TAASYPTV 334


>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
          Length = 341

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 122/224 (54%), Gaps = 27/224 (12%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P ++DWRQ     + PV+ QG+CGSCW+F+TT  LE Q       L  LS+  L++C   
Sbjct: 125 PPTVDWRQHGA--VTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSA 182

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQDT 239
           +GN  CNGG +D AF+Y+K   G++++  YPY   E +  +C Y  + +      FV   
Sbjct: 183 YGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPY---EAVDDKCRYNPKNSGAEDVGFVD-- 237

Query: 240 WVTSGVDH--MMHLLQSGPIGVYLNHRLIESY----DGNPIRRNDWACNPHKLDHAVAIV 293
            + +G +H  M+ L   GP+ V ++    ES+    DG     N   C+   LDH V +V
Sbjct: 238 -IPAGDEHKLMLALATVGPVSVAIDASQ-ESFQLYSDGVYYDEN---CSSENLDHGVLVV 292

Query: 294 GYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           GYG +++G   W+V+NSWG    D GY ++ R   N CGI S A
Sbjct: 293 GYGTDEDGGDYWLVKNSWGPSWGDEGYIKMARNRDNHCGIASSA 336


>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 452

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 141/317 (44%), Gaps = 37/317 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F  +  K+ R+Y    E   R   F+ + + +  Y        +G +  SD +P+E   R
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 95  TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
                     ER  EA R RV+  + +   G  P ++DWR+     + PV+ QG CGSCW
Sbjct: 94  Y------HNGERHFEAARGRVRTLV-QVPPGKAPAAVDWRRKGA--VTPVKDQGSCGSCW 144

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYG---LESQ 210
           +F+    +E Q A     L  LS+  LV CD  +  C GG +D AFE++ +     + ++
Sbjct: 145 SFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTE 204

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD-------HMMHLLQSGPIGVYLNH 263
             YPY +       C     K +       +T  VD          +L  +GP+ V ++ 
Sbjct: 205 KSYPYVSGGGEEPPC-----KPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDA 259

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
               SY G  +     +C    L+H V +VGY + +    WI++NSW     + GY +IE
Sbjct: 260 TTFMSYSGGVVT----SCTSEALNHGVLLVGYNDSSKPPYWIIKNSWSSSWGEKGYIRIE 315

Query: 324 RGANACGIESYAYLASV 340
           +G N C +   A  A V
Sbjct: 316 KGTNQCLVAQLASSAVV 332


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 89/245 (36%), Positives = 127/245 (51%), Gaps = 18/245 (7%)

Query: 103 EKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE 162
           E+   E +R     FL E     +P S+DWR      + PV++QG CGSCWAF+TT  LE
Sbjct: 145 ERHFSEGNRINGSAFL-EVNYVQVPTSVDWRDHGY--VTPVKNQGHCGSCWAFSTTGALE 201

Query: 163 SQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKE 219
            Q+      L  LS+  LV+C    GN  CNGG +D AF+Y+ +  G++S+  YPY  K+
Sbjct: 202 GQLFRKSGRLVSLSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGIDSEDCYPYTAKD 261

Query: 220 NITFRCTYEKEKAKVFVQ---DTWVTSGVDHMMHLLQSGPIGVYLN-HRLIESYDGNPIR 275
             T +C ++ E A   V    D    S    M  +   GP+ V ++ H     +  + I 
Sbjct: 262 --TAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIF 319

Query: 276 RNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACG 330
                C+  +L+HAV +VGYG    ++ G   WIV+NSWG    DHGYF + +   N CG
Sbjct: 320 YEP-KCSSERLNHAVLVVGYGYEGEDEAGKKYWIVKNSWGKQWGDHGYFYLSKDRGNHCG 378

Query: 331 IESYA 335
           I + A
Sbjct: 379 IATTA 383


>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 357

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 142/302 (47%), Gaps = 19/302 (6%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL 97
           + V +F  +  ++ + Y    E+K RF  FK++    D    T+         + Q   L
Sbjct: 54  RHVLSFSRFTHRYGKKYQSVEEMKLRFSVFKEN---LDLIRSTNKKGLSYKLSLNQFADL 110

Query: 98  RLTGKEKERLEADRERVKKFLNERK--KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
                ++ +L A +          K  +  +P + DWR+  +  ++PV+ QG CGSCW F
Sbjct: 111 TWQEFQRYKLGAAQNCSATLKGSHKITEATVPDTKDWREDGI--VSPVKEQGHCGSCWTF 168

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           +TT  LE+           LS+ QLV+C     N  C+GG    AFEY+K   GL+++  
Sbjct: 169 STTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEA 228

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNHRLIE 267
           YPY  K+     C +  +   V V+D+  +T G +    H + L++   +   + H    
Sbjct: 229 YPYTGKDG---GCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEF-R 284

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y       N     P  ++HAV  VGYG ++ +  W+++NSWG    D+GYF++E G N
Sbjct: 285 FYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKN 344

Query: 328 AC 329
            C
Sbjct: 345 MC 346


>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
          Length = 377

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 142/302 (47%), Gaps = 19/302 (6%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL 97
           + V +F  +  ++ + Y    E+K RF  FK++    D    T+         + Q   L
Sbjct: 54  RHVLSFSRFTHRYGKKYQSVEEMKLRFSVFKEN---LDLIRSTNKKGLSYKLSLNQFADL 110

Query: 98  RLTGKEKERLEADRERVKKFLNERK--KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
                ++ +L A +          K  +  +P + DWR+  +  ++PV+ QG CGSCW F
Sbjct: 111 TWQEFQRYKLGAAQNCSATLKGSHKITEATVPDTKDWREDGI--VSPVKEQGHCGSCWTF 168

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           +TT  LE+           LS+ QLV+C     N  C+GG    AFEY+K   GL+++  
Sbjct: 169 STTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEA 228

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGVD----HMMHLLQSGPIGVYLNHRLIE 267
           YPY  K+     C +  +   V V+D+  +T G +    H + L++   +   + H    
Sbjct: 229 YPYTGKDG---GCKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEF-R 284

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y       N     P  ++HAV  VGYG ++ +  W+++NSWG    D+GYF++E G N
Sbjct: 285 FYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKN 344

Query: 328 AC 329
            C
Sbjct: 345 MC 346


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 108/344 (31%), Positives = 161/344 (46%), Gaps = 58/344 (16%)

Query: 27  YVWRDLAYD-----SIKQVDA-FKTYIVKWNRTYTDD--------NEIKTRFEYFK---- 68
           Y   DL YD     S +++ A F +++++  ++Y D+         E  TR+  FK    
Sbjct: 35  YSILDLGYDPQDLSSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLR 94

Query: 69  ----QDGKETDEYYGTSGSSDRSPQEI-LQRTGLRLTGKEKERLEADRERVKKFLNERKK 123
               ++ K    + G +  +D + +E   QR G R           DR R +    E + 
Sbjct: 95  FIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRF----------DRSRERTSHEEFRY 144

Query: 124 GP-----LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKS 178
           G      LP S+DWR+    V   V+ QG CGSCWAF+  A +E    L    L  LS+ 
Sbjct: 145 GSVQLKDLPDSIDWREKGAVV--GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQ 202

Query: 179 QLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFV 236
           +LV+CD G +  CNGG +D AF +V K  GL+++ADYPY+       RC   K  AKV  
Sbjct: 203 ELVDCDKGEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGT---RCDRSKMNAKVVT 259

Query: 237 QDTWVTSGVDHMMHLLQS---GPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVA 291
            D +    V+    LL++    P+ V ++     ++ Y           C    LDH V 
Sbjct: 260 IDGYEDVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGR---CGT-DLDHGVT 315

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER----GANACGI 331
            VGYG+++G   WI++NSWG    + GY ++ R     A  CGI
Sbjct: 316 NVGYGKEDGKAYWIIKNSWGSNWGEKGYVKMARNTGLAAGLCGI 359


>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
 gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 117/224 (52%), Gaps = 23/224 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +PK++DWR+     + PV++QG+CGSCWAF+ +  LE Q+ L    L  LS+  LV+C H
Sbjct: 114 IPKTVDWREKGC--VTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+Y+K+  GL+S+  YPY  K+     C Y  E A     DT   
Sbjct: 172 DQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDG---SCKYRAEYA--VANDTGFV 226

Query: 243 SGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
                   L++     GPI V ++  H  ++ Y        +  C+   LDH V +VGYG
Sbjct: 227 DIPQQEKALMKPVATVGPISVAMDASHPSLQFYSSGIYYEPN--CSSKDLDHGVLVVGYG 284

Query: 297 ----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
               + N    W+V+NSWG      GY +I +   N CG+ + A
Sbjct: 285 YEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAA 328


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 118/221 (53%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV++QG+CGSCW+F+TT  LE Q       L  LS+  L++C  
Sbjct: 115 LPKTVDWR--KKGAVTPVKNQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSR 172

Query: 186 --GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG +D AF+Y+K   G++++  YPY   + +   C + K  + V   DT   
Sbjct: 173 SFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPYNATDGV---CHFNK--SAVGATDTGFV 227

Query: 241 -VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G ++ +   +   GP+ V ++  H   + Y        +  C+  +LDH V +VGY
Sbjct: 228 DIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEGVYDEPE--CDSEQLDHGVLVVGY 285

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G K+G   W+V+NSWG    D GY  + R   N CGI S A
Sbjct: 286 GTKDGQDYWLVKNSWGTTWGDGGYIYMSRNKDNQCGIASAA 326


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 151/318 (47%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F+++I +  + Y    E   RFE FK + K  DE        + G +  +D S Q
Sbjct: 43  KLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQ 102

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   +  V++QG 
Sbjct: 103 EFKNKYLGLKVDYSRR------RESPEEFT--YKDVELPKSVDWR--KKGAVTQVKNQGS 152

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + V+  G
Sbjct: 153 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDG 212

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL-- 261
           L  + DYPY  +E     C   KE+ +V     +     +    ++  L + P+ V +  
Sbjct: 213 LHKEEDYPYIMEEGT---CEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEA 269

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+    V+NSWG    + GY +
Sbjct: 270 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIR 325

Query: 322 IERGA----NACGIESYA 335
           + R        CGI   A
Sbjct: 326 MRRNIGKPEGICGIYKMA 343


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 150/317 (47%), Gaps = 47/317 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++ K  + Y    E + RFE FK + K  DE+     + +R+ +  L R    LT +
Sbjct: 46  YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEH----NAQNRTYKVGLNRFA-DLTNE 100

Query: 103 EKER--LEADRERVKKFLNERKKGP---------LPKSLDWRQSKVKVLNPVESQGRCGS 151
           E     L    +  ++F   +   P         LP+S+DWR++    +NPV+ Q  CGS
Sbjct: 101 EYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGA--VNPVKDQRSCGS 158

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECD-HGNLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+T A +E    ++   L  LS+ +LV+CD   ++ CNGG +D AF++ +K  GL++
Sbjct: 159 CWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLDT 218

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTW----------VTSGVDHMMHLLQSGPIGV 259
           + DYPY   +     C    + +KV   D +          +   V H     Q   + V
Sbjct: 219 EKDYPYTGFDG---ECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAH-----QPVSVAV 270

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
               R ++ Y           C    LDH +  VGYG +NG   WIVRNSWG    ++GY
Sbjct: 271 EAGGRALQLYVSGIFTGE---CGT-ALDHGIVAVGYGTENGTDYWIVRNSWGSSWGENGY 326

Query: 320 FQIERG-----ANACGI 331
            ++ER      +  CGI
Sbjct: 327 IRMERNMADAFSGKCGI 343


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 148/336 (44%), Gaps = 76/336 (22%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE-------------------------- 76
           ++ ++VK ++ Y    E   RFE FK +    DE                          
Sbjct: 35  YEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNM 94

Query: 77  YYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSK 136
           Y GT   + R+  +I   TG R      +RL                   P  +DWR SK
Sbjct: 95  YLGTKNDAKRNVMKIKITTGHRYAFNSGDRL-------------------PVHVDWR-SK 134

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNI 195
             V + ++ QG CGSCWAF+T A +E+   ++   L  LS+ +LV+CD   N  CNGG +
Sbjct: 135 GAVAH-IKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLM 193

Query: 196 DVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW----------VTSG 244
           D AFE+ V+  G++++ DYPY+  E    RC   ++ AKV   D +          +   
Sbjct: 194 DYAFEFIVENGGIDTEQDYPYKGFEG---RCDPTRKNAKVVSIDGYEDVPAYNENALKKA 250

Query: 245 VDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTW 304
           V H     Q   + +    R ++ Y           C  + LDH V +VGYG +NG+  W
Sbjct: 251 VFH-----QPVSVAIEAGGRALQLYQSGVFTGR---CGTN-LDHGVVVVGYGFENGVDYW 301

Query: 305 IVRNSWGDIGPDHGYFQIERGA-----NACGIESYA 335
           +VRNSWG    + GYF++ER         CGI   A
Sbjct: 302 LVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQA 337


>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/220 (36%), Positives = 112/220 (50%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+  DWR S +  ++PV+ QG CGSCW F+TT  LE+           LS+ QLV+C  
Sbjct: 141 LPEMKDWRVSGI--VSPVKDQGHCGSCWTFSTTGALEAAYKQAFGKGISLSEQQLVDCAG 198

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  C+GG    AFEYVK   GL+++  YPY  K      C +  E   V V D+  +
Sbjct: 199 AFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTGKNG---ECKFSSENVGVQVLDSVNI 255

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +    P+ V        RL   Y       +     P  ++HAV  VGY
Sbjct: 256 TLGAEDELKHAVAFVRPVSVAFQVVNGFRL---YKEGVYTSDTCGRTPMDVNHAVLAVGY 312

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           G +NG+  W+++NSWG    D GYF++E G N CG+ + A
Sbjct: 313 GVENGVPYWLIKNSWGADWGDSGYFKMEMGKNMCGVATCA 352


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 146/301 (48%), Gaps = 28/301 (9%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR- 98
            + FK +  K  + Y    E + R   FK++ K   E  G   S       +  + GL  
Sbjct: 47  TEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSG------LEHKVGLNK 100

Query: 99  ---LTGKEKERLEADRERVKKFLNERKK------GPLPKSLDWRQSKVKVLNPVESQGRC 149
              L+ +E   +   + +    + E++K         P SLDWR   V  +  V+ QG C
Sbjct: 101 FADLSNEEFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGV--VTAVKDQGDC 158

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH-GNLNCNGGNIDVAFEYV-KQYGL 207
           GSCW+F+TT  +E+  A++   L  LS+ +LV+CD   N  C GG++D AF++V    G+
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS--GPIGVYLNHRL 265
           +++ADYPY     +   C   KE+ KV   + +V         L  +   PI V ++   
Sbjct: 219 DTEADYPYTG---VDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSA 275

Query: 266 I--ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
           +  + Y G  I   D + +P+ +DHA+ IVGYG +N    WIV+NSWG      GYF I 
Sbjct: 276 LDFQLYTGG-IYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIR 334

Query: 324 R 324
           R
Sbjct: 335 R 335


>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
          Length = 358

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 144/308 (46%), Gaps = 29/308 (9%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-RLTG 101
           F  +  ++ + Y    E+K RF  F ++ +          S++R  + +  + G+ R   
Sbjct: 58  FARFAHRYGKRYQSVEEMKLRFAIFMENLELIR-------STNR--RGLPYKLGINRYAD 108

Query: 102 KEKERLEADRERVKKFLNERKKG-------PLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
              E   A R    +  +   KG        LPK+ DWR+  +  ++PV+ QG CGSCW 
Sbjct: 109 MSWEEFRASRLGAAQNCSATLKGNHKMTDELLPKTKDWREDGI--VSPVKDQGSCGSCWT 166

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQA 211
           F+TT  LE+           LS+ QLV+C +   N  CNGG    AFEY+K   GL+++ 
Sbjct: 167 FSTTGALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYIKYNGGLDTEE 226

Query: 212 DYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGV-DHMMHLLQ-SGPIGVYLNH-RLIE 267
            YPY         C ++ E   V  V+   +T G  D ++H +    P+ +         
Sbjct: 227 SYPYAGVNGF---CHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIAFEVVSGFR 283

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN 327
            Y G     +        ++HAV  VGYG +NG+  W+++NSWG+     GYF++E G N
Sbjct: 284 FYKGGVYTSDTCGRTQMDVNHAVLAVGYGVENGVPYWLIKNSWGEEWGVDGYFKMELGKN 343

Query: 328 ACGIESYA 335
            CGI + A
Sbjct: 344 MCGIATCA 351


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 119/223 (53%), Gaps = 24/223 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR  K   +  V+ QG CG+CW+F+ T  +E    ++   L  LS+ +L++CD 
Sbjct: 118 VPDSVDWR--KKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDK 175

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  CNGG +D AFE+V K +G++++ DYPY+ ++     C  +K K KV   D++  +
Sbjct: 176 SYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGT---CKKDKLKQKVVTIDSY--A 230

Query: 244 GVDH-----MMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           GV       +M  + + P+  G+  + R  + Y           C+   LDHAV IVGYG
Sbjct: 231 GVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSGIFSG---PCST-SLDHAVLIVGYG 286

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
            +NG+  WIV+NSWG      G+  ++R        CGI   A
Sbjct: 287 SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLA 329


>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 524

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 144/311 (46%), Gaps = 28/311 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ  +   E         +G +  SD SP+
Sbjct: 118 QQFAAFKQ---KYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPE 174

Query: 90  EILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
           E        L G +     A  +R +K +N    G  P ++DWR  K   + PV+ QG C
Sbjct: 175 EF---RATYLNGAK--YYAAALKRPRKVVNV-STGKAPPAVDWR--KKGAVTPVKDQGSC 226

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYG 206
           GSCWAFA    +E Q  +    L  LS+  LV CD    NC GG  D AF+++    +  
Sbjct: 227 GSCWAFAAIGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNCGGGFADRAFKWIVSSNKGN 286

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHR 264
           + ++  YPY + +     C    +     +         ++ +   L ++GP+ + ++  
Sbjct: 287 VFTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVAIAVDAS 346

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
               Y G  +     +C+   ++H V +VGY + +    WI++NSW     + GY +IE+
Sbjct: 347 TFLDYKGGVLT----SCSSKHVNHEVLLVGYNDTSKPPYWIIKNSWDKEWGEEGYIRIEK 402

Query: 325 GANACGIESYA 335
           G N C ++ YA
Sbjct: 403 GTNLCLMKEYA 413


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/253 (33%), Positives = 121/253 (47%), Gaps = 21/253 (8%)

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
           +   TG R+ G  K        +   FL     G LPK++DWR      + PV+ QG+CG
Sbjct: 89  VAMMTGFRVNGTSKA------AKGSTFLPPNNVGKLPKTVDWRTKGY--VTPVKDQGQCG 140

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-VKQYGLES 209
           SCWAF+ T  LE Q       L  LS+  LV+C   N  CNGG +D AF+Y +   G+++
Sbjct: 141 SCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSDKNYGCNGGLMDRAFQYIIDAGGIDT 200

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH--LLQSGPIGVYLN--HR 264
           +  YPY   +     C ++       V   T VTSG +  +   +   GPI V ++  H 
Sbjct: 201 EESYPYIAMDG---NCHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHF 257

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y       N+  C+   LDH V  VGYG   +G   WIV+NSW +    +GY  + 
Sbjct: 258 SFQLYQSGV--YNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMS 315

Query: 324 RGA-NACGIESYA 335
           R   N CGI + A
Sbjct: 316 RNKDNQCGIATQA 328


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 117/219 (53%), Gaps = 16/219 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P ++DWR      + PV++QG+CGSCWAF+TT  LE Q       L  LS+  LV+C  
Sbjct: 120 VPDTVDWRTKGY--VTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSR 177

Query: 185 -HGNLNCNGGNIDVAFEYV-KQYGLESQADYPY-RNKENITFRCTYEKEKAKVFVQDTWV 241
             GN+ C GG +D  F+YV   +G++S+  YPY    E   ++ + +  +   F     V
Sbjct: 178 TEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCHYKASCDSAEVTGFTD---V 234

Query: 242 TSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
           TSG +   M  +   GP+ V ++  H+  + Y+       +  C+  +LDH V +VGYG 
Sbjct: 235 TSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPE--CSSSELDHGVLVVGYGT 292

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
             G   W+V+NSWG+     GY ++ R  +N CGI + A
Sbjct: 293 DGGKDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSA 331


>gi|123484966|ref|XP_001324382.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121907264|gb|EAY12159.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 310

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 88/268 (32%), Positives = 129/268 (48%), Gaps = 27/268 (10%)

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           S  +P E     G + +G         R      +NE+  G  P S DWR     V+ PV
Sbjct: 58  SHLTPSEYHSLLGYKNSG---------RNHKYSIINEKNAGSHPDSFDWRDHP-GVIGPV 107

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-V 202
           + Q  CGSCWAF+T   LES  A+     Y LS+  LV+C      CNGG    A+++ +
Sbjct: 108 KDQNDCGSCWAFSTIFGLESNWAVKHNAAYILSEQNLVDCCSSAAGCNGGFPADAWDWMI 167

Query: 203 KQYGLES--QADYPYRNKENITFRCTYEKEKAK-------VFVQDTWVTSGVDHMMHLLQ 253
            + G ++  + DYPY ++E     C + K+KA        V V D       + + +L  
Sbjct: 168 DEQGGKTMLEVDYPYTSQEGT---CKWNKKKAAPPQVKGYVEVADCDENDLAEKIANL-- 222

Query: 254 SGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDI 313
            GP  + ++  L           +D  C+   LDHAV  VGYG +NG   WIVRNSWG++
Sbjct: 223 -GPASIAIDASLYSFMMYQSGIYDDPKCSSMNLDHAVGCVGYGVENGAKYWIVRNSWGEM 281

Query: 314 GPDHGYFQIERGA-NACGIESYAYLASV 340
             + GY ++ R   N CG+ + A++A V
Sbjct: 282 WGEKGYIRMARDKHNQCGVATEAFIAQV 309


>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 337

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 149/316 (47%), Gaps = 33/316 (10%)

Query: 39  QVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ---EILQRT 95
             ++F  ++ K+ +TY+   E   R   +      ++ YY    + +  P    E+ Q +
Sbjct: 31  MAESFNMWMKKYEKTYSTMEEYNERLRVYT-----SNYYYIEQLNKEHGPHTEYELNQFS 85

Query: 96  GL------RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            L      ++   E +   A     +K +N R     P ++DWR+  V  + PV+ QG+C
Sbjct: 86  DLTFAEFKKIYLTEPQHCSATNGNFQKPVNARD----PVAVDWREKNV--ITPVKDQGKC 139

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVK-QYG 206
           GSCW F+TT  LE+  A+    L  LS+ QLV+C     N  CNGG    AFEY+K   G
Sbjct: 140 GSCWTFSTTGCLEAHHAIKTGQLISLSEQQLVDCAGAFNNHGCNGGLPSQAFEYIKYNGG 199

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VTSGV--DHMMHLLQSGPIGVYLN- 262
           +ES+++Y Y  K+ +   C +        V D   +T     D    +   GP+ +    
Sbjct: 200 IESESNYNYTAKDGV---CRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEV 256

Query: 263 HRLIESYDGNPIRRNDWAC--NPHKLDHAVAIVGYGE-KNGILTWIVRNSWGDIGPDHGY 319
            +  + Y     +     C  +P K++HAV +VGY + K G   WIV+NSW       GY
Sbjct: 257 TKSFQHYKKGVYQGEIEVCSQSPDKVNHAVLVVGYNQTKLGEEYWIVKNSWSASWGMDGY 316

Query: 320 FQIERGANACGIESYA 335
           F I RG NACG+ + A
Sbjct: 317 FWIRRGHNACGLATCA 332


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 150/315 (47%), Gaps = 34/315 (10%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEI 91
           +D F+++I K  + Y    E   RFE FK +    DE        + G +  +D S +E 
Sbjct: 30  IDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLGLNEFADLSHEEF 89

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
             +  L L      R E   E   K ++      +PKS+DWR  K   +  V++QG CGS
Sbjct: 90  KNKY-LGLNVDLSNRRECSEEFTYKDVS-----SIPKSVDWR--KKGAVTDVKNQGSCGS 141

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AF Y +   GL  
Sbjct: 142 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGLHK 201

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYLNH--R 264
           + DYPY  +E     C   K +++V     +     +  + ++  L + P+ V ++   R
Sbjct: 202 EEDYPYIMEEGT---CEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGR 258

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y G      D  C   +LDH VA VGYG   G+   +V+NSWG    + G+ +++R
Sbjct: 259 DFQFYSGGVF---DGHCGT-ELDHGVAAVGYGSAKGLDFIVVKNSWGSKWGEKGFIRMKR 314

Query: 325 G----ANACGIESYA 335
                A  CGI   A
Sbjct: 315 NTGKPAGLCGINKMA 329


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 125/246 (50%), Gaps = 16/246 (6%)

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
           + G + ++  + R        E     +P+S+DWR+     + PV+ QG+CGSCWAF++T
Sbjct: 95  MNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGA--ITPVKDQGQCGSCWAFSST 152

Query: 159 AILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYPY 215
             LE Q       L  LS+  L++C   +GN  CNGG +D AF+Y+K   G++++  YPY
Sbjct: 153 GALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPY 212

Query: 216 RNKENITFRCTYE-KEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYD 270
             ++ +   C Y  + +  V      + SG +  +   +   GP+ V ++  H   + Y 
Sbjct: 213 EAEDGV---CRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYS 269

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-ANAC 329
                    +C+   LDH V +VGYG  NG   W+V+NSW +   D GY +I R   N C
Sbjct: 270 KGXYYEP--SCDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHC 327

Query: 330 GIESYA 335
           G+ + A
Sbjct: 328 GVATAA 333


>gi|1809288|gb|AAC47721.1| secreted cathepsin L 2 [Fasciola hepatica]
          Length = 326

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 119/224 (53%), Gaps = 12/224 (5%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P+S+DWR      +  V++QG+CGSCWAF+TT  +E Q    ++     S+ QLV+
Sbjct: 105 KLAVPESIDWRD--YYYVTEVKNQGQCGSCWAFSTTGAVEGQFRKNERASASFSEQQLVD 162

Query: 183 C--DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           C  D GN  C GG ++ A+EY+K  GLE+++ YPY+  E     C Y+   A   V   +
Sbjct: 163 CPRDLGNYGCGGGYMENAYEYLKHNGLETESYYPYQAVEG---PCQYDGRLAYAKVTGYY 219

Query: 241 VTSGVDH--MMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
                D   + +L+ + GP  V L+         + I ++   C P +L HAV  VGYG 
Sbjct: 220 TVHSGDEIELKNLVGTEGPAAVALDADSDFMMYQSGIYQSQ-TCLPDRLTHAVLAVGYGS 278

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY +  R   N CGI S A +  V
Sbjct: 279 QDGTDYWIVKNSWGTWWGEDGYIRFARNRGNMCGIASLASVPMV 322


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 117/227 (51%), Gaps = 31/227 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP+S+DWR+     + PV++QG+CGSCWAF+  + +ES   ++   +  LS+ +LVEC  
Sbjct: 145 LPESVDWREKGA--VAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECST 202

Query: 184 DHGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
           D GN  CNGG +D AF ++ K  G++++ DYPY+    +  +C   +  AKV   D +  
Sbjct: 203 DGGNSGCNGGLMDAAFNFIIKNGGIDTEDDYPYKA---VDGKCDINRRNAKVVSIDAFED 259

Query: 241 --------VTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAI 292
                   +   V H     Q   + +    R  + Y          +C  + LDH V  
Sbjct: 260 VPENDEKSLQKAVAH-----QPVSVAIEAGGRQFQLYKSGVF---SGSCTTN-LDHGVVA 310

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
           VGYG +NG   WIVRNSWG    + GY ++ER  NA    CGI   A
Sbjct: 311 VGYGTENGKDYWIVRNSWGPKWGEAGYIRMERNINATTGKCGIAMMA 357


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 115/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKAVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 -VTSGV--DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + +G   D    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|157862757|gb|ABV90501.1| cathepsin L, partial [Fasciola gigantica]
          Length = 244

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P  +DWR S    +  V+ Q  CGSCWAF+TT  +E Q           S+ QLV+C  
Sbjct: 26  VPDRIDWRDSGY--VTKVKDQEDCGSCWAFSTTGTMEGQFMKNIGFNVSFSEQQLVDCSS 83

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
           D GN  C GG +++A+EY++++GLE ++ YPYR  E     C Y++      V   ++  
Sbjct: 84  DFGNNGCRGGLMEIAYEYLRRFGLEIESTYPYRAVEG---PCRYDRRLGVAKVTGYYIVH 140

Query: 244 GVDH--MMHLLQ-SGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
             D   + +L+   GP  V L+   +ES D    R   +    C+P +L+H V  VGYG 
Sbjct: 141 SGDEVELQNLVGIEGPAAVALD---VES-DFVMYRSGIYQSQTCSPDRLNHGVLAVGYGT 196

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 197 QSGTDYWIVKNSWGTWWGEGGYIRMVRNRGNMCGIASMASLPMV 240


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 115/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV+ QG+CGSCWAF+ T  LE Q  L    L  LS+  LV+C  
Sbjct: 116 LPKAVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K   G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 -VTSGV--DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + +G   D    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/311 (33%), Positives = 151/311 (48%), Gaps = 36/311 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++VK  + Y    E + RF+ FK + +  D++   + + DR+ +  L R    LT +
Sbjct: 59  YEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDH---NSAEDRTYKLGLNRFA-DLTNE 114

Query: 103 E------KERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           E        +++ +R   K   N    R    LP S+DWR  K   + PV+ QG CGSCW
Sbjct: 115 EYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPDSVDWR--KEGAVPPVKDQGGCGSCW 172

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQA 211
           AF+    +E    ++   L  LS+ +LV+CD G N  CNGG +D AFE++    G++S  
Sbjct: 173 AFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDE 232

Query: 212 DYPYRNKENITFRC-TYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGPIGVYL--NHRLI 266
           DYPYR  +    RC TY K    V + D       D +     + + P+ V +    R  
Sbjct: 233 DYPYRGVDG---RCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREF 289

Query: 267 ESY-DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           + Y  G    R   A     LDH V  VGYG   G   WIVRNSWG    + GY ++ER 
Sbjct: 290 QLYVSGVFTGRCGTA-----LDHGVVAVGYGTAKGHDYWIVRNSWGSSWGEDGYIRLERN 344

Query: 326 -ANA----CGI 331
            AN+    CGI
Sbjct: 345 LANSRSGKCGI 355


>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 109/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + P + Q  CGSCWAF+    +E Q      TL  LS  +LV+C   D+
Sbjct: 115 AVDWREEGA--VTPAKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCKGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGEYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
          Length = 440

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 142/306 (46%), Gaps = 30/306 (9%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPE 95

Query: 90  EILQRTGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E       R T     E   A  +R +K +N    G  P ++DWR  K   + PV+ QG+
Sbjct: 96  E------FRATYHNGAEYYAAALKRPRKVVN-VSTGKAPPAIDWR--KKGAVTPVKDQGQ 146

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQY 205
           C S WAF+    +E Q  +    L  LS+  LV CD  +  C GG  D AF+++    + 
Sbjct: 147 CHSSWAFSAIGNIEGQWKIAGHELTSLSEQMLVSCDTNDFGCGGGFSDPAFKWIVSSNKG 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNH 263
            + ++  YPY +       C    +     ++D       ++ +   L + GP+ + ++ 
Sbjct: 207 NVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVAIAVDA 266

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
              +SY G  +     +C    LDH V +VGY + +    WI++NSWG    + GY +IE
Sbjct: 267 TSFQSYTGGVLT----SCISEHLDHGVLLVGYDDTSKPPYWIIKNSWGKGWGEEGYIRIE 322

Query: 324 RGANAC 329
           +G N C
Sbjct: 323 KGTNQC 328


>gi|108735858|gb|ABG00260.1| cathepsin L1 [Fasciola hepatica]
          Length = 219

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/216 (36%), Positives = 115/216 (53%), Gaps = 12/216 (5%)

Query: 131 DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNL 188
           DWR+S    +  V+ QG CGSCWAF+TT  ++ Q    ++T    S+ QLV+C    GN 
Sbjct: 6   DWRESGY--VTEVKDQGNCGSCWAFSTTGTMKGQYMKNERTSISFSEQQLVDCSRPWGNN 63

Query: 189 NCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH- 247
            C GG ++ A+EY+KQ+GLE+++ YPY   E     C Y+++     V   +     D  
Sbjct: 64  GCGGGLMENAYEYLKQFGLETESSYPYSAVEG---PCRYDRKLGVAKVTGYYTVHSGDEV 120

Query: 248 -MMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWI 305
            + +L+   GP  V L+  L      + I  +   C+P +L H V  VGYG ++G   WI
Sbjct: 121 ELQNLVGGEGPPAVALDAELDFMMYRSGIYXSQ-TCSPDRLSHGVLAVGYGTQDGTDYWI 179

Query: 306 VRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           V+NSWG    + GY ++ R   N CGI S A +  V
Sbjct: 180 VKNSWGTWWGEDGYIRMVRNRGNMCGIASLASVPMV 215


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 82/220 (37%), Positives = 117/220 (53%), Gaps = 21/220 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +PKS+DWR      +  V+ QG CGSCWAF++T  LE Q      TL  LS+  LV+C  
Sbjct: 122 IPKSVDWRSKGA--VTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCST 179

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  CNGG +D AF Y+K   G++++  YPY   E I   C +   KA +   D    
Sbjct: 180 KYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY---EGIDDSCHF--NKATIGATDRGSV 234

Query: 241 -VTSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G +  M   +   GP+ V ++  H   + Y       N+  C+P  LDH V +VGY
Sbjct: 235 DIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIY--NEPQCDPQNLDHGVLVVGY 292

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
           G +++G   W+V+NSWG    D G+ ++ R A N CGI S
Sbjct: 293 GTDESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIAS 332


>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
          Length = 334

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 123/234 (52%), Gaps = 13/234 (5%)

Query: 109 ADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALL 168
           ++R+R +      +   LP+++DWR   +  +  V+ QG CGSCWAF+    LE Q+A  
Sbjct: 102 SERQRGQSIFVTSEGADLPQTVDWRDKGL--VTSVKKQGSCGSCWAFSAAGALEGQLAKT 159

Query: 169 KKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRC 225
              L  LS   LV+C   +GN  CNGG +  AF+YV    G++S+A YPYR +     +C
Sbjct: 160 TGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAFQYVIDNQGIDSEASYPYRGQVQ---QC 216

Query: 226 TYEKE-KAKVFVQDTWVTSGVDHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACN 282
            Y    +A    Q  ++T G +  +   +   GPI V ++ +  + Y       +D +C+
Sbjct: 217 HYNPAFRAANCSQYRFLTQGDEGNLQAAVASIGPISVAIDAKQPKFYFYKSGVYDDPSCS 276

Query: 283 PHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN-ACGIESYA 335
              ++HAV  VGYG  NG   W+V+NSWG    D GY ++ R  N  CGI  +A
Sbjct: 277 -QTINHAVLAVGYGTLNGQDYWLVKNSWGVKFGDKGYIRMVRNKNDQCGIAQFA 329


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 153/318 (48%), Gaps = 37/318 (11%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           K ++ F++++ +  + Y    E   RF+ FK + K  DE        + G +  +D S Q
Sbjct: 42  KLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQ 101

Query: 90  EILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E   +  GL++    +      RE  ++F    K   LPKS+DWR  K   +  V++QG 
Sbjct: 102 EFKNKYLGLKVDYSRR------RESPEEFT--YKDFELPKSVDWR--KKGAVTQVKNQGS 151

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYG 206
           CGSCWAF+T A +E    ++   L  LS+ +L++CD   N  CNGG +D AF + V+  G
Sbjct: 152 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGG 211

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS---GPIGVYL-- 261
           L  + DYPY  +E     C   KE+ +V     +     ++   LL++    P+ V +  
Sbjct: 212 LHKEEDYPYIMEEGT---CEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEA 268

Query: 262 NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
           + R  + Y G      D  C    LDH VA VGYG   G+   IV+NSWG    + GY +
Sbjct: 269 SGRDFQFYSGGVF---DGHCGS-DLDHGVAAVGYGTSKGVNYIIVKNSWGSKWGEKGYIR 324

Query: 322 IERGA----NACGIESYA 335
           + R        CGI   A
Sbjct: 325 MRRNIGKPEGICGIYKMA 342


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 112/222 (50%), Gaps = 30/222 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR      + P++ QG CGSCWAF+T A +E+   ++      LS+ +LV+CD 
Sbjct: 128 LPVHVDWRMKGA--VAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDR 185

Query: 186 G-NLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
             N  CNGG +D AFE++ Q G +++  DYPYR  + I   C   K+ AKV   D +   
Sbjct: 186 AYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGI---CDPTKKNAKVVNIDGYEDV 242

Query: 241 -------VTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
                  +   V H     Q   + +  + R ++ Y           C    LDH V +V
Sbjct: 243 PPYDENALKKAVAH-----QPVSVAIEASGRALQLYQSGVFTGK---CGT-SLDHGVVVV 293

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
           GYG +NG+  W+VRNSWG    + GYF+++R        CGI
Sbjct: 294 GYGSENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGI 335


>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 329

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/330 (27%), Positives = 152/330 (46%), Gaps = 44/330 (13%)

Query: 43  FKTYIVKWNRTYTDD-NEIKTRFEYFKQDGKETDEY-------YGTSGSSDRSPQEILQR 94
           F  ++++  +TY  D  E   R E F ++     E        YG +  +D +  E    
Sbjct: 8   FDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARDGAEYGATPFADLTEDEFASS 67

Query: 95  TGLR--LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             +R  +     ERL+  R    + L       +P + DWR   +  + PV++QG CGSC
Sbjct: 68  LLMREPIDAARVERLK--RHESSRVLPHLPTENIPLNFDWRA--LGAVTPVKNQGMCGSC 123

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEY-V 202
           W+F+ T  +E    +    L  LS+ QLV+CDH          +  C+GG    A  Y V
Sbjct: 124 WSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANAMAYVV 183

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKE--KAKVFVQDTWVTSGVDHM-MHLLQSGPIGV 259
           K+ GL+++A YPY        RC  +++   A      ++V++    +   L++ GP+ V
Sbjct: 184 KRGGLDAEAAYPYLGARG-DGRCKSKEDGPPAATITNYSFVSADESQIAAALVKHGPLSV 242

Query: 260 YLNHRLIESYDGNPIRRN---DWACNPHKLDHAVAIVGYGEKNGILT--------WIVRN 308
            ++ R ++ Y     RR     WAC+  +LDH V IVG+G +             W+++N
Sbjct: 243 GIDARWMQLY-----RRGVACPWACDKTRLDHGVLIVGFGAEGRAPARGFRREPFWLIKN 297

Query: 309 SWGDIGPDHGYFQIERGANACGIESYAYLA 338
           SWG    + GY++I +   +CG+ +    A
Sbjct: 298 SWGARWGEEGYYKICKDKGSCGVNTMVLAA 327


>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
 gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
          Length = 336

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 125/238 (52%), Gaps = 26/238 (10%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N+  +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NQTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACN 282
                AK+  FV D    + +  M  +   GP+ V ++  H+ ++ Y          AC+
Sbjct: 217 PRFNVAKITGFV-DIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--ACS 273

Query: 283 PHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CGI + A
Sbjct: 274 SSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMA 331


>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
          Length = 318

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 52/316 (16%)

Query: 59  EIKTRFEYFKQDGKETDEYYGTSGSSDR-----SPQEILQRTGLRLTGKEKERLEADR-- 111
           E  TR    +   ++   + G + SS R      PQ    R    L  +  + L ++   
Sbjct: 1   EFGTRLRELQGQVRQDLRHQGGARSSLRRLQVQPPQSQAARQARPLASRRHQILRSNSGR 60

Query: 112 -------ERVKKFLNERKKGP------LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                   +  +F    +K P      LPK  DWR  K  V N V+  G CGSCW+F+TT
Sbjct: 61  VPPPVPRPQAVRFPAHAQKAPILPTKDLPKDFDWR-DKGAVTN-VKDLGGCGSCWSFSTT 118

Query: 159 AILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEYVKQYGLES 209
             LE    L    L  LS+ QLV+CDH          +  CNGG ++ AFE ++  G++ 
Sbjct: 119 GALEVSFYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEILQSGGVQK 178

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLI 266
           + D PY  ++      T + +K KV   D      +D      +L+++GP+ V +N   +
Sbjct: 179 EKDIPYTGRDG-----TCKFDKTKVAATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFM 233

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGE--------KNGILTWIVRNSWGDI-GPDH 317
           ++Y G       + C  H LDH V +VGYGE        KN    WI++NSWG+  G + 
Sbjct: 234 QTYVGG--VSCPYICGKH-LDHGVLLVGYGEGRYAPIRFKNKPY-WIIKNSWGESWGEND 289

Query: 318 GYFQIERGANACGIES 333
           GY +I RG N CG+++
Sbjct: 290 GYDEICRGRNVCGVDA 305


>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
          Length = 347

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 89/253 (35%), Positives = 132/253 (52%), Gaps = 26/253 (10%)

Query: 98  RLTGKEKERLEAD--RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           +L G    RL  D  R+   KFL        P S+DWR+  +  + PV++QG CGSCWAF
Sbjct: 101 KLNGYRHRRLFGDSMRKNGTKFLVPFNV-KAPDSVDWREHNL--VTPVKNQGMCGSCWAF 157

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQAD 212
           + T  LE Q       L  LS+  LV+C   +GN  CNGG +D+AFEY+K  +G++++  
Sbjct: 158 SATGALEGQHFRATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEG 217

Query: 213 YPYRNKENITFRCTYEKE----KAKVFVQDTWVTSGVDHMMHLLQS--GPIGVYLN--HR 264
           YPY  KE    RC ++K     + + FV    +  G +  + +  +  GPI + ++  HR
Sbjct: 218 YPYVGKE---MRCHFKKRDIGAEDRGFVD---LPEGDEDALKVAVATQGPISIAIDAGHR 271

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y        D  C+  +LDH V +VGYG +      WI++NSWG    + GY +I 
Sbjct: 272 SFQLYKKGVYF--DEECSSEELDHGVLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIA 329

Query: 324 RGA-NACGIESYA 335
           R   N CG+ + A
Sbjct: 330 RNRNNHCGVATKA 342


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 119/236 (50%), Gaps = 24/236 (10%)

Query: 104 KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILES 163
           ++RL A+ E V           LP SLDWRQ     + P++ QG CGSCWAF+  A +ES
Sbjct: 108 QDRLPAEDEDVDV-------SSLPTSLDWRQKGA--VTPIKDQGDCGSCWAFSAIASIES 158

Query: 164 QVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENIT 222
              L  K L  LS+ QL++CD  +  C+GG ++ AF++ VK  G+ ++A YPY       
Sbjct: 159 AHFLATKELVSLSEQQLMDCDTVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVG-- 216

Query: 223 FRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGNPIRRN 277
             C   K K KV     +        D +M  +   P+ V +  +    ++Y    +   
Sbjct: 217 -SCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGK 275

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER--GANACGI 331
              C+   LDH V ++GYG + G+  WI++NSWG    + G+ +IER  G   CG+
Sbjct: 276 ---CD-DSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGM 327


>gi|7271897|gb|AAF44679.1|AF239268_1 cathepsin L, partial [Fasciola gigantica]
          Length = 219

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 118/224 (52%), Gaps = 18/224 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P  +DWR S    +  V+ Q  CGSCWAF+TT  +E Q           S+ QLV+C  
Sbjct: 1   VPDKIDWRDSGY--VTKVKDQEDCGSCWAFSTTGTMEGQFMKNIGFNVSFSEQQLVDCSS 58

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
           D GN  C GG +++A+EY++++GLE ++ YPYR  E     C Y++      V   ++  
Sbjct: 59  DFGNNGCRGGLMEIAYEYLRRFGLEIESTYPYRAVEG---PCRYDRRLGVAKVTGYYIVH 115

Query: 244 GVDH--MMHLLQ-SGPIGVYLNHRLIESYDGNPIRRNDW---ACNPHKLDHAVAIVGYGE 297
             D   + +L+   GP  V L+   +ES D    R   +    C+P +L+H V  VGYG 
Sbjct: 116 SGDEVELQNLVGIEGPAAVALD---VES-DFVMYRSGIYQSQTCSPDRLNHGVLAVGYGT 171

Query: 298 KNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           ++G   WIV+NSWG    + GY ++ R   N CGI S A L  V
Sbjct: 172 QSGTDYWIVKNSWGTWWGEGGYIRMVRNRGNMCGIASMASLPMV 215


>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
          Length = 334

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 123/244 (50%), Gaps = 22/244 (9%)

Query: 111 RERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
           + R  K   E     +PKS+DW Q     + PV++QG+CGSCWAF+ T  LE Q+     
Sbjct: 99  KHRKGKVFQEPLFAEIPKSVDWTQKGY--VTPVKNQGQCGSCWAFSATGALEGQMFRKTG 156

Query: 171 TLYPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTY 227
            L  LS+  LV+C    GN  CNGG +D AF+Y+K   GL+S+  YPY  ++  T  C Y
Sbjct: 157 KLVSLSEQNLVDCSRSQGNQGCNGGLMDFAFQYIKDNGGLDSEESYPYLARD--TDSCNY 214

Query: 228 EKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWAC 281
           + E +     DT           L+++    GPI V ++  H+  + Y        D  C
Sbjct: 215 KPEYS--VANDTGFVDIPQRERALMKAVATVGPISVAIDAGHQSFQFYKSGIYFDPD--C 270

Query: 282 NPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAY 336
           +   LDH V +VGYG    + N    WIV+NSWG     +GY ++ +   N CGI + A 
Sbjct: 271 SSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGCNGYVKMAKDQNNHCGIATAAS 330

Query: 337 LASV 340
             +V
Sbjct: 331 YPTV 334


>gi|34811401|pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F
 gi|34811402|pdb|1M6D|B Chain B, Crystal Structure Of Human Cathepsin F
          Length = 214

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/217 (35%), Positives = 118/217 (54%), Gaps = 8/217 (3%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P   DWR      +  V+ QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  
Sbjct: 2   PPEWDWRSKGA--VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 59

Query: 187 NLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           +  C GG    A+  +K  G LE++ DY Y+        C +  EKAKV++QD+   S  
Sbjct: 60  DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQ---SCQFSAEKAKVYIQDSVELSQN 116

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
           +  +   L + GPI V +N   ++ Y     R     C+P  +DHAV +VGYG+++ +  
Sbjct: 117 EQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPF 176

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           W ++NSWG    + GY+ + RG+ ACG+ + A  A V
Sbjct: 177 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213


>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
 gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
          Length = 353

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 110/216 (50%), Gaps = 12/216 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P   DWR  K+  ++PV++Q  CGSCW F+TT  LES  A     +  LS+ QLV+C  G
Sbjct: 133 PSKKDWRDDKI--VSPVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGG 190

Query: 187 --NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             N  CNGG    AFEY++   GL+++  YPY   +    +CTY +      V D   +T
Sbjct: 191 YNNFGCNGGLPSQAFEYIRYNGGLDTEDSYPYTGHDG---KCTYNQNSIGAKVYDVVNIT 247

Query: 243 SGV-DHMMHLLQ-SGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
            G  D ++H +  + P+ + Y   +    Y       N     P  ++HAV  VGY    
Sbjct: 248 EGAEDELIHAVAFNRPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYNRDA 307

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
            +  WI++NSWG+     GYF +E G N CGI + A
Sbjct: 308 PVPYWIIKNSWGESFGLDGYFYMEMGKNMCGIATCA 343


>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 147/321 (45%), Gaps = 45/321 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQEILQR 94
           F  +  K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+E    
Sbjct: 41  FAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEE---- 96

Query: 95  TGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
              R T     E   A  +R +K +N    G  P+++DWR  K   + PV+ QG+C S W
Sbjct: 97  --FRATYHNGAEYYAAALKRPRKVVN-VSTGKAPEAVDWR--KKGAVTPVKDQGKCDSSW 151

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYV---KQYGLESQ 210
           AF     +E Q  +    L  LS+  LV CD  +L C  G +D AF+++       + ++
Sbjct: 152 AFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGCRAGFMDTAFKWIVSPNDGNVFTE 211

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-----------QSGPIGV 259
             YPY +       C    +  KV      V + +D  +H+L           ++GP+ +
Sbjct: 212 QSYPYASGGGNVPAC---NKSGKV------VGANIDDHVHILDNENAIAEWLAKNGPVAI 262

Query: 260 YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGY 319
            ++    + Y G  +     +C   +++ A  +VGY + +    WI++NSWG    + GY
Sbjct: 263 AVDATSFQRYTGGVLT----SCISKEVNSAALLVGYDDTSKPPYWIIKNSWGKGWGEEGY 318

Query: 320 FQIERGANACGIESYAYLASV 340
            +IE+G N C ++ Y   A V
Sbjct: 319 IRIEKGTNQCRMKDYVSSAVV 339


>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 291

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 131/282 (46%), Gaps = 30/282 (10%)

Query: 78  YGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSK 136
           +G +  SD +P E      G +L  ++   + +    +  +        LP   DWR+  
Sbjct: 17  HGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHD----LPLEFDWRERG 72

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG---------N 187
              + PV++QG CGSCW F+ T  +E    L    L  LS+ QLV+CDH          +
Sbjct: 73  A--VTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDPSAPRNCD 130

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             CNGG    A  YV+++GL+++++YPY+  +       +    A V   +   T+    
Sbjct: 131 YGCNGGLPLNAMRYVQKHGLDTESNYPYKGVDGKCASARHGPAAASVSSFNLVSTNETQI 190

Query: 248 MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT---- 303
              LL+ GP+ + ++   +++Y G       W CN   LDH V IVGYG  NG       
Sbjct: 191 AAALLKHGPLSIGIDAAWMQTYVGG--VACPWICNKAGLDHGVLIVGYG-VNGTAPARPW 247

Query: 304 ------WIVRNSWG-DIGPDHGYFQIERGANACGIESYAYLA 338
                 WIV+NSWG + G + GY+ I +   ACG+ +    A
Sbjct: 248 HRRQDYWIVKNSWGPNWGVEGGYYHICKDRAACGLNTMVVAA 289


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 158/332 (47%), Gaps = 36/332 (10%)

Query: 36  SIKQVDAFK----TYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEI 91
           +I  +D  K    TY ++  + Y ++ E + R + F ++  +  ++         S +  
Sbjct: 17  AISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLG 76

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNER-----------KKGPLPKSLDWRQSKVKVL 140
           L +    L  + KE +      +++ + ER               +PKS+DWR+     +
Sbjct: 77  LNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGA--V 134

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVA 198
             V+ QG CGSCWAF++T  LE Q       L  LS+  LV+C   +GN  CNGG +D A
Sbjct: 135 TGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194

Query: 199 FEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMH--LL 252
           F Y+K   G++++  YPY   E I   C +   KA +   DT    +  G +  M   + 
Sbjct: 195 FRYIKDNGGIDTEKSYPY---EGIDDSCHF--NKATIGATDTGFVDIPEGDEEKMKKAVA 249

Query: 253 QSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNS 309
             GP+ V ++  H   + Y       N+  C+   LDH V +VGYG +++G+  W+V+NS
Sbjct: 250 TMGPVSVAIDASHESFQLYSEGVY--NEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNS 307

Query: 310 WGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           WG    + GY ++ R   N CGI + +   +V
Sbjct: 308 WGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 101/311 (32%), Positives = 143/311 (45%), Gaps = 35/311 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           ++ ++VK  + Y    E + RFE FK +    DE+         G +  +D + +E   R
Sbjct: 47  YEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTR 106

Query: 95  -TGLRLTGKEKER-LEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSC 152
             G R+    + R + +   R    + ++    LP+S+DWR+    V   V+ QG CGSC
Sbjct: 107 FLGTRINPNRRNRKVNSQTNRYATRVGDK----LPESVDWRKEGAVV--GVKDQGSCGSC 160

Query: 153 WAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQ 210
           WAF+  A +E    L    L  LS+ +LV+CD   N  CNGG +D AFE++     L  +
Sbjct: 161 WAFSAIAAVEGVNKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPE 220

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDTW-----VTSGVDHMMHLLQSGPIGVYLNHRL 265
            DYPYR    I  RC   ++ AKV   D +        G        Q   + V    R 
Sbjct: 221 EDYPYRA---IDGRCDQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGRE 277

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + YD          C    LDH VA VGYG +NG   WIVRNSWG    + GY ++ER 
Sbjct: 278 FQLYDSGVFTGR---CGT-ALDHGVAAVGYGTENGKDYWIVRNSWGGSWGEAGYIRLERN 333

Query: 326 -----ANACGI 331
                +  CGI
Sbjct: 334 LATSKSGKCGI 344


>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
          Length = 336

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/240 (34%), Positives = 126/240 (52%), Gaps = 30/240 (12%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N+  +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NQTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSG--VDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWA 280
                AK+  FV    + SG  +  M  +   GP+ V ++  H+ ++ Y          A
Sbjct: 217 PRFNVAKITGFVD---IPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--A 271

Query: 281 CNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           C+  +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CG+ + A
Sbjct: 272 CSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKA 331


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+K   GL+++  YPY   ++    C ++        +   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKP--CKFDNSSVGATLIGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V S  +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V +VGY
Sbjct: 228 DVKSSNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLVVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPNWGDQGYIMMSRNKNNQCGIATSA 329


>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   ++
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCRGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGDYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
          Length = 334

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 116/230 (50%), Gaps = 24/230 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPKS+DWR  K   + PV++Q +CGSCWAF+ T  LE Q+      L  LS+  LV+C H
Sbjct: 114 LPKSVDWR--KKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+YVK+  GL+S+  YPY   + I   C Y  E +     DT  T
Sbjct: 172 PQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEI---CKYRPENS--VANDTGFT 226

Query: 243 SGVDH-----MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
             +       M  +   GPI V ++  H   + Y        D  C+   LDH V +VGY
Sbjct: 227 VILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGY 284

Query: 296 G----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G      +    W+V+NSWG     +GY +I +   N CGI + A    V
Sbjct: 285 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPDV 334


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 155/328 (47%), Gaps = 47/328 (14%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEI 91
           + Y   + +D ++ ++VK  + Y   +E + RF+ FK +              D + Q  
Sbjct: 25  INYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN---------LGFIQDHNAQNN 75

Query: 92  LQRTGLR----LTGKE------KERLEADRERVKKFLNERKK------GPLPKSLDWRQS 135
               GL     +T KE        R +A R RV K  N   +        LP  +DWR  
Sbjct: 76  TYTLGLNKFADITNKEYRAMYLGTRTDAKR-RVMKTQNTGHRYAYNSGDQLPVHVDWRLK 134

Query: 136 KVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGN 194
               + P++ QG CGSCWAF+T A +E    ++      LS+ +LV+CD   +  CNGG 
Sbjct: 135 GA--VGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGL 192

Query: 195 IDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMH 250
           +D AF+++ Q G ++++ DYPY   + I   C   K+K KV   D +    ++  + +  
Sbjct: 193 MDYAFQFIIQNGGIDTEEDYPY---QGIDGTCDETKKKTKVVQIDGYEDVPSNNENALKK 249

Query: 251 LLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRN 308
            +   P+ V +  + R ++ Y           C    LDH V +VGYG +NG+  W+VRN
Sbjct: 250 AVSHQPVSVAIEASGRALQLYQSGVFTGK---CGT-ALDHGVVVVGYGTENGVDYWLVRN 305

Query: 309 SWGDIGPDHGYFQIERGANA-----CGI 331
           SWG    + GYF++ER   +     CGI
Sbjct: 306 SWGTGWGEDGYFKMERNVRSTSEGKCGI 333


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 119/223 (53%), Gaps = 24/223 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR  K   +  V+ QG CG+CW+F+ T  +E    ++   L  LS+ +L++CD 
Sbjct: 118 VPDSVDWR--KKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDK 175

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  CNGG +D AFE+V K +G++++ DYPY+ ++     C  +K K KV   D++  +
Sbjct: 176 SYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGT---CKKDKLKQKVVTIDSY--A 230

Query: 244 GVDH-----MMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           GV       +M  + + P+  G+  + R  + Y           C+   LDHAV IVGYG
Sbjct: 231 GVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRGIFSG---PCST-SLDHAVLIVGYG 286

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
            +NG+  WIV+NSWG      G+  ++R        CGI   A
Sbjct: 287 SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLA 329


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/224 (37%), Positives = 116/224 (51%), Gaps = 18/224 (8%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G LPKS+DWR S +  ++ V+ QG CGSCWAF+TT  LE Q +     L  LS+ QLV+C
Sbjct: 112 GTLPKSVDWRNSHM--VSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDC 169

Query: 184 --DHGNLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEK-AKVFVQDT 239
             D GN  C GG +D AF+Y+    GL+++  YPY   ++    C ++        V   
Sbjct: 170 SKDFGNQGCGGGLMDQAFQYITANGGLDTEESYPYTATDDEP--CKFDNSSVGATLVGYK 227

Query: 240 WVTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG +H +   +   GP+ V ++  H   + Y       ++  C+  +LDH V  VGY
Sbjct: 228 DVKSGNEHALKRAVATVGPVSVAIDAGHESFQFYSSGVY--DEPQCSTEQLDHGVLAVGY 285

Query: 296 GEKNG---ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G  N       WIV+NSWG    D GY  + R   N CGI + A
Sbjct: 286 GAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSRNKNNQCGIATSA 329


>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   ++
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCRGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGDYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/299 (33%), Positives = 141/299 (47%), Gaps = 36/299 (12%)

Query: 56  DDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR---TGLRLTGKEK 104
           D +E   RFE FK + K  DE+         G +  +D S +E   R   T +   G   
Sbjct: 68  DGSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMM 127

Query: 105 ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ 164
            R +    R    + ++    LPKS+DWR     V   V+ QG CGSCWAF+T A +E  
Sbjct: 128 ARTKTRSNRYAPSVGDK----LPKSVDWRSQGAVV--QVKDQGSCGSCWAFSTIAAVEGI 181

Query: 165 VALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENIT 222
             ++   L  LS+ +LV+CD   N  C+GG ++ AFE++    G++S  DYPYR    + 
Sbjct: 182 NKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRG---VD 238

Query: 223 FRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLIESYDGNPIRRN 277
            +C   K+ A+V   D +        + L   + + PI V +    R  + Y        
Sbjct: 239 GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGK 298

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG-----ANACGI 331
              C    LDH V  VGYG +NG+  WIVRNSWG    + GY ++ER      A  CGI
Sbjct: 299 ---CGT-ALDHGVTAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGI 353


>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
          Length = 450

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/247 (35%), Positives = 126/247 (51%), Gaps = 30/247 (12%)

Query: 108 EADRERVKKFLNERKK------GPL----PKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           E  R+ +  F N+++K       PL    PKS+DWR+     + PV++QG+CGSCWAF+ 
Sbjct: 210 EEFRQVMNGFRNQKQKSGKVFHAPLLLQAPKSVDWREKGF--VTPVKNQGQCGSCWAFSA 267

Query: 158 TAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVK-QYGLESQADYP 214
           T  LE Q+      L  LS+  LV+C    GNL C GG +D AF+Y+K   GL+S+  YP
Sbjct: 268 TGALEGQMFRKTGKLISLSEQNLVDCSRRQGNLGCQGGLMDNAFQYIKDNGGLDSEESYP 327

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGN 272
           Y+  +     C Y+ E A     DT     +  M  +   GPI V ++  H   + Y   
Sbjct: 328 YKGMDGT---CQYKAEWA--VANDTGFEKAL--MKAVASVGPISVAIDAGHASFQFYKDG 380

Query: 273 PIRRNDWACNPHKLDHAVAIVGYG---EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NA 328
                D  C+   LDH V +VGYG     +    W+++NSWG+    +GY +I +   N 
Sbjct: 381 IYYEPD--CSSENLDHGVLVVGYGVEKRNSNDKYWLIKNSWGEQWGANGYVKIAKDRNNH 438

Query: 329 CGIESYA 335
           CG+ S A
Sbjct: 439 CGVASAA 445


>gi|156125004|gb|ABU50820.1| Aca s 1 allergen [Acarus siro]
          Length = 331

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/314 (28%), Positives = 152/314 (48%), Gaps = 29/314 (9%)

Query: 39  QVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEIL-----Q 93
           ++  F+ +   + + Y    E   R   F+   K   E     G +  +  +       +
Sbjct: 22  EITTFEQFKAVFGKVYATPEEESIRRANFEASLKWIQENDRKDGGAHLAVNQFADLGANE 81

Query: 94  RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
             G+ LT +   R EA  E V   ++   +G LP++ DWR      L P+E+QGRCG+CW
Sbjct: 82  SVGVNLTAR---RGEAFFEAVT--IHVTPEGNLPETFDWRSK----LGPIENQGRCGACW 132

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVEC----DHG---NLNCNGGNIDVAFEYVKQYG 206
           AFA+ A +E+  A+   T   LSK +LVEC    DH    N  C GG    A +YV+  G
Sbjct: 133 AFASLATVEAAFAIKYNTHIRLSKQELVECTRESDHTPYENSGCQGGYSWEALKYVQVTG 192

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFV---QDTWVTSGVDHMMHLLQS-GPIGVYL- 261
           +  +A YPY  K+N     ++ + + +  +       + +  + +M +L++ GP+ V + 
Sbjct: 193 VVEEAAYPYEAKDNQACYDSHLRSEKRYHINAFHRLQMAAPDESIMTVLKTHGPVAVDID 252

Query: 262 -NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
            +H   + Y    IR         +++H + IVG+G +NG+  W++RNSWG    + GY 
Sbjct: 253 ADHNGFKHYKSGVIRLTRGGTT--EVNHVINIVGWGRENGLDYWLIRNSWGTHWGEAGYG 310

Query: 321 QIERGANACGIESY 334
           ++ER  N  GI  +
Sbjct: 311 KVERHHNNMGINHF 324


>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 110/212 (51%), Gaps = 13/212 (6%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DH 185
           ++DWR+     + PV+ Q  CGSCWAF+    +E Q      TL  LS  +LV+C   ++
Sbjct: 115 AVDWREEGA--VTPVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEY 172

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           GN  C GG +  AF++V+  G++++  YPY  +     R + +K    V    T+V    
Sbjct: 173 GNNGCRGGLMGQAFDFVQDEGIQTEESYPYEGR-----RSSCKKSGDYVTKVKTYVFPLD 227

Query: 246 DHMMH--LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK-LDHAVAIVGYGEKNGIL 302
           +  M   +   GP+ V +    +  YD   +       N  + L+H V +VGYG +NG+ 
Sbjct: 228 EQEMARTVAAKGPVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYGSENGVD 287

Query: 303 TWIVRNSWGDIGPDHGYFQIERGANACGIESY 334
            WIV+NSWG    + GYF++++   ACGI  Y
Sbjct: 288 YWIVKNSWGADWGEKGYFRLKKDVKACGIGYY 319


>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
 gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
          Length = 332

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/320 (27%), Positives = 158/320 (49%), Gaps = 20/320 (6%)

Query: 24  SAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE------- 76
           SA      + Y+  +    F +++ ++N+TY  + E   +F+ FK + +  +E       
Sbjct: 11  SAFSFIESVIYNLEQSEKLFDSFVKQYNKTYLTEEERMIKFDNFKNNLRIINEKNRGSKH 70

Query: 77  -YYGTSGSSDRSPQEILQRT-GLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ 134
             +  +  SD +  ++L+ T G +L  K+       +E     + E  +  LP++ DWR 
Sbjct: 71  AVFDINKYSDLNKNDLLRHTTGFKLGLKKNYSFTTVKECGVVEIKEEPQVLLPETFDWRD 130

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
                + PV++Q  CGSCWAF+T   +ES   +    +  LS+  L+ CD  N  CNGG 
Sbjct: 131 KHG--VTPVKNQLICGSCWAFSTIGNIESLYNIKYDKVIDLSEQHLINCDLVNNGCNGGL 188

Query: 195 IDVAFEYVKQYG--LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL 252
           +  A E + Q G  + S+ + PY   +++  +  +E   +       ++    + +  LL
Sbjct: 189 MHWALENILQEGGGVVSEENDPYYGLDSVCKKTPWELNISGC---KRYILQNENKLKELL 245

Query: 253 Q-SGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
             +GPI V ++   + +Y        D   N + L+HAV +VGYGE + +  WI++NSWG
Sbjct: 246 VVNGPISVAIDVSDVINYKSGIA---DICENNNGLNHAVLLVGYGEYDEVPYWILKNSWG 302

Query: 312 DIGPDHGYFQIERGANACGI 331
               + G+F+I+R  N+CG+
Sbjct: 303 IEWGEDGFFRIQRNKNSCGL 322


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/255 (33%), Positives = 130/255 (50%), Gaps = 26/255 (10%)

Query: 88  PQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           P+      G+R  G+    L +DR R       R    LP S+DWR+    V  P++ QG
Sbjct: 9   PRRRTTYFGVRGAGRRTPGLASDRYRY------RAGDALPDSVDWREKGAVV--PIKDQG 60

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQY 205
            CGSCWAF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AF+++    
Sbjct: 61  GCGSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNG 120

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYLN 262
           G++++ DYPY  ++    RC   ++ AKV   +++    V+    L     S PI V ++
Sbjct: 121 GIDTEKDYPYTEQDG---RCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAID 177

Query: 263 H--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
              R  + Y+          C    LDH V +VGYG ++G   WIVRNSWG+   + GY 
Sbjct: 178 GGGRSFQLYNSGIFTGK---CG-TSLDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYI 233

Query: 321 QIERGANA----CGI 331
           ++ R  ++    CGI
Sbjct: 234 RMARNIDSPSGICGI 248


>gi|403223167|dbj|BAM41298.1| cysteine protease precursor TacP [Theileria orientalis strain
           Shintoku]
          Length = 417

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/309 (31%), Positives = 146/309 (47%), Gaps = 43/309 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGT-------SGSSDRSPQEILQR- 94
           F+++  K++R Y  D + + RF  F+    E  E  GT       +  SD S +E  Q  
Sbjct: 122 FESFNAKYHRVYKYDKDRRERFVNFRDSYLEVKEQRGTEMYTKGINRFSDLSEKEFYQMF 181

Query: 95  ------TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
                 T  +L     + L+  +      LN        + LDWR  K K ++ V+ QG 
Sbjct: 182 KPVKVPTFEKLPSSSTDDLDLSK------LN-------GEDLDWR--KAKTVSQVKDQGD 226

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLE 208
           CG CWAFAT   +ES     K  +Y LS+ +L++CD  +  CNGG  + A  YV++YGL 
Sbjct: 227 CGGCWAFATVGSVESFYLTHKDVVYSLSEQELLDCDPNSFGCNGGFPESALNYVRRYGLA 286

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLN-HRLIE 267
           S  D P+   +    +C+    K KV + D +V  G + M   L   P   ++       
Sbjct: 287 SANDLPFVGHDK---KCSVPDVK-KVKISDYYVFKGKNIMNKSLVVTPTVTFMGVSPEFT 342

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIV--GYGEKNGILTWIVRNSWGDIGPDHGYFQIER- 324
            Y G      +  C   KL+HAV +V  GY EK     W+V+NSWG    + GYF+++R 
Sbjct: 343 KYQGGVY---NGVC-ADKLNHAVLLVGEGYDEKLKTRYWVVKNSWGTDWGEDGYFRLQRT 398

Query: 325 --GANACGI 331
             G++ CGI
Sbjct: 399 DEGSDMCGI 407


>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
 gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
          Length = 336

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/240 (34%), Positives = 126/240 (52%), Gaps = 30/240 (12%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N+  +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NQTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSG--VDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWA 280
                AK+  FV    + SG  +  M  +   GP+ V ++  H+ ++ Y          A
Sbjct: 217 PRFNVAKITGFVD---IPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--A 271

Query: 281 CNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           C+  +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CG+ + A
Sbjct: 272 CSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKA 331


>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
          Length = 350

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 18/218 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR  K  +++ V+ QG CGSCW F+TT  LES  A        LS+ QLV+C  
Sbjct: 133 LPAEKDWR--KEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAG 190

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  CNGG    AFEY+K   GLE++  YPY  +      C +  E   V V  +  +
Sbjct: 191 AFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQNG---PCKFTSEDVAVQVLGSVNI 247

Query: 242 TSGV-DHMMHLLQ-SGPIGVYL----NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
           T G  D + H +  + P+ V      + RL   Y             P  ++HAV  VGY
Sbjct: 248 TLGAEDELKHAVAFARPVSVAFEVVDDFRL---YKKGVYTSTTCGNTPMDVNHAVLAVGY 304

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
           G ++G+  W+++NSWG    DHGYF++E G N CG+ +
Sbjct: 305 GIEDGVPYWLIKNSWGGEWGDHGYFKMEMGKNMCGVAT 342


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/220 (35%), Positives = 117/220 (53%), Gaps = 17/220 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR      + PV+ QG+CGSCW+F+ T  LE Q       L  LS+  LV+C  
Sbjct: 120 LPGQIDWRDKGA--VTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSE 177

Query: 186 --GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYE-KEKAKVFVQDTWV 241
             GN  CNGG +D AF Y+K   G++++  YPY+ ++    +C Y+ K K         +
Sbjct: 178 KFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDE---KCHYKPKNKGATDRGYVDI 234

Query: 242 TSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
            SG +  +   +   GP+ V ++  H+  + Y G      +  C+P +LDH V +VGYG 
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPE--CSPSQLDHGVLVVGYGT 292

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           E +G   W+V+NSWG    D GY ++ R   N CGI + A
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEA 332


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 118/223 (52%), Gaps = 28/223 (12%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP S+DWR      + PV+ QG+CGSCW F+ T  LE Q       L  LS+ QLV+C  
Sbjct: 108 LPSSVDWRNQGY--VTPVKDQGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAG 165

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
            +GN  CNGG ++ A++Y+K   G+E ++ YPY  ++    RC +++ K  V     +V 
Sbjct: 166 RYGNYGCNGGLMESAYDYIKGVGGVELESAYPYTARDG---RCKFDRSKV-VATCKGYVV 221

Query: 243 SGVDHMMHLLQS----GPIGVYLN-----HRLIES--YDGNPIRRNDWACNPHKLDHAVA 291
             V     L+Q+    GP+ V ++      +L ES  YD    RR    C+   LDH V 
Sbjct: 222 IPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYESGVYD---FRR----CSSTNLDHGVL 274

Query: 292 IVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
            VGYG + G   W+V+NSWG    D GY ++ +   N CGI +
Sbjct: 275 AVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQCGIAT 317


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/252 (34%), Positives = 127/252 (50%), Gaps = 21/252 (8%)

Query: 99  LTGKEKERL----EADRERVKKFLNERKKGP---LPKSLDWRQSKVKVLNPVESQGRCGS 151
           LT  E  RL      D  +  K      + P   +P   DWRQ     +  V++QG+CGS
Sbjct: 80  LTNAEFNRLFKGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGA--VTHVKNQGQCGS 137

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGLE 208
           CW+F+TT   E    L    L  LS+  L++C   +GN  CNGG +D AFEY+    G++
Sbjct: 138 CWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGID 197

Query: 209 SQADYPYRNKENITFRCTYEK-EKAKVFVQDTWVTSGVDH-MMHLLQSGPIGVYLN--HR 264
           ++A YPY+    +T  C Y    K       T VTSG ++ +++     P+ V ++  H 
Sbjct: 198 TEASYPYQTAGPLT--CQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASHN 255

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y G     +  AC+  +LDH V +VG+G +NG   W V+NSWG     +GY ++ R
Sbjct: 256 SFQFYSGGVYYES--ACSSTQLDHGVLVVGWGSENGQDFWWVKNSWGASWGLNGYIKMSR 313

Query: 325 GA-NACGIESYA 335
              N CGI + A
Sbjct: 314 NQNNNCGIATAA 325


>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 330

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/265 (32%), Positives = 134/265 (50%), Gaps = 22/265 (8%)

Query: 79  GTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           G +   D + +E++++ TGL++          ++ER      +     +PKS+D+R  K 
Sbjct: 75  GMNHLGDMTSEEVVEKMTGLQIP--------MNQERSFTLAMDDMPSKIPKSVDYR--KK 124

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNI 195
            ++  V++QG CGSCWAF+    LE Q+A     L  LS   LV+C   +GN  CNGG +
Sbjct: 125 GMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFM 184

Query: 196 DVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKE-KAKVFVQDTWVTSGVDHMMH--L 251
             AF+YV   +G++S A YPY  ++    +C Y    +A       ++  G ++ +   L
Sbjct: 185 TRAFQYVIDNHGIDSDASYPYTGRDE---QCRYNPATRAANCSSYQFLPEGDENALKQAL 241

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
              GPI V ++ R            ND +C   +++H V  VGYG  NG   W+V+NSWG
Sbjct: 242 ATIGPISVAIDARRPRFSFYRSGVYNDPSCT-QEVNHGVLAVGYGSLNGQDYWLVKNSWG 300

Query: 312 DIGPDHGYFQIERG-ANACGIESYA 335
               D GY ++ R   N CGI  YA
Sbjct: 301 STFGDQGYIRMARNTGNQCGIALYA 325


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 117/225 (52%), Gaps = 25/225 (11%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           PKS+DWR  K   + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 115 PKSVDWR--KKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRA 172

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKV----FVQDT 239
            GN  CNGG +D AF+YVK   G++S+  YPY  K++    C Y+          FV   
Sbjct: 173 QGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQ--ECHYDPNNNSANDTGFVD-- 228

Query: 240 WVTSGV--DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V SG   D M  +   GP+ V ++  H+  + Y        +  C+   LDH V +VGY
Sbjct: 229 -VQSGCEKDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPE--CSSEDLDHGVLVVGY 285

Query: 296 G----EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           G    + +G   WIV+NSW +   D+GY  I +   N CGI + A
Sbjct: 286 GFESEDVDGKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAA 330


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 118/221 (53%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P ++DWR  K   + PV++QG+CGSCWAF+TT  LE Q       L  LS+  LV+C  
Sbjct: 114 VPDTVDWR--KEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCST 171

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  C GG +D AF+Y+K+  G++++  YPY  + +   RC +  +K+ +   DT   
Sbjct: 172 AYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARND---RCRF--QKSNIGAVDTGFV 226

Query: 241 -VTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            VT G +  +       GPI V ++  H   + Y       N+  C+   LDH V +VGY
Sbjct: 227 DVTHGDEEALKTAAGTVGPISVAIDAGHMSFQFYHSGVY--NNAGCSSTSLDHGVLVVGY 284

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G   G   W+V+NSWG+     GY  + R   N CG+ + A
Sbjct: 285 GTYQGSDYWLVKNSWGERWGMEGYIMMSRNKNNQCGVATQA 325


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/318 (27%), Positives = 157/318 (49%), Gaps = 36/318 (11%)

Query: 24  SAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD---------GKET 74
           S++   R+L+  ++  V+  + ++V++ R Y D  E   RFE FK +          K+ 
Sbjct: 19  SSVLAARELSDAAM--VERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKN 76

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNER-KKGPLPKSLDWR 133
             + G +  +D + +E     G + T        A++     F  E      LP ++DWR
Sbjct: 77  KFWLGVNQFADLTTEEFKANKGFKPT--------AEKVPTTGFKYENLSVSALPTAVDWR 128

Query: 134 QSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN--CN 191
                 + P+++QG+CG CWAF+  A +E  V L    L  LS+ +LV+CD  +++  C 
Sbjct: 129 TKGA--VTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCE 186

Query: 192 GGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCT-YEKEKAKVFVQDTWVTSGVDHMM 249
           GG +D AFE+V K  GL ++++YPY+  +    +C    K  A +   +    +    +M
Sbjct: 187 GGWMDSAFEFVIKNGGLATESNYPYKAVDG---KCKGGSKSAATIKGHEDVPVNNEAALM 243

Query: 250 HLLQSGPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIV 306
             + + P+ V ++   R    Y G  +     +C   +LDH +A +GYG E +G   WI+
Sbjct: 244 KAVANQPVSVAVDASDRTFMLYSGGVMTG---SCGT-ELDHGIAAIGYGMESDGTKYWIL 299

Query: 307 RNSWGDIGPDHGYFQIER 324
           +NSWG    + G+ ++E+
Sbjct: 300 KNSWGTTWGEKGFLRMEK 317


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 153/325 (47%), Gaps = 39/325 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           FK ++  + R+Y+ + E   R   F Q+     E+     ++     +        LT  
Sbjct: 54  FKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSD-----LTED 108

Query: 103 EKERLEADRERVKKFLNERKKG--------PLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
           E E+L           N    G         LP++ DWR+     +  V+ QGRCGSCWA
Sbjct: 109 EFEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREKGA--VTEVKLQGRCGSCWA 166

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEYVKQY 205
           F+TT  +E    L    L  LS+ QL++CD+          +  CNGG +  A+ Y+ + 
Sbjct: 167 FSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLES 226

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLN 262
            GLE ++ YPY  +      C ++ EK  V + + T + +  + +  +L+++GP+ + +N
Sbjct: 227 GGLEEESSYPYTGERG---ECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVN 283

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSWGDIGP 315
              +++Y G         C+  +L+H V +VGYG K   IL       WI++NSWG+   
Sbjct: 284 AIFMQTYIGG--VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWG 341

Query: 316 DHGYFQIERGANACGIESYAYLASV 340
           + GY+++ RG   CGI +    A V
Sbjct: 342 EDGYYKLCRGHGMCGINTMVSAAMV 366


>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
          Length = 346

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 147/308 (47%), Gaps = 27/308 (8%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY----------YGTSGSSDRSPQE 90
           + F  Y+  +N++Y  + E + RF  FK+     ++           YG +  SD + +E
Sbjct: 39  EQFHEYLSDFNKSYPQEAEFQFRFAAFKKSLANIEQLNANKTKSSAQYGLTKFSDFTAEE 98

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKF-LNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
            L     R  G  ++   A + R+KK  L    +  LP+ +DWR   V  ++ V++Q  C
Sbjct: 99  FLDLQNNR-AGVRRDLRGAAQSRLKKVALRSAYEKELPQIVDWRNKNV--VSKVKNQKNC 155

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK--QYGL 207
           G+CWAFA +  +ES  A+  + L  LS  QL++C   N  C GG+      ++K     +
Sbjct: 156 GACWAFAVSETIESMQAIKTQQLTDLSIQQLIDCSSYNNGCKGGDTCALLRWIKVNNIAI 215

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV---DHMMHLLQ-SGPIGVYLNH 263
            ++ DYP   ++    +C        V V      S V   D ++ LL  +GP+ V ++ 
Sbjct: 216 MNETDYPLVLEDQ---KCQKTDMSEGVKVGTYQCNSFVGREDIILKLLAINGPVAVAISG 272

Query: 264 RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
              ++Y G  I+   + C    L HAV IVGY     +  +IVRNSWG+   D+GY  + 
Sbjct: 273 ETWQNYVGGVIQ---FHCEGD-LSHAVQIVGYNLTAKVPFYIVRNSWGEDFGDNGYLYVA 328

Query: 324 RGANACGI 331
            G N CG+
Sbjct: 329 IGGNVCGL 336


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 119/221 (53%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP ++DWRQ     + P+++QG+CGSCWAF+  A +E    +    L  LS+ +LV+CD 
Sbjct: 100 LPTNVDWRQEGA--VTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 157

Query: 185 -HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             GN  CNGG +  AFE++K+ GL ++ +YPY+  E+    C  +KEK +      +   
Sbjct: 158 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESA---CNEQKEKYQFVSISGYEKV 214

Query: 244 GVDHMMHL---LQSGPIGVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            V+    L   + + P+ V ++      + Y G     N   C  ++L+H VAIVGYGE 
Sbjct: 215 PVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGN---CG-NQLNHGVAIVGYGET 270

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGAN----ACGIESYA 335
           +    W+V+NSWG    + GY +++R +      CGI   A
Sbjct: 271 SNQAYWLVKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMA 311


>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
          Length = 345

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 116/230 (50%), Gaps = 24/230 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPKS+DWR  K   + PV++Q +CGSCWAF+ T  LE Q+      L  LS+  LV+C H
Sbjct: 125 LPKSVDWR--KKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSH 182

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF+YVK+  GL+S+  YPY   + I   C Y  E +     DT  T
Sbjct: 183 PQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEI---CKYRPENS--VANDTGFT 237

Query: 243 SGVDH-----MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
             +       M  +   GPI V ++  H   + Y        D  C+   LDH V +VGY
Sbjct: 238 VILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGY 295

Query: 296 G----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G      +    W+V+NSWG     +GY +I +   N CGI + A    V
Sbjct: 296 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPDV 345


>gi|28974202|gb|AAO61485.1| cathepsin H [Sterkiella histriomuscorum]
          Length = 366

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 116/218 (53%), Gaps = 17/218 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           +P   DWR     V++PV++QG+CGSCW F+T   +ES   L       LS+ QLV+C  
Sbjct: 135 IPTEWDWR--TFGVVSPVKNQGKCGSCWTFSTVGCVESHYLLKYGAFRNLSEQQLVDCAG 192

Query: 184 DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
           D+ N  C+GG    AFEY+K   GL  +  YPY+       +C+ +K +  V ++   V 
Sbjct: 193 DYDNHGCSGGLPSHAFEYIKDNGGLALETTYPYKAANG---QCSIQKGQQSVGIRGGAVN 249

Query: 243 SGV---DHMMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
             +   D    +   GP+ V    R+I+    Y          A  P+ ++HAV  VG+G
Sbjct: 250 ISLNEDDLKQAIYLHGPVSVAF--RVIDGFRDYKSGVYAVEGCANGPNDVNHAVLAVGFG 307

Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIES 333
            ++N +  WI++NSWG    D G+F+++RG N CGI++
Sbjct: 308 TDENKVDYWIIKNSWGAAWGDQGFFKMKRGVNMCGIQN 345


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 117/221 (52%), Gaps = 16/221 (7%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HG 186
           ++DWR   +  +  ++ QG+CGSCWAF+TT  LE Q A    TL  LS+  LV+C    G
Sbjct: 117 TVDWRDKGL--VTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEG 174

Query: 187 NLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
           N  C GG++D  F+Y+ Q  G++++  YPY+ K +   RC ++       +   T VTSG
Sbjct: 175 NKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNH---RCKFDNSCIGATMSSFTDVTSG 231

Query: 245 VDHMMH--LLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
            +  +       GPI  G+  +H+  + Y       N++ C+  KLDH V +VGYG    
Sbjct: 232 DEDALKQACANIGPISVGIDASHQSFQFYSSGVY--NEFECSSTKLDHGVLVVGYGTYGS 289

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
              W+V+NSWG +  + GY  + R   N CG+ + A    V
Sbjct: 290 KDYWLVKNSWGTVWGNEGYIMMSRNKDNQCGVATDASFPVV 330


>gi|473159|emb|CAA83538.1| cathepsin L [Schistosoma mansoni]
          Length = 317

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 149/318 (46%), Gaps = 31/318 (9%)

Query: 36  SIKQVDAFKTYIVKWNRTYTDDNEIKTR------FEYFKQDGKETD---EYY--GTSGSS 84
           S++  D +K + +K+N+TY+D NEI+ +       E  +Q     D   E Y  G +   
Sbjct: 8   SLQYDDIWKQWKLKYNKTYSDSNEIRRKAIFMRYVEKIQQHNLRHDLGLEGYTMGLNQFC 67

Query: 85  DRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVE 144
           D   +EI      ++ G     L  D++   +  N+    PLP   DWR      + PV+
Sbjct: 68  DMDWEEIKTIMLSKVFGNSP--LWDDKKEELELSND----PLPSKWDWRDH--GAVTPVK 119

Query: 145 SQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV 202
           +QG CGSCWAF+    +E Q+    K L  LS+ QLV+C   +GN  C GG +D +F Y+
Sbjct: 120 NQGLCGSCWAFSAAGAVEGQLVKKHKKLISLSEQQLVDCSYKYGNDGCQGGTMDQSFAYL 179

Query: 203 KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLN 262
           ++Y +ES+ DY Y   ++       +         D            L   GPI V   
Sbjct: 180 EKYPIESEKDYKYIGHDSSCHFRKSKGVVKVKKFVDLPARDEEKLQKALYHYGPISV--- 236

Query: 263 HRLIESYDGNPIRRNDW----ACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
              I++ D   + ++       C+   L+H V  VGYG +N    W+++NSWG     +G
Sbjct: 237 --AIDALDDLILYKSGIYESKQCSSFLLNHGVLAVGYGRENRKDYWLIKNSWGTTWGMNG 294

Query: 319 YFQIERGA-NACGIESYA 335
           YF++ R   N CGI + A
Sbjct: 295 YFKLRRNKHNMCGIATNA 312


>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
          Length = 376

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 124/222 (55%), Gaps = 21/222 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P+S+DWR      +  V++QG CGSCWAF+ T  LE Q    K TL  LS+  LV+C  
Sbjct: 159 VPESMDWRDHGY--VTEVKNQGMCGSCWAFSATGSLEGQHKRSKGTLVSLSEQNLVDCSA 216

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
            +GN  CNGG +D AF+Y+K+ +G++++  YPY+ ++    +C +  +++ V   DT   
Sbjct: 217 AYGNNGCNGGLMDFAFQYIKENHGIDTETSYPYKARQK---KCHF--QRSSVGADDTGFM 271

Query: 241 -VTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G +  + +  +  GPI V ++  HR  + Y        +  C+  +LDH V +VGY
Sbjct: 272 DLPEGDEDQLKIAVATQGPISVAIDAGHRSFQLYKTGVYYEKE--CSSEQLDHGVLVVGY 329

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G + +    WIV+NSWG    + GY ++ R   N CGI + A
Sbjct: 330 GTDPDHGDYWIVKNSWGTTWGEQGYVRMARNKNNHCGIATKA 371


>gi|24654434|ref|NP_725686.1| CG4847, isoform D [Drosophila melanogaster]
 gi|21645235|gb|AAM70880.1| CG4847, isoform D [Drosophila melanogaster]
 gi|255653098|gb|ACU24747.1| RH39096p [Drosophila melanogaster]
          Length = 420

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 131/259 (50%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A      K +N   K P+P + DWR+     + P
Sbjct: 165 ADLTHSEFLSQLTGLKRSPEAKARAAASL----KLVNLPAK-PIPDAFDWREHGG--VTP 217

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 218 VKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAA 277

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+  +  YPY + +     C Y+  K+   +Q        D   +  ++ +
Sbjct: 278 FCFIDEVQKGVSQEGAYPYIDNKGT---CKYDGSKSGATLQGFAAIPPKDEEQLKKVVAT 334

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GP+   +N    +++Y G     ND  CN  + +H++ +VGYG + G   WIV+NSW D
Sbjct: 335 LGPVACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDD 392

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 393 TWGEKGYFRLPRGKNYCFI 411


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 100/316 (31%), Positives = 152/316 (48%), Gaps = 28/316 (8%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL-R 98
           ++ F+ +  +  + Y   ++ K RFE FK++ K    Y     S   SP    Q  GL R
Sbjct: 47  IELFQRWKEENKKIYRSPDQEKLRFENFKRNLK----YIAEKNSKRISPYG--QSLGLNR 100

Query: 99  LTGKEKERLEAD-RERVKKFLNERKKGP--------LPKSLDWRQSKVKVLNPVESQGRC 149
                 E  ++    +VKK  ++R             P SLDWR  K  V+  V+ QG C
Sbjct: 101 FADMSNEEFKSKFTSKVKKPFSKRNGLSGKDHSCEDAPYSLDWR--KKGVVTAVKDQGYC 158

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLE 208
           G CWAF++T  +E   A++   L  LS+ +LV+CD  N  C+GG++D AFE+V    G++
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRTNDGCDGGHMDYAFEWVMHNGGID 218

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPI--GVYLNHR 264
           ++ +YPY   +     C   KE+ KV   D +  V      ++      PI  G+  +  
Sbjct: 219 TETNYPYSGADGT---CNVAKEETKVIGIDGYYNVEQSDRSLLCATVKQPISAGIDGSSW 275

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER 324
             + Y G  I   D + +P  +DHA+ +VGYG +     WIV+NSWG      GY  I R
Sbjct: 276 DFQLYIGG-IYDGDCSSDPDDIDHAILVVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRR 334

Query: 325 GANA-CGIESYAYLAS 339
             N   G+ +  Y+AS
Sbjct: 335 NTNLKYGVCAINYMAS 350


>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
          Length = 353

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/217 (35%), Positives = 112/217 (51%), Gaps = 12/217 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP   DWR  K  +++ V+ QG CGSCW F+TT  LE+  A        LS+ QLV+C  
Sbjct: 136 LPDEKDWR--KEGIVSQVKDQGNCGSCWTFSTTGALEAAYAQAFGKNISLSEQQLVDCAG 193

Query: 186 G--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-V 241
              N  CNGG    AFEY+K   GL+++  YPY  K+ +   C +  +   V V D+  +
Sbjct: 194 AFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGV---CKFTAKNVAVRVIDSINI 250

Query: 242 TSGVDHMMH--LLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
           T G +  +   +    P+ V     +    Y+            P  ++HAV  VGYG +
Sbjct: 251 TLGAEDELKQAVAFVRPVSVAFEVAKDFRFYNNGVYTSTICGSTPMDVNHAVLAVGYGVE 310

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           +G+  WI++NSWG    D+GYF++E G N CG+ + A
Sbjct: 311 DGVPYWIIKNSWGSNWGDNGYFKMELGKNMCGVATCA 347


>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
           Precursor
 gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
          Length = 346

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/217 (35%), Positives = 120/217 (55%), Gaps = 20/217 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S+DWR+  V V   V+ QG CGSCWAF+  A +ES  A++   L  LS+ +LV+CD 
Sbjct: 18  LPESIDWREKGVLV--GVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDR 75

Query: 186 G-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             N  C+GG +D AFE+V K  G++++ DYPY+ +  +   C   ++ AKV   D++   
Sbjct: 76  SYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGV---CDQYRKNAKVVKIDSYEDV 132

Query: 244 GVDH---MMHLLQSGPIGVYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            V++   +   +   P+ + L    R  + Y           C    +DH V I GYG +
Sbjct: 133 PVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGK---CGT-AVDHGVVIAGYGTE 188

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
           NG+  WIVRNSWG    ++GY +++R  ++    CG+
Sbjct: 189 NGMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGL 225


>gi|118366977|ref|XP_001016704.1| Cysteine proteinase 3 precursor, putative [Tetrahymena thermophila]
 gi|89298471|gb|EAR96459.1| Cysteine proteinase 3 precursor, putative [Tetrahymena thermophila
           SB210]
          Length = 343

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 79/217 (36%), Positives = 116/217 (53%), Gaps = 21/217 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVAL-LKKTLYPLSKSQLVEC- 183
           +P+ +DWR   +  + PV++QG+CGSCW F+TT  LES  AL        LS+ QL++C 
Sbjct: 122 IPEFVDWRTKGI--VTPVKNQGQCGSCWTFSTTGALESHWALHTGNAPLLLSEQQLIDCA 179

Query: 184 -DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV 241
            D  N  C+GG    AFEY+    GL+++ DYPY   +N    C +++  A   V  ++ 
Sbjct: 180 GDFNNFGCSGGLPSQAFEYISYAGGLDTEGDYPYEATDN---ECEFKRSHAAAKVVRSFN 236

Query: 242 TSGVDH---MMHLLQSGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE 297
            +  D    + HL  +GPI + Y        YDG        + +P  ++HAV  VGY  
Sbjct: 237 ITFQDEDELIYHLATAGPISIAYQVTDDFFKYDGGIYSNPYCSTSPDMVNHAVLAVGYN- 295

Query: 298 KNGILT---WIVRNSWGDIGPDHGYFQIERGANACGI 331
               LT   +IV+NSWG+   + GYF IE G+N CG+
Sbjct: 296 ----LTGRYYIVKNSWGEHWGNEGYFNIELGSNMCGL 328


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/241 (33%), Positives = 122/241 (50%), Gaps = 32/241 (13%)

Query: 104 KERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILES 163
           ++RL A+ E V           LP SLDWRQ     + P++ QG CGSCWAF+  A +ES
Sbjct: 112 QDRLPAEDEDVDV-------SSLPTSLDWRQKGA--VTPIKDQGDCGSCWAFSAIASIES 162

Query: 164 QVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEY-VKQYGLESQADYPY------- 215
              L  K L  LS+ QL++CD  +  C+GG ++ AF++ VK  G+ ++A YPY       
Sbjct: 163 AHFLATKELVSLSEQQLMDCDTVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSC 222

Query: 216 -RNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIESYDGN 272
             NK  I  +   E    KV  +D+      D +M  +   P+ V +  +    ++Y   
Sbjct: 223 NANKVAIINKVA-EITGFKVVTEDS-----ADALMKAVSKTPVTVSICGSDENFQNYKSG 276

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER--GANACG 330
            +      C    LDH V ++GYG + G+  WI++NSWG    + G+ +IER  G   CG
Sbjct: 277 ILSGQ---CG-DSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICG 332

Query: 331 I 331
           +
Sbjct: 333 M 333


>gi|123480189|ref|XP_001323249.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121906110|gb|EAY11026.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 315

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 121/226 (53%), Gaps = 15/226 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
           K  +P ++DWRQS +  +NP+++QG CGSCWAF+T    E   A     LY LS+  LV+
Sbjct: 97  KDDVPDTVDWRQSGL--VNPIKNQGNCGSCWAFSTIQAQEGVYAKNHGNLYSLSEQNLVD 154

Query: 183 CDHGNLNCNGGNIDVAFEYV--KQYGLES-QADYPYRNKENITFRCTYEKEKAKVFVQDT 239
           C      CNGG +  A++YV   Q GL + + DYPY  K+  T +    K  AKV   D 
Sbjct: 155 CVTSCSGCNGGLMHEAYQYVIANQQGLFNLEVDYPYTAKDG-TCKFDVSKGYAKV-TGDF 212

Query: 240 WVTSGVDHMMHLLQS--GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            VT G ++ + +  +  GPI + ++  H   + Y       + W C+   LDHAV ++GY
Sbjct: 213 QVTQGDENALKVASATYGPIAIAIDASHFTFQLYHSGIY--DPWFCSSSNLDHAVGLIGY 270

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G       W+VRNSWG    + GY ++ R   N CG+ + A++  V
Sbjct: 271 GTDKKDY-WLVRNSWGTSWGESGYIRMVRNKNNKCGVATMAFVPQV 315


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 119/221 (53%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP ++DWRQ     + P+++QG+CGSCWAF+  A +E    +    L  LS+ +LV+CD 
Sbjct: 100 LPTNVDWRQEGA--VTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 157

Query: 185 -HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
             GN  CNGG +  AFE++K+ GL ++ +YPY+  E+    C  +KEK +      +   
Sbjct: 158 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESA---CNEQKEKYQFVSISGYEKV 214

Query: 244 GVDHMMHL---LQSGPIGVYLNHRL--IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            V+    L   + + P+ V ++      + Y G     N   C  ++L+H VAIVGYGE 
Sbjct: 215 PVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGN---CG-NQLNHGVAIVGYGET 270

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYA 335
           +    W+V+NSWG    + GY +++R +      CGI   A
Sbjct: 271 SNQAYWLVKNSWGTDWGESGYIRMKRDSTDKQGTCGIAMMA 311


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 160/340 (47%), Gaps = 58/340 (17%)

Query: 31  DLAYD-----SIKQVDA-FKTYIVKWNRTYTDD--------NEIKTRFEYFK-------- 68
           DL YD     S +++ A F +++++  ++Y ++         E  TR+  FK        
Sbjct: 39  DLGYDPQDLSSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHG 98

Query: 69  QDGKETDEYYGTSGSSDRSPQEI-LQRTGLRLTGKEKERLEADRERVKKFLNERKKGP-- 125
           ++ K    + G +  +D + +E   QR G R           DR R +    E + G   
Sbjct: 99  ENEKNQGYFLGLNAFADLTNEEFRAQRHGGRF----------DRSRERTSYEEFRYGSVQ 148

Query: 126 ---LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
              LP S+DWR+    V   V+ QG CGSCWAF+  A +E    L    L  LS+ +LV+
Sbjct: 149 LKDLPDSIDWREKGAVV--GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVD 206

Query: 183 CDHG-NLNCNGGNIDVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           CD G +  CNGG +D AF +V K  GL+++ADYPY+       RC   K  AKV   D +
Sbjct: 207 CDKGEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGT---RCDRSKMNAKVVTIDGY 263

Query: 241 VTSGVDHMMHLLQS---GPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
               V+    LL++    P+ V ++     ++ Y           C    LDH V  VGY
Sbjct: 264 EDVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGR---CGT-DLDHGVTNVGY 319

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER----GANACGI 331
           G+++G   WI++NSWG    + GY ++ R     A  CGI
Sbjct: 320 GKEDGKAYWIIKNSWGSNWGEKGYIKMARNTGLAAGLCGI 359


>gi|387915678|gb|AFK11448.1| cathepsin L1 [Callorhinchus milii]
          Length = 336

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 110/216 (50%), Gaps = 12/216 (5%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP S+DWR      + PV++QG CGSCWAF++T  LE Q       L PLS+  LV+C  
Sbjct: 120 LPGSVDWRDKGY--VTPVKNQGACGSCWAFSSTGALEGQTFKKTGKLIPLSEQNLVDCSQ 177

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
             GN  CNGG +D AF Y++Q  G++++A YPY  KE+    C Y+             +
Sbjct: 178 KQGNHGCNGGMMDRAFTYIQQNNGIDTEASYPYTAKEH---PCNYDPRHNAATCHGYRYS 234

Query: 243 SGVDHMM---HLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
              D M     +   GPI V ++ + I           +  C  + ++HAV +VGY  + 
Sbjct: 235 EQYDEMALAETVATIGPISVAIDAKHISFQFYKSGIYQEPRCQSYNINHAVLVVGYNSQG 294

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESY 334
           G   WIV+NS+G    + GY  + +   N CGI SY
Sbjct: 295 GNNYWIVKNSFGSRWGNKGYIWMPKDKNNHCGIASY 330


>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 456

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 100/308 (32%), Positives = 149/308 (48%), Gaps = 30/308 (9%)

Query: 48  VKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTGLR 98
           +K+ + Y + +EI  RF  FK +  +   Y         YG +  SD +  E   RT L 
Sbjct: 163 LKYRKQYHETDEI--RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDE-FARTHLT 219

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
            +             + K +N      +PK+ DWR+     +  V++QG CGSCWAF+TT
Sbjct: 220 ASWVVPSSRSNTPTSLGKEVNN-----IPKNFDWREKGA--VTEVKNQGMCGSCWAFSTT 272

Query: 159 AILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVKQYGLESQADYPYRN 217
             +ESQ       L  LS+ QLV+CD  +  CNGG    A+E  +K  GL  + +YPY  
Sbjct: 273 GNVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLEDNYPYDA 332

Query: 218 KENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIR 275
           K     +C  + +   V++  +        +    L  +  I V +N  L++ Y  + I 
Sbjct: 333 KNE---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQ-HGIS 388

Query: 276 RNDWA-CNPHKLDHAVAIVGYG--EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIE 332
              W  C+ + LDHAV +VGYG  EKN    WIV+NSWG    ++GYF++ RG   CGI 
Sbjct: 389 HPWWIFCSKYLLDHAVLLVGYGVSEKNEPF-WIVKNSWGVEWGENGYFRMYRGDGTCGIN 447

Query: 333 SYAYLASV 340
           + A  A +
Sbjct: 448 TVATSALI 455


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 116/223 (52%), Gaps = 21/223 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P+S+DWR+     + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 99  PRSVDWREKGY--VTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP 156

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPY--RNKENITFRCTYEKEKAKVFVQDTWV 241
            GN  CNGG +D AF+YV+   G++S+  YPY  ++ E+  ++  Y       FV    +
Sbjct: 157 EGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVD---I 213

Query: 242 TSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
             G +   M  +   GP+ V ++  H   + Y        D  C+   LDH V +VGYG 
Sbjct: 214 PQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGF 271

Query: 297 ---EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
              + +G   WIV+NSWG+   D GY  + +   N CGI + A
Sbjct: 272 EGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 314


>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
 gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
          Length = 336

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/242 (34%), Positives = 127/242 (52%), Gaps = 34/242 (14%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N   +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NRTSQGPLFMEPSFFAAPQQVDWRQRGY--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D+AF+YVK+  GL+S+  YPY  ++++   C 
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLP--CR 214

Query: 227 YEKE----KAKVFVQDTWVTSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRND 278
           Y+      K+  FV    + SG +   M  +   GP+ V ++  H+ ++ Y         
Sbjct: 215 YDPRFNVAKSTGFVD---IPSGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER- 270

Query: 279 WACNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
            AC+  +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CG+ +
Sbjct: 271 -ACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVAT 329

Query: 334 YA 335
            A
Sbjct: 330 KA 331


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 150/316 (47%), Gaps = 45/316 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR---- 98
           ++ ++VK  + Y    E   RF+ FK +    DE+         + Q      GL     
Sbjct: 39  YEEWLVKHQKVYNGLREKDQRFQIFKDNLNFIDEH---------NAQNYTYIVGLNKFAD 89

Query: 99  LTGKE-KERLEADRERVKKFLNERK----------KGPLPKSLDWRQSKVKVLNPVESQG 147
           +T +E ++     R  +K+ + + K             LP  +DWR      +  ++ QG
Sbjct: 90  MTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGA--ITHIKDQG 147

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQY 205
            CGSCWAF+T A +E+   ++   L  LS+ +LV+CD   N  CNGG +D AFE++    
Sbjct: 148 SCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGNG 207

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL- 261
           G+++   YPY+  E    RC   ++KAK+   D +    ++  + +   +   P+ V + 
Sbjct: 208 GIDTDQHYPYKGFEG---RCDPTRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVSVAIE 264

Query: 262 -NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
            + R ++ Y           C    LDHAV IVGYG +NG+  W+VRNSWG    + GYF
Sbjct: 265 ASGRALQLYQSGVFTGK---CGT-SLDHAVVIVGYGSENGLDYWLVRNSWGTNWGEDGYF 320

Query: 321 QIERGANA-----CGI 331
           ++ER         CGI
Sbjct: 321 KMERNVKGTHTGKCGI 336


>gi|321476443|gb|EFX87404.1| hypothetical protein DAPPUDRAFT_307061 [Daphnia pulex]
          Length = 332

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/260 (35%), Positives = 126/260 (48%), Gaps = 14/260 (5%)

Query: 79  GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
           G +  SD  P E  +  GL        RL+A   + K  +  R   P   +LD R     
Sbjct: 76  GLNKFSDMLPSEWSRYLGLNKAALVAARLKAGPTQFK--VESRDVSP---TLDLRYDSC- 129

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            L  V+ QG+CGSCWAFA  A LE        T   LS+ QLV+CD  +  CNGG    A
Sbjct: 130 -LPEVKDQGQCGSCWAFAAVAPLEFAQCKKDSTRVVLSERQLVDCDRLDSGCNGGMYTDA 188

Query: 199 FEYVKQYG-LESQADY-PYRNKENIT-FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG 255
           + Y+K  G    Q  Y PY  ++N   FR +    +   F       + +   + + Q G
Sbjct: 189 WTYIKNAGGCAKQTLYSPYNARKNFCKFRSSMVGAQVSTF-DFLPANNPLAMQVAMEQHG 247

Query: 256 PIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIG 314
           PI V +       S+ G+    N  AC+  +++HAV +VG+G  NG+  W+VRNSWG   
Sbjct: 248 PIAVAIAVVPSFLSFHGDVYDDN--ACDGAEINHAVVVVGWGTLNGVDYWMVRNSWGTNW 305

Query: 315 PDHGYFQIERGANACGIESY 334
              GY +I+RG N CGIESY
Sbjct: 306 GLSGYIRIKRGVNKCGIESY 325


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 151/313 (48%), Gaps = 39/313 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++VK  + Y    E   RF+ FK + +  D++     + +R+ +  L R    LT +
Sbjct: 4   YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDH----NADNRTYKLGLNRFA-DLTNE 58

Query: 103 E------KERLEADRERVKKFLNERKKGP-----LPKSLDWRQSKVKVLNPVESQGRCGS 151
           E        R++ +R  VK      +  P     LP+S+DWR     +  PV+ QG CGS
Sbjct: 59  EYRARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVL--PVKDQGNCGS 116

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D A+E+ +   G++S
Sbjct: 117 CWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDS 176

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHR 264
           + DYPYR  +     C   ++ AKV   D++     +  + L   + + P+ V +    R
Sbjct: 177 EEDYPYRAVDGT---CDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGR 233

Query: 265 LIESY-DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIE 323
             + Y  G    R   A     LDH V  VGYG   G   WIVRNSWG    + GY ++E
Sbjct: 234 EFQLYVSGVFTGRCGTA-----LDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLE 288

Query: 324 RG-----ANACGI 331
           R      +  CGI
Sbjct: 289 RNLAKSRSGKCGI 301


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 93/320 (29%), Positives = 153/320 (47%), Gaps = 31/320 (9%)

Query: 32  LAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFK------QDGKETDEYY--GTSGS 83
           + Y   + +D ++ ++VK  + Y   +E + RF+ FK      QD    +  Y  G +  
Sbjct: 25  INYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKF 84

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPV 143
           +D + +E   R     T  + +R     +             LP  +DWR      + P+
Sbjct: 85  ADITNEEY--RAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGA--VGPI 140

Query: 144 ESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV 202
           + QG CGSCWAF+T A +E    ++      LS+ +LV+CD   +  CNGG +D AF+++
Sbjct: 141 KDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGLMDYAFQFI 200

Query: 203 KQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIG 258
            Q G ++++ DYPY   + I   C   K+K KV   D +    ++  + +   +   P+ 
Sbjct: 201 IQNGGIDTEEDYPY---QGIDGTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVS 257

Query: 259 VYL--NHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           V +  + R ++ Y           C    LDH V +VGYG +NG+  W+VRNSWG    +
Sbjct: 258 VAIEASGRALQLYQSGVFTGK---CGT-ALDHGVVVVGYGTENGVDYWLVRNSWGTGWGE 313

Query: 317 HGYFQIERGANA-----CGI 331
            GYF++ER   +     CGI
Sbjct: 314 DGYFKMERNVRSTSEGKCGI 333


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/297 (32%), Positives = 142/297 (47%), Gaps = 21/297 (7%)

Query: 50  WNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEA 109
           W RT  ++N    +    +Q         G +   D + +E  +     LTG E+   + 
Sbjct: 47  WRRTVWEENLKAIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEI----LTG-ERHFSKG 101

Query: 110 DRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           +R     FL E     +P S+DWR      + PV++QG CGSCWAF+TT  LE Q+    
Sbjct: 102 NRINGSAFL-EANFVQVPTSVDWRDHGY--VTPVKNQGHCGSCWAFSTTGALEGQLFRKS 158

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  LS+  LV+C    GN  C+GG +D+AF+Y+ Q  G++S+  YPY  K+  T +CT
Sbjct: 159 GRLISLSEQNLVDCSWQQGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKD--TAQCT 216

Query: 227 YEKEKAKVFVQ---DTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNP 283
           ++ E A   V    D    S    M  +   GP+ V ++               D  C+ 
Sbjct: 217 FKPECATAPVTGFVDIPPHSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSS 276

Query: 284 HKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
             LDHAV +VGYG    ++ G   WIV+NSWG    D GY  + +   N CGI + A
Sbjct: 277 ESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVA 333


>gi|260821804|ref|XP_002606293.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
 gi|229291634|gb|EEN62303.1| hypothetical protein BRAFLDRAFT_57270 [Branchiostoma floridae]
          Length = 246

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 121/236 (51%), Gaps = 23/236 (9%)

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           KK   + + GP+P   DWR  K  V++ V+ QG CGSCW F+ T  LES  A+       
Sbjct: 13  KKARVQSRAGPVPAKKDWR-DKPGVVSGVKDQGHCGSCWTFSATGCLESVTAITFGAPMN 71

Query: 175 LSKSQLVECDHG--NLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEK 231
           LS+ QLV C  G  N  C GG    A+EYVK   G+ES+ DYPY  K+    +C +   K
Sbjct: 72  LSEQQLVSCAQGFNNHGCEGGLPSQAWEYVKWAQGIESEKDYPYTAKDG---KCMFNTNK 128

Query: 232 AKVFVQDTW-VTSG-VDHMMHLLQS-GPIG----VYLNHRLIES--YDGNPIRRNDWACN 282
              +V+D   +T G  D ++  + +  P+     V  + +L +   Y      R+     
Sbjct: 129 TIAYVRDVVNITQGDEDEILQAVGTLNPVSIAYQVVADFKLYKKGVYSSKLCHRDQ---- 184

Query: 283 PHKLDHAVAIVGYGEKNGILT-WIVRNSWGDIGPDHGYFQIERGANACGI-ESYAY 336
              ++HAV +VGYGE   ++  WIV+NSWG      GYF IER  N CG+ E  AY
Sbjct: 185 -EHVNHAVLVVGYGEDESVIPYWIVKNSWGPSWGMDGYFLIERNQNMCGLAECAAY 239


>gi|213512532|ref|NP_001134063.1| Cathepsin O precursor [Salmo salar]
 gi|209730446|gb|ACI66092.1| Cathepsin O precursor [Salmo salar]
          Length = 341

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 85/294 (28%), Positives = 137/294 (46%), Gaps = 11/294 (3%)

Query: 43  FKTYIVKWNRTYTDDNEI-KTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTG 101
           F+++  +++R Y   ++    R  YFK   K        S   D +   I Q + L +  
Sbjct: 45  FESFREQFHRNYKLHSDCYHRRRSYFKNSIKRHAYLNSLSTDKDSAKYGINQFSDLSIHE 104

Query: 102 KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
             +  L A  E V  +   + +G LP   DWR      +  V++Q  CG CWAF+    +
Sbjct: 105 FRELYLTATAETVPPYSGLKTEG-LPAKFDWRVKAA--VGSVQNQQACGGCWAFSVVGAI 161

Query: 162 ESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ--YGLESQADYPYRNKE 219
           ES  A   +    LS  Q+++C + N  CNGG+I  A  ++KQ    L  Q++YPY+ + 
Sbjct: 162 ESVYAKSGQPFKQLSVQQVIDCSYKNQGCNGGSITRALSWLKQTRVKLVKQSEYPYKAET 221

Query: 220 NIT--FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRN 277
            I   F  +++    K F    +       M  L++ GP+ V ++    + Y G  ++ +
Sbjct: 222 GICHLFSQSHDGVLVKDFAAHDYSGHEEAMMGRLVEWGPLAVTVDAISWQDYLGGIMQHH 281

Query: 278 DWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGI 331
              C+ H  +HAV + GY     +  WIV+NSWG    + GY  I+ G N CGI
Sbjct: 282 ---CSCHHANHAVLVTGYDTTGDVPYWIVQNSWGTSWGNEGYVYIKMGGNVCGI 332


>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
 gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
          Length = 341

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 122/227 (53%), Gaps = 31/227 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +PKS+DWR  K   + PV++QG+CGSCW+F+ T  LE Q       L  LS+  L++C  
Sbjct: 124 IPKSVDWR--KKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  C GG +D+AF+Y+K   GL+++  YPY  +++   +C Y  E +    K FV  
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDD---KCRYNPENSGATDKGFVD- 237

Query: 239 TWVTSG-VDHMMHLLQS-GPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAV 290
             +  G  D +MH L + GP+ + ++      + Y      NP       C+  +LDH V
Sbjct: 238 --IPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNP------RCSSTELDHGV 289

Query: 291 AIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
             VG+G +K G   WIV+NSWG    D GY  + R   N CG+ S A
Sbjct: 290 LAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSA 336


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 116/221 (52%), Gaps = 23/221 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LPKS+DWR+     +  V+ QG CGSCWAF++T  LE Q      TL  LS+  LV+C  
Sbjct: 122 LPKSVDWREKGA--VTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSA 179

Query: 185 -HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  CNGG +D AF Y+K   G++++  YPY   E I   C + K+      + F   
Sbjct: 180 KYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY---EGIDDSCHFNKDSVGATDRGFAD- 235

Query: 239 TWVTSGVDHMM--HLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             +  G +  M   +   GP+ V ++  H   + Y       N+  CN   LDH V +VG
Sbjct: 236 --IPQGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGIY--NEPECNSQNLDHGVLVVG 291

Query: 295 YG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
           YG +++G   W+V+NSWG    D G+ ++ R   N CGI S
Sbjct: 292 YGTDESGKDYWLVKNSWGTTWGDKGFIKMARNEDNQCGIAS 332


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 97/310 (31%), Positives = 148/310 (47%), Gaps = 35/310 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++ K  ++Y    E + RF+ FK + +  DE+     + +R+ +  L R    LT +
Sbjct: 53  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEH----NAENRTYKVGLNRFA-DLTNE 107

Query: 103 E------KERLEADRERVKKFLNE---RKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCW 153
           E        R  A R    K  +    R    LP+S+DWR+    V   V+ QG CGSCW
Sbjct: 108 EYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVV--EVKDQGSCGSCW 165

Query: 154 AFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY-VKQYGLESQA 211
           AF+T A +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE+ +   G++S+ 
Sbjct: 166 AFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEE 225

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHL---LQSGPIGVYL--NHRLI 266
           DYPY+  +    RC   ++ A V   D +     +    L   + + P+ V +    R  
Sbjct: 226 DYPYKASDG---RCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREF 282

Query: 267 ESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER-- 324
           + Y           C    LDH V  VGYG +NG+  WIV+NSWG    + GY ++ER  
Sbjct: 283 QLYQSGIFTGR---CGT-ALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDL 338

Query: 325 ---GANACGI 331
                  CGI
Sbjct: 339 ATSATGKCGI 348


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 27/305 (8%)

Query: 44  KTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKE 103
           + ++ K+ R Y D++E + RFE F+ +  E  E +   G+    P ++       LT +E
Sbjct: 39  EMWMAKYGRVYKDNSEKERRFEIFRNN-VEFIESFNKLGNR---PYKLDINEFADLTNEE 94

Query: 104 KERLEADRERVKKF-LNERKK------GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFA 156
            +  +   +R     L E+          +P S+DWRQ+    + P++ QG+CG CWAF+
Sbjct: 95  FKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTSMDWRQNGA--VTPIKDQGQCGCCWAFS 152

Query: 157 TTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGNIDVAFEYVKQY-GLESQADY 213
             A +E    L    L  LS+ +LV+CD    +  C GG +D AFE++KQ  GL ++A+Y
Sbjct: 153 AVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANY 212

Query: 214 PYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNH--RLIESYDG 271
           PY+  +          + AK+   +    +  D ++  + S P+ V ++      + Y G
Sbjct: 213 PYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSG 272

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANA-- 328
                +   C   +LDH V  VGYG   +G   W+V+NSWG    + GY ++ER   A  
Sbjct: 273 GVFTGD---CGT-ELDHGVTAVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERDIEAKE 328

Query: 329 --CGI 331
             CGI
Sbjct: 329 GLCGI 333


>gi|19922450|ref|NP_611221.1| CG4847, isoform A [Drosophila melanogaster]
 gi|24654437|ref|NP_725687.1| CG4847, isoform B [Drosophila melanogaster]
 gi|24654439|ref|NP_725688.1| CG4847, isoform C [Drosophila melanogaster]
 gi|45552699|ref|NP_995874.1| CG4847, isoform E [Drosophila melanogaster]
 gi|7302775|gb|AAF57850.1| CG4847, isoform A [Drosophila melanogaster]
 gi|15010382|gb|AAK77239.1| GH01592p [Drosophila melanogaster]
 gi|21645236|gb|AAM70881.1| CG4847, isoform B [Drosophila melanogaster]
 gi|21645237|gb|AAM70882.1| CG4847, isoform C [Drosophila melanogaster]
 gi|45445496|gb|AAS64820.1| CG4847, isoform E [Drosophila melanogaster]
 gi|220944958|gb|ACL85022.1| CG4847-PA [synthetic construct]
 gi|220954732|gb|ACL89909.1| CG4847-PA [synthetic construct]
          Length = 390

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/259 (33%), Positives = 131/259 (50%), Gaps = 23/259 (8%)

Query: 84  SDRSPQEIL-QRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNP 142
           +D +  E L Q TGL+ + + K R  A      K +N   K P+P + DWR+     + P
Sbjct: 135 ADLTHSEFLSQLTGLKRSPEAKARAAASL----KLVNLPAK-PIPDAFDWREHGG--VTP 187

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC----DHGNLNCNGGNIDVA 198
           V+ QG CGSCWAFATT  +E        +L  LS+  LV+C    D G   C+GG  + A
Sbjct: 188 VKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAA 247

Query: 199 FEYVK--QYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQS 254
           F ++   Q G+  +  YPY + +     C Y+  K+   +Q        D   +  ++ +
Sbjct: 248 FCFIDEVQKGVSQEGAYPYIDNKGT---CKYDGSKSGATLQGFAAIPPKDEEQLKKVVAT 304

Query: 255 -GPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGD 312
            GP+   +N    +++Y G     ND  CN  + +H++ +VGYG + G   WIV+NSW D
Sbjct: 305 LGPVACSVNGLETLKNYAGG--IYNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDD 362

Query: 313 IGPDHGYFQIERGANACGI 331
              + GYF++ RG N C I
Sbjct: 363 TWGEKGYFRLPRGKNYCFI 381


>gi|226476132|emb|CAX72156.1| cathepsin L, a [Schistosoma japonicum]
          Length = 331

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/333 (30%), Positives = 152/333 (45%), Gaps = 55/333 (16%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ 89
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK                Q
Sbjct: 20  DKQYDEI-----WRQWKLKYNKTYTSNDDEMRRKMIFMRRIGK---------------IQ 59

Query: 90  EILQRTGLRLTGK--------EKERLEADRERVKKFLN------------ERKKGPLPKS 129
           E   R  L L G         + E  E +R    K               E    P+P  
Sbjct: 60  EHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGNELELTNKPVPSK 119

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGN 187
            DWR      +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN
Sbjct: 120 WDWRDHGA--VTAVKNQGLCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGN 177

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             C GG +D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D 
Sbjct: 178 YGCGGGFMDHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDE 234

Query: 248 ---MMHLLQSGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
                 + Q GPI V  +    +  Y       ND  C    ++H V +VGYGE++G   
Sbjct: 235 KTLQKAVYQYGPISVGIVALDSLTMYKSGVFESND--CKHADINHGVLVVGYGEEHGKDY 292

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           W+++NSWGD+    GYF++ R   N CG+ S A
Sbjct: 293 WLIKNSWGDLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
          Length = 340

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 114/215 (53%), Gaps = 15/215 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P+S+DWR+     +N V+ QG+CGSCWAF+T A LES+  +    L  LS+ QLV+C  
Sbjct: 125 IPESIDWREKGA--VNAVKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSK 182

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYE--KEKAKVFVQDTWV 241
           +GN  CNGG++ +A +Y+    G+E++ DYPY  K+     C +E  KE A        V
Sbjct: 183 NGNEGCNGGDMGLAMDYIASAGGVETEKDYPYVGKDQT---CAFEASKEVATDKGHINIV 239

Query: 242 TSGVDHMMHLLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
                 +   +  GP+ V +    L   +  + I  + W C  + LDH VA VGYG  NG
Sbjct: 240 PGKFATLQAAIAEGPVSVAIEADSLFFQFYRSGIFDSSW-CGTN-LDHGVAAVGYGVDNG 297

Query: 301 ILTWIVRNSWGDIGPDHGYFQI---ERGANACGIE 332
              +IVRNSW D     GY  I     G   CGI+
Sbjct: 298 KQYYIVRNSWSDSWGLKGYINIIANGDGNGMCGIQ 332


>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 149/326 (45%), Gaps = 48/326 (14%)

Query: 38  KQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDE--------YYGTSGSSDRSPQ 89
           +Q  AFK    K++R+Y D  E   RF  FKQ+ +   E         +G +  SD SP+
Sbjct: 39  QQFAAFKQ---KYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPE 95

Query: 90  EILQRTGLRLT-GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           E       R T     E   A  +R +K +N    G  P+++DWR  K   + PV+ QG+
Sbjct: 96  E------FRATYHNGAEYYAAALKRPRKVVN-VSTGKAPEAVDWR--KKGAVTPVKDQGK 146

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY--- 205
           C S WAF     +E Q  +    L  LS+  LV CD  +L C  G +D AF+++      
Sbjct: 147 CDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGCRAGFMDTAFKWIVSSNNG 206

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLL-----------QS 254
            + ++  YPY +       C    +  KV      V + +D  +H+L           + 
Sbjct: 207 NVFTEQSYPYASGGGNVPTC---NKSGKV------VGANIDDHVHILDNENAIAEWLAKK 257

Query: 255 GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIG 314
           GP+ + ++    +SY G  +     +C   +++ A  +VGY + +    WI++NSW    
Sbjct: 258 GPVAIAVDATSFQSYTGGVLT----SCISKEVNSAALLVGYDDTSKPPYWIIKNSWSKGW 313

Query: 315 PDHGYFQIERGANACGIESYAYLASV 340
            + GY +IE+G N C ++ Y   A V
Sbjct: 314 GEEGYIRIEKGTNQCRMKEYVSSAVV 339


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 116/228 (50%), Gaps = 30/228 (13%)

Query: 125 PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC- 183
           P+P   +W       + PV+ QG+CGSCWAF+ T  +E Q+ L KK L  LS+ QLV+C 
Sbjct: 107 PVPSYANWTAKGA--VTPVKDQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCS 164

Query: 184 -DHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWV 241
            D GNL C GG +D AF+Y +   G+ ++  YPY  K+N    C Y+K  +   +     
Sbjct: 165 GDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAKDN---DCKYKKSMSVATISSFKD 221

Query: 242 TSGVDH---MMHLLQSGPIGVYLN-----HRLIES---YDGNPIRRNDWACNPHKLDHAV 290
               D     M +   GP+ V ++      +  ES   YD N        C+   LDH V
Sbjct: 222 VKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVYYDEN--------CSSEVLDHGV 273

Query: 291 AIVGYG--EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             VGYG  +K+G+  W+V+NSW      +GY ++ R   N CGI + A
Sbjct: 274 LAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNNCGIATMA 321


>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 122/227 (53%), Gaps = 31/227 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +PKS+DWR  K   + PV++QG+CGSCW+F+ T  LE Q       L  LS+  L++C  
Sbjct: 124 IPKSVDWR--KKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 185 -HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKA----KVFVQD 238
            +GN  C GG +D+AF+Y+K   GL+++  YPY  +++   +C Y  E +    K FV  
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDD---KCRYNPENSGATDKGFVD- 237

Query: 239 TWVTSG-VDHMMHLLQS-GPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAV 290
             +  G  D +MH L + GP+ + ++      + Y      NP       C+  +LDH V
Sbjct: 238 --IPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNP------RCSSTELDHGV 289

Query: 291 AIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
             VG+G +K G   WIV+NSWG    D GY  + R   N CG+ S A
Sbjct: 290 LAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSA 336


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 112/222 (50%), Gaps = 30/222 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR      + P++ QG CGSCWAF+T A +E+   ++      LS+ +LV+CD 
Sbjct: 130 LPVHVDWRVKGA--VAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDR 187

Query: 186 G-NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW--- 240
             N  CNGG +D AFE++ Q  G+++  DYPYR  + I   C   K+ AKV   D +   
Sbjct: 188 AYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGI---CDPTKKNAKVVNIDGFEDV 244

Query: 241 -------VTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
                  +   V H     Q   I +  + R ++ Y           C    LDH V +V
Sbjct: 245 PPYDENALKKAVAH-----QPVSIAIEASGRDLQLYQSGVFTGK---CGT-SLDHGVVVV 295

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGI 331
           GYG +NG+  W+VRNSWG    + GYF+++R        CGI
Sbjct: 296 GYGSENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGI 337


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 150/313 (47%), Gaps = 42/313 (13%)

Query: 43  FKTYIVKWNRTYTDDNEIKT--RFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLR-- 98
           ++ ++VK  +    ++ ++   RFE FK + +  D         D + + +  R GL   
Sbjct: 43  YEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFID---------DHNKKNLSYRLGLTRF 93

Query: 99  --LTGKE------KERLEADRERVKKFLNERKKG-PLPKSLDWRQSKVKVLNPVESQGRC 149
             LT  E        ++E   ER      E + G  LP+S+DWR  K   +  V+ QG C
Sbjct: 94  ADLTNDEYRSKYLGAKMEKKGERRTSQRYEARVGDELPESIDWR--KKGAVAEVKDQGSC 151

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYV-KQYGL 207
           GSCWAF+T   +E    ++   L  LS+ +LV+CD   N  CNGG +D AFE++ K  G+
Sbjct: 152 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 211

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTW---VTSGVDHMMHLLQSGPIGVYL--N 262
           ++  DYPY+  +     C   ++ AKV   D++    T   + +   +   P+ V +   
Sbjct: 212 DTDKDYPYKGVDGT---CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAG 268

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + YD       D  C   +LDH V  VGYG +NG   WIVRNSWG    + GY ++
Sbjct: 269 GRAFQLYDSGIF---DGTCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLKM 324

Query: 323 ER----GANACGI 331
            R     +  CGI
Sbjct: 325 ARNIASSSGKCGI 337


>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
 gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 88/264 (33%), Positives = 133/264 (50%), Gaps = 19/264 (7%)

Query: 79  GTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
           G +  +D + +EI  + TGL L  + + +     ++   F      G +P S+DWR    
Sbjct: 75  GMNHLADMTSEEIKSKLTGLILPPQSERQATFSSQKNSTF-----GGKVPDSIDWRDKGC 129

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNI 195
             ++ V++QG CGSCWAF+    LE Q+ L    L  LS   LV+C   +GN  C GG +
Sbjct: 130 --VSDVKNQGGCGSCWAFSAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFM 187

Query: 196 DVAFEYV-KQYGLESQADYPYRNKENITFRCTYEKE-KAKVFVQDTWVTSGV-DHMMHLL 252
             AF+YV    G++S + YPY   +    +C Y+   KA    + T +  G  D++   L
Sbjct: 188 TQAFQYVIDNKGIDSDSYYPYHAMDE---KCHYDPTGKASTCAKYTEIVPGTEDNLKQAL 244

Query: 253 QS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
            S GPI V ++      +       +D  C+ H+++H V  VGYG  NG   W+++NSWG
Sbjct: 245 GSIGPISVAIDGTRPSFFLYRSGVYSDPTCS-HEVNHGVLAVGYGNLNGQDFWLLKNSWG 303

Query: 312 DIGPDHGYFQIERG-ANACGIESY 334
               D GY +I R   N CG+ SY
Sbjct: 304 TKYGDQGYVRIARNKGNLCGVASY 327


>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
          Length = 257

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/255 (32%), Positives = 127/255 (49%), Gaps = 41/255 (16%)

Query: 102 KEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAIL 161
           K K +L  D+  +    +      LP   DWR+     +  V++QG CGSCW+F+TT  +
Sbjct: 6   KAKPKLSTDKAPILPTSD------LPDDFDWREKGA--VTGVKNQGSCGSCWSFSTTGAV 57

Query: 162 ESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQYGLESQA 211
           E    L    L  LS+ QLV+CDH          +  C GG +  AFEY +K  GL+ + 
Sbjct: 58  EGAHFLATGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQREK 117

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPIGVYLNHRLIES 268
           DYPY  ++    +C ++K K    V +  V  G+D      +L++ GP+ V +N   +++
Sbjct: 118 DYPYTGRDG---KCHFDKSKIAASVANFSVV-GLDEDQIAANLVKHGPLAVGINAAWMQT 173

Query: 269 YDGN---PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-------WIVRNSWGDIGPDHG 318
           Y G    P+      C   + DH V +VGYG              WI++NSWG+   + G
Sbjct: 174 YVGGVSCPL-----ICFKRQ-DHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQG 227

Query: 319 YFQIERGANACGIES 333
           Y++I RG N CG+++
Sbjct: 228 YYKICRGRNICGVDA 242


>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
          Length = 343

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 117/226 (51%), Gaps = 18/226 (7%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-- 183
           LP++ DWR+     ++PV++QG CGSCW F+TT  LES   +  K  Y LS+ QLV+C  
Sbjct: 125 LPENFDWREHGG--VSPVKNQGHCGSCWTFSTTGCLESAHLIHHKKAYNLSEQQLVDCAQ 182

Query: 184 DHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT 242
           D  N  CNGG    AFEY+    GLE + DY Y  +E +   C ++  K    V++ +  
Sbjct: 183 DFDNHGCNGGLPSHAFEYIHYVGGLEEEQDYSYHAEEGL---CEFDPTKTAGTVREVFNI 239

Query: 243 SGVDH---MMHLLQSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           +  D     + L    P+ V     +++    Y     + +     P  ++HAV  VGYG
Sbjct: 240 TETDEDQLTIALAYFNPVSVAF--EVVDGFRFYKEGVYQSDTCKSGPEDVNHAVLAVGYG 297

Query: 297 --EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
             +K     +IV+NSWG    D G+F+I+RG N CGI + A    V
Sbjct: 298 MCKKCETPYFIVKNSWGAEWGDEGFFKIKRGENMCGIATCASFPIV 343


>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
          Length = 465

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 144/309 (46%), Gaps = 32/309 (10%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ----EILQRTGLR 98
           F+ + +K+N+ YT  +E   RF  FK + K  DE    + S   S +    E    +   
Sbjct: 28  FRQFQIKYNKQYTS-SEYAERFATFKSNLKVIDEKNRDAASRKSSVRFGVNEFADLSQSE 86

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATT 158
                   ++A R+       +     LP + DWR      +  V++QG+CGSCW+F+TT
Sbjct: 87  FRATYLNSVQAVRDPNAAVAADLPVEDLPTAFDWRTKGA--VTGVKNQGQCGSCWSFSTT 144

Query: 159 AILESQVALLKKTLYPLSKSQLVECDHGNL----------NCNGGNIDVAFEY-VKQYGL 207
             +E Q  L   TL  LS+  LV+CDH  +           CNGG    A+ Y +K  G+
Sbjct: 145 GNVEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIKNGGI 204

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHM-MHLLQSGPIGVYLNHRL 265
           +++A YPY+  +     C+++       + + T+V+S    M  +L+ +GP+ +  +   
Sbjct: 205 DTEASYPYQGVDGT---CSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAADAVE 261

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDIGPDHGYF 320
            + Y G      D  C  + LDH + IVGY  +N I       WIV+NSWG    + GY 
Sbjct: 262 WQFYLGGVF---DVPCG-NTLDHGILIVGYSAENTIFHKDKAYWIVKNSWGATWGEQGYI 317

Query: 321 QIERGANAC 329
            I RG   C
Sbjct: 318 YISRGNGEC 326


>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
 gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
          Length = 336

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/238 (34%), Positives = 124/238 (52%), Gaps = 26/238 (10%)

Query: 119 NERKKGPL---------PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLK 169
           N   +GPL         P+ +DWRQ     + PV+ Q +CGSCW+F++T  LE Q+    
Sbjct: 99  NRTSQGPLFMEPSFFAAPQQVDWRQRGF--VTPVKDQKQCGSCWSFSSTGALEGQLFRKT 156

Query: 170 KTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
             L  +S+  LV+C    GN  CNGG +D AF+YVK+  GL+S+  YPY  ++++  R  
Sbjct: 157 GKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYD 216

Query: 227 YEKEKAKV--FVQDTWVTSGVDHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACN 282
                AK+  FV D    + +  M  +   GP+ V ++  H+ ++ Y          AC+
Sbjct: 217 PRFNVAKITGFV-DIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYER--ACS 273

Query: 283 PHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
             +LDHAV +VGYG +   +     WIV+NSW D   D GY  + +   N CG+ + A
Sbjct: 274 SSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSA 331


>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
 gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
 gi|1094710|prf||2106314A cathepsin L
          Length = 319

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 152/315 (48%), Gaps = 29/315 (9%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEI 91
           + +  + +K+ + Y  + E + RF  FK +  +   Y         YG +  SD +  E 
Sbjct: 18  EKYVQFKLKYRKQY-HETEDEIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDE- 75

Query: 92  LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
             RT L  +             + K +N      +PK+ DWR+     +  V++QG CGS
Sbjct: 76  FARTHLTASWVVPSSRSNTPTSLGKEVNN-----IPKNFDWREKGA--VTEVKNQGMCGS 128

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFE-YVKQYGLESQ 210
           CWAF+TT  +ESQ       L  LS+ QLV+CD  +  CNGG    A+E  +K  GL  +
Sbjct: 129 CWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLE 188

Query: 211 ADYPYRNKENITFRCTYEKEKAKVFVQDT--WVTSGVDHMMHLLQSGPIGVYLNHRLIES 268
            +YPY  K     +C  + +   V++  +        +    L  +  I V +N  L++ 
Sbjct: 189 DNYPYDAKNE---KCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQF 245

Query: 269 YDGNPIRRNDWA-CNPHKLDHAVAIVGYG--EKNGILTWIVRNSWGDIGPDHGYFQIERG 325
           Y  + I    W  C+ + LDHAV +VGYG  EKN    WIV+NSWG    ++GYF++ RG
Sbjct: 246 YQ-HGISHPWWIFCSKYLLDHAVLLVGYGVSEKNEPF-WIVKNSWGVEWGENGYFRMYRG 303

Query: 326 ANACGIESYAYLASV 340
             +CGI + A  A +
Sbjct: 304 DGSCGINTVATSAMI 318


>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 398

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 113/216 (52%), Gaps = 14/216 (6%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P ++DWR+     + P++ QG+CGSCWAF+    LE Q  +    L  LS+ QLV+C   
Sbjct: 184 PDTVDWREKGA--VTPIKDQGQCGSCWAFSAIGSLEGQHFINTGNLVSLSEQQLVDCSLK 241

Query: 187 NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
           N  CNGG +  AF+Y++   G ES+ DYPY  K      C Y+  KA   V   T + SG
Sbjct: 242 NDGCNGGMLSTAFKYIESVAGEESETDYPYTAKNG---TCQYDPSKAVAKVTGYTALPSG 298

Query: 245 VDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
            +  ++  +   GPI V ++  H+  + Y          +C+   LDH V +VGYG ++ 
Sbjct: 299 DEDSLNDAVTSKGPISVCIDASHKSFQLYSEGVYYEK--SCSYFLLDHCVLVVGYGTEDT 356

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
              W+V+NSWG      GY ++ R   N CGI + A
Sbjct: 357 ADYWLVKNSWGTSWGMKGYIRMSRNRKNNCGIATNA 392



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 3/60 (5%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G +P S+DWR  K   + PV SQG+CG  W +     +ESQ  +   TL PLS  Q+++C
Sbjct: 109 GNVPNSIDWR--KKGAVTPVSSQGQCG-VWPWPIVGSVESQYFIKTGTLVPLSVQQILDC 165


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 96/328 (29%), Positives = 150/328 (45%), Gaps = 28/328 (8%)

Query: 30  RDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS------ 83
           RDL  D+       + ++ K  R Y DD E   R E F+ +    +     +        
Sbjct: 28  RDL-VDAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLE 86

Query: 84  ----SDRSPQEI-LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
               +D +  E    RTGLR +     R         ++ N    G LP S+DWR     
Sbjct: 87  ENQFADLTNAEFRATRTGLRPSSSRGNRAPTSF----RYAN-VSTGDLPASVDWRGKGA- 140

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNID 196
            +NPV+ QG CG CWAF+  A +E  V L    L  LS+ QLV CD    +  C GG +D
Sbjct: 141 -VNPVKDQGDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMD 199

Query: 197 VAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG 255
            AF++ +K  GL +++DYPY   ++           A +   +    +    ++  + + 
Sbjct: 200 DAFDFIIKNGGLAAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQ 259

Query: 256 PIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGD 312
           P+ V ++   R  + Y G  +  +  A    +LDHA+  VGYG   +G   W+++NSWG 
Sbjct: 260 PVSVAIDGGDRHFQFYKGGVL--SGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGT 317

Query: 313 IGPDHGYFQIERG-ANACGIESYAYLAS 339
              + GY ++ERG A+  G+   A +AS
Sbjct: 318 SWGEDGYVRMERGVADKEGVCGLAMMAS 345


>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
           distachyon]
          Length = 373

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 39/288 (13%)

Query: 78  YGTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSK 136
           +G +  SD +P+E   R TGL+  G       A R   ++         LP S DWR   
Sbjct: 98  HGVTPFSDLTPEEFQARLTGLQQQGTNNNMPAAARATAEELAT------LPASFDWRAKG 151

Query: 137 VKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG---------N 187
              +  V+ QG CGSCWAF+TT  +E    +    L  LS+ QLV+CDH          +
Sbjct: 152 A--VTEVKMQGMCGSCWAFSTTGAVEGAHFVATGKLLNLSEQQLVDCDHTCDAVAKNECD 209

Query: 188 LNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD 246
             C+GG +  A+ Y ++  GL  QA YPY   +     C ++  K  V V         D
Sbjct: 210 SGCSGGLMTNAYTYLIRAGGLMEQAAYPYTGAQGT---CRFDANKVAVRVTSFTAVPPDD 266

Query: 247 H---MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
                  L+++GP+ V LN   +++Y G         C    ++H V +VGYG + G+  
Sbjct: 267 EDQIRASLVRAGPLAVGLNAAFMQTYLGG--VSCPLLCPRKLINHGVLLVGYGAR-GLAP 323

Query: 304 --------WIVRNSWGDIGPDHGYFQIERGA---NACGIESYAYLASV 340
                   WI++NSWG    + GY+++ RGA   N CG++S     +V
Sbjct: 324 LRLGYRPYWIIKNSWGKEWGEGGYYRLCRGARNRNVCGVDSMVSAVAV 371


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/291 (33%), Positives = 145/291 (49%), Gaps = 35/291 (12%)

Query: 63  RFEYFKQDGKETDEYYGTSGS--------SDRSPQEILQRTGLRLTGKEKERLEADRERV 114
           RF  FK + K   E      S         D + +E  +RT      K     + +R+  
Sbjct: 57  RFNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEE-FRRTYAGSNIKHHRMFQGERQTT 115

Query: 115 KKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP 174
           K F+       LP S+DWR++    + PV++QG+CGSCWAF+T   +E    +  K L  
Sbjct: 116 KSFMYANVD-TLPTSVDWRKNGA--VTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTS 172

Query: 175 LSKSQLVECD-HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKA 232
           LS+ +LV+CD + N  CNGG +D+AFE++K+  GL S+  YPY+  +     C   KE A
Sbjct: 173 LSEQELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDET---CDTNKENA 229

Query: 233 KVFV----QDTWVTSGVDHMMHLLQSGPIGVYLNH--RLIESY-DGNPIRRNDWACNPHK 285
            V      +D    S VD +M  +   P+ V ++      + Y +G    R    C   +
Sbjct: 230 PVVSIDGHEDVPKNSEVD-LMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGR----CGT-E 283

Query: 286 LDHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA----NACGI 331
           L+H VA+VGYG   +G   WIV+NSWG+   + GY +++RG       CGI
Sbjct: 284 LNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGI 334


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 154/315 (48%), Gaps = 35/315 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGK 102
           ++ ++ +  R Y    E + RFE FK + +  +   G + S +R+ +  L +    LT +
Sbjct: 50  YEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIE---GHNNSGNRTYKVGLNQFA-DLTNE 105

Query: 103 EKERL------EADRERVK-----KFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           E   +      +A R  VK     +    R    +P S+DWR  K   + P+++QG CGS
Sbjct: 106 EYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWR--KRGAVAPIKNQGSCGS 163

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDH-GNLNCNGGNIDVAFEY-VKQYGLES 209
           CWAF+T A +E    ++   +  LS+ +LV+CD   N  CNGG +D AFE+ +   G+++
Sbjct: 164 CWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDT 223

Query: 210 QADYPYRNKENITFRCTYEKEKAKVFVQDTW--VTSGVDHMMHLLQSGPIGVYL--NHRL 265
           +  YPYR  E    RC   ++  KV   D +  V      +   +   P+ V +  + R 
Sbjct: 224 EKHYPYRGVEG---RCDPVRKNYKVVSIDGYEDVPRNERALQKAVAHQPVCVAIEASGRA 280

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG 325
            + Y           C   ++DH V +VGYG ++G+  WIVRNSWG    ++GY ++ER 
Sbjct: 281 FQLYSSGVFTGE---CG-EEVDHGVVVVGYGSEDGVDYWIVRNSWGTKWGENGYVKMERN 336

Query: 326 A-----NACGIESYA 335
                   CGI + A
Sbjct: 337 VKKSHLGKCGIMTEA 351


>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 92/318 (28%), Positives = 150/318 (47%), Gaps = 40/318 (12%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F+ ++ K+ + Y+   E   R   F ++     E+        +G +  SD S +E  +R
Sbjct: 7   FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSEEE-FER 65

Query: 95  TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWA 154
               + G+   +       V +     +   LP+S DWR+     +  V+ QG CGSCWA
Sbjct: 66  MFTGVVGRPHMK-----GGVAETAAALEVDGLPESFDWREKGA--VTEVKMQGTCGSCWA 118

Query: 155 FATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCNGGNIDVAFEY-VKQ 204
           F+TT  +E    +  K L  LS+ QLV+CDH          +  C GG +  A++Y ++ 
Sbjct: 119 FSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEA 178

Query: 205 YGLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHM-MHLLQSGPIGVYLN 262
            GLE ++ YPY  K      C ++ ++  V  V  T V    + +  +L+  GP+ V LN
Sbjct: 179 GGLEEESSYPYTGKHG---ECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLN 235

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSWGDIGP 315
              +++Y G         C    ++H V +VGYG K   IL       WI++NSWG    
Sbjct: 236 AXFMQTYIGG--VSCPLICPKRWINHGVLLVGYGAKGYSILRFGYKPYWIIKNSWGXRWG 293

Query: 316 DHGYFQIERGANACGIES 333
           +HGY+++ RG   CG+ +
Sbjct: 294 EHGYYRLCRGHGMCGMNT 311


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 147/313 (46%), Gaps = 36/313 (11%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR-------- 94
           ++ +  +  R+Y   +E + R E F+ + +  D++   + +   S +  L R        
Sbjct: 47  YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106

Query: 95  -----TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRC 149
                 G+R  G  + R         +F   R    LP S+DWR     V   V+ QG C
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRF---RSSDDLPDSIDWRDKGAVV--DVKDQGSC 161

Query: 150 GSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-HGNLNCNGGNIDVAFEYV-KQYGL 207
           GSCWAF+T A +E    ++   L  LS+ +LV+CD + N  CNGG +D AFE++    G+
Sbjct: 162 GSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNGGI 221

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD---HMMHLLQSGPIGVYL--N 262
           ++  DYPY  ++     C   ++ A V   D++    ++    +   + + P+ V +   
Sbjct: 222 DTDEDYPYTGRDG---SCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAG 278

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
            R  + Y+          C   +LDH V  +GYG +NG   WIV+NSWG    + GY ++
Sbjct: 279 GRAFQLYESGIFTG---YCGT-ELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRM 334

Query: 323 ERGANA----CGI 331
           ER  N+    CGI
Sbjct: 335 ERNINSATGKCGI 347


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 118/221 (53%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK++DWR  K   + PV++QG+CGSCWAF+TT  LE       + L  LS+  LV+C  
Sbjct: 119 LPKTVDWR--KKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSR 176

Query: 186 --GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG +D AF+Y+K   G++++  YPY   + +   C + +  + V   DT   
Sbjct: 177 SFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGV---CHFNR--SDVGATDTGFV 231

Query: 241 -VTSGVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +  G ++ +   +   GP+ V ++  H   + Y        +  C+  +LDH V +VGY
Sbjct: 232 DIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPE--CSSEQLDHGVLVVGY 289

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G K+G   W+V+NSWG    D GY  + R   N CGI S A
Sbjct: 290 GTKDGQDYWLVKNSWGTTWGDEGYIYMTRNKDNQCGIASSA 330


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/235 (34%), Positives = 119/235 (50%), Gaps = 22/235 (9%)

Query: 113 RVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTL 172
           R   F    +   LP ++DWR      +  V+ Q +CGSCWAF+ T  LE Q       L
Sbjct: 106 RGSAFFRLAEGTHLPTTVDWRDKGY--VTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKL 163

Query: 173 YPLSKSQLVEC--DHGNLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEK 229
             LS+ QLV+C  D+GN+ CNGG +D AF+Y+++ G ++++  YPY  ++    +C ++ 
Sbjct: 164 VSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKSYPYEAEDG---QCRFKP 220

Query: 230 E----KAKVFVQDTWVTSGVDHMMH--LLQSGPI--GVYLNHRLIESYDGNPIRRNDWAC 281
           E    K   +V    VT G +  +   +   GP+  G+  +H   + YD       D  C
Sbjct: 221 ENVGAKCTGYVD---VTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYDEQD--C 275

Query: 282 NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           +   LDH V  VGYG  NG   W+V+NSWG      GY  + R   N CGI + A
Sbjct: 276 SSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDNQCGIATAA 330


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 116/223 (52%), Gaps = 21/223 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P+S+DWR+     + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 223 PRSVDWREKGY--VTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP 280

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPY--RNKENITFRCTYEKEKAKVFVQDTWV 241
            GN  CNGG +D AF+YV+   G++S+  YPY  ++ E+  ++  Y       FV    +
Sbjct: 281 EGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVD---I 337

Query: 242 TSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
             G +   M  +   GP+ V ++  H   + Y        D  C+   LDH V +VGYG 
Sbjct: 338 PQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGF 395

Query: 297 ---EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
              + +G   WIV+NSWG+   D GY  + +   N CGI + A
Sbjct: 396 EGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 438


>gi|56758920|gb|AAW27600.1| SJCHGC00098 protein [Schistosoma japonicum]
 gi|226476138|emb|CAX72159.1| cathepsin L, a [Schistosoma japonicum]
          Length = 331

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 157/326 (48%), Gaps = 41/326 (12%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEY------------ 77
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ G E  E+            
Sbjct: 20  DKQYDEI-----WRQWKLKYNKTYTSNDDEMRRKMIFMRRIG-EIQEHNLRHDLGLEGYT 73

Query: 78  YGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKV 137
            G +   D   +E+ +    ++ G      +   E       E    P+P + DWR    
Sbjct: 74  MGLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGNEL------ELTNKPVPSTWDWRDHGA 127

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNI 195
             +  V+ QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN  C GG +
Sbjct: 128 --VTAVKHQGLCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCEGGYM 185

Query: 196 DVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLL 252
           D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D       + 
Sbjct: 186 DHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVY 242

Query: 253 QSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
           Q GPI  G+   + LI  Y       ND  C    ++HAV +VGYG+++G   W+++NSW
Sbjct: 243 QYGPISVGIVALNSLI-MYKSGVFESND--CKYADINHAVLVVGYGKEHGKDYWLIKNSW 299

Query: 311 GDIGPDHGYFQIERGA-NACGIESYA 335
           GD+    GYF++ R   N CG+ S A
Sbjct: 300 GDLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 134/303 (44%), Gaps = 25/303 (8%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGL----- 97
           F  +  K+ + Y   NE   RF  FK +    D  Y T+  +      + + T L     
Sbjct: 27  FNNFKTKYGKVYNGINEDAVRFGIFKAN---VDIIYATNARNLTFALGVNEFTDLTQEEF 83

Query: 98  --RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
               TG +   L +   R+    +E    PL  S+DW    V  + PV++QG+CGSCW+F
Sbjct: 84  AASYTGLKPASLWSGLPRLST--HEYNGAPLASSVDWTTQGV--VTPVKNQGQCGSCWSF 139

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPY 215
           +TT  LE   AL    L  LS+ Q  +CD  +  CNGG +D AF + K+  + ++  YPY
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFEDCDTTDSGCNGGWMDNAFSFAKKNSICTEGSYPY 199

Query: 216 RNKENIT--FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYL--NHRLIESYDG 271
              +       C     +  V       T     MM  +   P+ + +  +    + Y  
Sbjct: 200 TATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSS 259

Query: 272 NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIER---GANA 328
             +     +C   +LDH V  VGYG + G   W V+NSWG    + GY +++R   GA  
Sbjct: 260 GVLTA---SCG-TRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRGKGGAGE 315

Query: 329 CGI 331
           CG+
Sbjct: 316 CGL 318


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/246 (32%), Positives = 122/246 (49%), Gaps = 20/246 (8%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           ++     R   +L     G LP ++DWR      + P+++QG+CGSCW+F+ T  LE Q 
Sbjct: 94  KMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGY--VTPIKNQGQCGSCWSFSATGSLEGQT 151

Query: 166 ALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENIT 222
                 L  LS+  LV+C    GN  C GG +D AF+Y+K   G+++++ YPY  K    
Sbjct: 152 FKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYPYEAKNG-- 209

Query: 223 FRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS-----GPIGVYLN--HRLIESYDGNPIR 275
            +C +    A V   D+  T         LQS     GPI V ++  H   + Y      
Sbjct: 210 -KCRFNA--ANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVY- 265

Query: 276 RNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESY 334
            +++ C+  +LDH V  VGYG ++G   W+V+NSWG+     GY  + R   N CGI + 
Sbjct: 266 -HEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNCGIATS 324

Query: 335 AYLASV 340
           A   +V
Sbjct: 325 ASYPTV 330


>gi|56752859|gb|AAW24641.1| unknown [Schistosoma japonicum]
          Length = 331

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 154/334 (46%), Gaps = 57/334 (17%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ 89
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK                Q
Sbjct: 20  DKQYDEI-----WRQWRLKYNKTYTSNDDEMRRKMIFMRRIGK---------------IQ 59

Query: 90  EILQRTGLRLTGK--------EKERLEADRERVKKFLN------------ERKKGPLPKS 129
           E   R  L L G         + E  E +R    K               E    P+P +
Sbjct: 60  EHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGNELELTNKPVPST 119

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGN 187
            DWR      +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN
Sbjct: 120 WDWRDHGA--VTAVKNQGMCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGN 177

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             C GG +D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D 
Sbjct: 178 YGCEGGYMDHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDE 234

Query: 248 ---MMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
                 + Q GPI  G+     LI  Y       ND  C    ++H V +VGYG+++G  
Sbjct: 235 KTLQKAVYQYGPISVGIVAVDSLI-MYKSGVFESND--CKYADINHGVLVVGYGKEHGKD 291

Query: 303 TWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            W+++NSWGD+    GYF++ R   N CG+ S A
Sbjct: 292 YWLIKNSWGDLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
          Length = 331

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/242 (33%), Positives = 119/242 (49%), Gaps = 22/242 (9%)

Query: 106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQV 165
           ++ A+R +   +++    G LP S+DWR  K   +  +++QG CGSCW+F+ T  LE Q 
Sbjct: 95  KMSANRTKGDLYMSPSNIGDLPDSVDWR--KEGYVTDIKNQGHCGSCWSFSATGSLEGQH 152

Query: 166 ALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENIT 222
               K L  LS+  LV+C    GN  C GG +D AF Y++   G++++  YPY  K    
Sbjct: 153 FKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGF- 211

Query: 223 FRCTYEKEKAKVFVQDTWVTSGVDHMMH------LLQSGPI--GVYLNHRLIESYDGNPI 274
             C ++ E   V   DT     + HM        +   GPI  G+   H+  + Y     
Sbjct: 212 --CHFKAEN--VGATDTGYVD-IPHMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVY 266

Query: 275 RRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIES 333
                AC+  KLDH V  VGYG ++G   W+V+NSWG      GY  + R   N CGI +
Sbjct: 267 SEP--ACSSSKLDHGVLAVGYGTESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIAT 324

Query: 334 YA 335
            A
Sbjct: 325 QA 326


>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 154/332 (46%), Gaps = 41/332 (12%)

Query: 30  RDLAYDSIKQVDA-FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGT 80
           R    D +   +  F+ ++ K+ + Y+   E   R   F ++     E+        +G 
Sbjct: 47  RKFGVDGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGV 106

Query: 81  SGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
           +  SD S +E  +R    + G+   +       V +     +   LP+S DWR+     +
Sbjct: 107 TPFSDLSEEE-FERMFTGVVGRPHMK-----GGVAETAAALEVDGLPESFDWREKGA--V 158

Query: 141 NPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---------GNLNCN 191
             V+ QG CGSCWAF+TT  +E    +  K L  LS+ QLV+CDH          +  C 
Sbjct: 159 TEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCE 218

Query: 192 GGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKV-FVQDTWVTSGVDHM- 248
           GG +  A++Y ++  GLE ++ YPY  K      C ++ ++  V  V  T V    + + 
Sbjct: 219 GGLMTNAYKYLIEAGGLEEESSYPYTGKHG---ECKFKPDRVAVRVVNFTEVPINENQIA 275

Query: 249 MHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT---- 303
            +L+  GP+ V LN   +++Y G         C    ++H V +VGYG K   IL     
Sbjct: 276 ANLVCHGPLAVGLNAIFMQTYIGG--VSCPLICPKRWINHGVLLVGYGAKGYSILRFGYK 333

Query: 304 --WIVRNSWGDIGPDHGYFQIERGANACGIES 333
             WI++NSWG    +HGY+++ RG   CG+ +
Sbjct: 334 PYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNT 365


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 116/223 (52%), Gaps = 21/223 (9%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-- 184
           P+S+DWR+     + PV+ QG+CGSCWAF+TT  LE Q       L  LS+  LV+C   
Sbjct: 133 PRSVDWREKGY--VTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP 190

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPY--RNKENITFRCTYEKEKAKVFVQDTWV 241
            GN  CNGG +D AF+YV+   G++S+  YPY  ++ E+  ++  Y       FV    +
Sbjct: 191 EGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVD---I 247

Query: 242 TSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG- 296
             G +   M  +   GP+ V ++  H   + Y        D  C+   LDH V +VGYG 
Sbjct: 248 PQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGF 305

Query: 297 ---EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
              + +G   WIV+NSWG+   D GY  + +   N CGI + A
Sbjct: 306 EGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAA 348


>gi|56754277|gb|AAW25326.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 153/333 (45%), Gaps = 55/333 (16%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ 89
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK                Q
Sbjct: 31  DKQYDEI-----WRQWKLKYNKTYTSNDDEMRRKMIFMRRIGK---------------IQ 70

Query: 90  EILQRTGLRLTGK--------EKERLEADRERVKKFLN------------ERKKGPLPKS 129
           E   R  L L G         + E  E +R    K               E    P+P +
Sbjct: 71  EHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGNELELTNKPVPST 130

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGN 187
            DWR      +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN
Sbjct: 131 WDWRDHGA--VTAVKNQGMCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGN 188

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             C GG +D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D 
Sbjct: 189 YGCGGGFMDHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDE 245

Query: 248 ---MMHLLQSGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
                 + Q GPI V  +    +  Y       ND  C    ++H V +VGYG+++G   
Sbjct: 246 KTLQKAVYQYGPISVGIVALDSLTMYKSGVFESND--CKYADINHGVLVVGYGKEHGKDY 303

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           W+++NSWGD+    GYF++ R   N CG+ S A
Sbjct: 304 WLIKNSWGDLWGSKGYFKLRRNKHNMCGVASNA 336


>gi|301609082|ref|XP_002934106.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 117/223 (52%), Gaps = 18/223 (8%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
           P S+DWR      +  V++QG CGSC+AF T   LE Q      TL   S  +LV+C + 
Sbjct: 120 PASIDWRTQGC--VTSVKNQGSCGSCYAFGTVGALECQWKKKMGTLVSFSPQELVDCSYT 177

Query: 186 -GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            GN  C GG +  +F Y+K+YG+  ++ YPY  KE    RC  +K      V+  +V   
Sbjct: 178 EGNNGCKGGYLQASFRYMKKYGIMEESSYPYTAKEG---RCKKDKPSNVGVVKTFYVVPA 234

Query: 245 VDHM--MHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPH---KLDHAVAIVGYGEK 298
              +  M +L + GP+ V ++     S +G  + ++    +P+   K+DHAV +VGYG  
Sbjct: 235 GKELLLMKVLGTVGPVSVAIDC----SREGFRMYKSGVYYDPYCTTKVDHAVLVVGYGTD 290

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAYLASV 340
           NG   W+V+NSWG    D GY ++ R   N C I S+A   +V
Sbjct: 291 NGKDYWLVKNSWGVGYGDKGYIKMARNRGNNCAIASHAVYPTV 333


>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
          Length = 333

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 90/252 (35%), Positives = 127/252 (50%), Gaps = 33/252 (13%)

Query: 108 EADRERVKKFLNERKK------GPL----PKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
           E  R+ +  F N++ K      GPL    PKS+DWR  K   + PV++Q +CGSCWAF+ 
Sbjct: 86  EEFRQVMVCFRNQKHKNGKVFRGPLLLDLPKSVDWR--KKGYVTPVKNQKQCGSCWAFSA 143

Query: 158 TAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYVKQY-GLESQADYP 214
           T  LE Q+      L  LS+  LV+C    GN  CNGG ++ AF YVK+  GL+S+A YP
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMNYAFRYVKENGGLDSEASYP 203

Query: 215 YRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS----GPIGVYLN--HRLIES 268
           Y  K+ I   C Y+ E +     DT       H   L+++    GPI V ++  H   + 
Sbjct: 204 YEAKDGI---CKYKPENS--VANDTGFVVIPTHEKELMKAVATVGPISVAVDASHSSFQF 258

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT----WIVRNSWGDIGPDHGYFQIER 324
           Y           C+   LDH V +VGYG +         W+++NSWG     +GY +I +
Sbjct: 259 YKSGIYFEKK--CSSKNLDHGVLVVGYGFEGANSKDNKYWLIKNSWGPEWGLNGYIKIAK 316

Query: 325 GA-NACGIESYA 335
              N CGI + A
Sbjct: 317 DQNNHCGIATAA 328


>gi|348504496|ref|XP_003439797.1| PREDICTED: digestive cysteine proteinase 2-like [Oreochromis
           niloticus]
          Length = 352

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 118/217 (54%), Gaps = 18/217 (8%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT--LYPLSKSQLVECD-- 184
           ++D+R   +  +  V+ QG CGSCWAF+TT  +E+Q  L KKT  L  LS+  LV+C   
Sbjct: 139 AVDYR--SMGFVTEVKDQGFCGSCWAFSTTGAIEAQ--LYKKTGQLISLSEQNLVDCSKS 194

Query: 185 HGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTS 243
            G   C+G  +  A++YV   GLES   YPY + +  T  C Y+   A   ++D  ++  
Sbjct: 195 FGTYGCSGAWMANAYDYVVSNGLESSNTYPYTSVD--TQPCFYDSSLAVAHIRDYRFIPR 252

Query: 244 GVDHMMH--LLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
           G +  M   L   GPI V ++  H     Y        +  CNP+ L+HAV +VGYG + 
Sbjct: 253 GDEQAMADALATIGPITVTIDADHASFLFYSSGIYDEPN--CNPNNLNHAVLLVGYGSQE 310

Query: 300 GILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G   WI++NSWG    + GY +I R G NACG+ SYA
Sbjct: 311 GQDYWIIKNSWGTGWGEGGYMRIVRNGQNACGLASYA 347


>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
 gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
          Length = 334

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 118/230 (51%), Gaps = 24/230 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPKS+DWR  K   + PV++Q +CGSCWAF+ T  LE Q+      L  LS+  LV+C H
Sbjct: 114 LPKSVDWR--KKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSH 171

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  CNGG ++ AF YVK+  GL+S+  YPY   + I   C Y  E +     DT   
Sbjct: 172 PQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGI---CKYRSENS--VANDTGFK 226

Query: 241 -VTSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            V +G +   M  +   GPI V ++  H   + Y        D  C+   LDH V +VGY
Sbjct: 227 VVPAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGY 284

Query: 296 G----EKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAYLASV 340
           G      +    W+V+NSWG     +GY +I +   N CGI + A   +V
Sbjct: 285 GFEGANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334


>gi|56756955|gb|AAW26649.1| unknown [Schistosoma japonicum]
          Length = 331

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 155/325 (47%), Gaps = 39/325 (12%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDE-----------YY 78
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK  +              
Sbjct: 20  DKQYDEI-----WRQWKLKYNKTYTSNDDEMRRKMIFMRRIGKIQEHNLRHDLGLEGYTM 74

Query: 79  GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVK 138
           G +   D   +E+ +    ++ G      +  +E       E    P+P   DWR     
Sbjct: 75  GLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGKEL------ELTNKPVPSKWDWRDHGA- 127

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNID 196
            +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN  C GG +D
Sbjct: 128 -VTAVKNQGMCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGNYGCGGGFMD 186

Query: 197 VAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQ 253
            AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D       + Q
Sbjct: 187 HAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDEKTLQKAVYQ 243

Query: 254 SGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
            GPI  G+     LI  Y       ND  C    ++H V +VGYG+++G   W+++NSWG
Sbjct: 244 YGPISVGIVALDSLI-MYKSGVFESND--CKYGDINHGVLVVGYGKEHGKDYWLIKNSWG 300

Query: 312 DIGPDHGYFQIERGA-NACGIESYA 335
           D+    GYF++ R   N CG+ S A
Sbjct: 301 DLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 118/226 (52%), Gaps = 25/226 (11%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           +P SLDWR+     + PV+ QG CGSCWAF+TT  +E Q+      L  LS+  LV+C  
Sbjct: 116 VPNSLDWREKGY--VTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR 173

Query: 185 -HGNLNCNGGNIDVAFEYVK-QYGLESQADYPYRNKENITFRCTYEKEKAKV----FVQD 238
             GN  CNGG +D AF+Y+K Q GL+S+  YPY   ++    C Y+ + +      FV  
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQP--CHYDPKYSAANDTGFVD- 230

Query: 239 TWVTSGVDH--MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVG 294
             + SG +H  M  +   GP+ V ++  H   + Y        +  C+  +LDH V  VG
Sbjct: 231 --IPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKE--CSSEELDHGVLAVG 286

Query: 295 YG----EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           YG    + +G   WIV+NSW +   D GY  + +   N CGI + A
Sbjct: 287 YGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAA 332


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 142/314 (45%), Gaps = 28/314 (8%)

Query: 46  YIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---------YGTSGSSDRSPQEILQRTG 96
           ++ K+ R YTD  E   R E F  + +  D            G +  SD +  E +Q T 
Sbjct: 42  WMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQ-TH 100

Query: 97  LRLTGKEKERLEADRERVKKFLN-ERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAF 155
           L   G ++  L  + E V K       +  +P+S+DWR      +  V++QG CG CWAF
Sbjct: 101 LGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGA--VTGVKNQGSCGCCWAF 158

Query: 156 ATTAILESQVALLKKTLYPLSKSQLVEC-----DHGNLN-CNGGNIDVAFEYV-KQYGLE 208
           A  A  E  V +    L  +S+ Q+++C       GN N C+GG+ID A  YV    GL+
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGLQ 218

Query: 209 SQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVD--HMMHLLQSGPIGVYLNHR-L 265
            +A Y Y   +    +  +    A  F +   VT   D   +  L+   PI V +     
Sbjct: 219 PEAAYAYTGLQGAC-QSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEASDD 277

Query: 266 IESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT-WIVRNSWGDIGPDHGYFQIER 324
              Y          +C   +L+HAV +VGYG  +G    W+V+N WG    + GY +I R
Sbjct: 278 FRHYMSGVFTAGTSSCG-QRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIAR 336

Query: 325 GANA--CGIESYAY 336
           G  A  CGI +YAY
Sbjct: 337 GNGAPNCGISAYAY 350


>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 151/329 (45%), Gaps = 60/329 (18%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           F+ ++  + + Y+   E   R   F ++  +  E+        +G +  SD + +E  + 
Sbjct: 51  FRVFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRM 110

Query: 95  -TGL--------RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVES 145
            TG+           G E   +E D               LP+  DWR+     +  V++
Sbjct: 111 YTGVADVGGSRGHAVGAEAPMVEVD--------------GLPEDFDWREKGG--VTEVKN 154

Query: 146 QGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLN----------CNGGNI 195
           QG CGSCWAF+TT   E    +    L  LS+ QLV+CD    +          C GG +
Sbjct: 155 QGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLM 214

Query: 196 DVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHL 251
             A+EY+ +  GLE +  YPY  K      C ++ EK  V V + + T  +D      +L
Sbjct: 215 TNAYEYLMEAGGLEEERSYPYTGKRG---HCKFDPEKVAVRVVN-FTTIPLDEDQIAANL 270

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------W 304
           ++ GP+ V LN   +++Y G         C+  K++H V +VGYG K   IL       W
Sbjct: 271 VRQGPLAVGLNAVFMQTYIGGV--SCPLICSKRKVNHGVLLVGYGSKGFSILRLSNKPYW 328

Query: 305 IVRNSWGDIGPDHGYFQIERGANACGIES 333
           I++NSWG    ++GY+++ RG + CGI S
Sbjct: 329 IIKNSWGKKWGENGYYKLCRGHDICGINS 357


>gi|56756677|gb|AAW26511.1| unknown [Schistosoma japonicum]
          Length = 331

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 155/333 (46%), Gaps = 55/333 (16%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ 89
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK                Q
Sbjct: 20  DKQYDEI-----WRQWKLKYNKTYTSNDDEMRRKMIFMRRIGK---------------IQ 59

Query: 90  EILQRTGLRLTGKE---KERLEADRERVKKFLNERKKG-----------------PLPKS 129
           E   R  L L G      +  + + E VK+ +  +  G                 P+P  
Sbjct: 60  EHNLRHDLGLEGYTMGLNQFCDMEWEEVKRIMFPKVFGNSPLWNDDGNELELTNKPVPSK 119

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGN 187
            DWR      +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN
Sbjct: 120 WDWRDHGA--VTAVKNQGLCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGN 177

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             C GG +D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D 
Sbjct: 178 YGCGGGFMDHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDE 234

Query: 248 ---MMHLLQSGPIGV-YLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
                 + Q GPI V  +    +  Y       ND  C    ++H V +VGYG+++G   
Sbjct: 235 KTLQKAVYQYGPISVGIVALDSLTMYKSGVFESND--CKYGDINHGVLVVGYGKEHGKDY 292

Query: 304 WIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           W+++NSWGD+    GYF++ R   N CG+ S A
Sbjct: 293 WLIKNSWGDLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 76/217 (35%), Positives = 113/217 (52%), Gaps = 17/217 (7%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--G 186
           ++DWRQ     + P++ QG CGSCWAF+TT  LE Q  +    L  LS+  L++C    G
Sbjct: 113 TVDWRQKGA--VTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFG 170

Query: 187 NLNCNGGNIDVAFEYVKQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
           N  C GG +D AF Y+K  G ++++  YPY  K+     C Y+   +   +        +
Sbjct: 171 NKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKV--CDYKTSCSGATLSSYTDIKAM 228

Query: 246 DHMMHLLQS----GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
           D M  L+Q+    GP+ V ++  H+ +  Y        +  C+  KLDH V  VGYG  +
Sbjct: 229 DEMA-LMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPE--CSRTKLDHGVLAVGYGSMD 285

Query: 300 GILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           G+  W+V+NSWG    D GY ++ R   N CGI + A
Sbjct: 286 GMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKA 322


>gi|67469932|ref|XP_650937.1| cysteine proteinase [Entamoeba histolytica HM-1:IMSS]
 gi|1929343|emb|CAA62835.1| cysteine proteinase [Entamoeba histolytica]
 gi|56467606|gb|EAL45551.1| cysteine proteinase, putative [Entamoeba histolytica HM-1:IMSS]
 gi|449710372|gb|EMD49461.1| cysteine proteinase, putative [Entamoeba histolytica KU27]
          Length = 318

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 120/223 (53%), Gaps = 15/223 (6%)

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP-----LSK 177
           +G +P+S+DWR +K KV   +  Q  CGSC++FA+ A +E ++ +     +      LS+
Sbjct: 92  RGDVPESVDWR-AKGKV-PAIRDQASCGSCYSFASVAAIEGRLLVAGSKKFTVDDLDLSE 149

Query: 178 SQLVECDH--GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVF 235
            QLV+C    GN  CNGG++ ++F YVK  G+  + DYPY   E     CTY+K+K  V 
Sbjct: 150 QQLVDCSVSVGNKGCNGGSLLLSFRYVKLNGIMQEKDYPYVAAEET---CTYDKKKVAVK 206

Query: 236 VQ-DTWVTSGVDH-MMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIV 293
           +     V  G +  +M     GP+   ++   ++         N   C+  +L+H VA+V
Sbjct: 207 ITGQKLVRPGSEKALMRAAAEGPVAAAIDASGVKFQLYKSGIYNSKECSSTQLNHGVAVV 266

Query: 294 GYGEKNGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
           GYG +NG   WIVRNSWG I  D GY  + R   N CGI S A
Sbjct: 267 GYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQCGIASGA 309


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 115/221 (52%), Gaps = 20/221 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LPK +DWR  K   + PV+ QG+CGSCWAF+ T  LE +  L    L  LS+  LV+C  
Sbjct: 116 LPKVVDWR--KKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQ 173

Query: 186 --GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-- 240
             GN  C GG ++ AF+Y+K+  G++++  YPY   E +   C ++KE   V   DT   
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPY---EAVDGECRFKKED--VGATDTGYV 228

Query: 241 -VTSGV--DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            + +G   D    +   GPI V ++  H   + Y        +  C+   LDH V +VGY
Sbjct: 229 EIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE--CSSEDLDHGVLVVGY 286

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYA 335
           G K G   W+V+NSW +   D GY  + R   N CGI S A
Sbjct: 287 GVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQA 327


>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 374

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 153/330 (46%), Gaps = 54/330 (16%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY--------YGTSGSSDRSPQEILQR 94
           FK ++  + R+Y+   E   R   F Q+     E+        +G +  SD +  E  + 
Sbjct: 54  FKVFMENYGRSYSTREEYLRRLGIFSQNMLRAAEHQALDPTAVHGVTQFSDLTEVEFEKL 113

Query: 95  -TGLRLT---GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            TG   T   G     LE +               LP++ DWR+     +  V+ QGRCG
Sbjct: 114 YTGXPSTNTAGGVAPPLEVEG--------------LPENFDWREKGA--VTEVKIQGRCG 157

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG---------NLNCNGGNIDVAFEY 201
           SCWAF+TT  +E    L    L  LS+ QL++CD+          +  CNGG +  A+ Y
Sbjct: 158 SCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNY 217

Query: 202 VKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH---MMHLLQSGPI 257
           + +  GLE ++ YPY  +      C ++ EK  V + + +    VD      +L+++GP+
Sbjct: 218 LLESGGLEEESSYPYTGERG---ECKFDPEKITVRITN-FTNIPVDENQIAAYLVKNGPL 273

Query: 258 GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILT------WIVRNSW 310
            + +N   +++Y G         C+  +L+H V +VGYG K   IL       WI++NSW
Sbjct: 274 AMGVNAIFMQTYIGG--VSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSW 331

Query: 311 GDIGPDHGYFQIERGANACGIESYAYLASV 340
           G    + GY+++ RG   CGI +    A V
Sbjct: 332 GKKWGEDGYYKLCRGHGMCGINTMVSAAMV 361


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 143/316 (45%), Gaps = 22/316 (6%)

Query: 44  KTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKE 103
           ++++ +  RTY D  E   R E F+ + +  D +   + ++     +  +    R     
Sbjct: 44  ESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLT 103

Query: 104 KERLEADRERVKK-------------FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCG 150
            E   A R  +++             + N   +     S+DWR   +  +  V+ QG CG
Sbjct: 104 DEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWR--AMGAVTGVKDQGSCG 161

Query: 151 SCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNCNGGNIDVAFEYV-KQYGL 207
            CWAF+  A +E    +    L  LS+ QLV+CD    +  C GG +D AF+Y+ +Q GL
Sbjct: 162 CCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGGL 221

Query: 208 ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHR--L 265
            S++ YPY  ++  + R    +  A +   +    +    +M  +   P+ V +N    +
Sbjct: 222 ASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDYV 281

Query: 266 IESYD-GNPIRRNDWACNPHKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIE 323
              YD G      +  C   +LDHA+  VGYG   +G   W+++NSWG    + GY +I 
Sbjct: 282 FRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRIR 341

Query: 324 RGANACGIESYAYLAS 339
           RG+   G+   A LAS
Sbjct: 342 RGSRGEGVCGLAKLAS 357


>gi|226476108|emb|CAX72144.1| cathepsin L, a [Schistosoma japonicum]
          Length = 331

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 154/334 (46%), Gaps = 57/334 (17%)

Query: 31  DLAYDSIKQVDAFKTYIVKWNRTYT-DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQ 89
           D  YD I     ++ + +K+N+TYT +D+E++ +  + ++ GK                Q
Sbjct: 20  DKQYDEI-----WRQWRLKYNKTYTSNDDEMRRKMIFMRRIGK---------------IQ 59

Query: 90  EILQRTGLRLTGK--------EKERLEADRERVKKFLN------------ERKKGPLPKS 129
           E   R  L L G         + E  E +R    K               E    P+P +
Sbjct: 60  EHNLRHDLGLEGYTMGLNQFCDMEWEEVNRIMFPKVFGNSPLWNDDGNELELTNKPVPST 119

Query: 130 LDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGN 187
            DWR      +  V++QG CGSCWAF+ T  +E Q+    K L  LS+ QLV+C   +GN
Sbjct: 120 WDWRDHGA--VTAVKNQGMCGSCWAFSATGAIEGQLRRKHKKLISLSEQQLVDCSTPYGN 177

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDH 247
             C GG +D AF Y++ + +ES+ DY Y   +     C Y K K  V V+        D 
Sbjct: 178 YGCEGGYMDHAFNYLESHYIESENDYKYLGYDA---NCHYRKSKGVVKVKKFVDLPSKDE 234

Query: 248 ---MMHLLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGIL 302
                 + Q GPI  G+     LI  Y       ND  C    ++H V +VGYG+++G  
Sbjct: 235 KTLQKAVYQYGPISVGIVALDSLI-MYKSGVFESND--CKYAGINHGVLVVGYGKEHGKD 291

Query: 303 TWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
            W+++NSWGD+    GYF++ R   N CG+ S A
Sbjct: 292 YWLIKNSWGDLWGSKGYFKLRRNKHNMCGVASNA 325


>gi|123470506|ref|XP_001318458.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121901218|gb|EAY06235.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 317

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 88/258 (34%), Positives = 129/258 (50%), Gaps = 27/258 (10%)

Query: 94  RTGL----RLTGKEKERLEADRERVKKFLNERKKGPL-----PKSLDWRQSKVKVLNPVE 144
           R GL     LT  E + L   ++  +K   E +  PL     P S DWR+  V  +N ++
Sbjct: 61  RCGLNQFAHLTPSEYQELLGYKQMKQK--EEVEFAPLKNFNAPDSFDWREKGV--VNAIK 116

Query: 145 SQGRCGSCWAFATTAILESQVALLKK-TLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVK 203
            QG+CGSCWAF++    ESQ A+     LY L++ QLV+C H    CNGGN+  A+ +VK
Sbjct: 117 DQGQCGSCWAFSSIQAQESQWAIHHPGELYDLAEQQLVDCVHDCFGCNGGNVGWAYTWVK 176

Query: 204 --QYGL-ESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMM--HLLQSGP-- 256
             ++G+   Q DYPY  K+    +C ++K K    +      S  +  +   + ++GP  
Sbjct: 177 LFEHGMFMLQKDYPYTAKDG---KCAFDKSKGITKITTHKKASHDEEALKTSVAENGPHA 233

Query: 257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPD 316
           I +   H     Y+       D +C+   LDHAV +VGYG       W+VRNSW     +
Sbjct: 234 IAIDAGHDSFMMYESGVYE--DASCSSSTLDHAVGLVGYGVDGDKDFWLVRNSWSTTWGE 291

Query: 317 HGYFQIERG-ANACGIES 333
            GY +I R   N CG+ S
Sbjct: 292 QGYVRIRRNYHNMCGVAS 309


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.133    0.410 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,543,905,539
Number of Sequences: 23463169
Number of extensions: 237389341
Number of successful extensions: 633677
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5615
Number of HSP's successfully gapped in prelim test: 1386
Number of HSP's that attempted gapping in prelim test: 613376
Number of HSP's gapped (non-prelim): 9183
length of query: 341
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 198
effective length of database: 9,003,962,200
effective search space: 1782784515600
effective search space used: 1782784515600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)