BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022276
         (300 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
          Length = 368

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 217/298 (72%), Positives = 255/298 (85%), Gaps = 9/298 (3%)

Query: 3   RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           R ++S L+  LLS  +AS  + ++ DD +IRQVVP DG+Q  DHLLNAEHHF+ FK+KF 
Sbjct: 4   RCLISFLVYALLSFTIASTTSPDELDDPLIRQVVP-DGDQ--DHLLNAEHHFTTFKAKFG 60

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           KTYATQEEHDYRF++FKANLRRA++ Q++DPTAVHGVT FSDLTP EFRRQ+LGL RRLR
Sbjct: 61  KTYATQEEHDYRFKLFKANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQYLGL-RRLR 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           LPADA +APILPTNDLPTDFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+TGEL
Sbjct: 120 LPADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGEL 179

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ERE+DYPYTG D G 
Sbjct: 180 VSLSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLKAGGLEREEDYPYTGNDRGP 239

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           CKFD++KI A+VSNFSV+S DEDQ+AANLVKHGPLA  + ++ +     +++  VS P
Sbjct: 240 CKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQ----TYMGGVSCP 293


>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
 gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 213/263 (80%), Positives = 236/263 (89%), Gaps = 5/263 (1%)

Query: 16  SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           S +AS V+ ND DD +IRQVV SDGE   D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17  SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
            VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73  GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
            DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
           ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
           NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275


>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 213/263 (80%), Positives = 235/263 (89%), Gaps = 5/263 (1%)

Query: 16  SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           S +AS V+ ND DD +IRQVV SDGE   D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17  SAVASTVSSNDLDDPLIRQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
            VFKANLRRAK+ Q++DPTA HG+TKFSDLTP EFRRQFLGL R LRLP DA KAPILPT
Sbjct: 73  GVFKANLRRAKKHQMIDPTAAHGITKFSDLTPKEFRRQFLGLKRWLRLPTDANKAPILPT 132

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
            DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
           ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREADYPYTGTDGGTCKFDKSKVVASVS 252

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
           NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275


>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 367

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 213/263 (80%), Positives = 235/263 (89%), Gaps = 5/263 (1%)

Query: 16  SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           S +AS V+  D DD +I QVV SDGE   D LLNAEHHF+ FKSKF KTYATQEEHDYRF
Sbjct: 17  SAVASTVSSTDLDDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRF 72

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
            VFKANLRRAK+ Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT
Sbjct: 73  GVFKANLRRAKKHQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPT 132

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
            DLPTD+DWRDHGAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 133 TDLPTDYDWRDHGAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDH 192

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
           ECDPEE G+CDSGC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VS
Sbjct: 193 ECDPEEYGACDSGCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVS 252

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
           NFSV+S DEDQ+AANLVKHGPL+
Sbjct: 253 NFSVVSIDEDQIAANLVKHGPLS 275


>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 208/251 (82%), Positives = 228/251 (90%), Gaps = 4/251 (1%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +I QVV SDGE   D LLNAEHHF+ FKSKF KTYATQEEHDYRF VFKANLRRAK+
Sbjct: 29  DDPLIIQVV-SDGE---DDLLNAEHHFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK 84

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            Q++DPTA HGVTKFSDLTP EFRRQFLGL RRLRLP DA KAPILPT DLPTD+DWRDH
Sbjct: 85  HQMIDPTAAHGVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKAPILPTTDLPTDYDWRDH 144

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGEL SLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTEVKDQGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDS 204

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GC+GGLMN+AFEY LKAGG+ERE+DYPYTGTDGG+CKFDKSK+ A+VSNFSV+S DEDQ+
Sbjct: 205 GCDGGLMNNAFEYALKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQI 264

Query: 267 AANLVKHGPLA 277
           AANLVKHGPL+
Sbjct: 265 AANLVKHGPLS 275


>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
 gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  430 bits (1105), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 206/301 (68%), Positives = 252/301 (83%), Gaps = 11/301 (3%)

Query: 1   MERLILSSLLLL--LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           MER    SL++   L SS+L +A +   DD +IRQVVP      ED+LL+A+HHF+ FK+
Sbjct: 1   MERSCFLSLIVFAFLSSSILFTATSDELDDPLIRQVVP----DVEDYLLSAQHHFTAFKA 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YATQEEHDYRF+VFKANLRRA++ QL+DP+AVHGVTKFSDLTP EFRRQ+LGL +
Sbjct: 57  KFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDPSAVHGVTKFSDLTPREFRRQYLGL-K 115

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           +LRLPADA +APILPT+ +P DFDWRDHGAVT VK+QG+CGSCWSFSA GALEGAHFL+T
Sbjct: 116 KLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLAT 175

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GELVSLSEQQLVDCDHECDP E G+CDSGCNGGLM +AFEYILKAGG+ERE+DYPYTG+D
Sbjct: 176 GELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTNAFEYILKAGGLEREEDYPYTGSD 235

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
            G CKF+++KIAA+V+NFSV+S DEDQ+AANLV++GPLA  + ++ +     +++  VS 
Sbjct: 236 RGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQNGPLAVGINAVFMQ----TYIGGVSC 291

Query: 299 P 299
           P
Sbjct: 292 P 292


>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 366

 Score =  429 bits (1104), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 211/300 (70%), Positives = 249/300 (83%), Gaps = 9/300 (3%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDD-AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
           M  L +    LLL S+ +A+   ++D+D  +IRQVVP   +  + HLLNAEHHFS FK+K
Sbjct: 1   MANLSILFFGLLLFSAAVATVERIDDEDNLLIRQVVP---DAEDHHLLNAEHHFSAFKTK 57

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           F+KTYATQEEHD+RFR+FK NL RAK  Q LDP+AVHGVT+FSDLTPSEFR QFLGL + 
Sbjct: 58  FAKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPSEFRGQFLGL-KP 116

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
           LRLP+DAQKAPILPT+DLPTDFDWRDHGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTG
Sbjct: 117 LRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTG 176

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            LVSLSEQQLVDCDHECDPEE G+CDSGCNGGLM +AFEY LKAGG+ RE+DYPYTG D 
Sbjct: 177 GLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFEYTLKAGGLMREEDYPYTGRDR 236

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           G CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 237 GPCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVGINAVFMQ----TYIGGVSCP 292


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 206/285 (72%), Positives = 244/285 (85%), Gaps = 9/285 (3%)

Query: 16  SVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           + +A+A  ++D DD +IRQVVP   +  + HLLNAEHHFS FK+KF KTYATQEEHD+RF
Sbjct: 16  ATVAAAERIDDEDDLLIRQVVP---DAEDHHLLNAEHHFSAFKTKFGKTYATQEEHDHRF 72

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
           R+FK NL RAK  Q LDP+AVHGVT+FSDLTP+EFRRQFLGL + LRLP+DAQKAPILPT
Sbjct: 73  RIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQFLGL-KPLRLPSDAQKAPILPT 131

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
           NDLPTDFDWR+HGAVTGVK+QG+CGSCWSFSA GALEGAHFLSTGELVSLSEQQLVDCDH
Sbjct: 132 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDH 191

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
           ECDPEE G+CDSGCNGGLM +AFEY L+AGG+ REKDYPYTG D G CKFDKSK+AA+V+
Sbjct: 192 ECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVA 251

Query: 255 NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           NFSV+S DE+Q+AANLV++GPLA  + ++ +     +++  VS P
Sbjct: 252 NFSVVSLDEEQIAANLVQNGPLAVGINAVFMQ----TYIGGVSCP 292


>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 369

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 204/284 (71%), Positives = 242/284 (85%), Gaps = 14/284 (4%)

Query: 22  VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           V   D+D +IRQVV SDGE  +D LLNA+HHF+LFKSK+ K+YATQEEHDYR  VFKANL
Sbjct: 19  VVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKYGKSYATQEEHDYRLSVFKANL 75

Query: 82  RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTN 135
           RRAKR QLLDP+AVHGVTKFSDLTP EFRR FLG+       R+L+LPADA  A ILPT+
Sbjct: 76  RRAKRHQLLDPSAVHGVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILPTS 135

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
           DLP+DFDWRD+GAVTGVKDQG+CGSCWSFS TGALEGA+FL+TGELVSLSEQQLVDCDH 
Sbjct: 136 DLPSDFDWRDYGAVTGVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHL 195

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
           CDPEE+G+CDSGCNGGLM +A+EY+L++GG+E+EKDYPYTG D G+CKFDKSKIAAAV+N
Sbjct: 196 CDPEEAGACDSGCNGGLMTTAYEYVLQSGGLEKEKDYPYTGKD-GTCKFDKSKIAAAVAN 254

Query: 256 FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           FSV+S DEDQ+AANLVKHGPL+  + ++ +     +++  VS P
Sbjct: 255 FSVVSLDEDQIAANLVKHGPLSVGINAVFMQ----TYIGGVSCP 294


>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  420 bits (1079), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 203/277 (73%), Positives = 236/277 (85%), Gaps = 8/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           MERL L SLL  +L    +SA+A +D+D +IRQVV    E  + HLLNAEHHFSLFKSKF
Sbjct: 1   MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YA++EEHD+RF+VFKANLRRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++  
Sbjct: 55  GKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           +   +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECDPE+  +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 232

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            C FDKSKIAAAV+NFSVI  DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 269


>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
 gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
          Length = 367

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/297 (69%), Positives = 240/297 (80%), Gaps = 15/297 (5%)

Query: 8   SLLLLLLSSVLASAVAVNDDDA-----MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           S LL L+ ++L SA   +         +IRQVVP       D LL+AEH F LFK+KF K
Sbjct: 7   SALLFLIPTLLFSAAVSDISSDESDDLLIRQVVPEG-----DDLLSAEHQFGLFKAKFGK 61

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
           TY+T EEHDYRF VF+ANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR +LGL + LRL
Sbjct: 62  TYSTVEEHDYRFSVFEANLRRARRHQLLDPSAVHGVTRFSDLTPDEFRRDYLGL-KPLRL 120

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           PADAQKAPILPTNDLPTDFDWRDHGAVT VKDQG+CGSCWSFSA GALEGAHFL+TG L+
Sbjct: 121 PADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQGSCGSCWSFSAIGALEGAHFLTTGNLI 180

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           S+SEQQLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSC
Sbjct: 181 SMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSC 240

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KF+KS+I A+VSNFSV+S DEDQ+AAN+VK+GPLA  + ++ +     +++  VS P
Sbjct: 241 KFNKSQIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQ----TYMKGVSCP 293


>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 365

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 204/273 (74%), Positives = 236/273 (86%), Gaps = 8/273 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +IRQVVP +GE  EDHLLNAEHHFS FKSKF KTYAT+EEHD+RF VFK+N+RRA+ 
Sbjct: 27  DDILIRQVVP-EGE-VEDHLLNAEHHFSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARL 84

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
              LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFDWRD 
Sbjct: 85  HAQLDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFDWRDK 143

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE GSCDS
Sbjct: 144 GAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDS 203

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMN+AFEY++ +GGV+REKDYPYTG D G+CKFDKSKIAA+VSN+SVIS DE+Q+
Sbjct: 204 GCNGGLMNNAFEYLIGSGGVQREKDYPYTGRD-GTCKFDKSKIAASVSNYSVISLDEEQI 262

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 263 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 291


>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 202/277 (72%), Positives = 235/277 (84%), Gaps = 8/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           MERL L SLL  +L    +SA+A +D+D +IRQVV    E  + HLLNAEHHFSLFKSKF
Sbjct: 1   MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YA++EEHD+RF+VFKAN RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++  
Sbjct: 55  GKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           +   +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECDPE+  +CD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 232

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            C FDKSKIAAAV+NFSVI  DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 269


>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 201/279 (72%), Positives = 238/279 (85%), Gaps = 9/279 (3%)

Query: 25  NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           +DDD +IRQVVP  G+     E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30  SDDDIIIRQVVPELGDVEGSEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
           LRRA+R Q LDP+A HGVT+FSDLTP+EFR  +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90  LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148

Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
           FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208

Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
            GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268

Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
            DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 269 LDEDQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 303


>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 377

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 204/301 (67%), Positives = 250/301 (83%), Gaps = 15/301 (4%)

Query: 10  LLLLLSSVLASAVAV------NDDDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSK 59
           L+++LS + ASA+        +D D +IRQVV     ++G   +D LL A+HHFS+FK K
Sbjct: 7   LIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHFSVFKQK 66

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NR 118
           F K+YA++EEHD+RFRVFKANL+RA+R Q LDP+A HGVT+FSDLTPSEFRR FLGL +R
Sbjct: 67  FGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRSR 126

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           RL LPADA KAPILPT+ LPTDFDWRD GAV+ VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 127 RLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEGANFLAT 186

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK+GG+ +E+DYPYTGTD
Sbjct: 187 GKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDYPYTGTD 246

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
            G+CKFDKSKIAA+V+NFSV+S DE+Q+AANLVK+GPLA  + ++ +     +++  VS 
Sbjct: 247 RGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYIKGVSC 302

Query: 299 P 299
           P
Sbjct: 303 P 303


>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 366

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/281 (70%), Positives = 233/281 (82%), Gaps = 5/281 (1%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           R  L  L  LL ++ L  A   + DD +IRQVV   G+     LLNA+HHF++FK +F K
Sbjct: 4   RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
            YA+ EEHDYR  VFKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+ 
Sbjct: 59  VYASDEEHDYRLSVFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLV 178

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D   C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVC 238

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           +FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAV 279


>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
          Length = 367

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/277 (72%), Positives = 233/277 (84%), Gaps = 10/277 (3%)

Query: 27  DDAMIRQVVP----SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR 82
           DD +IRQVVP       E+ EDHLLNAEHHF+ FK+KF K YAT+EEHD RF VFK+NLR
Sbjct: 23  DDILIRQVVPDAVGEAAEKEEDHLLNAEHHFASFKAKFGKKYATKEEHDRRFGVFKSNLR 82

Query: 83  RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFD 142
           RA+    LDP+AVHGVTKFSDLTP+EFRRQFLG  + LRLPA+AQKAPILPT DLP DFD
Sbjct: 83  RARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLGF-KPLRLPANAQKAPILPTKDLPKDFD 141

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WRD GAVT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G
Sbjct: 142 WRDKGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYG 201

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
           +CDSGCNGGLMN+AFEYIL++GGV++EKDYPYTG D G+CKFDK+K+AA VSN+SV+S D
Sbjct: 202 ACDSGCNGGLMNNAFEYILQSGGVQKEKDYPYTGRD-GTCKFDKTKVAATVSNYSVVSLD 260

Query: 263 EDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           EDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 261 EDQIAANLVKNGPLAVGINAVFMQ----TYIGGVSCP 293


>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
          Length = 377

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 201/279 (72%), Positives = 237/279 (84%), Gaps = 9/279 (3%)

Query: 25  NDDDAMIRQVVPSDGE---QSEDHLLNAEHH-FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           +DDD +IRQVVP  G+     E++LL A+HH FS+FK +F K+YA+QEEHDYRF+VFKAN
Sbjct: 30  SDDDIIIRQVVPELGDVEGGEEENLLTADHHHFSIFKRRFGKSYASQEEHDYRFKVFKAN 89

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
           LRRA+R Q LDP+A HGVT+FSDLTP+EFR  +LGL R L+LP DAQKAPILPTNDLP D
Sbjct: 90  LRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGL-RPLKLPHDAQKAPILPTNDLPED 148

Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
           FDWRDHGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG LVSLSEQQLV+CDHECDPEE
Sbjct: 149 FDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEE 208

Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
            GSCDSGCNGGLMN+AFEY LKAGG+ +E+DYPYTGTD GSCKFDK+KIAA+VSNFSVIS
Sbjct: 209 MGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAASVSNFSVIS 268

Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
            DEDQ+AANLVK GPLA  + ++ +     +++  VS P
Sbjct: 269 LDEDQIAANLVKIGPLAVAINAVFMQ----TYVGGVSCP 303


>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
 gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
 gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 208/273 (76%), Positives = 235/273 (86%), Gaps = 8/273 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +IRQVV     + EDHLLNAEHHF+ FKSKF K YATQEEHDYRF VFKANL RAK+
Sbjct: 29  DDPLIRQVV----SEGEDHLLNAEHHFTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKK 84

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            Q++DPTA HGVTKFSDLTP EFRRQ LGL RRLRLP DA KAPILPT DLPTDFDWRDH
Sbjct: 85  HQIMDPTAAHGVTKFSDLTPKEFRRQLLGLKRRLRLPTDANKAPILPTGDLPTDFDWRDH 144

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG+CGSCWSFSATGALEGAH+L+TGELVSLSEQQLVDCDHECDPEE G+CDS
Sbjct: 145 GAVTSVKDQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDS 204

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GC+GGLMN+AFEY LKAGG+EREKDYPYTG D G+CKF+KSK+AA+VSNFSV+S DEDQ+
Sbjct: 205 GCSGGLMNNAFEYALKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQI 264

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           AANLVKHGPL+  + ++ +     +++  VS P
Sbjct: 265 AANLVKHGPLSVAINAVFMQ----TYIGGVSCP 293


>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/298 (68%), Positives = 242/298 (81%), Gaps = 9/298 (3%)

Query: 3   RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           R  L  L  LL ++ L  A   +D DD +IRQVV  DG+     LLNA+HHF++FK +F 
Sbjct: 4   RFSLLFLCTLLATTSLVFAAEDDDGDDVLIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K YA+ EEHDYR  VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60  KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D   
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293


>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
          Length = 365

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 197/257 (76%), Positives = 222/257 (86%), Gaps = 7/257 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +IRQVV      + D LL+AEHHF+ FK++F KTYAT EEHDYRF +FKANLRRAKR
Sbjct: 31  DDLLIRQVV-----SNSDDLLSAEHHFAAFKARFRKTYATAEEHDYRFSIFKANLRRAKR 85

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            QLLDP+AVHGVT+FSDLTP+EFR+ +LGL + LR P D Q+APILPTNDLPTDFDWRDH
Sbjct: 86  NQLLDPSAVHGVTRFSDLTPAEFRQNYLGL-KPLRFPIDTQQAPILPTNDLPTDFDWRDH 144

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG CGSCWSFS TGALEGAHFL+TG LVSLSEQQLVDCDHECDPEE G+CD 
Sbjct: 145 GAVTAVKDQGECGSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDR 204

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMN+AFEYILKAGGV R +DYPYTGTD G CKFDK+KIAA+VSNFS +S DEDQ+
Sbjct: 205 GCNGGLMNTAFEYILKAGGVVRGEDYPYTGTD-GHCKFDKTKIAASVSNFSTVSIDEDQI 263

Query: 267 AANLVKHGPLAGNVASI 283
           AANLVK+GPLA  + +I
Sbjct: 264 AANLVKNGPLAVGINAI 280


>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/298 (68%), Positives = 242/298 (81%), Gaps = 9/298 (3%)

Query: 3   RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           R  L  L  LL ++ L  A   +D DD +IRQVV  DG+     LLNA+HHF++FK +F 
Sbjct: 4   RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K YA+ EEHDYR  VFKAN+RRAKR Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+
Sbjct: 60  KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLK 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKL 179

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D   
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293


>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
          Length = 365

 Score =  412 bits (1060), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 201/270 (74%), Positives = 234/270 (86%), Gaps = 8/270 (2%)

Query: 30  MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
           +IRQVVP +GE  EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+N+RRA+    
Sbjct: 30  LIRQVVP-EGE-VEDHLLNAEHHFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQ 87

Query: 90  LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
           LDP+AVHGVTKFSDLTP+EF R+FLGL + LRLPA AQKAPILPTN+LP DFDWRD GAV
Sbjct: 88  LDPSAVHGVTKFSDLTPAEFHRKFLGL-KPLRLPAHAQKAPILPTNNLPKDFDWRDKGAV 146

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
           T VKDQG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE GSCDSGCN
Sbjct: 147 TNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCN 206

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLMN+AFEY++ +GGV+REKDYPYTG D G+CKFDKSKIAA+VSN+SVIS DE+Q+AAN
Sbjct: 207 GGLMNNAFEYLIGSGGVQREKDYPYTGRD-GTCKFDKSKIAASVSNYSVISLDEEQIAAN 265

Query: 270 LVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           LVK+GPLA  + ++ +     +++  VS P
Sbjct: 266 LVKNGPLAVAINAVYMQ----TYVGGVSCP 291


>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
          Length = 368

 Score =  412 bits (1059), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 195/282 (69%), Positives = 236/282 (83%), Gaps = 12/282 (4%)

Query: 10  LLLLLSSVLASA--VAVN------DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           L+ +LS +L ++  +AVN      DDD +IRQVV  +    + H+LNAEHHF+LFK +F 
Sbjct: 7   LVFVLSILLTTSFLLAVNGEIKGGDDDILIRQVVGDE----DHHMLNAEHHFTLFKKRFG 62

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           KTYA+ EEH YRF VFKANLRRA R Q LDP+AVHGVT+FSD+TP EF ++FLG+NRRLR
Sbjct: 63  KTYASDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDMTPDEFSQKFLGVNRRLR 122

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            P+DA KAPILPT DLP+DFDWR+HGAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+L
Sbjct: 123 FPSDANKAPILPTEDLPSDFDWREHGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKL 182

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE  SCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD  +
Sbjct: 183 VSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDKAT 242

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLA  + ++
Sbjct: 243 CKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAV 284


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 198/271 (73%), Positives = 234/271 (86%), Gaps = 7/271 (2%)

Query: 30  MIRQVVPSDGE-QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
           +IRQVVP  GE + ED+LLNAEHHF+ FK+KF+KTYAT+EEHD+RF VFK+NLRRA+   
Sbjct: 32  LIRQVVPDVGEAEEEDNLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHA 91

Query: 89  LLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGA 148
            LDP+AVHGVTKFSDLTP+EFRRQFLGL + LR PA AQKAPILPT DLP DFDWRD GA
Sbjct: 92  KLDPSAVHGVTKFSDLTPAEFRRQFLGL-KPLRFPAHAQKAPILPTKDLPKDFDWRDKGA 150

Query: 149 VTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGC 208
           VT VKDQGACGSCWSFS TGALEGAH+L+TGELVSLSEQQLVDCDH CDPEE G+CDSGC
Sbjct: 151 VTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGC 210

Query: 209 NGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
           NGGLMN+AFEYIL++GGV++EKDYPYTG D G+CKFDK+K+AA VSN+SV+S DE+Q+AA
Sbjct: 211 NGGLMNNAFEYILQSGGVQKEKDYPYTGRD-GTCKFDKTKVAATVSNYSVVSLDEEQIAA 269

Query: 269 NLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           NLVK+GPLA  + ++ +     +++  VS P
Sbjct: 270 NLVKNGPLAVAINAVFMQ----TYVGGVSCP 296


>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
          Length = 358

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 196/251 (78%), Positives = 223/251 (88%), Gaps = 5/251 (1%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +IRQVV    +  EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF VFKANL +AK 
Sbjct: 21  DDFLIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKL 76

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            Q LDPTA HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPT +LP DFDWR+ 
Sbjct: 77  HQKLDPTAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPEDFDWREK 136

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH CDPEE+GSCDS
Sbjct: 137 GAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDS 196

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE+Q+
Sbjct: 197 GCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDEEQI 255

Query: 267 AANLVKHGPLA 277
           AANLVK+GPLA
Sbjct: 256 AANLVKNGPLA 266


>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
          Length = 363

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 201/294 (68%), Positives = 240/294 (81%), Gaps = 15/294 (5%)

Query: 12  LLLSSVLASAVAV------NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
            + + VL +AVA       N DD +IRQVV    +  EDHLLNAEHHF+ FKSKFSK+Y+
Sbjct: 5   FIFAIVLFAAVATSSTDNTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYS 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T+EEHDYRF VFK+NL +AK  Q LDPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA 
Sbjct: 61  TKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAH 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLS
Sbjct: 121 AQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFD
Sbjct: 181 EQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFD 239

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KSK+ A+VSNFSV+S DE+Q+AANLVK+GPLA  + +  +     +++  VS P
Sbjct: 240 KSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQ----TYMSGVSCP 289


>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
          Length = 363

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 201/294 (68%), Positives = 240/294 (81%), Gaps = 15/294 (5%)

Query: 12  LLLSSVLASAVAV------NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
            + + VL +AVA       N DD +IRQVV    +  EDHLLNAEHHF+ FKSKFSK+Y+
Sbjct: 5   FIFAIVLFAAVATSSTDDTNTDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYS 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T+EEHDYRF VFK+NL +AK  Q LDPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA 
Sbjct: 61  TKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAH 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           AQKAPILPT +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLS
Sbjct: 121 AQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDH CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFD
Sbjct: 181 EQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRD-GSCKFD 239

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KSK+ A+VSNFSV+S DE+Q+AANLVK+GPLA  + +  +     +++  VS P
Sbjct: 240 KSKVVASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQ----TYMSGVSCP 289


>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
 gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 203/281 (72%), Positives = 236/281 (83%), Gaps = 9/281 (3%)

Query: 21  AVAVNDDDAMIRQVVPSDGEQ-SEDHLLNAE-HHFSLFKSKFSKTYATQEEHDYRFRVFK 78
           A  +N DD +IR+VV  DG+  S  +LL+AE HHFSLFKSKF K+Y +QEEHDYRF VFK
Sbjct: 21  AETLNGDDPLIREVV--DGQDASSSNLLSAEQHHFSLFKSKFKKSYGSQEEHDYRFSVFK 78

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
           ANLRRA R Q LDPTA HGVT+FSDLTP+EFR+Q LGL RRLRLP DA +APILPT+DLP
Sbjct: 79  ANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGL-RRLRLPKDANEAPILPTSDLP 137

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
            DFDWRD GAV  +K+QG+CGSCWSFSATGALEGAHFL+TGELVSLSEQQLVDCDHECDP
Sbjct: 138 EDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDP 197

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
           EE GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD  +CKFDK+K+AA V+NFSV
Sbjct: 198 EEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRDACKFDKNKVAARVANFSV 257

Query: 259 ISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           +S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 258 VSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294


>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
 gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 195/254 (76%), Positives = 222/254 (87%), Gaps = 5/254 (1%)

Query: 24  VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
            N DD +IRQVV    + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26  TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81

Query: 84  AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
           AK  Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82  AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201

Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
           CDSGCNGGLMN+AFEYIL++GGV  EKDY YTG D GSCKFDKSK+ A+VSNFSV+S DE
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGGVVSEKDYAYTGRD-GSCKFDKSKVVASVSNFSVVSLDE 260

Query: 264 DQMAANLVKHGPLA 277
           DQ+AANLVK+GPLA
Sbjct: 261 DQIAANLVKNGPLA 274


>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
          Length = 364

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 202/270 (74%), Positives = 232/270 (85%), Gaps = 8/270 (2%)

Query: 30  MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
           +IRQVVP +GE  EDHLLNAEHHFS FK+KF KTYAT+EEHD+RF VFK+NLRRA+    
Sbjct: 29  LIRQVVP-EGE-VEDHLLNAEHHFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQ 86

Query: 90  LDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
           LDP+AVHGVTKFSDLT +EF+RQFLGL + L LPA+AQKAPILPTN+LP DFDWRD GAV
Sbjct: 87  LDPSAVHGVTKFSDLTAAEFQRQFLGL-KPLGLPANAQKAPILPTNNLPKDFDWRDKGAV 145

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
           T VKDQGACGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH CDPEE G+CDSGCN
Sbjct: 146 TNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCN 205

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLMN+AFEYIL AGGV+RE+DYPY G D  SCKFDKSKIAA+V+N+SVIS DEDQ+AAN
Sbjct: 206 GGLMNNAFEYILGAGGVQREEDYPYAGRD-SSCKFDKSKIAASVANYSVISLDEDQIAAN 264

Query: 270 LVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           LVK+GPLA  + ++ +     +++  VS P
Sbjct: 265 LVKNGPLAVGINAVYMQ----TYIGGVSCP 290


>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
          Length = 368

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 197/280 (70%), Positives = 228/280 (81%), Gaps = 5/280 (1%)

Query: 20  SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
           SA   N DD++IRQVV    E S + L   +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20  SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79

Query: 80  NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
           NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA  APILPTNDLP 
Sbjct: 80  NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWR+ GAV  VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFSV+
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSVV 258

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294


>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 365

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 200/277 (72%), Positives = 233/277 (84%), Gaps = 6/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+RL L SL    L    +SA+A  D+D +IRQVV S+ E  + HLLNAEHHFSLFKSKF
Sbjct: 1   MDRLFLLSLPRFAL---FSSAIAFPDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKF 56

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YA++EEHD+RF+VFKANLRRA+  QLLDP+A HG+TKFSDLTPSEFRR +LGL++  
Sbjct: 57  GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 115

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           +   +A+KAPILPT+DLP D+DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 116 KPKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 175

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECD E+  SCD+GC GGLM +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 176 LVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKD-G 234

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            C FDKSKIAAAV+NFSVI  DEDQ+AANLVKHGPLA
Sbjct: 235 KCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLA 271


>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
          Length = 363

 Score =  409 bits (1052), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 195/280 (69%), Positives = 233/280 (83%), Gaps = 6/280 (2%)

Query: 21  AVAVNDDDAMIRQVVPSDGEQSE-DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
           AV  +  D +IRQVV +D  + E D LL+ EHHF LFK+KF +TY T+EEH+YR  VFK+
Sbjct: 17  AVTADSSDPLIRQVVQNDETEIESDPLLDPEHHFKLFKNKFGRTYDTEEEHEYRLTVFKS 76

Query: 80  NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
           NLRRAKR Q+LDPTA HGVTKFSDLTPSEFR+++LGL  +L+LPADA KAPILPT++LP 
Sbjct: 77  NLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLPQ 136

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWRD GAVT VK+QG+CGSCWSFS TGALEG+HFL TGELVSLSEQQLVDCDHECDP 
Sbjct: 137 DFDWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPA 196

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           E  SCDSGCNGGLMN+AFEYILKAGG+++E DYPYTG D G+CKFDKSKIAA+V+NFSV+
Sbjct: 197 EYNSCDSGCNGGLMNNAFEYILKAGGLQKEADYPYTGRD-GTCKFDKSKIAASVANFSVV 255

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           S+DEDQ+AANLV +GPLA  + +  +     +++  VS P
Sbjct: 256 STDEDQIAANLVTNGPLAIGINAAWMQ----TYIGQVSCP 291


>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/277 (71%), Positives = 232/277 (83%), Gaps = 8/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           MERL L SLL  +L    +SA+A +D+D +IRQVV    E  + HLLNAEHHFSLFKSKF
Sbjct: 1   MERLFLLSLLAFVL---FSSAIAFSDEDPLIRQVV---SETDDSHLLNAEHHFSLFKSKF 54

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YA++EEHD+RF+VFKANLRRA+  QLLDP+A HG+TKFSDLTPSEFRR +LGL++  
Sbjct: 55  GKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP- 113

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           +   +A+KAPILPT+DLP DFDWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 114 KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 173

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECDPE+  +CD+GC GG   +AFEY LKAGG++ EKDYPYTG D G
Sbjct: 174 LVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLEKDYPYTGKD-G 232

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            C FDKSKI AAV+NFSVI  DEDQ+AANLVKHGPLA
Sbjct: 233 KCHFDKSKICAAVTNFSVIGLDEDQIAANLVKHGPLA 269


>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
           Full=Turgor-responsive protein 15A; Flags: Precursor
 gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
          Length = 363

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/262 (74%), Positives = 228/262 (87%), Gaps = 7/262 (2%)

Query: 18  LASAVA--VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
           +A+AV    N+DD +IRQVV    +  EDHLLNAEHHF+ FKSKFSK+YAT+EEHDYRF 
Sbjct: 15  VATAVTDDTNNDDFIIRQVV----DNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFG 70

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
           VFK+NL +AK  Q  DPTA HG+TKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT 
Sbjct: 71  VFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTT 130

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
           +LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGAH+L+TG+LVSLSEQQLVDCDH 
Sbjct: 131 NLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHV 190

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
           CDPE++GSCDSGCNGGLMN+AFEY+L++GGV +EKDY YTG D GSCKFDKSK+ A+VSN
Sbjct: 191 CDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKSKVVASVSN 249

Query: 256 FSVISSDEDQMAANLVKHGPLA 277
           FSV++ DEDQ+AANLVK+GPLA
Sbjct: 250 FSVVTLDEDQIAANLVKNGPLA 271


>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
 gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
          Length = 373

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 202/297 (68%), Positives = 245/297 (82%), Gaps = 8/297 (2%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSED-HLLNAEHHFSLFKSKFSK 62
            ++SS+L +  S+V A  +  + +D +IRQV     E S + +LL AEHHFSLFK KF K
Sbjct: 10  FVISSILFV--SAVTAETLTTDGEDPLIRQVTDGQDESSANPNLLGAEHHFSLFKKKFKK 67

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
           TYA+QEEHDYRF++FK+NLRRA+R Q LDPTA HGVT+FSDLT SEFRRQFLGL RRLRL
Sbjct: 68  TYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLGL-RRLRL 126

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           P DA +AP+LPTNDLP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TG+LV
Sbjct: 127 PKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLV 186

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDP E G+CDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+C
Sbjct: 187 SLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGAC 246

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           +FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 247 QFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 299


>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
          Length = 355

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 197/278 (70%), Positives = 235/278 (84%), Gaps = 7/278 (2%)

Query: 22  VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           VA +D+D +IRQVV S+ E  + HLLNAEHHFSLFKSKF K YA++EEHD+RF+VFKANL
Sbjct: 19  VAFSDEDPLIRQVV-SETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANL 77

Query: 82  RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
           RRA+R QLLDP+A HG+TKFSDLTPSEFRR +LGL++  +   +A+KAPILPT+DLP D+
Sbjct: 78  RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLHKP-KPKLNAEKAPILPTSDLPADY 136

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDHGAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECDPE+ 
Sbjct: 137 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 196

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCD+GC+GGLM +AFEY LKAGG++REKDYPYTG   G C FDKSKIAAAV+NFSVI  
Sbjct: 197 DSCDAGCSGGLMTTAFEYTLKAGGLQREKDYPYTGKX-GKCHFDKSKIAAAVTNFSVIGL 255

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DEDQ+AANLVKHGPLA  + +  +     +++  VS P
Sbjct: 256 DEDQIAANLVKHGPLAVGINAAWMQ----TYVGGVSCP 289


>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
 gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
          Length = 366

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 194/273 (71%), Positives = 230/273 (84%), Gaps = 8/273 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD +IRQVV  DG+     LLNA+HHF++FK +F K YA+ EEHDYR  VFKAN+RRAKR
Sbjct: 27  DDILIRQVV-GDGDGD---LLNADHHFAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKR 82

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+ PADA+ APILPT++LP+DFDWRD 
Sbjct: 83  HQQLDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDR 142

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VK+QG CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDPEE+GSCDS
Sbjct: 143 GAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDS 202

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D   C+FDK+KIAA V+NFSV+S DEDQ+
Sbjct: 203 GCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQI 262

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 263 AANLVKNGPLAVAINAVFMQ----TYIGGVSCP 291


>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
          Length = 374

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 194/258 (75%), Positives = 217/258 (84%), Gaps = 1/258 (0%)

Query: 20  SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
           SA   N DD++IRQVV    E S + L   +HH SLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 26  SAETFNGDDSLIRQVVEGQDESSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDYRFSVFKS 85

Query: 80  NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
           NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA KAPILPTNDLP 
Sbjct: 86  NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANKAPILPTNDLPE 144

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWR+ GAV  VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 145 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 204

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK K+AA V+NFSV+
Sbjct: 205 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKDKVAAGVANFSVV 264

Query: 260 SSDEDQMAANLVKHGPLA 277
           S DEDQ+AANLVK+GPLA
Sbjct: 265 SLDEDQIAANLVKNGPLA 282


>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
 gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/280 (70%), Positives = 227/280 (81%), Gaps = 5/280 (1%)

Query: 20  SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
           SA   N DD++IRQVV    E S + L   +HHFSLFK KF K+Y +QEEHDYRF VFK+
Sbjct: 20  SAETFNGDDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYRFSVFKS 79

Query: 80  NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT 139
           NLRRA R Q LDPTA HGVT+FSDLT +EFR+Q LGL R+LRLP DA  APILPTNDLP 
Sbjct: 80  NLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGL-RKLRLPKDANTAPILPTNDLPE 138

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWR+ GAV  VK+QG+CGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDPE
Sbjct: 139 DFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPE 198

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           E GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTG D G+CKFDK+K+AA V+NFS +
Sbjct: 199 EPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGMDRGACKFDKNKVAAGVANFSAV 258

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 SLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 294


>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 363

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 196/277 (70%), Positives = 229/277 (82%), Gaps = 6/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M    L    L++ S   A++    DD+ +I QVV   G +     L AEHHF  FK +F
Sbjct: 1   MNNPTLIIFFLVIFSVFFAASADGGDDEPLIMQVVEGSGVR-----LGAEHHFLDFKRRF 55

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YA+QEEH+YRF VFKAN+RRA+R Q LDP+A HGVT+FSDLT SEFR + LGL R +
Sbjct: 56  GKAYASQEEHNYRFEVFKANMRRARRHQSLDPSAAHGVTRFSDLTASEFRNKVLGL-RGV 114

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           RLP++A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGE
Sbjct: 115 RLPSNANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGE 174

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +CKFDK+KIAA+V+NFSVIS DEDQ+AANLVK+GPLA
Sbjct: 235 NCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLA 271


>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 373

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 208/297 (70%), Positives = 245/297 (82%), Gaps = 8/297 (2%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           LI ++LL + L S + S          IRQVVP   E++++HLLNAEHHFSLFKSK+ KT
Sbjct: 9   LIAATLLAVSLGSAVISGEVNYGFVNPIRQVVP---EENDEHLLNAEHHFSLFKSKYEKT 65

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
           YATQEEHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR  RL
Sbjct: 66  YATQEEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D  +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDNTAC 245

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KFDKSKIAA+VSNFSV+SSDEDQ+AANLVKHGPLA  + ++ +     +++  VS P
Sbjct: 246 KFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQ----TYIGGVSCP 298


>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
          Length = 367

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 200/299 (66%), Positives = 241/299 (80%), Gaps = 9/299 (3%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+RL L SLL+  + S  +SA A +D+D +IRQV  S+ + + +HLLNAEHHFSLFKSKF
Sbjct: 1   MDRLFLLSLLVFTIFS--SSAFAFSDEDPLIRQVT-SESDDNNNHLLNAEHHFSLFKSKF 57

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K YATQEEHD+R +VFKANLRRA+R QLLDPTA HG+TKFSDLTPSEFRR +LGL++  
Sbjct: 58  GKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGLHKP- 116

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           +      KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGE
Sbjct: 117 KPKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGE 176

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECD E+   CD+GC GGLM +AFEY LKAGG++REKDYPYTG + G
Sbjct: 177 LVSLSEQQLVDCDHECDAEQKSECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRN-G 235

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
            C FDKSKIAA+V+N+SV+  DEDQ+AANLVKHGPLA  + S  +     +++  VS P
Sbjct: 236 QCHFDKSKIAASVTNYSVVGLDEDQIAANLVKHGPLAVGINSAWMQ----TYIGGVSCP 290


>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
          Length = 366

 Score =  406 bits (1043), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 197/297 (66%), Positives = 238/297 (80%), Gaps = 9/297 (3%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           R  L  L  LL ++ L  A   + DD +IRQVV   G+     LLNA+HHF++FK +F K
Sbjct: 4   RFSLLFLCTLLATTYLVFAAEDDGDDILIRQVVGDGGD-----LLNADHHFTVFKRRFGK 58

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
            YA+ EEHDYR   FKAN+RRAK+ Q LDP AVHGVT+FSDLTP+EFRR+FLGLNRRL+ 
Sbjct: 59  VYASDEEHDYRLSEFKANMRRAKQHQELDPAAVHGVTQFSDLTPTEFRRKFLGLNRRLKF 118

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           PADA+ APILPT++LP+DFDWRDHGAVT VK+QG CGSC SFS TGALEGA+FL+TG+LV
Sbjct: 119 PADAKTAPILPTDELPSDFDWRDHGAVTPVKNQGTCGSCCSFSTTGALEGANFLATGKLV 178

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+D+PYTG D   C
Sbjct: 179 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDHPYTGNDLQVC 238

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           +FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 239 RFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 291


>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
          Length = 363

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 195/278 (70%), Positives = 231/278 (83%), Gaps = 9/278 (3%)

Query: 22  VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           +A +DDD +IRQVV    E  ++H+LNAEHHFSLFKSK+ K YA+QEEHD+R +VFKANL
Sbjct: 19  IAFSDDDPLIRQVV---SETDDNHMLNAEHHFSLFKSKYGKIYASQEEHDHRLKVFKANL 75

Query: 82  RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
           RRA+R QLLDPTA HG+T+FSDLTPSEFRR +LGL++  R   +AQKAPILPT+DLP DF
Sbjct: 76  RRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLHKP-RPKLNAQKAPILPTSDLPEDF 134

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQLVDCDHECD EE 
Sbjct: 135 DWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEK 194

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
             CD+GCNGGLM +AFEY LKAGG++REKDYPYTG D G C FDKSKIAA+V+NFSVI  
Sbjct: 195 SECDAGCNGGLMTTAFEYTLKAGGLQREKDYPYTGRD-GKCHFDKSKIAASVANFSVIGL 253

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DEDQ+AANLVKHGPLA  + +  +     +++  VS P
Sbjct: 254 DEDQIAANLVKHGPLAVGINAAWMQ----TYMRGVSCP 287


>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
          Length = 371

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 200/307 (65%), Positives = 246/307 (80%), Gaps = 19/307 (6%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+R  L SLL+  L+   A+ V   D+D +IRQVV SDGE  +D LLNA+HHF+LFKSK+
Sbjct: 1   MDRFSLPSLLIHALT---AACVVRADEDPLIRQVV-SDGE--DDALLNADHHFTLFKSKY 54

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K+YATQEEHDYR  VFKANLRRAKR Q+LDP+AVHGVTKFSDLTP EFRR +LG+ +  
Sbjct: 55  GKSYATQEEHDYRLSVFKANLRRAKRHQMLDPSAVHGVTKFSDLTPKEFRRTYLGIRKSS 114

Query: 121 RL--------PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
                     PADA  A ILPT+DLP DF+WRD+GAVTGVKDQG CGSCWSFS TG LEG
Sbjct: 115 SSKQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGAVTGVKDQGLCGSCWSFSTTGTLEG 174

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            +FL+TGEL+SL+EQ+LVDCDH CDP+++G+CD+GCNGGLM +A+EY+L++GG+E+EKDY
Sbjct: 175 TNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGGLMTTAYEYVLQSGGLEKEKDY 234

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           PYTG D G+CKFDKSKIAAAV+NFSV+S DEDQ+AANLVKHGPL+  + SI +     ++
Sbjct: 235 PYTGRD-GTCKFDKSKIAAAVANFSVVSLDEDQIAANLVKHGPLSVGINSIFMQ----TY 289

Query: 293 LFTVSSP 299
           +  VS P
Sbjct: 290 IGGVSCP 296


>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 368

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 199/298 (66%), Positives = 238/298 (79%), Gaps = 9/298 (3%)

Query: 3   RLILSSLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           R  L  L  LL ++ L  A   +D DD +IRQVV  DG+     LLNA+HHF++FK +F 
Sbjct: 4   RFSLLFLCTLLATTSLVFAAEDDDGDDILIRQVV-GDGDGD---LLNADHHFTVFKRRFG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K YA+ EEHDYR  VFKAN+RRAKR Q LDP AVHGVT+FSD TP+EFRR+FLGLNRRL+
Sbjct: 60  KAYASDEEHDYRLSVFKANMRRAKRHQELDPAAVHGVTQFSDSTPTEFRRKFLGLNRRLK 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            PADA+ APILPT++LP+DFDWRD GAVT VK+QG CG CWSFS TGALEGA+FL+TG+L
Sbjct: 120 FPADAKTAPILPTDELPSDFDWRDRGAVTPVKNQGTCGLCWSFSTTGALEGANFLATGKL 179

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE+GSCD GCNGGLMNSAFEY LKAGG+ RE+DYPYTG D   
Sbjct: 180 VSLSEQQLVDCDHECDPEEAGSCDFGCNGGLMNSAFEYTLKAGGLMREEDYPYTGNDLQV 239

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           C+FDK+KIAA V+NFSV+S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 240 CRFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQ----TYIGGVSCP 293


>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 371

 Score =  402 bits (1033), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 201/292 (68%), Positives = 234/292 (80%), Gaps = 14/292 (4%)

Query: 1   MERLILSSLLL-LLLSSVLASAV-------AVNDD-DAMIRQVVPSDGEQSEDHLLNAEH 51
           MER     L   +LLS+ +A  V       AV+D+ D +IRQVV      ++D  L AE 
Sbjct: 1   MERFNAIPLFFAILLSATVAYGVSSDQINSAVSDEEDILIRQVVSG----ADDRPLTAEQ 56

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           HF  FK KF KTY T EEHDYRFRVFKANLR+AKR Q LDP AVHGVT+FSDLT SEFR 
Sbjct: 57  HFQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDPDAVHGVTRFSDLTESEFRE 116

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            F+GLNR LRLPADA +APILPT++L +DFDWRD GAVT VKDQG+CGSCWSFSA GALE
Sbjct: 117 NFVGLNR-LRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALE 175

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           GA+FLSTG+L+SLSEQQLVDCDHECDPEE+G+CD+GCNGGLM SAFEYI+KAGG+ERE+D
Sbjct: 176 GANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVKAGGLEREED 235

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           YPYTGTD GSCKF   KIAA+ +NFSVIS+D DQ+AANLVK+GPLA  + ++
Sbjct: 236 YPYTGTDRGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAV 287


>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 365

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 196/272 (72%), Positives = 229/272 (84%), Gaps = 9/272 (3%)

Query: 9   LLLLLLSSVLASAVAVND---DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           LLL+  S V A+  A +D   ++ +I QVV  DG    D  L AEHHF  FK +F K Y 
Sbjct: 8   LLLVAFSLVFAAVSASSDGGNEEPLIMQVV--DGG---DVRLGAEHHFLEFKRRFGKAYD 62

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           +++EHDYR++VFKAN+RRA+R Q LDP+A HGVT+FSDLTPSEFR + LGL R +RLP D
Sbjct: 63  SEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVTRFSDLTPSEFRNKVLGL-RGVRLPLD 121

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A KAPILPT++LP+DFDWRDHGAVT VK+QG+CGSCWSFS TGALEGAHFLSTGELVSLS
Sbjct: 122 ANKAPILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLS 181

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEYILK+GGV RE+DYPY+G D G+CKFD
Sbjct: 182 EQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMREEDYPYSGADSGTCKFD 241

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K+KIAA+V+NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 242 KTKIAASVANFSVVSLDEDQIAANLVKNGPLA 273


>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
 gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/277 (68%), Positives = 229/277 (82%), Gaps = 6/277 (2%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+   L   ++L + SV A +     +D +IRQVV  +G +     L AEHHF+LFK KF
Sbjct: 1   MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+  LGL R +
Sbjct: 56  GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
            LP DA  APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQQLVDCDHECDPE+ GSCD+GCNGGLMNSAFEYILK+GGV RE+DYPY+GTD G
Sbjct: 175 LVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKSGGVMREEDYPYSGTDRG 234

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           SCKFDK KIAA+V+NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 235 SCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLA 271


>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
          Length = 365

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 193/281 (68%), Positives = 233/281 (82%), Gaps = 11/281 (3%)

Query: 19  ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
           AS  + + +D +I+Q+V  DG    DH L+A+HHF LFK +F K+YATQE+HDYRF VFK
Sbjct: 22  ASGKSSDGEDLVIQQIV--DG----DHPLSADHHFRLFKRRFGKSYATQEDHDYRFSVFK 75

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            NLRRA+  Q LDP+AVHGVT+FSDLTP+EFRR  LGL +RLR PADA KAPILPT DLP
Sbjct: 76  TNLRRARHHQRLDPSAVHGVTQFSDLTPAEFRRNHLGL-KRLRFPADANKAPILPTEDLP 134

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
            DFDWRDHGAV  VK+QG+CGSCWSFS TGALEGA+FL+TG+LVSLSEQQLVDCDHECDP
Sbjct: 135 ADFDWRDHGAVASVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDP 194

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
           EE GSCDSGCNGGLMNSA EY LKAGG+ RE+DYPY+GTD G+CKFD++KIAA+V+NFSV
Sbjct: 195 EEPGSCDSGCNGGLMNSALEYTLKAGGLMREEDYPYSGTDRGTCKFDETKIAASVANFSV 254

Query: 259 ISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           +S DE+Q+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 255 VSLDENQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 291


>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
 gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
          Length = 371

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 192/257 (74%), Positives = 221/257 (85%), Gaps = 6/257 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +I QVV SDG    D LLNAE+ F+ FK+KF KTYAT EEHD+RF VFKANLRRAKR
Sbjct: 35  EDLLIHQVV-SDG----DDLLNAEYQFAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKR 89

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            QLLDP+A HGVT+FSDLTP EFR+ +LGL +RL+LPADAQKAPILPT DLPTDFDWRDH
Sbjct: 90  HQLLDPSAEHGVTQFSDLTPREFRQNYLGL-KRLQLPADAQKAPILPTKDLPTDFDWRDH 148

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VKDQG CGSCWSFS  GALEGAHFL+TG LVSLS QQL+DCD ECDPEE  +CD 
Sbjct: 149 GAVTAVKDQGYCGSCWSFSTIGALEGAHFLATGNLVSLSTQQLLDCDTECDPEEYDACDD 208

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMN+AFEYILKAGGV +E+DYPYTGTD G C+F+K+KIAA+V+NFSV+S DEDQ+
Sbjct: 209 GCNGGLMNNAFEYILKAGGVAQEEDYPYTGTDRGLCRFNKTKIAASVANFSVVSLDEDQI 268

Query: 267 AANLVKHGPLAGNVASI 283
           AANLVK+GPLA  + ++
Sbjct: 269 AANLVKNGPLAVGINAV 285


>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/279 (70%), Positives = 228/279 (81%), Gaps = 6/279 (2%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           M+RL L  S+ +L    V  S+  VND DD +IRQVV      +E  +L +E HFSLFKS
Sbjct: 1   MDRLKLCFSVFVLFFLIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKS 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+  
Sbjct: 57  KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVRA 116

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             +LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275


>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 361

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 190/269 (70%), Positives = 227/269 (84%), Gaps = 5/269 (1%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           LL  L  ++ +SA+A +DDD +IRQVV  +    ++H+LNAEHHFSLFK+KF K YA+QE
Sbjct: 4   LLSFLAFALFSSAIAFSDDDPLIRQVVSGN---DDNHMLNAEHHFSLFKAKFGKIYASQE 60

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
           EHD+R +VFKANL RAKR QLLDP+A HG+T+FSDLTPSEFRR +LGLN+  R   +A+K
Sbjct: 61  EHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLNKP-RPNLNAEK 119

Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
           APILPT DLP+DFDWR+ GAVT VK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQQ
Sbjct: 120 APILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQ 179

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
           LVDCDHECDP E   CD+GCNGGLM +AFEY LKAGG++ EKDYPYTG + G C FDKS+
Sbjct: 180 LVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLEKDYPYTGRN-GKCHFDKSR 238

Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           IAA+VSNFSV+  DEDQ+AANL+KHGPLA
Sbjct: 239 IAASVSNFSVVGLDEDQIAANLLKHGPLA 267


>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
 gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
          Length = 368

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 190/257 (73%), Positives = 215/257 (83%), Gaps = 6/257 (2%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           D+ MIRQV     E   D  LNAE HF  FK++F KTYAT EEHDYRF VFKANLRRAKR
Sbjct: 31  DNLMIRQV-----ESHVDDFLNAERHFEKFKARFQKTYATPEEHDYRFNVFKANLRRAKR 85

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            QLLDP+AVHGVT+FSDLTP+EFRR +LGLN  LR PADAQ+APILPT++LPTDFDWR++
Sbjct: 86  HQLLDPSAVHGVTQFSDLTPAEFRRDYLGLNP-LRFPADAQQAPILPTDNLPTDFDWREN 144

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VK+QG CGSCWSFS  GALEGAHFL+TG L SLSEQQLVDCD ECDPEE  +CD 
Sbjct: 145 GAVTPVKNQGNCGSCWSFSTIGALEGAHFLATGNLESLSEQQLVDCDRECDPEEYDACDD 204

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMN+AFEYILK GGVEREKDYPYTG D   CKF++SKI A+VSNFSV+S DEDQ+
Sbjct: 205 GCNGGLMNNAFEYILKTGGVEREKDYPYTGRDRSPCKFNESKIVASVSNFSVVSIDEDQI 264

Query: 267 AANLVKHGPLAGNVASI 283
           AANLVK+GPLA  + ++
Sbjct: 265 AANLVKNGPLAVGINAV 281


>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
          Length = 373

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/280 (69%), Positives = 228/280 (81%), Gaps = 9/280 (3%)

Query: 3   RLILSSLLLLLL-----SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK 57
           +L  S  +LL+L     S ++A   + + DD +IRQVV  DG  +E  +L++E HFSLFK
Sbjct: 5   KLSFSVFVLLILFVSVSSGIVAETSSSDGDDLVIRQVV--DG--AEPKVLSSEDHFSLFK 60

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            KF K YA+ EEHDYR  VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+ 
Sbjct: 61  RKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDPSARHGVTQFSDLTRSEFRKKHLGVR 120

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
              +LP DA KAPILPT +LP DFDWRD GAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 121 GGFKLPKDANKAPILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLA 180

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TG+LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG 
Sbjct: 181 TGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGK 240

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DG +CK DKSKI A+VSNFSVIS DEDQ+AANLVK+GPLA
Sbjct: 241 DGPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLA 280


>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 368

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/278 (70%), Positives = 231/278 (83%), Gaps = 4/278 (1%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
           M+RL LS S+  LL   V AS+     DD +I+QVV  DG  +E ++L++E HFSLFK K
Sbjct: 1   MDRLKLSLSVFALLFIVVSASSDGNEGDDLVIKQVV--DG-GAEPNVLSSEDHFSLFKKK 57

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           F K YA++EEHDYRF VFK+NLRRA+R Q LDP+A HGVT+FSDLT SEF+R+ LG+   
Sbjct: 58  FGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSEFKRKHLGVKGG 117

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            +LP DA KAPILPT +LP +FDWR+ GAVT VK+QG+CGSCWSFSATGALEGA+FL+TG
Sbjct: 118 FKLPKDANKAPILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 177

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           +LVSLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTG DG
Sbjct: 178 KLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGKDG 237

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 238 ATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275


>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
 gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 373

 Score =  396 bits (1017), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 203/297 (68%), Positives = 242/297 (81%), Gaps = 8/297 (2%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           LI ++LL   L S + S    +     IRQVVP   E++++ LLNAEHHF+LFKSK+ KT
Sbjct: 9   LIAATLLAGSLGSTVISGEVTDGFVNPIRQVVP---EENDEQLLNAEHHFTLFKSKYEKT 65

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR-LRL 122
           YATQ EHD+RFRVFKANLRRA+R QLLDP+AVHGVT+FSDLTP EFRR+FLGL RR  RL
Sbjct: 66  YATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRL 125

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           P D Q APILPT+DLPT+FDWR+ GAVT VK+QG CGSCWSFSA GALEGAHFL+T ELV
Sbjct: 126 PTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELV 185

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDP ++ SCDSGC+GGLMN+AFEY LKAGG+ +E+DYPYTG D  +C
Sbjct: 186 SLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTAC 245

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KFDKSKI A+VSNFSV+SSDEDQ+AANLV+HGPLA  + ++ +     +++  VS P
Sbjct: 246 KFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQ----TYIGGVSCP 298


>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
          Length = 360

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 190/272 (69%), Positives = 224/272 (82%), Gaps = 12/272 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D MI QVV  +G       L AEHHF  FK +F K YAT+EEH YRF VFK+N+ RA+R 
Sbjct: 27  DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
           QLLDP+AVHGVT+FSDLTP EFR   LGL R + LP+DA  APILPT++LP DFDWR+HG
Sbjct: 80  QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSG 198

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLMNSAFEYIL  GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258

Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           ANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286


>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
 gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
           Precursor
 gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
 gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
          Length = 368

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 196/279 (70%), Positives = 227/279 (81%), Gaps = 6/279 (2%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           M+RL L  S+ +L    V  S+  VND DD +IRQVV      +E  +L +E HFSLFK 
Sbjct: 1   MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+  
Sbjct: 57  KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             +LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFEY LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKD 236

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275


>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
          Length = 364

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 193/275 (70%), Positives = 227/275 (82%), Gaps = 9/275 (3%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           +DD+ +IRQVV    E  ++HLLNAEHHFS FK+KFSKTYAT+EEHDYRF VFK+NL RA
Sbjct: 25  DDDNILIRQVV----EDGDEHLLNAEHHFSAFKTKFSKTYATKEEHDYRFGVFKSNLLRA 80

Query: 85  KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWR 144
           K  Q LDP+A+HGVTKFSDLTPSEFR QFLGL + L LP+DA  APILPT++LP DFDWR
Sbjct: 81  KSHQELDPSAIHGVTKFSDLTPSEFRSQFLGL-KPLSLPSDAHNAPILPTDNLPKDFDWR 139

Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
           DHGAVT VK+QG  GSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDHECDP+ + +C
Sbjct: 140 DHGAVTNVKNQGTGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDAC 199

Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
           DSGCNGGLM +AF Y  KAGG+ RE+DY YTG D G CKFDKSKIAA+VSNFSV+S DED
Sbjct: 200 DSGCNGGLMTTAFGYTKKAGGLVREEDYLYTGRDRGPCKFDKSKIAASVSNFSVVSLDED 259

Query: 265 QMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           Q+AANLVK+GPL+  + ++ +     +++  VS P
Sbjct: 260 QIAANLVKNGPLSVGINAVYMQ----TYIGGVSCP 290


>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
          Length = 373

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 191/297 (64%), Positives = 239/297 (80%), Gaps = 9/297 (3%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG-EQSEDHLLNAEHHFSLFKSKFSK 62
           +I S  ++ ++ +   SA    + D +I QV  +DG E +E  LL AEHH+SLFK +F K
Sbjct: 11  VIFSFFIVGVICTETFSAEGF-EVDPLIEQV--TDGHEGAEPQLLTAEHHYSLFKKRFKK 67

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
           +Y +Q+EHDYRF++F+ NLRRA R Q LDP+A HGVT+FSDLTP EFR+ +LGL RRLRL
Sbjct: 68  SYGSQKEHDYRFKIFQVNLRRAARHQNLDPSATHGVTQFSDLTPGEFRKAYLGL-RRLRL 126

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           P DA +APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGA+FL+TG+LV
Sbjct: 127 PKDATEAPILPTDNLPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLV 186

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD G+C
Sbjct: 187 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTC 246

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KFD +K+AA V+NFSV+S DEDQ+AANL K+GPLA  + ++ +     +++  VS P
Sbjct: 247 KFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQ----TYIGGVSCP 299


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 191/272 (70%), Positives = 228/272 (83%), Gaps = 11/272 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D +IRQV  +DG+    H+LNAEHHF+ FK+KF K+YATQEEHDYRF VF+ANLRRAK  
Sbjct: 24  DPLIRQV--TDGDH---HMLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLH 78

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
             LDP+A HGVTKFSDLTP EF+RQ+LGL + LRLP+ A KAPILPT+DLP +FDWRD G
Sbjct: 79  AKLDPSAEHGVTKFSDLTPEEFKRQYLGL-KPLRLPSTANKAPILPTSDLPENFDWRDKG 137

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AVT VK+QG+CGSCW+FS TGALEGAH+LSTGELVSLSEQQLVDCDH CDPEE G+CD+G
Sbjct: 138 AVTPVKNQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAG 197

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLMN+AF+YIL+AGGV+ EKDYPY+G D  +CKFDKSK+AA V+NFSV+S DEDQ+A
Sbjct: 198 CNGGLMNNAFDYILQAGGVQTEKDYPYSGRD-ETCKFDKSKVAATVANFSVVSLDEDQIA 256

Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           ANLVKHGPLA  + +I +     +++  VS P
Sbjct: 257 ANLVKHGPLAVGINAIFMQ----TYIGGVSCP 284


>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
          Length = 368

 Score =  393 bits (1009), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 195/279 (69%), Positives = 227/279 (81%), Gaps = 6/279 (2%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           M+RL L  S+ +L    V  S+  VND DD +IRQVV      +E  +L +E HFSLFK 
Sbjct: 1   MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+  
Sbjct: 57  KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             +LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDHECDPEE+ SCDSGCNGGLMNSAFE+ LK GG+ +E+DYPYTG D
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEHTLKTGGLMKEEDYPYTGKD 236

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G +CK DKSKI A+VSNFSVIS DE+Q+AANLVK+GPLA
Sbjct: 237 GKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLA 275


>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  392 bits (1007), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 188/275 (68%), Positives = 223/275 (81%), Gaps = 8/275 (2%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           R++ S  LL     V  S     D+D +IRQVV    +++E  +L++E HF+LFK KF K
Sbjct: 5   RVLFSVSLLF----VFVSVSICGDEDLLIRQVV----DEAEPKVLSSEDHFTLFKKKFGK 56

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRL 122
            Y + EEH YRF VFKANLRRA R Q +DP+A HGVT+FSDLT SEFRR+ LG+    +L
Sbjct: 57  DYGSIEEHYYRFSVFKANLRRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHLGVTGGFKL 116

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
           P DA +APILPT++LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LV
Sbjct: 117 PKDANQAPILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLV 176

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQQLVDCDHECDPEE+GSCDSGCNGGLMNSAFEY LK GG+ RE+DYPYTGTDGGSC
Sbjct: 177 SLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYPYTGTDGGSC 236

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K D+SKI A+VSNFSV+S +EDQ+AANLVK+GPLA
Sbjct: 237 KLDRSKIVASVSNFSVVSINEDQIAANLVKNGPLA 271


>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 387

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 189/299 (63%), Positives = 237/299 (79%), Gaps = 9/299 (3%)

Query: 5   ILSSLLLLLLSS--VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           +++++   L SS  +++     +D D +IRQVV +DG+ +  H L AEHHFSLFK +F K
Sbjct: 10  VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NRRLR 121
           +YAT+EEHD RF++FKAN+RRA+R Q  DP+A+HGVT+FSDLTP EFR+ FLGL   RLR
Sbjct: 69  SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           LP D   APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL+TGEL
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFLATGEL 188

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDCDHECDPEE  +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D  +
Sbjct: 189 VSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 248

Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           C FDKSKIAA+++NFSV++S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 249 CNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQ----TYIGGVSCP 303


>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
 gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
 gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
 gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
 gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
          Length = 361

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/272 (68%), Positives = 218/272 (80%), Gaps = 4/272 (1%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L  L  + L  V  S     D+D +IRQVV    +++E  +L++E HF+LFK KF K Y 
Sbjct: 5   LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+    +LP D
Sbjct: 61  SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272


>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
          Length = 362

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 197/263 (74%), Positives = 229/263 (87%), Gaps = 6/263 (2%)

Query: 16  SVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           SV+A+A    N+DD +IRQV     +  +D LLNAEHHF+ FKSKFSK+YAT+EEHDYRF
Sbjct: 13  SVVATATKDDNNDDFLIRQVT----DHEDDQLLNAEHHFTTFKSKFSKSYATKEEHDYRF 68

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT 134
            VFK+NL++AK  Q LDP+A HGVTKFSDLT SEFRRQFLGL +RLRLPA AQKAPILPT
Sbjct: 69  GVFKSNLKKAKLHQKLDPSAEHGVTKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPT 128

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
           N+LP DFDWR+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+LVSLSEQQLVDCDH
Sbjct: 129 NNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDH 188

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
            CDP+E  SCDSGCNGGLMN+AFEY+L++GGV RE+DY YTG D GSCKFDKSKIAA+VS
Sbjct: 189 VCDPDEYNSCDSGCNGGLMNNAFEYLLQSGGVVREQDYSYTGRD-GSCKFDKSKIAASVS 247

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
           NFSV+S DEDQ+AANLVK+GPLA
Sbjct: 248 NFSVVSVDEDQIAANLVKNGPLA 270


>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
          Length = 374

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 202/302 (66%), Positives = 242/302 (80%), Gaps = 12/302 (3%)

Query: 5   ILSSLLLLLLSSVL-----ASAVAVND-DDAMIRQVVP-SDGEQSEDHLLNAEHHFSLFK 57
           +LS  +LLL SS L     AS V+ ++ DD +IRQVV  +D   ++D LLNAEHHFS FK
Sbjct: 3   LLSRFVLLLFSSSLVFAATASTVSSDESDDLLIRQVVAGADDHDNDDLLLNAEHHFSSFK 62

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            +F K Y + +EHD RF VFKANLRRAKR Q+LDP+AVHGVT+F DLTP+EFRR +LGL 
Sbjct: 63  KRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDPSAVHGVTQFFDLTPAEFRRTYLGL- 121

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +RLRLPAD  +APILPTNDLP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+
Sbjct: 122 KRLRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLA 181

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TG+LVSLSEQQLVDCDH CD E+  SCDSGCNGGLM SAFEY LKAGG+ERE+DYPYTGT
Sbjct: 182 TGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLKAGGLEREEDYPYTGT 241

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
           D   CKFDK+KIA + SNFSV+S DE+Q+AANLV +GPLA  + ++ +     +++  VS
Sbjct: 242 DHSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQ----TYIGGVS 297

Query: 298 SP 299
            P
Sbjct: 298 CP 299


>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
          Length = 335

 Score =  389 bits (999), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 192/272 (70%), Positives = 229/272 (84%), Gaps = 11/272 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D +IRQVV    + +EDH+LNAEHHFS FKSKFSKTYAT+EEHDYRF VFK+N+RRAK  
Sbjct: 1   DLLIRQVV----DDNEDHVLNAEHHFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLH 56

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
             LDP+AVHGVTKFSDLTPSEFRRQFLGL + LRLP  AQKAPILPT+DLP DFDWRD G
Sbjct: 57  AKLDPSAVHGVTKFSDLTPSEFRRQFLGL-KPLRLPEHAQKAPILPTHDLPEDFDWRDKG 115

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AVT VK+QG+CGSCW+FS TGALEG+HFL+TGELVSLS+QQLVDCDH CDPE+ G+CDSG
Sbjct: 116 AVTHVKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSG 175

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLMN+AFEYIL++GGV+RE+DYPYTG D G    D++  AA+VSNFSV+S DEDQ++
Sbjct: 176 CNGGLMNNAFEYILESGGVQREEDYPYTGRDRGPA-IDEAN-AASVSNFSVVSLDEDQIS 233

Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           ANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 234 ANLVKNGPLAIGINAVFMQ----TYIGGVSCP 261


>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
          Length = 359

 Score =  389 bits (999), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 189/273 (69%), Positives = 224/273 (82%), Gaps = 13/273 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D MI QVV  +G       L AEHHF  FK +F K YAT+EEH YRF VFK+N+ RA+R 
Sbjct: 27  DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
           QLLDP+AVHGVT+FSDLTP EF+   LGL R + LP+DA  APILPT++LP DFDWR+HG
Sbjct: 80  QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
           AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+ CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDS 198

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLMNSAFEYIL  GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQI 258

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 287


>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/272 (68%), Positives = 217/272 (79%), Gaps = 4/272 (1%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L  L  + L  V  S     D+D +IRQVV    +++E  +L++E HF+LFK KF K Y 
Sbjct: 5   LRVLFSVSLIFVFVSVSVCGDEDVLIRQVV----DETEPKVLSSEDHFTLFKKKFGKVYG 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ LG+    +LP D
Sbjct: 61  SIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGAHFL+TG+LVSLS
Sbjct: 121 ANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDHECDPEE GSCDSGCNG LMNSAFEY LK GG+ REKDYPYTGTDGGSCK D
Sbjct: 181 EQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLD 240

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 241 RSKIVASVSNFSVVSINEDQIAANLIKNGPLA 272


>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
          Length = 361

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 188/273 (68%), Positives = 222/273 (81%), Gaps = 13/273 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D MI QVV  +G       L AEHHF  FK +F K YAT+EEH YRF VFK+N+ RA+R 
Sbjct: 27  DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
           QLLDP+AVHGVT+FSDLTP EFR   LGL R + LP+DA  APILPT++LP DFDWR+HG
Sbjct: 80  QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWREHG 138

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE-CDPEESGSCDS 206
           AVT VK+QG+CGSCWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHE CDPEE+GSCDS
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDS 198

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GC GGLMNSAFEYIL  GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+
Sbjct: 199 GCKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQI 258

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 AANLVKNGPLAVAINAVYMQ----TYVGGVSCP 287


>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
          Length = 360

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/257 (70%), Positives = 211/257 (82%), Gaps = 4/257 (1%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +IRQVV  D +Q    LL+AE HFS F S++ K+YA + EH YRF VFK+NLRRA+R
Sbjct: 23  EDPVIRQVVSDDQQQ----LLSAEAHFSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARR 78

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
            Q LDPTAVHGVT+F+DLTPSEFRR +LGL RR R       APILPTN+LP DFDWRDH
Sbjct: 79  HQRLDPTAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADFDWRDH 138

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT VK+QG+CGSCWSFSA GALEGA++LSTG LVSLSEQQLVDCDHECD  E  SCD 
Sbjct: 139 GAVTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQ 198

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM +AFEYILK+GG+ERE DYPYTGTD G+CKF+K+KI+A  SNFSV+S DEDQ+
Sbjct: 199 GCNGGLMTTAFEYILKSGGLEREADYPYTGTDRGTCKFNKAKISAVASNFSVVSIDEDQI 258

Query: 267 AANLVKHGPLAGNVASI 283
           AANLVKHGPLA  + ++
Sbjct: 259 AANLVKHGPLAVGINAV 275


>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
          Length = 360

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 184/272 (67%), Positives = 221/272 (81%), Gaps = 12/272 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D +IRQVV  +G       L AEHHF  FK +F K Y ++EEH YRF VFK+N+ RA+R 
Sbjct: 27  DPLIRQVVDGEG-------LGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRH 79

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
           QLLDP+AVHGVT+FSDLTP EFR   LGL R + LP+DA  APIL T++LP DFDWR+HG
Sbjct: 80  QLLDPSAVHGVTRFSDLTPMEFRHSVLGL-RGVGLPSDADSAPILRTDNLPKDFDWREHG 138

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AVT VK+QG+CG+CWSFSATGALEGAHFLSTG+LVSLSEQQLVDCDHECDPEE+GSCDSG
Sbjct: 139 AVTPVKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSG 198

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           C GGLMNSAFEYIL  GGV RE+DYPY+GT GG+CKFD++KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CKGGLMNSAFEYILNNGGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIA 258

Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           ANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286


>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
           [Cucumis sativus]
          Length = 381

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 182/299 (60%), Positives = 230/299 (76%), Gaps = 15/299 (5%)

Query: 5   ILSSLLLLLLSS--VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           +++++   L SS  +++     +D D +IRQVV +DG+ +  H L AEHHFSLFK +F K
Sbjct: 10  VITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNH-HALGAEHHFSLFKRRFGK 68

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-NRRLR 121
           +YAT+EEHD RF++FKAN+RRA+R Q  DP+A+HGVT+FSDLTP EFR+ FLGL   RLR
Sbjct: 69  SYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVTQFSDLTPFEFRKAFLGLRGHRLR 128

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           LP D   APILPT +LP DFDWR HG VT VK+QG+CGSCWSFS TGALEGA+FL     
Sbjct: 129 LPVDTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGANFL----- 183

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
             LSEQQLVDCDHECDPEE  +CDSGCNGGLMNSAFEY LKAGG+ +E+DYPY G D  +
Sbjct: 184 -XLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKAGGLMKEQDYPYAGIDRNT 242

Query: 242 CKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           C FDKSKIAA++++FSV++S DEDQ+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 243 CNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQ----TYIGGVSCP 297


>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
          Length = 313

 Score =  366 bits (939), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 169/224 (75%), Positives = 192/224 (85%)

Query: 54  SLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
           +LFK KF K Y + EEH YRF VFKANL RA R Q +DP+A HGVT+FSDLT SEFRR+ 
Sbjct: 1   ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKH 60

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           LG+    +LP DA +APILPT +LP +FDWRD GAVT VK+QG+CGSCWSFS TGALEGA
Sbjct: 61  LGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGA 120

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           HFL+TG+LVSLSEQQLVDCDHECDPEE GSCDSGCNGGLMNSAFEY LK GG+ REKDYP
Sbjct: 121 HFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYP 180

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YTGTDGGSCK D+SKI A+VSNFSV+S +EDQ+AANL+K+GPLA
Sbjct: 181 YTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLA 224


>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
          Length = 358

 Score =  364 bits (934), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 188/272 (69%), Positives = 222/272 (81%), Gaps = 12/272 (4%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D MI QVV  +G       L AEHHF  FK +F K YAT+EEH YRF VFK+N+ RA+R 
Sbjct: 27  DPMICQVVDDEG-------LGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRH 79

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHG 147
           QLLDP+AVHGVT+FSDLTP EF+   LGL R + LP+DA  APILPT++LP DFDWR HG
Sbjct: 80  QLLDPSAVHGVTQFSDLTPMEFQHSVLGL-RGVGLPSDADSAPILPTDNLPKDFDWRGHG 138

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AVT VK+QG+CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH+CDPEE+GSC SG
Sbjct: 139 AVTPVKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSG 198

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLMNSAFEYIL  GGV RE+DYPY+GT+GG+CKFDK+KIAA+V+NFSV+S DEDQ+A
Sbjct: 199 CNGGLMNSAFEYILNNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIA 258

Query: 268 ANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           ANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 259 ANLVKNGPLAVAINAVYMQ----TYVGGVSCP 286


>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 377

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 176/289 (60%), Positives = 214/289 (74%), Gaps = 12/289 (4%)

Query: 16  SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
           S  A+     D+D +IRQVV   G   +D+ L    HF+ F  +F KTY   EEH +R  
Sbjct: 18  SPAAATATAGDEDPLIRQVV--GGADGDDNDLELSSHFTSFVQRFGKTYKDAEEHAHRLS 75

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAP 130
           VFKANLRRA+R QLLDP+A HG+TKFSDLTP+EFRR FLGL    R     +   A  AP
Sbjct: 76  VFKANLRRARRHQLLDPSAEHGITKFSDLTPAEFRRTFLGLKTSRRSFLREIGGSAHDAP 135

Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
           +LPT+ LP DFDWRDHGAV  VK+QG+CGSCWSFSA+GALEGA++L+TG++  LSEQQ V
Sbjct: 136 VLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFV 195

Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
           DCDHECDPEE  SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKFDKSKI 
Sbjct: 196 DCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGRD-GTCKFDKSKIV 254

Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           A+V NFSV+S DE+Q+AANLVKHGPLA  + +  +     +++  VS P
Sbjct: 255 ASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYMQ----TYIGGVSCP 299


>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 376

 Score =  353 bits (905), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 178/304 (58%), Positives = 222/304 (73%), Gaps = 14/304 (4%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           + RL +    +LLLS V A +  V  +D +I QVV   G++  +  LNAE HF+ F  +F
Sbjct: 4   LRRLPIVVAAVLLLSGVAALSSPV--EDPLIEQVV--GGDEKNELELNAEAHFASFVQRF 59

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           +K+Y   +EH +R  VF ANLRRA+R Q LDP+AVHGVTKFSDLTP EFR +FLGL +  
Sbjct: 60  NKSYRDADEHAHRLSVFTANLRRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRKYR 119

Query: 121 R-----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
           R     L   A  AP LPT+ LPT+FDWR+HGAV  VKDQG+CGSCWSFS +GALEGAH+
Sbjct: 120 RSFLKGLSGSAHDAPALPTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHY 179

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           L+TG+L  LSEQQ+VDCDHECDP E  +CD+GCNGGLM +AF Y+ KAGG+E EKDYPYT
Sbjct: 180 LATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYT 239

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           G  GG+CKFDKSKIAA V NFS ++ DEDQ+AANLVKHGPLA  + ++ +     +++  
Sbjct: 240 GR-GGACKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQ----TYIGG 294

Query: 296 VSSP 299
           VS P
Sbjct: 295 VSCP 298


>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
 gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
          Length = 371

 Score =  352 bits (903), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 174/278 (62%), Positives = 208/278 (74%), Gaps = 12/278 (4%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +IRQVVP  G    D  LNAE HF  F  +F K+Y   +EH YR  VFKANLRRA+R
Sbjct: 24  EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKANLRRARR 81

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
            QLLDP+A HGVTKFSDLTP+EFRR +LGL +  R     L   A +AP+LPT+ LP DF
Sbjct: 82  HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDHGAV  VK+QG+CGSCWSFSA+GALEGAH+L+TG+L  LSEQQ VDCDHECD  E 
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S 
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DE Q++ANL+KHGPLA  + +  +     +++  VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294


>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
           Group]
 gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
 gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
          Length = 373

 Score =  349 bits (896), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 173/290 (59%), Positives = 215/290 (74%), Gaps = 12/290 (4%)

Query: 15  SSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRF 74
           S  +A+A    +++ +IRQVV   G    +  LNAE HF+ F  +F K+Y   +EH YR 
Sbjct: 14  SPAVAAASVPGEEEPLIRQVV--GGGDDNELELNAERHFASFVQRFGKSYRDADEHAYRL 71

Query: 75  RVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKA 129
            VFKANLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL    R     L   A +A
Sbjct: 72  SVFKANLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHEA 131

Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
           P+LPT+ LP DFDWRDHGAV  VK+QG+CGSCWSFSA+GALEGA++L+TG++  LSEQQ+
Sbjct: 132 PVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQM 191

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCDHECD  E  SCD+GCNGGLM +AF Y+LK+GG+E EKDYPYTG D G+CKFDKSKI
Sbjct: 192 VDCDHECDSSEPDSCDAGCNGGLMTNAFSYLLKSGGLESEKDYPYTGRD-GTCKFDKSKI 250

Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
             +V NFSV+S DEDQ+AANLVKHGPLA  + +  +     +++  VS P
Sbjct: 251 VTSVQNFSVVSVDEDQIAANLVKHGPLAIGINAAYMQ----TYIGGVSCP 296


>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
 gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
          Length = 371

 Score =  349 bits (896), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 173/278 (62%), Positives = 207/278 (74%), Gaps = 12/278 (4%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +IRQVVP  G    D  LNAE HF  F  +F K+Y   +EH YR  VFK NLRRA+R
Sbjct: 24  EDPLIRQVVP--GGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARR 81

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
            QLLDP+A HGVTKFSDLTP+EFRR +LGL +  R     L   A +AP+LPT+ LP DF
Sbjct: 82  HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDF 141

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDHGAV  VK+QG+CGSCWSFSA+GALEGAH+L+TG+L  LSEQQ VDCDHECD  E 
Sbjct: 142 DWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEP 201

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D G CKFDKSKI A+V NFSV+S 
Sbjct: 202 DSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSD-GKCKFDKSKIVASVQNFSVVSV 260

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DE Q++ANL+KHGPLA  + +  +     +++  VS P
Sbjct: 261 DEAQISANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294


>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
 gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
          Length = 371

 Score =  349 bits (895), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 172/278 (61%), Positives = 207/278 (74%), Gaps = 12/278 (4%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +IRQVVP  G    +  LNAE HF  F  +F K+Y   EEH YR  +FKANLRRA+R
Sbjct: 24  EDPLIRQVVP--GGDDNELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARR 81

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
            QLLDP+A HGVTKFSDLTP+EFRR +LGL +  R     L   A +AP+LPT+ LP DF
Sbjct: 82  HQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDF 141

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDHGAVT VK+QG+CGSCWSFS +GALEGAH+L+TG+L  LSEQQ+VDCDH CD  E 
Sbjct: 142 DWRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEP 201

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCDSGCNGGLM +AF Y+ KAGG+E EKDYPYTG+D   CKFDKSKI A+V NFSV+S 
Sbjct: 202 DSCDSGCNGGLMTNAFSYLQKAGGLESEKDYPYTGSD-DKCKFDKSKIVASVQNFSVVSV 260

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DE Q+AANL+KHGPLA  + +  +     +++  VS P
Sbjct: 261 DEGQIAANLIKHGPLAIGINAAYMQ----TYIGGVSCP 294


>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
           vulgare]
 gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 377

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 170/285 (59%), Positives = 212/285 (74%), Gaps = 12/285 (4%)

Query: 20  SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKA 79
           +  A  D++ +IRQVV   G    D+ L  +  F  F  +F KTY   EEH +R  VFKA
Sbjct: 22  ATAAAGDEEPLIRQVV--GGADPLDNDLELDSQFVGFVQRFGKTYRDAEEHAHRLSVFKA 79

Query: 80  NLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPT 134
           NLRRA+R QLLDP+A HGVTKFSDLTP+EFRR +LGL    R     +   A  AP+LPT
Sbjct: 80  NLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAHDAPVLPT 139

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
           + LP DFDWRDHGAV  VK+QG+CGSCWSFSA+GALEGA++L++G++  LSEQQLVDCDH
Sbjct: 140 DGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDH 199

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
           ECDP E  SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKFDKSKIAA+V 
Sbjct: 200 ECDPSEPDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD-GTCKFDKSKIAASVQ 258

Query: 255 NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           N+SV++ DE+Q+AANLVK+GPLA  + +  +     +++  VS P
Sbjct: 259 NYSVVAVDEEQIAANLVKYGPLAIGINAAYMQ----TYIGGVSCP 299


>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
          Length = 319

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 167/230 (72%), Positives = 195/230 (84%), Gaps = 4/230 (1%)

Query: 55  LFKSKF-SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
           L + KF  + YAT+EEHD+RF VFK+NLRRA       P  VHGVTKFSDLTP+EFRRQF
Sbjct: 7   LSRPKFRPRPYATKEEHDHRFGVFKSNLRRASCTPSSTPR-VHGVTKFSDLTPAEFRRQF 65

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           LGL + +R PA AQKAPILPT DLP DFDWRD GAVT VKDQG CGSCWSFS TGALEGA
Sbjct: 66  LGL-KAVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGALEGA 124

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           ++L+TGELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFEYIL++GGV++EKDYP
Sbjct: 125 YYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEKDYP 184

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           YTG D G+CKFDK+K+AA VSN+SV+  DE+Q+AANLVK+GPLA  + ++
Sbjct: 185 YTGRD-GTCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAV 233


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 166/278 (59%), Positives = 208/278 (74%), Gaps = 12/278 (4%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           +D +I QVV  D E   +  LNAE HF+ F  +F K+Y   +EH++R  VF+ANLRRA+R
Sbjct: 34  EDPLIEQVVGGDAENELE--LNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANLRRARR 91

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTDF 141
            Q LDP+AVHG+TKFSDLTP EFR +FLGL +  R     +   A  AP LPT+ LPT+F
Sbjct: 92  HQRLDPSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDGLPTEF 151

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+HGAV  VKDQG+CGSCWSFS +GALEGA++L+TG+L  LSEQQLVDCDHECDP E 
Sbjct: 152 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEP 211

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            +CD+GCNGGLM +AF Y+ KAGG+E EKDYPYTG +  +CKFDKSKIAA V NFS ++ 
Sbjct: 212 RACDAGCNGGLMTTAFSYLAKAGGLETEKDYPYTGRN-SACKFDKSKIAAQVKNFSTVAI 270

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DEDQ+AANLVKHGPLA  + ++ +     +++  VS P
Sbjct: 271 DEDQIAANLVKHGPLAIGINAVFMQ----TYIGGVSCP 304


>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
 gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
          Length = 367

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 160/278 (57%), Positives = 208/278 (74%), Gaps = 11/278 (3%)

Query: 28  DAMIRQVVPSDGEQSEDHL------LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           D+ IR+V  +  ++S   L      L+ E HF  F ++F K YAT E + +R +VF+ANL
Sbjct: 27  DSGIREVTDTARDESNGRLDAAKALLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANL 86

Query: 82  RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
            RA   Q LDP+AVHG+T+FSDLT  EF++QFLGL    RL  +A KAP+LPTNDLP DF
Sbjct: 87  VRAVSHQALDPSAVHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDF 145

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+HGAVT VK+QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP + 
Sbjct: 146 DWREHGAVTEVKNQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDK 205

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            SCD+GCNGGLM +A++Y++K+GG+E E DYPYTG   G C+F+ +KI A+V+NFS +S 
Sbjct: 206 VSCDAGCNGGLMTNAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSL 265

Query: 262 DEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           DEDQ+AANLVKHGPLA  + ++ +     +++  VS P
Sbjct: 266 DEDQIAANLVKHGPLAIGINAVFMQ----TYIGGVSCP 299


>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
          Length = 377

 Score =  337 bits (863), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 167/279 (59%), Positives = 208/279 (74%), Gaps = 12/279 (4%)

Query: 26  DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           D++ +IRQVV   G    D+ L  +     F  +F KTY   EEH +R  VFKANLRRA+
Sbjct: 28  DEEPLIRQVV--GGADPLDNDLELDSQLLGFVQRFGKTYRDAEEHAHRLSVFKANLRRAR 85

Query: 86  RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-----LPADAQKAPILPTNDLPTD 140
           R Q+LDP+A HGVTKFSDLTP+EFRR FLGL    R     +   A  AP+LPT+ LP D
Sbjct: 86  RHQMLDPSAEHGVTKFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAHDAPVLPTDGLPED 145

Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
           FDWRDHGAV  VK+QG+C SCWSFSA+GALEGA++L+TG++  LSEQQLVDCDHECDP E
Sbjct: 146 FDWRDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDHECDPAE 205

Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
             SCD+GCNGGLM SAF Y+LK+GG+EREKDYPYTG D G+CKF+KSKIAA+V NFSV++
Sbjct: 206 PDSCDAGCNGGLMTSAFSYLLKSGGLEREKDYPYTGKD-GTCKFEKSKIAASVQNFSVVA 264

Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
            DE+Q+AANLV++GPLA  + +  +     +++  VS P
Sbjct: 265 VDEEQIAANLVEYGPLAIGINAAYMQ----TYIGGVSCP 299


>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
 gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
          Length = 330

 Score =  337 bits (863), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 158/265 (59%), Positives = 204/265 (76%), Gaps = 7/265 (2%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           V  +G++S   LL+ E HF  F ++F K YAT E + +R +VF+ANL RA   Q LDP+A
Sbjct: 5   VVDNGDRSA--LLDVETHFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDPSA 62

Query: 95  VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
           VHG+T+FSDLT  EF++QFLGL    RL  +A KAP+LPTNDLP DFDWR+HGAVT VK+
Sbjct: 63  VHGITQFSDLTEEEFKQQFLGLRVPSRL-REANKAPVLPTNDLPEDFDWREHGAVTEVKN 121

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QGACGSCW+FS TGA+EGAHFL TG+L+SLSEQQLVDCDH CDP +  SCD+GCNGGLM 
Sbjct: 122 QGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMT 181

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
           +A++Y++K+GG+E E DYPYTG   G C+F+ +KI A+V+NFS +S DEDQ+AANLVKHG
Sbjct: 182 NAYDYVMKSGGLETETDYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHG 241

Query: 275 PLAGNVASIELPHISFSFLFTVSSP 299
           PLA  + ++ +     +++  VS P
Sbjct: 242 PLAIGINAVFMQ----TYIGGVSCP 262


>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
 gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
 gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
          Length = 381

 Score =  337 bits (863), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 164/256 (64%), Positives = 196/256 (76%), Gaps = 6/256 (2%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
            ++D +I QVV   G + ED  L+AE HF+ F+ +F +TY    E  YR  VF ANLRRA
Sbjct: 32  GEEDPLIEQVV--GGGEEEDAQLDAEAHFASFERRFGRTYRDAGERAYRMSVFAANLRRA 89

Query: 85  KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDF 141
           +R Q LDPTA HGVTKFSDLTP EFR +FLGL R      +  +  +APILPT+ LP DF
Sbjct: 90  RRHQRLDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDF 149

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+HGAV  VKDQG+CGSCWSFS +GALEGAHFL+TG+L  LSEQQ+VDCDHECD  ES
Sbjct: 150 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASES 209

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G +  +CKFDKSKI A V NFSVIS 
Sbjct: 210 RACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVISV 268

Query: 262 DEDQMAANLVKHGPLA 277
           +EDQ+AANLVKHGPLA
Sbjct: 269 NEDQIAANLVKHGPLA 284


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score =  336 bits (862), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 175/280 (62%), Positives = 209/280 (74%), Gaps = 14/280 (5%)

Query: 7   SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
           S+LL     + SV+  + A   DD +IRQV   V SD +   +   L NAE HF  F  +
Sbjct: 4   STLLFSAFCIFSVIFLSSATKPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           + K Y+  EEH++RF VFK+NL RA   Q LDP A HGVTKFSDLT  EFR Q+LGL   
Sbjct: 64  YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120

Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
            R P   DA  APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL 
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG 
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score =  336 bits (861), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 175/280 (62%), Positives = 209/280 (74%), Gaps = 14/280 (5%)

Query: 7   SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
           S+LL     + SV+  + A   DD +IRQV   V SD +   +   L NAE HF  F  +
Sbjct: 4   STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           + K Y+  EEH++RF VFK+NL RA   Q LDP A HGVTKFSDLT  EFR Q+LGL   
Sbjct: 64  YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGL--- 120

Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
            R P   DA  APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL 
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG 
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278


>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
          Length = 366

 Score =  333 bits (854), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 174/280 (62%), Positives = 208/280 (74%), Gaps = 14/280 (5%)

Query: 7   SSLLL--LLLSSVLASAVAVNDDDAMIRQV---VPSDGE--QSEDHLLNAEHHFSLFKSK 59
           S+LL     + SV+  + A   DD +IRQV   V SD +   +   L NAE HF  F  +
Sbjct: 4   STLLFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQILDARSALFNAEVHFRHFIRR 63

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           + K Y+  EEH++RF VFK+NL RA   Q LDP A HGVTKFSDLT   FR Q+LGL   
Sbjct: 64  YGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEGFRHQYLGL--- 120

Query: 120 LRLPA--DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
            R P   DA  APILPTNDLP DFDWR+ GAVT VK+QG+CGSCW+FS TGALEGA+FL 
Sbjct: 121 -RAPPLRDAHDAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGELVSLSEQQLVDCDHECDP ++ SCDSGCNGGLM SA++Y LK+GG+E+E+DYPYTG 
Sbjct: 180 TGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKSGGLEKEEDYPYTGK 239

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           D G+C F+K+KI A VSNFSV+S DE Q+AANLVK+GPL+
Sbjct: 240 D-GTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLS 278


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score =  333 bits (854), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 171/283 (60%), Positives = 207/283 (73%), Gaps = 11/283 (3%)

Query: 5   ILSSLLLLLLSSVLASA-VAVNDDDAM----IRQVVPSDGEQSEDHL----LNAEHHFSL 55
           ILS  LL L+ ++ A    A +D +A+    IR+V   DGE   D L    LNAE HF+ 
Sbjct: 18  ILSLALLFLVPTITAHVHEASSDLNAVLPNPIREVTDMDGEGVIDDLRRGLLNAEAHFAH 77

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           F  KF+K Y+  EEH  RF +FK NL +A R Q LD  A+HG+ KFSDLT  EF  Q+LG
Sbjct: 78  FVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDRDAIHGINKFSDLTEEEFHEQYLG 137

Query: 116 LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           L    R L    Q APILPT+DLP DFDWR+ GAVT VK+QGACGSCW+FS TGA+EGA+
Sbjct: 138 LTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTGAMEGAN 197

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           F+ TG+L+SLSEQQLVDCDHECD  E   CDSGCNGGLM +A++Y LKAGG++RE+DYPY
Sbjct: 198 FMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALKAGGLQREEDYPY 257

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           TG D GSCKFD +K+AA V+NFS +S DEDQ+AANLVK+GPLA
Sbjct: 258 TGID-GSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLA 299


>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
          Length = 241

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 155/202 (76%), Positives = 177/202 (87%), Gaps = 4/202 (1%)

Query: 24  VNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
            N DD +IRQVV    + +EDH+LNAEHHF+ FKSKFSK YAT+EEHDYRF VFK+NL +
Sbjct: 26  TNSDDLLIRQVV----DTAEDHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIK 81

Query: 84  AKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
           AK  Q LDP+A HG+TKFSDLT SEFRRQFLGLN+RLRLPA AQKAPILPTN+LP DFDW
Sbjct: 82  AKLHQKLDPSAQHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTNNLPEDFDW 141

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           R+ GAVT VKDQG+CGSCW+FS TGALEGA++L+TG+L SLSEQQLVDCDH CDPEE GS
Sbjct: 142 REKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGS 201

Query: 204 CDSGCNGGLMNSAFEYILKAGG 225
           CDSGCNGGLMN+AFEYIL++GG
Sbjct: 202 CDSGCNGGLMNNAFEYILQSGG 223


>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
          Length = 252

 Score =  330 bits (845), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 167/241 (69%), Positives = 195/241 (80%), Gaps = 7/241 (2%)

Query: 11  LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
           + L +SV +S  +  +DD +I QVVP   E  ED L LNAE HFS F  +F K+YA ++E
Sbjct: 15  VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
           H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL    RL    A +
Sbjct: 72  HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131

Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
            +APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L SLSE
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQLESLSE 191

Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
           QQLVDCDHECD  E  SCDSGCNGGLM +AFEY+LK+GG+E EKDYPYTGTD G CKFD+
Sbjct: 192 QQLVDCDHECDSSEPDSCDSGCNGGLMTTAFEYLLKSGGLELEKDYPYTGTDRGRCKFDE 251

Query: 247 S 247
           S
Sbjct: 252 S 252


>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
          Length = 292

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 146/220 (66%), Positives = 180/220 (81%), Gaps = 5/220 (2%)

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPT 139
           +RRA+R Q LDPTAVHGVT+FSDLTP EF+R +LGL + +  L   A +AP+LPTNDLP 
Sbjct: 1   MRRARRHQQLDPTAVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWRD GAVTGVK+QG+CGSCWSFS +GALEGA+FL+TG+L +LSEQQ+VDCDHECD E
Sbjct: 61  DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           E   CD GCNGGLMN+AF+Y+ K GG+E EKDYPYTGTD G+CKFD+SKI A+V NFSV+
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKVGGLESEKDYPYTGTDRGTCKFDESKIKASVHNFSVV 180

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           S DE+Q+AANLVKHGPLA  + ++ +     +++  VS P
Sbjct: 181 SIDEEQIAANLVKHGPLAIAINAVFMQ----TYIGGVSCP 216


>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
          Length = 364

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 155/256 (60%), Positives = 187/256 (73%), Gaps = 23/256 (8%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
            ++D +I QVV   G + ED  L+AE HF+ F+ +F +TY                 RRA
Sbjct: 32  GEEDPLIDQVV--GGGEEEDAQLDAEAHFASFERRFGRTYP--------------GPRRA 75

Query: 85  KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDF 141
           +R   LDPTA HGVTKFSDLTP EFR +FLGL R      +  +  +APILPT+ LP DF
Sbjct: 76  RR---LDPTATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGGEPHEAPILPTDGLPDDF 132

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+HGAV  VKDQG+CGSCWSFS +GALEGAHFL+TG+L  LSEQQ+VDCDHECD  ES
Sbjct: 133 DWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASES 192

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
            +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G +  +CKFDKSKI A V NFSVIS 
Sbjct: 193 RACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVISV 251

Query: 262 DEDQMAANLVKHGPLA 277
           +EDQ+AANLVKHGPLA
Sbjct: 252 NEDQIAANLVKHGPLA 267


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  300 bits (768), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 150/262 (57%), Positives = 189/262 (72%), Gaps = 4/262 (1%)

Query: 18  LASAVAVNDDDAMIRQVVPSDG--EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
           L +++ + D    +   V  DG  EQ    LL AE  F  F  +F K Y T EE+++RF+
Sbjct: 19  LVASLPLRDVIQQVTDGVRVDGSVEQFAHALLGAEKQFESFIKEFGKVYHTVEEYEHRFK 78

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
           VFK+NL RA + Q LDPTA HGVT FSDLT  EF  Q+LGL R   L + A  A  LPT 
Sbjct: 79  VFKSNLLRALKHQALDPTASHGVTMFSDLTEEEFATQYLGLKRPSAL-STAPTAEPLPTG 137

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
           DLP  FDWR+ GAV  VK+QG+CGSCW+FS TGA+EGAHFL+TG+L+SLSEQQLVDCDH+
Sbjct: 138 DLPPSFDWREKGAVGPVKNQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQ 197

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
           CDPEE+ +CD+GC GGLM +A++Y+ +AGG+E E DYPY G D G C+F+ +K+AA VSN
Sbjct: 198 CDPEEAQACDAGCGGGLMTNAYKYVEEAGGLELESDYPYKGRD-GKCQFNPNKVAAKVSN 256

Query: 256 FSVISSDEDQMAANLVKHGPLA 277
           F+ I  DEDQ+AA L+K GPLA
Sbjct: 257 FTNIPIDEDQVAAYLIKSGPLA 278


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 152/279 (54%), Positives = 192/279 (68%), Gaps = 11/279 (3%)

Query: 4   LILSSLLLLLLSSVLAS-----AVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           L+L  +++L  +   AS      +    DDA+    V    EQ    L+ AE  F  F  
Sbjct: 6   LLLVGIVVLGFAGFAASLPTGDTIREVTDDALSNGSV----EQFAHALIGAEKRFESFMK 61

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
            F K Y + EE+++RF VFK+NL +A + Q LDPTA HGVT FSDLT  EF  ++LGL R
Sbjct: 62  DFGKVYHSVEEYEHRFGVFKSNLLKALKHQALDPTASHGVTMFSDLTEEEFTSKYLGLKR 121

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
              L + A +AP LPT DLP +FDWR+ GAV  VKDQG CGSCW+FS TGA+EGAHFL++
Sbjct: 122 PSVL-SSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVEGAHFLNS 180

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+LVSLSEQQLVDCDH+CD EE+ +CD+GCNGG M +A++Y+  AGG+E E DYPY G D
Sbjct: 181 GKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYVEAAGGLELESDYPYEGRD 240

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G CKFD +K+A  VSNF+ I  DEDQ+AA L+K GPLA
Sbjct: 241 -GKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLA 278


>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
 gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  290 bits (742), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 143/300 (47%), Positives = 202/300 (67%), Gaps = 10/300 (3%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
           ++  +L  L+   +L   V  + +D  IRQV  +D  +   +LL  + E  F LF S + 
Sbjct: 1   MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
           K Y+T+EE+ +R  +F  N+ +A   Q++DP+AVHGVT+FSDLT  EF+R + G+     
Sbjct: 60  KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            R      +AP++  + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           +L+SLSEQQLVDCD  CDP++  +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG   
Sbjct: 180 KLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 238

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           G CKFD  K+A  V NF+ I  DE+Q+AANLV+HGPLA  + ++ +     +++  VS P
Sbjct: 239 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQ----TYIGGVSCP 294


>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 143/301 (47%), Positives = 201/301 (66%), Gaps = 11/301 (3%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
           ++  +L  L+   +    V  + +D  IRQV  +D  +   +LL  + E  F +F S + 
Sbjct: 1   MVAKALAQLITCIIFFCHVVASVEDLTIRQVT-ADERRVRPNLLGTHTESKFRVFMSDYG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
           K Y+T+EE+ +R  +F  N+ +A   Q++DPTAVHGVT+FSDLT  EF+R + G+     
Sbjct: 60  KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGG 119

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            R  A   +AP++  + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179

Query: 180 ELVSLSEQQLVDCDHE-CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           +L+SLSEQQLVDCD   CDP++  +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG  
Sbjct: 180 KLLSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR 239

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
            G CKFD  K+A  V NF+ I  DEDQ+AANLV+ GPLA  + ++ +     +++  VS 
Sbjct: 240 -GHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQ----TYIGGVSC 294

Query: 299 P 299
           P
Sbjct: 295 P 295


>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
          Length = 348

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 135/198 (68%), Positives = 157/198 (79%), Gaps = 4/198 (2%)

Query: 83  RAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPT 139
           R  R   LDPTA HGVTKFSDLTP EFR + LGL R      +  +  +APILPT+ LP 
Sbjct: 55  RELRAARLDPTATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHEAPILPTDGLPD 114

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           DFDWR+HGAV  VKDQG+CGSCWSFS +GALEGAHFL+TG+L  LSEQQ+VDCDHECD  
Sbjct: 115 DFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDAS 174

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
           ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G +  +CKFDKSKI A V NFSVI
Sbjct: 175 ESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQVKNFSVI 233

Query: 260 SSDEDQMAANLVKHGPLA 277
           S +EDQ+AANLVKHGPLA
Sbjct: 234 SVNEDQIAANLVKHGPLA 251


>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
          Length = 403

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 137/282 (48%), Positives = 194/282 (68%), Gaps = 12/282 (4%)

Query: 4   LILSS-LLLLLLSSVLASAVAVND----DDAMIRQVVPSDGEQSEDHLLN--AEHHFSLF 56
           L+L+  + LL++S+ ++ ++ +++    +   I QV     + + +HLLN  ++  F  F
Sbjct: 37  LVLAGCMFLLVISTQISFSLGLDNGRVSEGGFIAQVTE---KFNREHLLNLRSKTLFDKF 93

Query: 57  KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
             +  K Y+T EE+  R R+F+ NL +A   Q LDPTAVHG+T FSDLT  EF  ++ GL
Sbjct: 94  IVEHGKVYSTIEEYVRRLRIFEKNLLKAAENQALDPTAVHGITPFSDLTEYEFESRYTGL 153

Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
              R  L  + Q A ILP +DLP +FDWR+ GAVT VK QG CGSCW+FS TG +EGA+F
Sbjct: 154 LGVRQGLVNEKQTAEILPVDDLPANFDWREKGAVTEVKTQGNCGSCWAFSTTGVVEGANF 213

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           L+TG+L++LSEQQL+DCDH+CDP  + +CD+GC+GGLM +A+ Y+++AGG+E  K+YPYT
Sbjct: 214 LATGKLLNLSEQQLIDCDHKCDPLNTKACDNGCHGGLMTNAYNYLMEAGGIEEAKNYPYT 273

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G   G CKF+    A    NF+ ++ DE Q+AANLVKHGPLA
Sbjct: 274 GVQ-GDCKFNPDLAAVKAINFTTVNLDEKQIAANLVKHGPLA 314


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 142/301 (47%), Positives = 195/301 (64%), Gaps = 12/301 (3%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           L+  ++ L LL S + SA A+  D   +RQV  +DGE   +    +E  F +F  K+ K+
Sbjct: 42  LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
           Y T++E+ +RF +F  NL RA   Q LDPTAVHGVT+FSDL+  EF R F+G+       
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159

Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
            LP   Q   +       LP  FDWRD GAVT VK QG CGSCW+FS  GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCDH CDP +  +C++GCNGGLM +A++Y++++GG+E E  YPYTG  
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
            G C F   KIA  VSNF+ I  DE+Q+AA+LV+ GPLA  + ++ +     +++  VS 
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQ----TYIGGVSC 334

Query: 299 P 299
           P
Sbjct: 335 P 335


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 142/301 (47%), Positives = 195/301 (64%), Gaps = 12/301 (3%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           L+  ++ L LL S + SA A+  D   +RQV  +DGE   +    +E  F +F  K+ K+
Sbjct: 42  LLACAISLALLISAIPSATALRRDPEFLRQV--TDGEIFNNLPAGSERKFVMFMEKYGKS 99

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR-- 121
           Y T++E+ +RF +F  NL RA   Q LDPTAVHGVT+FSDL+  EF R F+G+       
Sbjct: 100 YPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFMGVRGGAGGE 159

Query: 122 -LPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
            LP   Q   +       LP  FDWRD GAVT VK QG CGSCW+FS  GA+EGA+F++T
Sbjct: 160 GLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCGAVEGANFIAT 219

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCDH CDP +  +C++GCNGGLM +A++Y++++GG+E E  YPYTG  
Sbjct: 220 GNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYTGRS 279

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSS 298
            G C F   KIA  VSNF+ I  DE+Q+AA+LV+ GPLA  + ++ +     +++  VS 
Sbjct: 280 -GQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQ----TYIGGVSC 334

Query: 299 P 299
           P
Sbjct: 335 P 335


>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 140/300 (46%), Positives = 198/300 (66%), Gaps = 14/300 (4%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL--NAEHHFSLFKSKFS 61
           ++  +L  L+   +L   V  + +D  IRQV  +D  +   +LL  + E  F LF S + 
Sbjct: 1   MVAKALAQLITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYG 59

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--RR 119
           K Y+T+EE+ +R  +F  N+ +A   Q++DP+AVHGVT+FSDLT  EF+R + G+     
Sbjct: 60  KNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGG 119

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            R      +AP++  + LP DFDWR+ G VT VK+QGACGSCW+FS TGA EGAHF+STG
Sbjct: 120 SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTG 179

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           +L+SLSEQQLVDCD      +  +CD+GC GGLM +A+EY+++AGG+E E+ YPYTG   
Sbjct: 180 KLLSLSEQQLVDCDQ----ADKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKR- 234

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           G CKFD  K+A  V NF+ I  DE+Q+AANLV+HGPLA  + ++ +     +++  VS P
Sbjct: 235 GHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQ----TYIGGVSCP 290


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 141/298 (47%), Positives = 196/298 (65%), Gaps = 12/298 (4%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + L+ + L L +  L++A        + R++   D E     LL  E  F +F   + ++
Sbjct: 10  MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
           Y+T+EE+  R  +F  N+ RA   Q LDPTAVHGVT+FSDLT  EF + + G+N      
Sbjct: 65  YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDEFEKLYTGVNGGFPSS 124

Query: 124 ADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            +A    AP L  + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+L
Sbjct: 125 NNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKL 184

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQL+DCD++CD  E  SCD+GCNGGLM +A+ Y+L++GG+E E  YPYTG + G 
Sbjct: 185 VSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGE 243

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           CKFD  KIA  ++NF+ I +DE+Q+AA LVK+GPLA  V +I +     +++  VS P
Sbjct: 244 CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 297


>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 377

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 137/293 (46%), Positives = 191/293 (65%), Gaps = 14/293 (4%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           SL+L  L+   A    V+D        +    +  ++ LL  E  F++F   + K Y+T+
Sbjct: 16  SLVLFALTLSSARQTTVHD--------IAKKLKLQDNQLLRTEKKFNVFMENYGKKYSTR 67

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
           EE+  R  +F  N+ RA   Q LDPTA+HGVT+FSDLT  EF+R + G+N         +
Sbjct: 68  EEYLQRLEIFAGNMLRAPENQALDPTAIHGVTQFSDLTEDEFQRHYTGVNGGFPWNNGVR 127

Query: 128 K-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
             AP L  + LP DFDWR+ GAVT VK QG CGSCW+FS TG++EGA+F++TG+L++LSE
Sbjct: 128 DVAPPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLNLSE 187

Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
           QQLVDCD +CD  ES +CD+GC GGLM +A++Y+L++GG+E E  YPYTG   G CKFD 
Sbjct: 188 QQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQSGGLEEESSYPYTGAK-GECKFDP 246

Query: 247 SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
            K+A  ++NF+ I  DE+Q+AA LVKHGPLA  + +I +     +++  VS P
Sbjct: 247 GKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQ----TYIGGVSCP 295


>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score =  270 bits (690), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 138/278 (49%), Positives = 182/278 (65%), Gaps = 15/278 (5%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           L  L LSS L     + D        V    E  ++ LL  E  F LF   +SK Y+T E
Sbjct: 19  LCALTLSSSLHHETLIQD--------VARKLELKDNDLLTTEKKFKLFMKDYSKKYSTTE 70

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---AD 125
           E+  R  +F  N+ +A   Q LDPTA+HGVT+FSDL+  EF R + G   +   P   A 
Sbjct: 71  EYLLRLGIFAKNMVKAAEHQALDPTAIHGVTQFSDLSEEEFERFYTGF--KGGFPSSNAA 128

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
              AP L     P +FDWR+ GAVTG+K QG CGSCW+F+ TG++EGA+FL+TG+LVSLS
Sbjct: 129 GGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLATGKLVSLS 188

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCD++CD  ++ SCD+GCNGGLM +A++Y+++AGG+E E  YPYTG   G CKFD
Sbjct: 189 EQQLVDCDNKCDITKT-SCDNGCNGGLMTTAYDYLMEAGGLEEETSYPYTGAQ-GECKFD 246

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
            +K+A  VSNF+ I +DE+Q+AA LV HGPLA  V ++
Sbjct: 247 PNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAV 284


>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
 gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 128/255 (50%), Positives = 173/255 (67%), Gaps = 5/255 (1%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
           +LL  E  F +F  + +K YAT+EE+ +RF +F  NL RA   Q LDPTA+HGVT F DL
Sbjct: 6   NLLGTEEKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDPTAIHGVTPFMDL 65

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           T  EF R + G+     +P +      +  + LP  FDWR+ GAVT VK QG+CGSCW+F
Sbjct: 66  TEEEFERMYAGVLGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAF 125

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG++EGA+F++TG+L++LSEQQLVDCD  CD  +  SCD GC GGLM +A+ Y+++AG
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIEAG 185

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           G++ E  YPYTG   G CKFD  KIA  V+NF+ I+ DE+Q+AANLV HGPLA  + +I 
Sbjct: 186 GLQEESSYPYTGKS-GECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIF 244

Query: 285 LPHISFSFLFTVSSP 299
           +     +++  VS P
Sbjct: 245 MQ----TYIGGVSCP 255


>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 374

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 138/294 (46%), Positives = 188/294 (63%), Gaps = 13/294 (4%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L+ + L L +  L+SA        + R++   D E     LL  E  F +F   + ++Y+
Sbjct: 12  LARVSLFLFALTLSSAHESTTVHDIARKLKVGDNE-----LLRTEKKFKVFMENYGRSYS 66

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T+EE+  R  +F  N+ RA   Q LDPTAVHGVT+FSDLT  EF + + G          
Sbjct: 67  TREEYLRRLGIFSQNMLRAAEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPST---NTA 123

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
              AP L    LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG+LVSLS
Sbjct: 124 GGVAPPLEVEGLPENFDWREKGAVTEVKIQGRCGSCWAFSTTGSIEGANFLATGKLVSLS 183

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQL+DCD++C+  E  SCD+GCNGGLM +A+ Y+L++GG+E E  YPYTG + G CKFD
Sbjct: 184 EQQLLDCDNKCEITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ERGECKFD 242

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
             KI   ++NF+ I  DE+Q+AA LVK+GPLA  V +I +     +++  VS P
Sbjct: 243 PEKITVRITNFTNIPVDENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 292


>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 128/257 (49%), Positives = 173/257 (67%), Gaps = 6/257 (2%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSD 103
           D +L  E  F +F  K+ K Y+++EE+ +R  +F  N+ RA   Q LDPTA+HGVT FSD
Sbjct: 52  DGVLGTEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPTALHGVTPFSD 111

Query: 104 LTPSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           L+  EF R F G+  R  +    A+ A  L  + LP  FDWR+ GAVT VK QG CGSCW
Sbjct: 112 LSEEEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCW 171

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TGA+EGAHF+ST +L++LSEQQLVDCDH CD  +  +CDSGC GGLM +A++Y+++
Sbjct: 172 AFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE 231

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
           AGG+E E  YPYTG   G CKF   ++A  V NF+ +  +E+Q+AANLV HGPLA  + +
Sbjct: 232 AGGLEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNA 290

Query: 283 IELPHISFSFLFTVSSP 299
           I +     +++  VS P
Sbjct: 291 IFMQ----TYIGGVSCP 303


>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 141/322 (43%), Positives = 193/322 (59%), Gaps = 31/322 (9%)

Query: 4   LILSSLLLLLLSSVLASAVAVN-------DDDAMIRQVVPSD------GEQSEDHLL--- 47
           ++  +L + LLS  L S+            D  MIRQV  +       G  S +H L   
Sbjct: 9   MLTCTLAITLLSCALISSTTFQHEIQYRVQDPLMIRQVTDNHHHRHHPGRSSANHRLLGT 68

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
             E HF  F  ++ KTY+T EE+ +R  +F  NL +A   Q +DP+A+HGVT+FSDLT  
Sbjct: 69  TTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQAMDPSAIHGVTQFSDLTEE 128

Query: 108 EFRRQFLGLNRRLRLPADAQKAP----------ILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           EF   ++GL     +    Q             ++  +DLP  FDWR+ GAVT VK QG 
Sbjct: 129 EFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLPESFDWREKGAVTEVKTQGR 188

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TGA+EGA+F++TG+L+SLSEQQLVDCDH CD +E   CD GC+GGLM +AF
Sbjct: 189 CGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDLKEKDDCDDGCSGGLMTTAF 248

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            Y+++AGG+E E  YPYTG   G CKF+  K+A  V NF+ I  DE Q+AAN+V +GPLA
Sbjct: 249 NYLIEAGGIEEEVTYPYTGKR-GECKFNPEKVAVKVRNFAKIPEDESQIAANVVHNGPLA 307

Query: 278 GNVASIELPHISFSFLFTVSSP 299
             + ++ +     +++  VS P
Sbjct: 308 IGLNAVFMQ----TYIGGVSCP 325


>gi|24417396|gb|AAN60308.1| unknown [Arabidopsis thaliana]
          Length = 193

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 132/197 (67%), Positives = 155/197 (78%), Gaps = 6/197 (3%)

Query: 1   MERLILS-SLLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS 58
           M+RL L  S+ +L    VL S+  VND DD +IRQVV      +E  +L +E HFSLFK 
Sbjct: 1   MDRLKLYFSVFVLSFFIVLVSSSDVNDGDDLVIRQVVGG----AEPQVLTSEDHFSLFKR 56

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           KF K YA+ EEHDYRF VFKANLRRA+R Q LDP+A HGVT+FSDLT SEFR++ LG+  
Sbjct: 57  KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             +LP DA KAPILPT +LP DFDWRDHGAVT VK+QG+CGSCWSFSATGALEGA+FL+T
Sbjct: 117 GFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 179 GELVSLSEQQLVDCDHE 195
           G+LVSLSEQQLVDCDH+
Sbjct: 177 GKLVSLSEQQLVDCDHQ 193


>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  260 bits (665), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 121/232 (52%), Positives = 160/232 (68%), Gaps = 2/232 (0%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           +  E  F +F  K+ K Y+++EE+ +R  +F  N+ RA   Q LDP A+HGVT FSDL+ 
Sbjct: 1   MGGEKEFRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXALHGVTPFSDLSE 60

Query: 107 SEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            EF R F G+  R  +    A+ A  L  + LP  FDWR+ GAVT VK QG CGSCW+FS
Sbjct: 61  EEFERMFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFS 120

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGA+EGAHF+ST +L++LSEQQLVDCDH CD  +  +CDSGC GGLM +A++Y+++AGG
Sbjct: 121 TTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGG 180

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +E E  YPYTG   G CKF   ++A  V NF+ +  BE+Q+AANLV HGPLA
Sbjct: 181 LEEESSYPYTGKH-GECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLA 231


>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
 gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
          Length = 381

 Score =  259 bits (663), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 131/253 (51%), Positives = 172/253 (67%), Gaps = 6/253 (2%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
           N E +F +F  K+ K Y T+EE+ +R  VF  NL RA   Q+LDPTAVHG+T F DLT  
Sbjct: 62  NTEENFKMFMIKYDKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEE 121

Query: 108 EFRRQFLGLNRRLRLPADAQKAP-ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           EF R + G+     + A+   A   L T  LP+ FDWR  GAVT VK QGACGSCW+FS 
Sbjct: 122 EFERMYTGVVGGGAVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFST 181

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGA+EGA+F++TG+L++LSEQQLVDCD  CD +E  +CD GC GGLM +A+ Y+++AGG+
Sbjct: 182 TGAIEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGL 241

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
           E E  YPYTG   G CKFD+ KIA  V NF+ I  DE+Q+AA+LV HGPLA  + ++ + 
Sbjct: 242 EDEISYPYTGKP-GKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQ 300

Query: 287 HISFSFLFTVSSP 299
               +++  VS P
Sbjct: 301 ----TYIGGVSCP 309


>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
          Length = 257

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 122/172 (70%), Positives = 143/172 (83%), Gaps = 5/172 (2%)

Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           KAPILPT+DLP DFDWR+ GAVTGVK+QG+CGSCWSFS TGA+EGAHFL+TGELVSLSEQ
Sbjct: 15  KAPILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQ 74

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
           QLVDCDHECD E+   CD+GC GGLM +AFEY LKAGG++REKDYPYTG D G C FDKS
Sbjct: 75  QLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLKAGGLQREKDYPYTGRD-GKCHFDKS 133

Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KIAA+V+NFSV+  DEDQ+AANLVKHGPLA  + +  +     +++  VS P
Sbjct: 134 KIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQ----TYVGGVSCP 181


>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
          Length = 245

 Score =  257 bits (656), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 122/177 (68%), Positives = 149/177 (84%), Gaps = 6/177 (3%)

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
           AD  KAP LPT++LP +FDWR+ GAVT VK+QG+CGSCWSFS TGALEGA++L+TGEL+S
Sbjct: 1   ADENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELIS 60

Query: 184 LSEQQLVDCDHECDPEESG-SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           LSEQQLVDCDHECDPEE   SCD+GCNGGLMN+AFEY LKAGG+++EKDYPYTG D G+C
Sbjct: 61  LSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALKAGGLQKEKDYPYTGKD-GTC 119

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           KFDK+KIAA+V NFSV+S DEDQ+AANLVK+GPLA  + +  +     +++  VS P
Sbjct: 120 KFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQ----TYIGGVSCP 172


>gi|357473429|ref|XP_003606999.1| Cysteine proteinase [Medicago truncatula]
 gi|355508054|gb|AES89196.1| Cysteine proteinase [Medicago truncatula]
          Length = 210

 Score =  256 bits (654), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 124/196 (63%), Positives = 152/196 (77%), Gaps = 6/196 (3%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+   L   ++L + SV A +     +D +IRQVV  +G +     L AEHHF+LFK KF
Sbjct: 1   MDHRTLLLFVVLFIFSVSAFSTPDEGEDPIIRQVVDEEGVR-----LGAEHHFNLFKHKF 55

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            K Y++++EHDYRF++FK+NL RAKR QL+DP+AVHGVT+FSDLTP EFR+  LGL R +
Sbjct: 56  GKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDPSAVHGVTRFSDLTPREFRKSVLGL-RGV 114

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
            LP DA  APILPT++LP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTG+
Sbjct: 115 GLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGAHFLSTGK 174

Query: 181 LVSLSEQQLVDCDHEC 196
           LVSLSEQQLVDCDHE 
Sbjct: 175 LVSLSEQQLVDCDHEV 190


>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
 gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
          Length = 331

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 136/284 (47%), Positives = 177/284 (62%), Gaps = 42/284 (14%)

Query: 2   ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFS 61
           +  +L S+L L  S  LA ++  + +D +I+QVV   G         AE+ F+ FK +F 
Sbjct: 6   QTFMLFSVLFLFFSVDLAFSMPKDREDPIIQQVVDKGG---------AEYQFNEFKQRFG 56

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K Y++++EHDYRF VFK+NL RAKR  ++DP+A HGVT+FSDLTP EFR   LGL + + 
Sbjct: 57  KVYSSKDEHDYRFNVFKSNLHRAKRHGIMDPSATHGVTRFSDLTPREFRNSILGL-KGVG 115

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           LP  A+ APIL T +LP DFDWR+ GAVT V++QG CGS WSFS  GALEGAHFLS+GEL
Sbjct: 116 LPRHAKAAPILSTENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGAHFLSSGEL 175

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ  VDCDH                       EYI K GG+ R +DY Y  T+   
Sbjct: 176 VSLSEQHHVDCDH-----------------------EYIQKYGGLMRVEDYTYYKTNTAR 212

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
                    +  +NFS IS D++Q+ ANLVKHGPLA  + ++ +
Sbjct: 213 ---------SVAANFSSISVDDNQITANLVKHGPLAAAINAVYM 247


>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
          Length = 363

 Score =  253 bits (646), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 135/300 (45%), Positives = 187/300 (62%), Gaps = 33/300 (11%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + L+ + L L +  L++A        + R++   D E     LL  E  F +F   + ++
Sbjct: 10  MCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNE-----LLRTEKKFKVFMENYGRS 64

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
           Y+T+EE+  R  +F  N+ RA   Q LDPTAVHGVT+FS                   LP
Sbjct: 65  YSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFS-------------------LP 105

Query: 124 ----ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
               A    AP L  + LP +FDWR+ GAVT VK QG CGSCW+FS TG++EGA+FL+TG
Sbjct: 106 VSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATG 165

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           +LVSLS+QQL+DCD++CD  E  SCD+GCNGGLM +A+ Y+L++GG+E E  YPYTG + 
Sbjct: 166 KLVSLSDQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG-ER 224

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           G CKFD  KIA  ++NF+ I +DE+Q+AA LVK+GPLA  V +I +     +++  VS P
Sbjct: 225 GECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQ----TYIGGVSCP 280


>gi|357473731|ref|XP_003607150.1| Cysteine proteinase [Medicago truncatula]
 gi|355508205|gb|AES89347.1| Cysteine proteinase [Medicago truncatula]
          Length = 326

 Score =  252 bits (643), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 138/275 (50%), Positives = 177/275 (64%), Gaps = 44/275 (16%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           L+L S+L L  S  LA +   + +D +I+QVV   G         AEH F+ FK +F K 
Sbjct: 7   LMLFSVLFLFFSVDLAFSTPNDREDPIIQQVVDKGG---------AEHQFNEFKQRFGKV 57

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
           Y++++EHDYRF VFK+NL RAKR  ++DP+A HGVT+FSDLTP EFR   LGL + + LP
Sbjct: 58  YSSKDEHDYRFNVFKSNLHRAKRHVIMDPSATHGVTRFSDLTPREFRNSILGL-KGVGLP 116

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
             A+ APIL + +LP DFDWR+ GAVT V++QG CGS WSFS  GALEGA+FLSTGELVS
Sbjct: 117 RHAKAAPILSSENLPRDFDWREKGAVTPVRNQGFCGSSWSFSTIGALEGANFLSTGELVS 176

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LS+QQ VDCDH                       EYI K+GG+ R +DY Y         
Sbjct: 177 LSDQQHVDCDH-----------------------EYIKKSGGLMRVEDYTYY-------- 205

Query: 244 FDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLA 277
             K+ IA +V +NFS +  D+DQ+AANL+K+GPLA
Sbjct: 206 --KTNIARSVAANFSSVLVDDDQIAANLLKYGPLA 238


>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
 gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
          Length = 343

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 191/302 (63%), Gaps = 30/302 (9%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L+ +L+ LL  V+  + +   D   IRQV  +D  + +D     E HF  F  KF K Y 
Sbjct: 5   LAIILVGLLILVVCCSSSNRLDIGKIRQV--TDNLEVKD----VEGHFKHFMQKFGKVYG 58

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T EE+ +R +VF+ANL      +  DPTA+HG+T F+DLTP E  R FLG  R+      
Sbjct: 59  TTEEYVHRLKVFQANLAHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
             +AP+LPT++LP  FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
           E+QL+DCD++         D+GC GG M SA+EY+ KA G+E E+DYPY           
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEAEEDYPYEELGYRHKPVR 226

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL-PHISFSFLFTVSS 298
           G C++  SK+ A ++N+S +S DEDQ+AANLVK+GPL     SI L  ++ F++   V+ 
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPL-----SIALRGNVLFTYEGGVAC 281

Query: 299 PK 300
           P+
Sbjct: 282 PR 283


>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
 gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
          Length = 343

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 138/302 (45%), Positives = 191/302 (63%), Gaps = 30/302 (9%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           L+ +L+ LL  V+  + +   D   IRQV  +D  + +D     E HF  F  KF K Y 
Sbjct: 5   LAIILVGLLILVICCSSSNRLDIGKIRQV--TDNLEVDD----VEGHFKHFMQKFGKVYG 58

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T EE+ +R +VF+ANL      +  DPTA+HG+T F+DLTP E  R FLG  R+      
Sbjct: 59  TTEEYVHRLKVFQANLVHVMSLKKQDPTAIHGITSFADLTPEELSR-FLGF-RKAYSNRV 116

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
             +AP+LPT++LP  FDWR+HGAVT VK QG CGSCW+FS TG +EGA+FL TG+L+SLS
Sbjct: 117 VNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLS 176

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD------G 239
           E+QL+DCD++         D+GC GG M SA+EY+ KA G+E ++DYPY           
Sbjct: 177 EEQLIDCDYK---------DNGCEGGDMLSAYEYV-KARGLEADEDYPYEELGYRHKPVR 226

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL-PHISFSFLFTVSS 298
           G C++  SK+ A ++N+S +S DEDQ+AANLVK+GPL     SI L  ++ F++   V+ 
Sbjct: 227 GPCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPL-----SIALRGNVLFTYEGGVAC 281

Query: 299 PK 300
           P+
Sbjct: 282 PR 283


>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
          Length = 709

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 137/264 (51%), Positives = 170/264 (64%), Gaps = 21/264 (7%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
           IRQV  +DG      LL  E  F+ F  +  + Y+  EE+  R RVF ANL RA   Q L
Sbjct: 29  IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85

Query: 91  DPTAVHGVTKFSDLTPSEFRRQFLGLN--------RRLRLP-ADAQKAPILPTNDLPTDF 141
           DPTA HGVT FSDLT  EF  +  GL         RR RLP   A  A     + LP+ F
Sbjct: 86  DPTARHGVTPFSDLTREEFEARLTGLATDVGDDDVRRRRLPMPSAAPATEEEVSGLPSSF 145

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+ 
Sbjct: 146 DWRDRGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKK 205

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS- 260
             CDSGC GGLM +A+ Y++ +GG+  +  YPYTG   G+C+FD +++A  V+NF+V++ 
Sbjct: 206 TECDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GACRFDANRVAVRVANFTVVAP 264

Query: 261 ------SDED-QMAANLVKHGPLA 277
                 +D D QM A LV+HGPLA
Sbjct: 265 AAGPGGNDGDAQMRAALVRHGPLA 288


>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
          Length = 318

 Score =  240 bits (612), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 119/180 (66%), Positives = 143/180 (79%), Gaps = 7/180 (3%)

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
           +R PA AQKAPILPT DLP DFDWRD GAVT VKD G CGSCWSFS TGALE + +L+TG
Sbjct: 71  VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           ELVSLSEQQLVDCDH CDPEE G+CDSGCNGGLMN+AFE IL++GGV++EKD PYTG D 
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE-ILQSGGVQKEKDIPYTGRD- 188

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           G+CKFDK+K+ AA      +S DE+Q+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 189 GTCKFDKTKV-AATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 243


>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
 gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
          Length = 373

 Score =  237 bits (605), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 132/269 (49%), Positives = 168/269 (62%), Gaps = 21/269 (7%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHL----LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           + DD  IRQV  +DG +S        L  E  F+ F  +  + Y+  EE+  R RVF AN
Sbjct: 19  STDDGFIRQV--TDGRRSRAGAGALGLLPEAQFAAFVRRHGRRYSGPEEYARRLRVFAAN 76

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK-----APILP-- 133
           L RA   Q LDPTA HGVT FSDLT  EF  +  G+  R     D Q+     AP  P  
Sbjct: 77  LARAAAHQALDPTARHGVTPFSDLTREEFEARLTGV--RAGAGGDVQRLVMSGAPAAPPA 134

Query: 134 ----TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
                + LP  FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TG+L+ LSEQQL
Sbjct: 135 SQEEVSRLPASFDWRDKGAVTGVKMQGACGSCWAFSTTGAVEGANFLATGKLLELSEQQL 194

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCDH C       C++GC GGLM +A+ Y++K+GG+  ++ YPYTG   G C+FD +K 
Sbjct: 195 VDCDHTCSAVAQNECNNGCAGGLMTNAYAYLMKSGGLMEQRAYPYTGAP-GPCRFDPAKA 253

Query: 250 AAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
           A  V+NF+ + + DE Q+ A LV+ GPLA
Sbjct: 254 AVRVANFTAVPAGDEAQIRAALVRRGPLA 282


>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
          Length = 309

 Score =  237 bits (604), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 134/261 (51%), Positives = 168/261 (64%), Gaps = 19/261 (7%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
           IRQV  +DG      LL  E  F+ F  +  + Y+  EE+  R RVF ANL RA   Q L
Sbjct: 29  IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85

Query: 91  DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
           DPTA HGVT FSDLT  EF  +  GL        RR  +P+ A  A     + LP  FDW
Sbjct: 86  DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+   
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204

Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
           CDSGC GGLM +A+ Y++ +GG+  +  YPYTG   G+C+FD +++A  V+NF+V++   
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263

Query: 261 -SDED---QMAANLVKHGPLA 277
            +D D   QM A LV+HGPLA
Sbjct: 264 GNDGDGDAQMRAALVRHGPLA 284


>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
          Length = 381

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 128/262 (48%), Positives = 160/262 (61%), Gaps = 16/262 (6%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR 86
           DD  IRQV            L  E  F+ F  +  + Y+  +E+  R RVF ANL RA  
Sbjct: 34  DDKFIRQVTTQGTRAGAGPGLLPEAQFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAA 93

Query: 87  RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILP------TND 136
            Q LDPTA HGVT FSDLT  EF  +  GL    R   D Q+     P  P         
Sbjct: 94  HQALDPTARHGVTPFSDLTREEFEARLTGL----RAGGDVQRLMSGVPAAPPASKEEVAR 149

Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
           LP  FDWRD GAVTGVK QGACGSCW+FS TGA+EGA+FL+TGELV LSEQQLVDCDH C
Sbjct: 150 LPASFDWRDKGAVTGVKTQGACGSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTC 209

Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
                  C++GC GGLM +A+ Y++++GG+  +  YPYTG   G C+FD +++A  V+NF
Sbjct: 210 SAVAQNECNNGCAGGLMTNAYSYLMESGGLMEQSAYPYTGA-AGPCRFDPTQVAVRVANF 268

Query: 257 SVI-SSDEDQMAANLVKHGPLA 277
           + + + DE Q+ A LV+ GPLA
Sbjct: 269 TAVPAGDEAQIRAALVRRGPLA 290


>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 376

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 134/261 (51%), Positives = 168/261 (64%), Gaps = 19/261 (7%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
           IRQV  +DG      LL  E  F+ F  +  + Y+  EE+  R RVF ANL RA   Q L
Sbjct: 29  IRQV--TDGGYWPPGLL-PEAQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQAL 85

Query: 91  DPTAVHGVTKFSDLTPSEFRRQFLGLN-------RRLRLPADAQKAPILPTNDLPTDFDW 143
           DPTA HGVT FSDLT  EF  +  GL        RR  +P+ A  A     + LP  FDW
Sbjct: 86  DPTARHGVTPFSDLTREEFEARLTGLAADVGDDVRRRPMPS-AAPATEEEVSGLPASFDW 144

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           RD GAVT VK QGACGSCW+FS TGA+EGA+FL+TG L+ LSEQQLVDCDH CD E+   
Sbjct: 145 RDRGAVTDVKMQGACGSCWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTE 204

Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS--- 260
           CDSGC GGLM +A+ Y++ +GG+  +  YPYTG   G+C+FD +++A  V+NF+V++   
Sbjct: 205 CDSGCGGGLMTNAYAYLMSSGGLMEQSAYPYTGAQ-GTCRFDANRVAVRVANFTVVAPPG 263

Query: 261 -SDED---QMAANLVKHGPLA 277
            +D D   QM A LV+HGPLA
Sbjct: 264 GNDGDGDAQMRAALVRHGPLA 284


>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
           distachyon]
          Length = 373

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 134/268 (50%), Positives = 173/268 (64%), Gaps = 14/268 (5%)

Query: 19  ASAVAVNDDDAMIRQVV----PSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT-QEEHDYR 73
           A+A A  DD  +IRQV     P+        LL  E  F+ F  +  K Y+   EE+  R
Sbjct: 19  AAAGASGDD--VIRQVTDNGAPAARRPPSPGLL-PEAKFAAFVRRHGKEYSGGAEEYARR 75

Query: 74  FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL---RLPADAQKAP 130
            RVF ANL RA   Q LDP A HGVT FSDLTP EF+ +  GL ++     +PA A +A 
Sbjct: 76  LRVFAANLARAAAHQALDPGARHGVTPFSDLTPEEFQARLTGLQQQGTNNNMPA-AARAT 134

Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
                 LP  FDWR  GAVT VK QG CGSCW+FS TGA+EGAHF++TG+L++LSEQQLV
Sbjct: 135 AEELATLPASFDWRAKGAVTEVKMQGMCGSCWAFSTTGAVEGAHFVATGKLLNLSEQQLV 194

Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
           DCDH CD      CDSGC+GGLM +A+ Y+++AGG+  +  YPYTG   G+C+FD +K+A
Sbjct: 195 DCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQAAYPYTGAQ-GTCRFDANKVA 253

Query: 251 AAVSNFSVISS-DEDQMAANLVKHGPLA 277
             V++F+ +   DEDQ+ A+LV+ GPLA
Sbjct: 254 VRVTSFTAVPPDDEDQIRASLVRAGPLA 281


>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
 gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
          Length = 384

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 105/144 (72%), Positives = 123/144 (85%), Gaps = 1/144 (0%)

Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
           T+ LP DFDWR+HGAV  VKDQG+CGSCWSFS +GALEGAHFL+TG+L  LSEQQ+VDCD
Sbjct: 145 TDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCD 204

Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
           HECD  ES +CDSGCNGGLM +AF Y++K+GG++ EKDYPY G +  +CKFDKSKI A V
Sbjct: 205 HECDASESRACDSGCNGGLMTTAFSYLMKSGGLQSEKDYPYAGRE-NTCKFDKSKIVAQV 263

Query: 254 SNFSVISSDEDQMAANLVKHGPLA 277
            NFSVIS +EDQ+AANLVKHGPLA
Sbjct: 264 KNFSVISVNEDQIAANLVKHGPLA 287


>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 126/254 (49%), Positives = 157/254 (61%), Gaps = 8/254 (3%)

Query: 30  MIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL 89
           +IRQV  S        LL  E  F+ F  +  K Y+  EE+  R RVF AN+ RA   Q 
Sbjct: 28  VIRQVTDSGHGAGHPGLL-PEAQFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQA 86

Query: 90  LDPTAVHGVTKFSDLTPSEFRRQFLGL---NRRLRLPADAQKAPILPTND---LPTDFDW 143
           LDP A HGVT FSDLT  EF  +  GL      LR       A      +   LP  FDW
Sbjct: 87  LDPGARHGVTPFSDLTREEFEARLTGLVGAGDVLRSARRMPAAAPATEEEVAALPASFDW 146

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           RD GAVT VK QG CGSCW+FS TGA+EGA+F++TG+L+ LSEQQLVDCDH CD      
Sbjct: 147 RDKGAVTDVKMQGVCGSCWAFSTTGAVEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTE 206

Query: 204 CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
           C+SGC+GGLM +A+ Y++ +GG+  +  YPYTG   G C+FD+ K+A  V+NF+ +  DE
Sbjct: 207 CNSGCSGGLMTNAYRYLMSSGGLMEQAAYPYTGAQ-GPCRFDRGKVAVRVANFTAVPLDE 265

Query: 264 DQMAANLVKHGPLA 277
           DQM A LV+ GPLA
Sbjct: 266 DQMRAALVRGGPLA 279


>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
 gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
          Length = 293

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 117/203 (57%), Positives = 145/203 (71%), Gaps = 7/203 (3%)

Query: 81  LRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLG---LNRRLRLPADAQKAPI--LPT 134
           L RA  +Q  D  +A HGVT+FSDLTP EF  ++LG   L+   R    A+   I  LPT
Sbjct: 3   LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
             LP +FDWR  GAV+ VKDQG CGSCW+FS TGA+EGAHF+STG+LV LSEQQL+DCD 
Sbjct: 63  KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
            CDP+   +CDSGCNGGL ++A EYI++ GG++ EK YPY G + G CK D+  + A + 
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEHGGIDTEKSYPYVG-EKGECKADEGTLGATLK 181

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
           NFS +SSDE QMAA LVKHGPL+
Sbjct: 182 NFSYVSSDEKQMAAALVKHGPLS 204


>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
          Length = 347

 Score =  227 bits (579), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 117/235 (49%), Positives = 159/235 (67%), Gaps = 15/235 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  F  K++K Y T EEH+ R+++FKAN+ +++    +      G+TKFSDLTP EF+R 
Sbjct: 33  FIKFSRKYAKVYGT-EEHNNRYQIFKANVEKSRYYNHVGKRENFGITKFSDLTPEEFKRM 91

Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCWS 163
           FL    +   P +A+K    P + +         PT FDWR HGAVT VK+QGACGSCW+
Sbjct: 92  FL---MKTYTPEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRVKNQGACGSCWT 148

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILK 222
           FS TG +EG   +  G+LVSLSEQQLVDCDH C   +   +CDSGCNGGLM SAF+Y++K
Sbjct: 149 FSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGGLMWSAFQYVIK 208

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG++ E  YPY G D  +C+F+KS +AA +S+++ ISSDE+QMAA L  +GP++
Sbjct: 209 NGGLDTEDSYPYEGVD-DTCRFNKSNVAATISSWTSISSDENQMAAWLAANGPIS 262


>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
 gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
          Length = 356

 Score =  226 bits (576), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 125/289 (43%), Positives = 174/289 (60%), Gaps = 30/289 (10%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M +LIL  ++LL+ S +LA   A    +A+          +SE   L     F+ F+ K 
Sbjct: 1   MNKLIL--VVLLVASFILAIEAAKGPFNAL---------PESEMQQL-----FTQFRRKH 44

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG----- 115
            K Y T++  D R+++FK N+ RA+    L      GVT+FSDLTP EF+  FL      
Sbjct: 45  VKLYGTKQVQDRRYQIFKQNVERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTP 104

Query: 116 ------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
                 L+   + PA+A K  +   +D P +FDWR+H AVT VKDQG CGSCW+FS TG 
Sbjct: 105 KQARELLSGMRQYPANA-KLTMKQVSDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGN 163

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEES-GSCDSGCNGGLMNSAFEYILKAGGVER 228
           +EG +   TG+L+SLSEQQLVDCDH C   E   +C++GCNGGLM S+FE+I+K GG+  
Sbjct: 164 VEGMYAAKTGKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKTGGLVT 223

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E+ YPY   D   C+F+ S     +SN++ +SS+ED+MAA L  +GP+A
Sbjct: 224 EESYPYEAVD-NRCRFNVSNAVVKISNWTFVSSNEDEMAAWLANNGPIA 271


>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 329

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 120/243 (49%), Positives = 153/243 (62%), Gaps = 21/243 (8%)

Query: 50  EHHFSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSE 108
           E  F  F  +  KTYA+  +E+  R  +F  N+ RAK     D  A +G T F+DLT  E
Sbjct: 5   ERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARD-GAEYGATPFADLTEDE 63

Query: 109 FRRQFLGLNRRLRLPADAQKA------------PILPTNDLPTDFDWRDHGAVTGVKDQG 156
           F    L     +R P DA +             P LPT ++P +FDWR  GAVT VK+QG
Sbjct: 64  FASSLL-----MREPIDAARVERLKRHESSRVLPHLPTENIPLNFDWRALGAVTPVKNQG 118

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCWSFSATGA+EGAHF+ +G LVSLSEQQLVDCDH CDP+   +CDSGC+GGL  +A
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178

Query: 217 FEYILKAGGVEREKDYPYTGTDG-GSCKF-DKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
             Y++K GG++ E  YPY G  G G CK  +    AA ++N+S +S+DE Q+AA LVKHG
Sbjct: 179 MAYVVKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVKHG 238

Query: 275 PLA 277
           PL+
Sbjct: 239 PLS 241


>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
 gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
          Length = 350

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 110/235 (46%), Positives = 151/235 (64%), Gaps = 15/235 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  F  K +K Y   E+H  R+++FK+N+ +A+    +      GV+KF DLTP EF+R 
Sbjct: 36  FVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRETFGVSKFMDLTPEEFKRM 94

Query: 113 FLGLNRRLRLPADAQKAPILP---------TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           FL    +   P +A+K    P           D PT +DWR  GAVT VK+QGACGSCW+
Sbjct: 95  FL---MKTYTPEEARKILAAPKEAVVTAQQVKDTPTSWDWRQKGAVTPVKNQGACGSCWT 151

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYILK 222
           FS TG +EG H + TG+LVSLSEQQLVDCDH C   +   +CD+GCNGGLM SAF+Y++K
Sbjct: 152 FSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWSAFQYVIK 211

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG+  E  YPY G D  +C+F+KS +A  +++++ I SDE +MAA L  +GP++
Sbjct: 212 TGGLVTEDSYPYEGVD-DTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANGPIS 265


>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
          Length = 465

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 118/244 (48%), Positives = 151/244 (61%), Gaps = 14/244 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL-------RRAKRRQLLDPTAVHGVTKFS 102
           E  F  F+ K++K Y T  E+  RF  FK+NL       R A  R+    +   GV +F+
Sbjct: 25  ETQFRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKNRDAASRK---SSVRFGVNEFA 80

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DL+ SEFR  +L   + +R P +A  A  LP  DLPT FDWR  GAVTGVK+QG CGSCW
Sbjct: 81  DLSQSEFRATYLNSVQAVRDP-NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCW 139

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYIL 221
           SFS TG +EG  FL+   L  LSEQ LVDCDHEC +      CD GCNGGL  +A+ YI+
Sbjct: 140 SFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYII 199

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
           K GG++ E  YPY G D G+C F  + I A +SN++ +SS+E QMAA LV +GPLA    
Sbjct: 200 KNGGIDTEASYPYQGVD-GTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAAD 258

Query: 282 SIEL 285
           ++E 
Sbjct: 259 AVEW 262


>gi|353441042|gb|AEQ94105.1| putative drought-inducible cysteine proteinase [Elaeis guineensis]
          Length = 187

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 115/175 (65%), Positives = 138/175 (78%), Gaps = 7/175 (4%)

Query: 11  LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHL-LNAEHHFSLFKSKFSKTYATQEE 69
           + L +SV +S  +  +DD +I QVVP   E  ED L LNAE HFS F  +F K+YA ++E
Sbjct: 15  VALSASVASSWPSYAEDDPLIVQVVP---ESDEDELRLNAEAHFSSFLRRFGKSYADEKE 71

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP---ADA 126
           H YRF VFKANLRRA+R Q +DPTAVHG+TKFSDLTP+EFRR +LGL    RL    A +
Sbjct: 72  HAYRFSVFKANLRRARRHQKMDPTAVHGITKFSDLTPAEFRRTYLGLRGGRRLRRALASS 131

Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            +APILPTN+LPTDFDWRDHGAVTGVKDQG+CGSCWSFSA+GALEGA+FL+TG+L
Sbjct: 132 HEAPILPTNNLPTDFDWRDHGAVTGVKDQGSCGSCWSFSASGALEGANFLATGQL 186


>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
 gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
          Length = 353

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 104/236 (44%), Positives = 152/236 (64%), Gaps = 14/236 (5%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           HF  F  KF + Y   EE++YR +VF+ N+  ++R  + +    +G+TKFSDLT  EFR+
Sbjct: 36  HFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIREGNNNYGITKFSDLTSDEFRK 95

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---------PTDFDWRDHGAVTGVKDQGACGSCW 162
            +L      + P + QK   + +N +         P  +DWR+HGA+TGVKDQG CGSCW
Sbjct: 96  FYL---MEKKTPKEIQKMMRMDSNKMVSNSYAKPAPDHYDWRNHGAITGVKDQGQCGSCW 152

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP-EESGSCDSGCNGGLMNSAFEYIL 221
           +FSA G++EG++ +   +LVS SEQQLVDCD+ C   E   SCD GCNGGL  SA++Y++
Sbjct: 153 AFSAIGSIEGSYAIKHKQLVSFSEQQLVDCDNNCVTFENQQSCDDGCNGGLQWSAYQYLM 212

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           KAGGV  EKDYPY   +   C+   +   A +SN++++S++E +MA  L ++GP+A
Sbjct: 213 KAGGVVTEKDYPYYA-ERYKCEVKPANFVAKLSNWTMLSTNETEMANWLAENGPIA 267


>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
 gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
          Length = 343

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 111/239 (46%), Positives = 145/239 (60%), Gaps = 10/239 (4%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFS 102
           L  +  F  F+ KF+K Y + EE+  RF +FK+NL + +   L+          GV KF+
Sbjct: 23  LEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACG 159
           DL+  EF+  +L  N+      D   A  L     N +PT FDWR  GAVT VK+QG CG
Sbjct: 82  DLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCG 140

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFE 218
           SCWSFS TG +EG HF+S  +LVSLSEQ LVDCDHEC + E   +CD GCNGGL  +A+ 
Sbjct: 141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YI+K GG++ E  YPYT   G  C F+ + I A +SNF++I  +E  MA  +V  GPLA
Sbjct: 201 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259


>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 291

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 110/199 (55%), Positives = 138/199 (69%), Gaps = 6/199 (3%)

Query: 84  AKRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLP 138
           A  RQ  D  +AVHGVT+FSDLTP+EF   FLG          + +     P  P +DLP
Sbjct: 4   AAERQAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLP 63

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
            +FDWR+ GAVT VK+QGACGSCW+FSATGA+EGA+FL TGELVSLSEQQLVDCDH CDP
Sbjct: 64  LEFDWRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDP 123

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
               +CD GCNGGL  +A  Y+ K  G++ E +YPY G DG          AA+VS+F++
Sbjct: 124 SAPRNCDYGCNGGLPLNAMRYVQKH-GLDTESNYPYKGVDGKCASARHGPAAASVSSFNL 182

Query: 259 ISSDEDQMAANLVKHGPLA 277
           +S++E Q+AA L+KHGPL+
Sbjct: 183 VSTNETQIAAALLKHGPLS 201


>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
          Length = 343

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 110/236 (46%), Positives = 144/236 (61%), Gaps = 10/236 (4%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
           +  F  F+ KF+K Y + EE+  RF +FK+NL + +   L+          GV KF+DL+
Sbjct: 26  QSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             EF+  +L  N+      D   A  L     N +PT FDWR  GAVT VK+QG CGSCW
Sbjct: 85  SDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCW 143

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYIL 221
           SFS TG +EG HF+S  +LVSLSEQ LVDCDHEC + E   +CD GCNGGL  +A+ YI+
Sbjct: 144 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 203

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K GG++ E  YPYT   G  C F+ + I A +SNF++I  +E  MA  +V  GPLA
Sbjct: 204 KNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLA 259


>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
          Length = 500

 Score =  213 bits (542), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 106/229 (46%), Positives = 149/229 (65%), Gaps = 21/229 (9%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLD----PTAVHGVTKFSDLTPSEFRRQFLGL----- 116
           T+EE++ R  +F+ N +RA  R++ D     +A HGVTKF DL+  EFR Q+LGL     
Sbjct: 188 TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEEEFREQYLGLLSTST 247

Query: 117 --------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
                    R+ ++ A +++        LP  +DWR  GAVT VKDQG CGSCW+FS TG
Sbjct: 248 SSSASKDAFRKHQMEAPSEE----DLEKLPQYYDWRARGAVTPVKDQGQCGSCWTFSTTG 303

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           A+EGA+F+ TG+LVSLSEQQL+DCD  C P+   +CDSGCNGGL ++A EYI++ GG++ 
Sbjct: 304 AIEGANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEHGGLDT 363

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           EK YPY      +C+  + K+ A +SN++ +  +E  MA  LVK+GPL+
Sbjct: 364 EKSYPYKAYKEDTCRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLS 412


>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 272

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 107/205 (52%), Positives = 135/205 (65%), Gaps = 11/205 (5%)

Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
           FSDLT  EF  ++LG        R  R     +    LP   LP +FDWR  GAVT VKD
Sbjct: 2   FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG CGSCW+FS TGA+EGAHF+STG+LV LSEQQLVDCD  CDP+   +CDSGCNGGL +
Sbjct: 62  QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
           +A EYI++ GG++ EK YPY G + G CK  K K+ A + NFS +S DE QMAA LVK+G
Sbjct: 122 NAMEYIVEHGGIDTEKSYPYVG-EKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKYG 180

Query: 275 PLAGNVASIELPHISFSFLFTVSSP 299
           PL+  + +  +     S++  V+ P
Sbjct: 181 PLSIGINAAWMQ----SYIGGVACP 201


>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
 gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
          Length = 346

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 107/237 (45%), Positives = 149/237 (62%), Gaps = 12/237 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLT 105
           +  F  F+ K++K Y++  E+  +F  FKANL    +  ++ +L       GV +F+DL+
Sbjct: 26  QTQFVAFQQKYNKVYSS-NEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLS 84

Query: 106 PSEFRRQFLGLNRRLRLP-ADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSC 161
            +EFR+ +L  N ++  P A    AP+L    L   PT FDWR  GAVTGVK+QG CGSC
Sbjct: 85  AAEFRKYYL--NAQVAKPDASLPMAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQCGSC 142

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCNGGLMNSAFEYI 220
           WSFS TG +EG  +L+   LV LSEQ LVDCDH+C + +   SCD+GC+GGL  +A+ Y+
Sbjct: 143 WSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAYRYV 202

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           ++ GG++ E  YPY    G SCKF    +AA +SNF++I  +E QMA  L  HGPLA
Sbjct: 203 IENGGLDSENSYPYLAVTGDSCKFKSGNVAAKISNFTMIPQNETQMAGYLATHGPLA 259


>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
 gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
          Length = 2676

 Score =  205 bits (522), Expect = 2e-50,   Method: Composition-based stats.
 Identities = 108/236 (45%), Positives = 143/236 (60%), Gaps = 13/236 (5%)

Query: 45   HLLNAEHHFSLFKSKFSKTYAT-QEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
            H L AEH F  F S +   Y   + +   RF +FK N+R+       +  TA +GVT+F+
Sbjct: 2363 HHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFA 2422

Query: 103  DLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            DLT  EF  + +G+   LR P   Q +  ++P    P  FDWRDHGAVTGVKDQG+CGSC
Sbjct: 2423 DLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSC 2482

Query: 162  WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
            W+FS TG +EG   + TG+LVSLSEQ+LVDCD           D GCNGGL ++A+  I 
Sbjct: 2483 WAFSVTGNIEGQWKMKTGDLVSLSEQELVDCD---------KLDQGCNGGLPDNAYRAIE 2533

Query: 222  KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            + GG+E E DYPY G+D   C F+K+     +S    I+S+E  MA  LVKHGP++
Sbjct: 2534 QLGGLESEDDYPYEGSD-DKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPIS 2588


>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
          Length = 715

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 105/226 (46%), Positives = 148/226 (65%), Gaps = 12/226 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F++ F + Y +++E   RF++F  N+R+AK+ Q ++  TAV+GVTKF+D++ SEF+ 
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK- 476

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           Q++G           +KA I   N LP  FDWR+HGAVT VK+QG+CGSCW+FS TG +E
Sbjct: 477 QYVGKVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGNIE 536

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G   +S  +LVSLSEQ+LVDCD           D GCNGGL + A++ I++ GG+E E D
Sbjct: 537 GQWAISKKKLVSLSEQELVDCD---------KVDEGCNGGLPSQAYKEIIRLGGLETETD 587

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y Y G +   C  DKSKI   ++    ISS+E +MAA LVK+GP++
Sbjct: 588 YKYRGHN-EKCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPIS 632


>gi|5777611|emb|CAB53397.1| cysteine protease [Medicago sativa]
          Length = 209

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 99/140 (70%), Positives = 114/140 (81%), Gaps = 10/140 (7%)

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGS W+FS TGALEGA++L+TG+LVSLSEQQLVDCDH CDPEE  SCDSGCNGGLMN+AF
Sbjct: 1   CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           EYIL++GGV  EKDY YTG D GSCKFDKSKI A+VSNFSV+S DEDQ+AANLVK+GPLA
Sbjct: 61  EYILQSGGVVSEKDYAYTGRD-GSCKFDKSKIVASVSNFSVVSLDEDQIAANLVKNGPLA 119

Query: 278 GNV---------ASIELPHI 288
             +         + +  PHI
Sbjct: 120 VAINAAWMQTYMSGVSCPHI 139


>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
          Length = 352

 Score =  200 bits (509), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 106/248 (42%), Positives = 152/248 (61%), Gaps = 26/248 (10%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLT 105
           E  F  F++K++K Y+  EE+  +F  FK+NL       K+   +      GV KF+DL+
Sbjct: 24  ESQFIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLS 82

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILP--TNDL----PTDFDWRDHGA---------VT 150
             EF++ +L  ++  RL  D    P+LP  ++D+    P  FDWR+ G          VT
Sbjct: 83  KEEFKKYYLS-SKEARLTDDL---PMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVT 138

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC-DPEESGSCDSGCN 209
            VK+QG CGSCWSFS TG +EG H+LSTG LV LSEQ LVDCDH C   E    C++GC+
Sbjct: 139 AVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCD 198

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGL  +A+ YI+K GG++ E  YPYT  D G CKF+ +++ A +S+F+++  +E Q+A+ 
Sbjct: 199 GGLQPNAYNYIIKNGGIQTEATYPYTAVD-GECKFNSAQVGAKISSFTMVPQNETQIASY 257

Query: 270 LVKHGPLA 277
           L  +GPLA
Sbjct: 258 LFNNGPLA 265


>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 101/239 (42%), Positives = 146/239 (61%), Gaps = 11/239 (4%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
           D  L  +  F  F    +K Y + EE   RFR+F AN+++ K  Q  +  +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DLT +EF++++LGL+  +        A I  +  +P +FDWR+H  VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA   +EG + L + EL+SLSEQ+L+DCD+          D+GC GGLM  AFE +  
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441

Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
            GG+E E DYPY G  D   C+  KS +  ++S    +S+DE+ +A  LVKHGPL+  V
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGV 500


>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  200 bits (508), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 101/239 (42%), Positives = 146/239 (61%), Gaps = 11/239 (4%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
           D  L  +  F  F    +K Y + EE   RFR+F AN+++ K  Q  +  +A++G T+F+
Sbjct: 271 DDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFA 330

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DLT +EF++++LGL+  +        A I  +  +P +FDWR+H  VT VK+QGACGSCW
Sbjct: 331 DLTKNEFKKKYLGLDSSMTSKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCW 390

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA   +EG + L + EL+SLSEQ+L+DCD+          D+GC GGLM  AFE +  
Sbjct: 391 AFSAIANIEGQYALKSKELLSLSEQELIDCDN---------LDNGCGGGLMTQAFEAVEN 441

Query: 223 AGGVEREKDYPYTG-TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
            GG+E E DYPY G  D   C+  KS +  ++S    +S+DE+ +A  LVKHGPL+  V
Sbjct: 442 LGGLETESDYPYEGHADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGV 500


>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
          Length = 474

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 104/227 (45%), Positives = 144/227 (63%), Gaps = 11/227 (4%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F  ++++TY++QEE D R RVF  NL+ A++ Q LD  TA +GVTKFSDLT  EFR
Sbjct: 175 QFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFR 234

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             +L      +    + K   +P    P  +DWR+HGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 TLYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNI 294

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  F  TG+LVSLSEQ+LVDCD         + D  C GGL ++A+E I K GG+E E 
Sbjct: 295 EGQWFAKTGKLVSLSEQELVDCD---------TVDQACGGGLPSNAYEAIEKLGGLETET 345

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY YTG    SC F   K+ A +++   +S+DE+++AA L ++GP++
Sbjct: 346 DYSYTGKK-QSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVS 391


>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
 gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
          Length = 475

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/239 (44%), Positives = 150/239 (62%), Gaps = 12/239 (5%)

Query: 40  EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
           E++ED  +     F  F  ++++TY++QE+ D R R+F  NL+ A++ Q LD  TA +GV
Sbjct: 165 EETED-FVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGV 223

Query: 99  TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
           TKFSDLT  EFR  +L      +    + K   +P    P  +DWR+HGAV+ VK+QG C
Sbjct: 224 TKFSDLTEEEFRTLYLNPLLSQQKLQRSMKPAAMPHGPAPPSWDWREHGAVSPVKNQGMC 283

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCW+FS TG +EG  F+ TG+LVSLSEQ+LVDCD         + D  C GGL ++A+E
Sbjct: 284 GSCWAFSVTGNIEGQWFVKTGKLVSLSEQELVDCD---------TADQACGGGLPSNAYE 334

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            I K GGVE E DY YTG    SC F   K+ A +++   +S DE+++AA L ++GP++
Sbjct: 335 AIEKLGGVETETDYSYTGKK-QSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVS 392


>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
          Length = 1036

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/257 (42%), Positives = 151/257 (58%), Gaps = 16/257 (6%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
           L  E  F  F  K+ K Y  +EE + RF++FK NL   +  Q  +  T  +GVT+F+DLT
Sbjct: 725 LKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLT 784

Query: 106 PSEFRRQFLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            +EF+ + LGL   L+   D       +P  +LP+D+DWR H  VT VKDQG+CGSCW+F
Sbjct: 785 KAEFKARHLGLKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQGSCGSCWAF 844

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG +EG + +  GEL+SLSEQ+LVDCD           DSGCNGGL ++A+  I + G
Sbjct: 845 SVTGNIEGQYAIKHGELLSLSEQELVDCD---------KLDSGCNGGLPDTAYRAIEELG 895

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
           G+E E DYPY   D   C F+K+K+   + +   I+S+E QMA  LVK+GP++     N 
Sbjct: 896 GLELESDYPYDAED-EKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANA 954

Query: 281 ASIELPHISFSFLFTVS 297
               +  +S  F F  S
Sbjct: 955 MQFYMGGVSHPFKFLCS 971


>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 326

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/238 (45%), Positives = 144/238 (60%), Gaps = 15/238 (6%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ-LLD---PTAVHGVTK 100
           H L+ +  +  FK + +K+Y    E   RF +F+ +LR+ +      D    T   GVTK
Sbjct: 15  HALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTK 74

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           F+DLT  EF    LG++R  +         + P  DLP+ FDWR+ GAVT VKDQG+CGS
Sbjct: 75  FADLTEKEFS-DMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGS 133

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CWSFS TG +EGA+FL TG+LVSLSEQ LVDC  E        C  GC+GG M+ A EYI
Sbjct: 134 CWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKE-------DC-YGCSGGYMDKALEYI 185

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
             AGG+  E DYPY G D   C+FD SK+AA +SNF+ I  +DED +   ++  GP++
Sbjct: 186 ETAGGIMSENDYPYEGID-DKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPIS 242


>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
           pulchellus]
          Length = 475

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 108/228 (47%), Positives = 152/228 (66%), Gaps = 13/228 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FS+F   ++KTY  +EEH+ RF +FK NL+R A   +L + TA +G+T+FSDL+PSEF R
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225

Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            +LGL + L    A+ +   + P N+ LP  FDWR  GAVT VK+QG CGSCW+FS TG 
Sbjct: 226 HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTGN 285

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG  FLS  +L+SLSEQ+LVDCDH          D GC GG M  A + +++ GG+E E
Sbjct: 286 VEGQWFLSRSKLLSLSEQELVDCDHG---------DHGCKGGYMGQAMKAVIEMGGLETE 336

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            +YPY G D G+C+F+K++  A V +F  +  +E ++A  L+KHGP++
Sbjct: 337 SEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLMKHGPVS 383


>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
 gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
          Length = 463

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 110/259 (42%), Positives = 155/259 (59%), Gaps = 18/259 (6%)

Query: 22  VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           V + D +   +Q VPS   + ED +L     F  F + ++K Y+ QEE   R ++F  NL
Sbjct: 137 VELTDTETSQKQNVPSS--ELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNL 194

Query: 82  RRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQKAPILPTNDLP 138
           ++A+  Q +D  TA +GVTK+SDLT  EFR  +L   L+ +   P    K  I+P    P
Sbjct: 195 KKAQMIQEMDQGTAEYGVTKYSDLTEDEFRSLYLNPLLSSK---PLYQMKKAIVPNMSAP 251

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
             +DWRDHGAVT VK+QG CGSCW+FS  G +EG  FL  G LVSLSEQ+LVDCD     
Sbjct: 252 DQWDWRDHGAVTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCD----- 306

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
                 D  C GGL ++A+E I K GG+E E++Y Y G    +C F  SK++A +++   
Sbjct: 307 ----GVDHACAGGLPSNAYEAIEKLGGIETEQEYSYEGHK-NTCSFSTSKVSAYINSSVE 361

Query: 259 ISSDEDQMAANLVKHGPLA 277
           I  DE+++AA L ++GP++
Sbjct: 362 IPKDENEIAAWLAQNGPIS 380


>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 325

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 110/236 (46%), Positives = 144/236 (61%), Gaps = 16/236 (6%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFS 102
           LN +  +  FK K +K+Y +  E   RFR+F+ NLR+ +    +    + T   GVTKF+
Sbjct: 17  LNDKEEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFT 76

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DLT  EF    L L++  R         + P  DLP+ FDWRD GAVT VKDQG CGSCW
Sbjct: 77  DLTEKEFL-DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCW 135

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG++E AHFL TG LVSLSEQ LVDC  +       +C  GC GG M+ A EYI K
Sbjct: 136 TFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKD-------TC-YGCGGGWMDKALEYIEK 187

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
            GG+  EKDYPY G D  +C+FD SK+AA +SNF+ I  +DE+ +   +   GP++
Sbjct: 188 -GGIMSEKDYPYEGVD-DNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPIS 241


>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
          Length = 803

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 140/217 (64%), Gaps = 11/217 (5%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           ++Y T EE   RFR+F+AN+++A   Q  +  TA +GVT FSD++  EF++ +LGL +R 
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGLKKRT 568

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
                 Q+   +P   LP ++DWR++ AVT VK+QG CGSCW+FS TG +EG + + TG 
Sbjct: 569 PDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQYAIKTGN 628

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQ+LVDCD           D GC GGL  +A+  I + GG+E E DYPY+G D  
Sbjct: 629 LVSLSEQELVDCDKY---------DDGCEGGLFETAYHAIEELGGLELESDYPYSGRD-N 678

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +C F+ S++  ++++   IS+DE  MA  LV +GP++
Sbjct: 679 TCHFNSSEVRVSITSSVNISNDETDMAKWLVANGPIS 715


>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
          Length = 257

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 93/174 (53%), Positives = 124/174 (71%), Gaps = 5/174 (2%)

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A+ A  L  + LP  FDWR+ GAVT VK QG CGSCW+FS TGA+EGAHF+ST +L++LS
Sbjct: 6   AETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLS 65

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCDH CD  +  +CDSGC GGLM +A++Y+++AGG+E E  YPYTG   G CKF 
Sbjct: 66  EQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKH-GECKFK 124

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
             ++A  V NF+ +  +E+Q+AANLV HGPLA  + +I +     +++  VS P
Sbjct: 125 PDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQ----TYIGGVSCP 174


>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 143/234 (61%), Gaps = 25/234 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F  K++K Y++Q+E D R  +F  NL+ A++ Q LD  +A +GVTKFSDLT  EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235

Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
             +L        L+R ++ PA   K P       P  +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +EG  FL  G LVSLSEQ+LVDCD           D  CNGGL ++A+E I K 
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+E E DY Y G    SC F   K+AA +++   +S DE ++AA L ++GP++
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVS 392


>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 107/234 (45%), Positives = 143/234 (61%), Gaps = 25/234 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F  K++K Y++Q+E D R  +F  NL+ A++ Q LD  +A +GVTKFSDLT  EFR
Sbjct: 176 QFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFR 235

Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
             +L        L+R ++ PA   K P       P  +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHRPMK-PASPAKGPA------PASWDWRDHGAVSSVKNQGMCGSCWA 288

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +EG  FL  G LVSLSEQ+LVDCD           D  CNGGL ++A+E I K 
Sbjct: 289 FSVTGNIEGQWFLKNGTLVSLSEQELVDCD---------GLDQACNGGLPSNAYEAIEKL 339

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+E E DY Y G    SC F   K+AA +++   +S DE ++AA L ++GP++
Sbjct: 340 GGLETETDYSYIGKK-QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVS 392


>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
          Length = 884

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 105/233 (45%), Positives = 141/233 (60%), Gaps = 18/233 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
           E  F  F  KF KTY + +E   RF++FK NL+  +  Q  +  TA +GVT F+DLTP E
Sbjct: 576 ETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKE 635

Query: 109 FRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           F+ ++LGL   L+      + P+    +P   LP  FDWRDH  VT VKDQG CGSCW+F
Sbjct: 636 FKARYLGLRPELK---HENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGSCWAF 692

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG +EG + +   +L+SLSEQ+LVDCD         S D GCNGG M +A++ I + G
Sbjct: 693 SVTGNVEGQYAIKHNQLLSLSEQELVDCD---------SLDEGCNGGDMENAYKAIERLG 743

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G+E E DYPY   D   C F ++K    V +   I+SDE +MA  LVK+GP++
Sbjct: 744 GLELESDYPYDAKD-EKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPIS 795


>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
           rotundata]
          Length = 884

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 105/242 (43%), Positives = 150/242 (61%), Gaps = 19/242 (7%)

Query: 39  GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHG 97
            E  +D LL     F  F   ++KTY + +E   R++VF+ NL+  ++ R+    TAV+G
Sbjct: 570 AEDYKDELL-----FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYG 624

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           VT F+DLTP EF+ ++LGL   L    D   Q+A ++P  DLP  FDWR++ AVT VKDQ
Sbjct: 625 VTMFADLTPEEFKTKYLGLKTNLNQENDIPLQEA-VIPDIDLPPKFDWREYNAVTPVKDQ 683

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FSA G +EG + +   +L+SLSEQ+LVDCD+          D GC GG M +
Sbjct: 684 GQCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDCDN---------LDDGCGGGYMIN 734

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           A++ + K GG+E E DYPY   +   C F K+K    V++   I++DE +MA  LVK+GP
Sbjct: 735 AYKTVEKLGGLELETDYPYDARN-EKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGP 793

Query: 276 LA 277
           ++
Sbjct: 794 IS 795


>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
          Length = 476

 Score =  193 bits (491), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 105/232 (45%), Positives = 144/232 (62%), Gaps = 23/232 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F +K++K Y++QEE D R ++FK NL+ A++ Q LD  +A +GVTKFSDLT  EFR 
Sbjct: 178 FKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLTEEEFRL 237

Query: 112 QFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            +L         RR   PA   ++P       P  +DWRDHGAV+ VK+QG CGSCW+FS
Sbjct: 238 TYLNPLLSQWTLRRPMKPASPARSPA------PASWDWRDHGAVSPVKNQGLCGSCWAFS 291

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG +EG  FL  G+L+SLSEQ+LVDCD           D  C GGL ++A+E I   GG
Sbjct: 292 VTGNIEGQWFLKHGKLLSLSEQELVDCD---------GLDHACRGGLPSNAYEAIEGLGG 342

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +E E DY Y+G     C F   K+AA +++   + SDE++MAA L ++GP++
Sbjct: 343 LEAENDYTYSGHK-QKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVS 393


>gi|356519401|ref|XP_003528361.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 205

 Score =  190 bits (482), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 102/179 (56%), Positives = 120/179 (67%), Gaps = 14/179 (7%)

Query: 21  AVAVNDDDAMIRQVVP-----SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
            V    DD +IRQVVP     +  ++ EDHLLN EHHF+ FK+KF K Y T+EEH+ RF 
Sbjct: 17  VVTSTTDDILIRQVVPDAVSEATEKEDEDHLLNEEHHFTSFKAKFGKKYVTKEEHNRRFG 76

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
           VFK+NL RA+    LDP+ VH +TK SDLT +EFRR        L   A+  KAP     
Sbjct: 77  VFKSNLHRARLHAKLDPSVVHNITKLSDLTSTEFRRX-FLSLXLLCFLANTHKAP----- 130

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
               DFDW D GA+T VKDQGACG CWSFS T +LEGAH+L+TGEL SLSEQQLVDCDH
Sbjct: 131 ---KDFDWXDKGAITNVKDQGACGLCWSFSTTRSLEGAHYLATGELGSLSEQQLVDCDH 186


>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
          Length = 325

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 98/231 (42%), Positives = 142/231 (61%), Gaps = 16/231 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           NA   +  FK  + K YA +++   RF +FK NL RA++ Q+ +  TA +GVT+FSDLTP
Sbjct: 27  NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF  ++LGL    R+     +  +      P   DWR+ GAV  +++QG+CGSCW+FS 
Sbjct: 86  EEFEAKYLGL----RIDEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSV 141

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  FL TG LVSLS+QQLVDCD         + D+GC GG     ++ I + GG+
Sbjct: 142 VGNIEGQWFLKTGYLVSLSKQQLVDCD---------TVDNGCYGGYPPYTYKEIKRMGGL 192

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E + DYPYTG  G  C+ D+SK+ A + +  V+ +DE++ AA L +HGP++
Sbjct: 193 ELQSDYPYTGW-GHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMS 242


>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
          Length = 475

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 142/234 (60%), Gaps = 25/234 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F +K++K Y++QEE D R R+F  NL+ A++ Q LD  +A +GVTKFSDLT  EFR
Sbjct: 176 QFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 235

Query: 111 RQFLG-------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
             +L        L++ ++ PA   K P       P  +DWRDHGAV+ VK+QG CGSCW+
Sbjct: 236 STYLNPLLSQWTLHQPMK-PATPAKGPS------PDSWDWRDHGAVSPVKNQGMCGSCWA 288

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS  G +EG  FL  G L+SLSEQ+LVDCD           D  C GGL ++A+E I K 
Sbjct: 289 FSVIGNIEGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKL 339

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+E E DY YTG     C F   K+AA +++   +  DE ++AA L ++GP++
Sbjct: 340 GGLETESDYSYTGHK-QRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVS 392


>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
 gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 115/298 (38%), Positives = 157/298 (52%), Gaps = 39/298 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQQLVDCDH          D GCNGG     +  I K GG+E   DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
            G C  ++SK  A V++ +V+   E   A  L + GPL+  + ++ L       +F +
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264


>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
 gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
          Length = 276

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 137/207 (66%), Gaps = 17/207 (8%)

Query: 74  FRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL--GLNRRLRLPADAQKAP 130
            ++F++N+R+A + Q +D  TA +G T FSDL+  EFR+Q +  G  + L    DA+   
Sbjct: 1   MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKDAE--- 57

Query: 131 ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLV 190
            +P  D+P   DWRD G VT VK+QG+CGSCW+FS TG +EG + + TG+LVSLSEQ+LV
Sbjct: 58  -IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELV 116

Query: 191 DCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA 250
           DCD         + D GC GGL ++A++ I K GG+E E DYPY G D   CKF+K+++ 
Sbjct: 117 DCD---------TIDKGCEGGLPSNAYKQIEKLGGLESESDYPYKGAD-SKCKFNKAEVK 166

Query: 251 AAVSNFSVISSDEDQMAANLVKHGPLA 277
             +++  VIS DE ++AA L K+GP++
Sbjct: 167 VTINSSVVISKDEKEIAAWLAKNGPIS 193


>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
          Length = 361

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 108/246 (43%), Positives = 152/246 (61%), Gaps = 31/246 (12%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FS+F   ++KTY  +EEH+ RF +FK NL+R A   +L + TA +G+T+FSDL+PSEF R
Sbjct: 34  FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93

Query: 112 QFLGLNRRL-RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWS------ 163
            +LGL + L    A+ +   + P N+ LP  FDWR  GAVT VK+QG CGSCW+      
Sbjct: 94  HYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSXXTE 153

Query: 164 ------------FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
                       FS TG +EG  FLS  +L+SLSEQ+LVDCDH          D GC GG
Sbjct: 154 VKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHG---------DHGCKGG 204

Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
            M  A + +++ GG+E E +YPY G D G+C+F+K++  A V +F  +  +E ++A  L+
Sbjct: 205 YMGQAMKAVIEMGGLETESEYPYKGVD-GTCEFNKTESKARVQSFVGLPQNETELAYWLM 263

Query: 272 KHGPLA 277
           KHGP++
Sbjct: 264 KHGPVS 269


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 98/239 (41%), Positives = 146/239 (61%), Gaps = 16/239 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K+YA  ++   RF +FK NL RA+  QL +  TA +GVT+FSDLTP
Sbjct: 27  SARELYEQFKRDYGKSYANDDDEK-RFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTP 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF  +FL      R     ++  +      P   DWR+ GAV  V+DQG+CGSCW+FS 
Sbjct: 86  EEFAAKFLSS----RFDDQVERVQLNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAFSV 141

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  FL TG+LVSLS+QQLVDCD +         DSGC+GG   + +  I++ GG+
Sbjct: 142 AGNVEGQWFLKTGQLVSLSKQQLVDCDVQ---------DSGCDGGYPPTTYGEIIRMGGL 192

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           E ++DYPY G +   CK D+SK+ A +++  V+ ++E + AA + +HGP++  + ++ L
Sbjct: 193 EAQRDYPYVGRE-QPCKLDESKLLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTL 250


>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
          Length = 328

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 115/298 (38%), Positives = 155/298 (52%), Gaps = 39/298 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                      P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  T
Sbjct: 97  FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQQLVDCDH          D GCNGG     +  I K GG+E   DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
            G C  ++SK  A V+  +V+   E   A  L + GPL+  + ++ L       +F +
Sbjct: 208 -GICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264


>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
          Length = 326

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 148/279 (53%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY + ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A ++  +++   E   A  L   GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245


>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
          Length = 322

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 105/249 (42%), Positives = 148/249 (59%), Gaps = 19/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 22  SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 80

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EF  ++L    R  +  D Q   + PT     P   DWR+ GAVT V++QG+CGSCW+F
Sbjct: 81  EEFAAKYL----RAAVNND-QVERVRPTGLKAAPERMDWREKGAVTAVENQGSCGSCWAF 135

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SA G +EG  F+ TG+LVSLS+QQLVDCD   +         GCNGG   S++  I   G
Sbjct: 136 SAAGNVEGQWFIKTGQLVSLSKQQLVDCDRVAE---------GCNGGWPVSSYLEIKHMG 186

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           G+E E DYPY G +  +C  +K K+ A + +  V+ + E++ AA L +HGPL+  + ++ 
Sbjct: 187 GLESESDYPYVGAE-QTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVA 245

Query: 285 LPHISFSFL 293
           L H     L
Sbjct: 246 LQHYQSGVL 254


>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
          Length = 326

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 148/279 (53%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY + ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TY-SNDDDELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GDLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A ++  +++   E   A  L   GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245


>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
          Length = 459

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 141/238 (59%), Gaps = 23/238 (9%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPS 107
           A + F  F  +  K Y ++ +   RFRVFK NL+  +  Q  +  TAV+G+T+FSDLTP 
Sbjct: 153 AWNQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPE 212

Query: 108 EFRRQFLGL--------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           EF++ +L          NR + L A+     +     LP  FDWRDHGAVT VK+QG CG
Sbjct: 213 EFKKIYLPYIWDEPIVPNRMVDLTAEG----VHLNETLPESFDWRDHGAVTDVKNQGFCG 268

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG +EG  FL+  +LVSLSEQ+LVDCD           D GC GGL + A++ 
Sbjct: 269 SCWAFSTTGNIEGQWFLAKKKLVSLSEQELVDCD---------KVDDGCEGGLPSQAYKE 319

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I++ GG+E E  YPY G  G  C  ++++ A  +++   +  DE+ M A LVK GP++
Sbjct: 320 IMRMGGLETESAYPYDGR-GEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPIS 376


>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
          Length = 427

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/236 (42%), Positives = 139/236 (58%), Gaps = 15/236 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           N    F  F+ KF K+Y++      R+ +FK NL + +  Q L+  TA +G+TKFSDL+ 
Sbjct: 122 NTSRLFEEFQRKFRKSYSSDTAK--RYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSA 179

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EFR     + RR +      +  I PT    LP  FDWR +GAVT VKDQG CGSCW+F
Sbjct: 180 EEFRHSLANMKRR-KSKGSQMETAIFPTTIQSLPPSFDWRANGAVTEVKDQGMCGSCWAF 238

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           + TG +EG  F  T +L+SLSEQQL+DCD +         D  CNGGL   A++ I+K G
Sbjct: 239 ATTGNIEGQWFRKTNKLISLSEQQLLDCDTK---------DEACNGGLPEWAYDEIVKMG 289

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
           G+  EKDYPY      SC   +  I+A ++  + + SDE ++AA LV++GP++  V
Sbjct: 290 GLMSEKDYPYEAMKEQSCHLRRPNISAYINGSATLPSDEAKLAAWLVQNGPISVGV 345


>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 114/298 (38%), Positives = 156/298 (52%), Gaps = 39/298 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                      P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  T
Sbjct: 97  FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQQLVDCDH          + GCNGG     +  I K GG+E   DYPYTG D
Sbjct: 157 GDLLALSEQQLVDCDH---------LEKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD 207

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
            G C  ++SK  A V++ +V+   E   A  L + GPL+  + ++ L       +F +
Sbjct: 208 -GICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264


>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
          Length = 308

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 100/228 (43%), Positives = 141/228 (61%), Gaps = 18/228 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL 114
           F  ++++TY+ ++E   RFR++K NLR AK  Q  +  TA++G T+FSDLT +EFR+  +
Sbjct: 10  FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRK--I 67

Query: 115 GLNRRLRLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            L  +   P    K        +  ND+P  FDWR+  AVT VK+QG+CGSCW+FS TG 
Sbjct: 68  MLPYKWETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVTGN 127

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EGA  + T +LVSLSEQ+LVDCD           D GCNGGL ++A+  I++ GG+E E
Sbjct: 128 IEGAWAIKTSKLVSLSEQELVDCD---------IIDQGCNGGLPSNAYREIIRMGGLEAE 178

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            DYPY G  G  C   K  IA  +++   +  DE++MAA LV  GP++
Sbjct: 179 SDYPYDGR-GEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPIS 225


>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
 gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
          Length = 353

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 105/265 (39%), Positives = 153/265 (57%), Gaps = 22/265 (8%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKS------KFSKTYATQEEHDYRFRVFKANLRRA 84
           + Q+ P+    S+D    A HH  +FK+      +++K+Y   +E +YR++VF  N+ RA
Sbjct: 30  MMQLQPATRRFSQD---TATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARA 86

Query: 85  KRRQLLD-PTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDW 143
              Q  D  T  +G TK SDLT  E +  F  + +  +     +KA I   N LP  FDW
Sbjct: 87  MLFQKHDNATGRYGFTKLSDLTDQEVK-SFYAMKKWPQQLYPTKKANIPQLNSLPQSFDW 145

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           R  GAVT VKDQ  CG+CW+F+ TG +EG  +L+ G+L SLSEQ+LVDCD          
Sbjct: 146 RSKGAVTAVKDQKRCGACWAFATTGNIEGQWYLNKGKLYSLSEQELVDCD---------K 196

Query: 204 CDSGCNGGLMNSAFEYIL-KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
            D GC GGL  +A+  I+ + GG+E EKDYPY   + G CK +KS+    +++   +S++
Sbjct: 197 IDEGCKGGLPLNAYHSIMNRLGGLETEKDYPYVAKN-GKCKLNKSEEVVYINSSVKVSTN 255

Query: 263 EDQMAANLVKHGPLAGNVASIELPH 287
           E  +AA LV HGP+A  + S+ + H
Sbjct: 256 ETDLAAWLVAHGPVAIGINSVNMLH 280


>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 330

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 140/238 (58%), Gaps = 13/238 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF 101
           +E   + AE  F  F +++ K+YA+ EE   R R+F+ NL R       +  A +GV KF
Sbjct: 21  AEAGTMTAEQQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGARYGVNKF 79

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
           +DLTP EF+  +L   R       A  A +  T  LP+ FDWRD GAVT  KDQG CG  
Sbjct: 80  ADLTPKEFKATYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG-- 137

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS T A+E   FLS  +LVSL+ QQ+VDCD        G+ D GC+GG   +A+EY++
Sbjct: 138 WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQ-------GNGDYGCDGGDPPTAYEYVI 190

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLA 277
           KAGG++ E+ YPYT  D G C F  S + A +SN++ I++  +E +M   L   GPL+
Sbjct: 191 KAGGLDTEESYPYTAED-GQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLS 247


>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 111/256 (43%), Positives = 152/256 (59%), Gaps = 27/256 (10%)

Query: 33  QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLD 91
           +  P D + SED   +A   F  F  +  K Y+ QE H  RF+ F  NL+R K    +  
Sbjct: 33  KTTPEDFDVSED---DARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQ 88

Query: 92  PTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA----------DAQKAPILPTNDLPTDF 141
            +A +GVT+F+DL+  EFRR +LGL   L++P            ++K     T D    F
Sbjct: 89  GSAKYGVTEFADLSDFEFRRHYLGLKPELKIPNRKKYERKSRNSSKKLKFAKTVD--ETF 146

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DW + GAVT VK+QG CGSCW+FS TG +EGA F +TG+LVSLSEQ+LVDCD +      
Sbjct: 147 DWVEKGAVTEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLVSLSEQELVDCDQK------ 200

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
              DSGCNGGLM+ AFE +++ GG+E E+ YPY G    +C F+KS     + +F  I  
Sbjct: 201 ---DSGCNGGLMDQAFEEVIRIGGLETEQQYPYDGVQ-ETCNFEKSLSKVQIDDFMDIGE 256

Query: 262 DEDQMAANLVKHGPLA 277
           DE+++A  L +HGPL+
Sbjct: 257 DEEEIAEALEEHGPLS 272


>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
          Length = 887

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 104/265 (39%), Positives = 151/265 (56%), Gaps = 27/265 (10%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL------RRAKRRQLLDPTAVHGVTK 100
           + +E  F+ F   +++TY+T EE + R R+F+ NL      R+ +R      TA + V  
Sbjct: 576 VRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERG-----TAHYDVNM 630

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           F+D++P EFR ++LGL   LR   D   +   +P  +LP  FDWR+   VT VKDQG CG
Sbjct: 631 FADMSPEEFRSRYLGLRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQGMCG 690

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG +EG + +  G L+SLSEQ+LVDCD           D GCNGGL ++A+  
Sbjct: 691 SCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRA 741

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA-- 277
           I K GG+E E DYPY   +   C F K+     +++   I+S+E QMA  LV++GP++  
Sbjct: 742 IEKLGGLELESDYPYEA-ENEKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIG 800

Query: 278 --GNVASIELPHISFSFLFTVSSPK 300
              N     +  +S  F F + +PK
Sbjct: 801 INANAMQFYVGGVSHPFKF-LCNPK 824


>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
          Length = 327

 Score =  184 bits (466), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 103/249 (41%), Positives = 143/249 (57%), Gaps = 19/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 27  SARELYEQFKRGYGKVYANEDDQK-RFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTP 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EF  ++L          D Q   + PT     P   DWR  GAVT V++QG+CGSCW+F
Sbjct: 86  EEFAAKYLSAPVN-----DDQVKRMRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAF 140

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S  G +EG  F+ TG+LVSLS+QQLVDCD             GCNGG   S++  I+  G
Sbjct: 141 STAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYMG 191

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           G+E E DYPY G +  +C  +K K+ A + +  V+  +E+  AA L +HGPL+  + ++ 
Sbjct: 192 GLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVA 250

Query: 285 LPHISFSFL 293
           L H     L
Sbjct: 251 LQHYQSGVL 259


>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 102/237 (43%), Positives = 139/237 (58%), Gaps = 14/237 (5%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQE-EHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
           H + AE  F  F + +   Y     E   RF +FK N+++       +  T V+ VT+F+
Sbjct: 223 HHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFT 282

Query: 103 DLTPSEFRRQFLGLNRRLRLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           DLT  EF+ ++LGLN  L+ P     ++A I   + LP  FDWR  GAVT VKDQGACGS
Sbjct: 283 DLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGS 342

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG   L TG+L+SLSEQ+LVDCD           D GC+GG M++A+  I
Sbjct: 343 CWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCD---------KMDDGCDGGYMDNAYRAI 393

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            + GG+E E++YPY   D   C F+KS     +S    ISS+E  MA  LV +GP++
Sbjct: 394 EQLGGLETEEEYPYEAED-DKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPIS 449


>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
          Length = 774

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 97/241 (40%), Positives = 149/241 (61%), Gaps = 21/241 (8%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTK 100
           SED  + AE  F+ F + +++TY++ E  + RF++F+ NL   +  R+    T ++GV  
Sbjct: 461 SED--MKAERLFNNFMTTYNRTYSSLE-RNLRFKIFRENLNFIEELRETEQGTGIYGVNM 517

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQG 156
           F+D++  EFR ++LGL   L+      + P+    +P  DLP+ FDWR  G VT VK+QG
Sbjct: 518 FADMSQKEFRTRYLGLRPDLQ---SENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQG 574

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FS TG +EG + +  G+L+SLSEQ+LVDCDH          D GCNGGL ++A
Sbjct: 575 QCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDH---------LDEGCNGGLPDNA 625

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           +  I + GG+E E DYPY   +   C F ++ +   +++   I+S+E Q+A  LV++GP+
Sbjct: 626 YRAIEQLGGLELESDYPYEA-ENEKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPI 684

Query: 277 A 277
           A
Sbjct: 685 A 685


>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 347

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 103/241 (42%), Positives = 139/241 (57%), Gaps = 20/241 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV-------HGVTKFS 102
           E  F  F+ K++K Y + E    +F  FK NL R      L+  A         GV +F+
Sbjct: 24  EIQFRDFQVKYNKVYGSHE-FSQKFVTFKDNLNRIDT---LNANAAASGSDTKFGVNEFA 79

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACG 159
           DL+  EFR+ ++       +P+DAQ A       L   P+ FDWR  GAVT VK+QG CG
Sbjct: 80  DLSVQEFRKFYMNA-VPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQGQCG 138

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC---DPEESGSCDSGCNGGLMNSA 216
           SCWSFS TG +EG  FL+   L  LSEQ LVDCDH C   D ++  SCD GCNGGL  +A
Sbjct: 139 SCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQ--SCDDGCNGGLQPNA 196

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           F+YI+  GG++ E  YPY       C+F  S I A +SN+ ++S++E Q+AA L  +GP+
Sbjct: 197 FQYIIGNGGIDTETSYPYLAVAQDKCQFKASNIGAKISNWQMLSTNETQIAAYLALNGPV 256

Query: 277 A 277
           +
Sbjct: 257 S 257


>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
 gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
          Length = 326

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 104/242 (42%), Positives = 140/242 (57%), Gaps = 19/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK K+ KTY+  ++ + RFR+FK NL RAKR Q ++  TA +GVT+FSDLT 
Sbjct: 27  DARALYEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTS 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWS 163
            EF+ ++L    R+R           P  D+  D   FDWRDHGAV  V DQG CGSCW+
Sbjct: 86  EEFKTRYL----RMRFDEPIVNEDPTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWA 141

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS  G +EG  F  TG+L+ LSEQQL+DCDH          D GC+GG     +  I + 
Sbjct: 142 FSVIGNVEGQWFRKTGDLLGLSEQQLIDCDHS---------DQGCDGGYPPQTYSAIEEM 192

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+E   DYPYTG D G C  D+SK  A V+  + +   E   A +L + GPL+  + ++
Sbjct: 193 GGLELRSDYPYTGKD-GICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAV 251

Query: 284 EL 285
            L
Sbjct: 252 LL 253


>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
 gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
          Length = 337

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 148/265 (55%), Gaps = 29/265 (10%)

Query: 30  MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           MI  ++     Q E HL     +A+H+F  F   ++K YA  +  +YRF++F  NL    
Sbjct: 5   MIFTILLVASSQIEGHLKFDIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNLEYIN 64

Query: 86  RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
            +  L+ +A++ + KFSDL+ +E   ++ GL  R   P++  K            AP   
Sbjct: 65  EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSRK--PSNMVKSTSNFCNVIHLDAPPDA 122

Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
            ++LP +FDWR +  +T VKDQGACGSCW+ +A G LE  + +    L++LSEQQL+DCD
Sbjct: 123 RDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182

Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
                    S +  C+GGLM++AFE ++ AGG+  E DYPY GT  G CK D  K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GICKIDNKKFALSV 232

Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
           S+    I  +E+ +   L+  GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITTGPIA 257


>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
          Length = 326

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A V+  +++   E   A  L   GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245


>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
          Length = 325

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 100/233 (42%), Positives = 137/233 (58%), Gaps = 20/233 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK  + K YA  ++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 27  NARELYEQFKRDYGKVYANDDDQK-RFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTP 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EF  ++L        P + Q   + PT     P   DWR+ GAV  V++QG+CGSCW+F
Sbjct: 86  EEFAAKYL------SRPMNDQVERVRPTGLKAAPERMDWREWGAVGPVENQGSCGSCWAF 139

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S  G +EG  FL TG+LVSLS+QQLVDCD           D GC GG   +A+  I++ G
Sbjct: 140 SVAGNVEGQWFLKTGQLVSLSKQQLVDCD---------VMDYGCGGGWPTNAYMEIMRMG 190

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G+E + DYPY G     C  +K K+ A + +  V+ + E++ AA L +HGPL+
Sbjct: 191 GLELQSDYPYVGVQ-QQCYLNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLS 242


>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
          Length = 326

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A V+  +++   E   A  L   GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245


>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 110/279 (39%), Positives = 146/279 (52%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A ++  +++   E   A  L   GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245


>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 596

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 139/227 (61%), Gaps = 12/227 (5%)

Query: 53  FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFR 110
           F +F  K+ +TY++  +E++ RF +FK N +  +   ++   TAV+G+TKF D++  E+ 
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           R       R  +P     +  L T ++P   DWR HGAVT VK+QG+CGSCW+FS TG +
Sbjct: 229 RTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNV 288

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL   +L+SLSEQ+LVDCD         + DSGC GGL ++A++ I K GG+E EK
Sbjct: 289 EGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEKLGGLEPEK 339

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DYPY G +G  C   +S     V+N   +  DE ++AA L ++GP++
Sbjct: 340 DYPYVG-EGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPIS 385



 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 53/115 (46%), Positives = 70/115 (60%), Gaps = 9/115 (7%)

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           E+ R       R  +P     +  L T ++P   DWR HGAVT VK+QG+CGSCW+FS T
Sbjct: 446 EYHRTLAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTT 505

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           G +EG  FL   +L+SLSEQ+LVDCD         + DSGC GGL ++A++ I K
Sbjct: 506 GNVEGQWFLKHKKLISLSEQELVDCD---------TLDSGCGGGLPSNAYKSIEK 551


>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
          Length = 322

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 102/250 (40%), Positives = 144/250 (57%), Gaps = 21/250 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 22  SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80

Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF  ++L        P +  Q   + PT     P   DWR  GAVT V++QG+CGSCW+
Sbjct: 81  EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 134

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS  G +EG  F+ TG+LVSLS+QQLVDCD             GCNGG   S++  I+  
Sbjct: 135 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAA---------QGCNGGWPASSYLEIMYM 185

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+E E DYPY G +  +C  +K K+ A + +  V+  +E+  AA L +HGPL+  + ++
Sbjct: 186 GGLESESDYPYVGVE-QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAV 244

Query: 284 ELPHISFSFL 293
            L +     L
Sbjct: 245 ALQYYQSGVL 254


>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF  ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFETRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A V+  +++   E   A  L   GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245


>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 441

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 99/233 (42%), Positives = 131/233 (56%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EF
Sbjct: 31  EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90

Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R ++    R         +  + +P    P   DWR  GAVT VKDQG+CGSCWSFSA G
Sbjct: 91  RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 150

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG    +   L SLSEQ LV CD +         D+GC GGLM++AFE+I+K  +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201

Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             EK YPY   G +   CK    K+ A ++    I  DED +A  L  +GP+A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
          Length = 478

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 17/258 (6%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           +DD   ++++  +   +  D+++   + F  F  +  K Y  + E   RFRVFK N +  
Sbjct: 150 HDDSVTVQELRKAKIIKPRDYVI--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVI 207

Query: 85  KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLPT 139
           +  Q  +  TAV+G TKFSD+T  EF+   L       +P D     ++   +   DLP 
Sbjct: 208 RELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLPD 267

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
            FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+  +LVSLSEQ+LVDCD      
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD------ 321

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
              S D GCNGGL ++A++ I++ GG+E E  YPY G  G +C   +  IA  ++    +
Sbjct: 322 ---SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVEL 377

Query: 260 SSDEDQMAANLVKHGPLA 277
             DE +M   LV  GP++
Sbjct: 378 PHDEVEMQKWLVTKGPIS 395


>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
 gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
          Length = 344

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/281 (36%), Positives = 151/281 (53%), Gaps = 17/281 (6%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           ++L    V AS    N  DA+I  V  +   + + +L  A  +F  F++K+ K YA   E
Sbjct: 4   IILFFVFVFASGGFDNGVDAIIDYVTAAPQFKLQYNLERAPQYFETFQTKYKKVYADDNE 63

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
            DYR+++FK NL     +   + +AV+ + KF+DLT +E   +F GL   +R PA     
Sbjct: 64  RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLG--IRSPALKNSC 121

Query: 130 -PIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
            P++   P+      FDWR    +T VKDQG CGSCW+FS    LE  + +   E V LS
Sbjct: 122 EPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHVDLS 181

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCD         + D GC GGL+++A+E I+  GG+E E+DYPY     G C+  
Sbjct: 182 EQQLVDCD---------TIDMGCAGGLLHTAYEEIMAMGGLEYEEDYPYRSVQ-GPCRLQ 231

Query: 246 KSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
             K   +V N +  +   ED++   L + GP+A  V +++L
Sbjct: 232 SDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDL 272


>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
          Length = 478

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 102/258 (39%), Positives = 148/258 (57%), Gaps = 17/258 (6%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           +DD   ++++  +   +  D+++   + F  F  +  K Y  + E   RFRVFK N +  
Sbjct: 150 HDDSVTVQELRKAKIIKPRDYVV--WNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVI 207

Query: 85  KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA----QKAPILPTNDLPT 139
           +  Q  +  TAV+G TKFSD+T  EF+   L       +P D     ++   +   DLP 
Sbjct: 208 RELQKNEQGTAVYGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKEGVTISEEDLPD 267

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
            FDWR+HGAVT VK+QG+CGSCW+FS TG +EGA FL+  +LVSLSEQ+LVDCD      
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD------ 321

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
              S D GCNGGL ++A++ I++ GG+E E  YPY G  G +C   +  IA  ++    +
Sbjct: 322 ---SVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDIAVYINGSVEL 377

Query: 260 SSDEDQMAANLVKHGPLA 277
             DE +M   LV  GP++
Sbjct: 378 PHDEVEMQKWLVTKGPIS 395


>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
          Length = 367

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 98/230 (42%), Positives = 129/230 (56%), Gaps = 14/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EFR +
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +    R         +  + +P    P   DWR  GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94  YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
           G    +   L SLSEQ LV CD +         D+GC GGLM++AFE+I+K  +G V  E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKVYTE 204

Query: 230 KDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K YPY   G +   CK    K+ A ++    I  DED +A  L  +GP+A
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
          Length = 326

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  F  K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFTLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A V+  +++   E   A  L   GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245


>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 110/288 (38%), Positives = 152/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAIAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 105/258 (40%), Positives = 141/258 (54%), Gaps = 24/258 (9%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKF 101
           L+  E H   +K +  K YA + E  +R ++F  N  + AK  QL     V    G+ K+
Sbjct: 23  LIKEEWH--TYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKY 80

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKDQ 155
           +D+   EF+    G N  LR     +   +  T        +P   DWR+HGAVTGVKDQ
Sbjct: 81  ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FS+TGALEG HF   G LVSLSEQ LVDC        +   ++GCNGGLM++
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDN 193

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHG 274
           AF YI   GG++ EK YPY G D  SC F+K+ I A  + F  I   DE++M   +   G
Sbjct: 194 AFRYIKDNGGIDTEKSYPYEGID-DSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMG 252

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +I+  H SF  
Sbjct: 253 PVS---VAIDASHESFQL 267


>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
          Length = 322

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 102/249 (40%), Positives = 145/249 (58%), Gaps = 19/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 22  SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTP 80

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EF  ++L       L +D Q   + PT     P   DWR  GAVT V++QG CGSCW+F
Sbjct: 81  EEFAAKYLSP----PLNSD-QVERVQPTGLKAAPERMDWRAKGAVTPVENQGECGSCWAF 135

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S  G +EG  F+ TG+LVSLS+QQLVDCD   +         GCNGG  +S++  I+  G
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQQLVDCDMAAE---------GCNGGWPSSSYLEIMDMG 186

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           G+E E DYPY G +  +C  +K K+ A + +  V+ + E++    L +HGPL+  + ++ 
Sbjct: 187 GLESENDYPYVGVE-QTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVA 245

Query: 285 LPHISFSFL 293
           L H     L
Sbjct: 246 LQHYQSGIL 254


>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
 gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
          Length = 337

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 148/265 (55%), Gaps = 29/265 (10%)

Query: 30  MIRQVVPSDGEQSEDHLL----NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           MI  ++     Q E HL     +A+H+F  F   ++K Y   +  +YRF++FK NL    
Sbjct: 5   MIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLEDIN 64

Query: 86  RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK------------APILP 133
            +  L+ +A++ + KFSDL+ +E   ++ GL  +   P++  +            AP   
Sbjct: 65  EKNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKK--PSNMVRSTSNFCNVIHLDAPPDV 122

Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
            ++LP +FDWR +  +T VKDQGACGSCW+ +A G LE  + +    L++LSEQQL+DCD
Sbjct: 123 HDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLIDCD 182

Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
                    S +  C+GGLM++AFE ++ AGG+  E DYPY GT  G CK D  K A +V
Sbjct: 183 ---------SANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTK-GVCKIDNKKFALSV 232

Query: 254 SNFS-VISSDEDQMAANLVKHGPLA 277
           S+    I  +E+ +   L+  GP+A
Sbjct: 233 SSCKRYIFQNEENLKKELITMGPIA 257


>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                      P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDPSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQQLVDCDY---------LDGGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A ++  +++   E   A  L   GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245


>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 110/279 (39%), Positives = 145/279 (51%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRET 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LS QQLVDCD+          D GC+GG     +  I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSGQQLVDCDY---------LDDGCDGGYPPQTYTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A V+  +++   E   A  L   GPL+
Sbjct: 207 GGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLS 245


>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
          Length = 459

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 100/233 (42%), Positives = 139/233 (59%), Gaps = 26/233 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY T+EE  +R  +F  N+ RA+  Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221

Query: 112 QFL------GLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            +L      GL +++RL          P +D  P ++DWR+ GAVT VK+QG CGSCW+F
Sbjct: 222 FYLNPLLKEGLGKKMRLAK--------PVDDPAPPEWDWRNKGAVTKVKNQGMCGSCWAF 273

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG +EG  FL  G+L+SLSEQ+LVDCD         + D  C GGL ++A+  I   G
Sbjct: 274 SVTGNVEGQWFLKQGDLLSLSEQELVDCD---------TLDKACMGGLPSNAYSAIKTLG 324

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G+E E DY Y G    +C F   K+   +++   +S DE ++AA L K GP++
Sbjct: 325 GLETEDDYSYHG-HLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGPIS 376


>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
           kowalevskii]
          Length = 352

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/304 (37%), Positives = 160/304 (52%), Gaps = 25/304 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSK 59
           + + +L+ + LS+V   + A+      I  V   D           +   +  F  F   
Sbjct: 1   MAILTLIAVFLSTVALGSQAIGPRTITINNVPMIDEIERNTNESGSVDKTQDLFQDFMKT 60

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           + K Y T+EEH  R+++F+ NL +A+R +Q    T  +GVTKF DL+  EFR+ +L    
Sbjct: 61  YDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFRKYYLTPVW 120

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRD--HGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
           R   P   +KA I P    P  FDWRD    AVT VK+QG CGSCW+FS TG +EG   +
Sbjct: 121 RGSDP-HMKKAEI-PKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFSTTGNIEGQWKI 178

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
             G LVSLSEQ+LVDCD           D GCNGGL ++A++ I++ GG+  E DYPYTG
Sbjct: 179 KKGTLVSLSEQELVDCD---------KLDQGCNGGLPSNAYQEIMRFGGIMSEDDYPYTG 229

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF-T 295
            D   CK + +     ++    IS DE  MA+ L  +GP+     SI +   +  F F  
Sbjct: 230 RD-QDCKLNATLNKVYINGSMNISKDEGDMASWLAANGPI-----SIGINANAMQFYFGG 283

Query: 296 VSSP 299
           VS P
Sbjct: 284 VSHP 287


>gi|241602000|ref|XP_002405373.1| cathepsin-like protease, putative [Ixodes scapularis]
 gi|215502535|gb|EEC12029.1| cathepsin-like protease, putative [Ixodes scapularis]
          Length = 273

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 106/250 (42%), Positives = 140/250 (56%), Gaps = 22/250 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+ + K+Y++Q E  +R  V+  N L+ AK  +      V     + KFSDL   
Sbjct: 27  EWETFKANYGKSYSSQAEEQFRMTVYMNNKLKVAKHNEQYAEGKVSYQLAMNKFSDLLHE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND----LPTDFDWRDHGAVTGVKDQGACGSCWS 163
           EF R   G  RR+R P       + P N      P   DWR  GAVT VK+Q  CGSCW+
Sbjct: 87  EFVRSRNGF-RRIR-PVKQASTYMEPANIEDVCFPQTVDWRKKGAVTPVKNQEQCGSCWA 144

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATG+LEG HFL TG+LVSLSEQ LVDC  +         + GC+GG+M+ AF YI   
Sbjct: 145 FSATGSLEGQHFLRTGKLVSLSEQNLVDCSDDFG-------NLGCSGGVMDDAFRYIKAN 197

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
           GG++ EK YPYTG D G C FDKS + A  + F  V + DE Q+   +   GP++    +
Sbjct: 198 GGIDTEKSYPYTGED-GQCVFDKSNVGATDTGFVDVQTGDETQLMKAVASVGPIS---VA 253

Query: 283 IELPHISFSF 292
           I+  H+SF F
Sbjct: 254 IDASHLSFQF 263


>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
          Length = 325

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 96/231 (41%), Positives = 136/231 (58%), Gaps = 16/231 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           NA   +  FK  + K YA +++   RF +FK NL RA++ Q+ +  TA +GVT+FSDLTP
Sbjct: 27  NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTP 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF   +LG     R+     +  +      P   DWR  GAV  V+DQG+CGSCW+FS 
Sbjct: 86  EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSV 141

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           T  +EG  FL TG LVSLS+QQLVDCD           D GC+GG     ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E +  YPYT     +C+ D+SK+ A + +  V+ +DE++ AA L +HGP++
Sbjct: 193 ELQSAYPYTSWK-QACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMS 242


>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
 gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
          Length = 434

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 93/236 (39%), Positives = 141/236 (59%), Gaps = 18/236 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
           ++LL +   F L   KF+K Y ++EE   RFR+F+AN+++       +  TA +G+T+FS
Sbjct: 128 EYLLQSFKDFVL---KFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFS 184

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DL+ +EF+  +LGL ++   P        +P   LP +FDWR + AVT VK+QG+CGSCW
Sbjct: 185 DLSVTEFK-NYLGLKKK---PESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCW 240

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG   +   EL+SLSEQ+L+DCD           D+GCNGG M   +E I+K
Sbjct: 241 AFSVTGNIEGLWAIKKHELLSLSEQELIDCD---------KIDNGCNGGYMPETYEAIMK 291

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
            GG+E E DYPY   +   C  +K++I   ++    ++  E  +A  L K+GP++ 
Sbjct: 292 LGGLETETDYPYEA-ENEKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSA 346


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 109/252 (43%), Positives = 147/252 (58%), Gaps = 20/252 (7%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFS 102
           +N    F  FK+KF+K Y + EE   RF VF  N+    R        VH     V +F+
Sbjct: 24  VNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFA 83

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           DLT  E+R+ +L       L  + Q+  +   N      DWR  GAVT +K+QG CGSCW
Sbjct: 84  DLTNEEYRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCW 141

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYIL 221
           SFS TG++EGAH ++TG LVSLSEQQLVDC        SGS  + GCNGGLM++AF+YI+
Sbjct: 142 SFSTTGSVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYII 193

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNV 280
             GG++ E+DYPYT  DG   K  +SK A ++S +  V  ++EDQ+AA  V+ GP++   
Sbjct: 194 SNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVS--- 249

Query: 281 ASIELPHISFSF 292
            +IE    SF  
Sbjct: 250 VAIEADQQSFQM 261


>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
          Length = 471

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 141/240 (58%), Gaps = 13/240 (5%)

Query: 40  EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
           ++S  H LN  EH F+ F+ KF + Y T  E   RFR+FK NL+  +     +  +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           +T+F+D+T  E++ Q  GL +R    A +     +P  DLP +FDWR+ GA++ VK+QG 
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG +EG H + TG L   SEQ+L+DCD         + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E I K GG+E E DYPY       C F+ +KI   V     +  +E  +A  L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380


>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 109/256 (42%), Positives = 150/256 (58%), Gaps = 27/256 (10%)

Query: 33  QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLD 91
           +  P D + SED   +A   F  F  +  K Y+ QE H  RF+ F  NL+R K    +  
Sbjct: 33  KTTPEDFDVSED---DARKQFENFLLEHPKMYSEQESHS-RFQTFWENLKRIKFHNHIEQ 88

Query: 92  PTAVHGVTKFSDLTPSEFRRQFLGLNRRLR----------LPADAQKAPILPTNDLPTDF 141
            +A +GVT+F+DL+  EFRR +LGL   L+              ++K     T D    F
Sbjct: 89  GSAKYGVTEFTDLSDFEFRRHYLGLKPELKNLNRKKYERKSRNSSKKLKFAKTAD--ETF 146

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DW + GAVT VK+QG CGSCW+FS TG +EGA F +TG+L+SLSEQ+LVDCD +      
Sbjct: 147 DWVEKGAVTEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLISLSEQELVDCDQK------ 200

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
              DSGCNGGLM+ AFE +++ GG+E E+ YPY G    +C F+KS     + +F  I  
Sbjct: 201 ---DSGCNGGLMDQAFEEVIRIGGLETEQQYPYDGVQ-ETCNFEKSLSKVQIDDFMDIGE 256

Query: 262 DEDQMAANLVKHGPLA 277
           DE+++A  L +HGPL+
Sbjct: 257 DEEEIAEALEEHGPLS 272


>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
          Length = 471

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 141/240 (58%), Gaps = 13/240 (5%)

Query: 40  EQSEDHLLN-AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
           ++S  H LN  EH F+ F+ KF + Y T  E   RFR+FK NL+  +     +  +A +G
Sbjct: 152 KKSNHHNLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYG 211

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           +T+F+D+T  E++ Q  GL +R    A +     +P  DLP +FDWR+ GA++ VK+QG 
Sbjct: 212 ITEFADMTSPEYK-QRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGN 270

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG +EG H + TG L   SEQ+L+DCD         + DS CNGGL ++A+
Sbjct: 271 CGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDCD---------TSDSACNGGLPDNAY 321

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E I K GG+E E DYPY       C F+ +KI   V     +  +E  +A  L+ +GP++
Sbjct: 322 EAIEKIGGLELESDYPYHARK-DQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPIS 380


>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
          Length = 458

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 101/239 (42%), Positives = 142/239 (59%), Gaps = 18/239 (7%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
           ED ++     F  F   +++TY T+EE  +R  VF  N+ RA++ Q LD  TA +GVTKF
Sbjct: 151 EDFVMQVASIFKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKF 210

Query: 102 SDLTPSEFRRQFLG-LNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGAC 158
           SDLT  EFR  +L  L + LR    +++ P+    +   P ++DWR+ GAVT VKDQG C
Sbjct: 211 SDLTEEEFRTIYLNPLLKELR----SKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMC 266

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCW+FS TG +EG  FL  G+L+SLSEQ+LVDCD           D  C GGL ++A+ 
Sbjct: 267 GSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDK---------LDKACLGGLPSNAYS 317

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            I   GG+E E DY Y G    +C F   K    +++   +S +E ++AA L K+GP++
Sbjct: 318 AIKTLGGLETEDDYGYNG-HLQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPIS 375


>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
          Length = 473

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 134/227 (59%), Gaps = 11/227 (4%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F  K+ K Y++QEE + R ++F+ NL+ A++ Q LD  +A +GVTKFSDLT  EFR
Sbjct: 174 QFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFR 233

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             +L             K         P  +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 234 STYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL  G L+SLSEQ+LVDCD           D  C GGL ++A+E I K GG+E E 
Sbjct: 294 EGQWFLKNGTLLSLSEQELVDCD---------GLDQACRGGLPSNAYEAIEKLGGLESET 344

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY YTG     C F   K+AA +++   +  DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPIS 390


>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
          Length = 260

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 140/244 (57%), Gaps = 21/244 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 1   SARELYEQFKRXYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 59

Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF  ++L        P +  Q   + PT     P   DWR  GAVT V++QG+CGSCW+
Sbjct: 60  EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 113

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS  G +EG  F+ TG+LVSLS+QQLVDCD   D         GCNGG   S++  I+  
Sbjct: 114 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHM 164

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+E + DYPY G     C  +K ++ A + +   +   ED  AA L +HGPL+  + +I
Sbjct: 165 GGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIALXPSEDDNAAYLAEHGPLSTLLNAI 223

Query: 284 ELPH 287
            L +
Sbjct: 224 TLQY 227


>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 452

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 96/230 (41%), Positives = 129/230 (56%), Gaps = 14/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EFR +
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +    R         +  + +P    P   DWR  GAVT VKDQG+CGSCWSFSA G +E
Sbjct: 94  YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGNIE 153

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
           G    +   L SLSEQ LV CD +         D+GC GG M++AFE+I+K  +G V  E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDSK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204

Query: 230 KDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K YPY   G +   CK    ++ A ++    I  DED +A  L  +GP+A
Sbjct: 205 KSYPYVSGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 447

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/233 (41%), Positives = 129/233 (55%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EF
Sbjct: 23  EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 82

Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R ++    R         +  + +P    P   DWR  GAVT VKDQG+CGSCWSFSA G
Sbjct: 83  RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 142

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG    +   L SLSEQ LV CD +         D+GC GG M++AFE+I+K  +G V
Sbjct: 143 NIEGQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKV 193

Query: 227 EREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             EK YPY   DG    C     ++ A ++    I  DED +A  L  +GP+A
Sbjct: 194 YTEKSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 246


>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
          Length = 473

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/227 (42%), Positives = 140/227 (61%), Gaps = 13/227 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY++QEE + R R+F+ N++ A+  Q L+  +A +G+TKFSDLT  EFR 
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234

Query: 112 QFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  +  +  L  + + A         T +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDT-WDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  F  TG+L+SLSEQ+LVDCD           D  C GGL ++A+E I   GG+E E 
Sbjct: 294 EGQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETET 344

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY YTG    SC F   K+AA +++   +  DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVS 390


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 101/236 (42%), Positives = 132/236 (55%), Gaps = 19/236 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y ++ E  +R ++F  N  + AK  QL     V    G+ K++D+   E
Sbjct: 27  WQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHE 86

Query: 109 FRRQFLGLNRRLRLPADAQKA-----PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
           F+    G N  +R    AQ+       I P N  +P   DWR HGAVT VKDQG CGSCW
Sbjct: 87  FKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCW 146

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           SFS+TG+LEG HF   G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI  
Sbjct: 147 SFSSTGSLEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKD 199

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
            GGV+ EK YPY G D  SC F+K+ + A  + F  I   DE+ M   +   GP+A
Sbjct: 200 NGGVDTEKSYPYEGID-DSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVA 254


>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
 gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
          Length = 473

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 97/227 (42%), Positives = 140/227 (61%), Gaps = 13/227 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY++QEE + R R+F+ N++ A+  Q L+  +A +G+TKFSDLT  EFR 
Sbjct: 175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234

Query: 112 QFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  +  +  L  + + A         T +DWRDHGAV+ VK+QG CGSCW+FS TG +
Sbjct: 235 MYLNPMLSQWSLKKEMKPAIPASAPAPDT-WDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  F  TG+L+SLSEQ+LVDCD           D  C GGL ++A+E I   GG+E E 
Sbjct: 294 EGQWFKKTGQLLSLSEQELVDCD---------KLDQACGGGLPSNAYEAIENLGGLETET 344

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY YTG    SC F   K+AA +++   +  DE ++AA L ++GP++
Sbjct: 345 DYSYTGHK-QSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVS 390


>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 454

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 98/233 (42%), Positives = 129/233 (55%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EF
Sbjct: 31  EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 90

Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R ++    R         +  + +P    P   DW   GAVT VKDQG CGSCWSFSA G
Sbjct: 91  RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIG 150

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG    +   L SLSEQ LV CD +         D+GC GGLM++AFE+I+K  +G V
Sbjct: 151 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DNGCGGGLMDNAFEWIVKENSGKV 201

Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             EK YPY   G +   CK    K+ A ++    I  DED +A  L  +GP+A
Sbjct: 202 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
          Length = 321

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 140/244 (57%), Gaps = 21/244 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           +A   +  FK  + K YA +++   RF +FK NL RA++ QL D  TA +GVT+FSDLTP
Sbjct: 22  SARELYEQFKRDYGKVYANEDDQK-RFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTP 80

Query: 107 SEFRRQFLGLNRRLRLPADA-QKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF  ++L        P +  Q   + PT     P   DWR  GAVT V++QG+CGSCW+
Sbjct: 81  EEFAAKYLSA------PVNNDQVKRVRPTGLKAAPERIDWRAKGAVTAVENQGSCGSCWA 134

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS  G +EG  F+ TG+LVSLS+QQLVDCD   D         GCNGG   S++  I+  
Sbjct: 135 FSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD---------GCNGGWPASSYLEIMHM 185

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+E + DYPY G     C  +K ++ A + +   +   ED  AA L +HGPL+  + +I
Sbjct: 186 GGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAI 244

Query: 284 ELPH 287
            L +
Sbjct: 245 TLQY 248


>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
 gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
          Length = 344

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 101/283 (35%), Positives = 153/283 (54%), Gaps = 21/283 (7%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           ++L    V+AS    N  +A+I  V  +   + + +L  A  +F  F++K+ K YA   E
Sbjct: 4   IILFFVFVVASGGLDNGVNAVIDYVAAAPHFKLQYNLERAPQYFETFQTKYKKVYADDNE 63

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR---LRLPADA 126
            DYR+++FK NL     +   + +AV+ + KF+DLT +E   +F GL  +   L+   D 
Sbjct: 64  RDYRYKIFKTNLEIINLKNQQNDSAVYNINKFADLTKNEVIAKFTGLGVKSPNLKNFCD- 122

Query: 127 QKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
              P++   P+      FDWR    +T VKDQG CGSCW+FS    LE  + +   E + 
Sbjct: 123 ---PLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAGLESQYAIKYNEHID 179

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQQLVDCD         + D GC GGL+++A+E I+  GGVE E+DYPY     G C+
Sbjct: 180 LSEQQLVDCD---------TIDMGCAGGLLHTAYEEIMSMGGVEYEEDYPYRSVQ-GPCR 229

Query: 244 FDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
            +  K   +V N +  I   ED++   L + GP+A  V +++L
Sbjct: 230 IENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDL 272


>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
           cathepsin; Flags: Precursor
 gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
 gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
 gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|225484|prf||1304284A cathepsin,prestalk
          Length = 376

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 99/247 (40%), Positives = 140/247 (56%), Gaps = 16/247 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F+ +  KF++ Y++ E  + R+ +FK+N+          D   V G+  F+D+T  E+R+
Sbjct: 36  FTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
            +LG               +L   DL   P   DWR   AVT +KDQG CGSCWSFS TG
Sbjct: 95  TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           + EGAH L T +LVSLSEQ LVDC     PEE    + GC+GGLMN+AF+YI+K  G++ 
Sbjct: 155 STEGAHALKTKKLVSLSEQNLVDCS---GPEE----NFGCDGGLMNNAFDYIIKNKGIDT 207

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E  YPYT   G +C F+KS I A +  +  I++  +    N  +HGP++    +I+  H 
Sbjct: 208 ESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVS---VAIDASHN 264

Query: 289 SFSFLFT 295
           SF  L+T
Sbjct: 265 SFQ-LYT 270


>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
          Length = 451

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 100/231 (43%), Positives = 134/231 (58%), Gaps = 22/231 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + ++K+YA   E   R  +F  NL  A++ Q LD  +A +GVTKFSDLT  EFR 
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213

Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            +L      L  R   P  A + P       P  +DWRDHGAVTGVK+QGACGSCW+FS 
Sbjct: 214 SYLNPLLSSLPGRALRPGPATRGPA------PASWDWRDHGAVTGVKNQGACGSCWAFSV 267

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG  FL  G L++LSEQ+LVDCD         + D  C GGL ++A+  I K GG+
Sbjct: 268 TGNVEGQWFLRRGALLALSEQELVDCD---------TLDQACGGGLPSNAYTAIEKLGGL 318

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E EKDY Y G     C F   K    +++   +S DE+++A  L ++GP++
Sbjct: 319 ETEKDYSYEGRK-ERCSFSPDKARVYINSSVDLSRDEEELATWLAENGPVS 368


>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
 gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 109/288 (37%), Positives = 151/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VK QG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + DSGCNGGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|340053969|emb|CCC48263.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 259

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 96/230 (41%), Positives = 127/230 (55%), Gaps = 14/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EFR +
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +    R         +  + +P    P   DWR  GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94  YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
           G    +   L SLSEQ LV CD +         D+GC GG M++AFE+I+K  +G V  E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204

Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K YPY   DG    C     ++ A ++    I  DED +A  L  +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 106/255 (41%), Positives = 138/255 (54%), Gaps = 23/255 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  QL     V    G+ K++D+   E
Sbjct: 28  WQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLHHE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAP------ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           F     G N  L     A  A       I P +  LP   DWR+ GAVTGVKDQG CGSC
Sbjct: 88  FHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSC 147

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  TG L+SLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 148 WAFSSTGALEGQHFRKTGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 200

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY G D  SC F+K  I A    F+ I   DE ++A  +   GP++   
Sbjct: 201 DNGGIDTEKSYPYEGID-DSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVS--- 256

Query: 281 ASIELPHISFSFLFT 295
            +I+  H SF F  T
Sbjct: 257 VAIDASHESFQFYST 271


>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
           partial [Trypanosoma vivax Y486]
          Length = 323

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/230 (41%), Positives = 127/230 (55%), Gaps = 14/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EFR +
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +    R         +  + +P    P   DWR  GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94  YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIGNIE 153

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
           G    +   L SLSEQ LV CD +         D+GC GG M++AFE+I+K  +G V  E
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTE 204

Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K YPY   DG    C     ++ A ++    I  DED +A  L  +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 107/279 (38%), Positives = 154/279 (55%), Gaps = 32/279 (11%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           +I S+L+LL++      A+A        R     DG       L  ++ F  + +K  K+
Sbjct: 1   MIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGKS 47

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR- 121
           Y++  E   R  +F   L   ++     + T   G+ KFSDLT +EFR   +G  +R R 
Sbjct: 48  YSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRY 107

Query: 122 ---LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
              LPA+ +   +   + LPT  DWR  GAVT +KDQG CGSCW+FSA  ++E AHFL+T
Sbjct: 108 QDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLAT 164

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
            ELVSLSEQQL+DCD         + D+GC+GGLM +AF++++K GGV  E  YPYTG+ 
Sbjct: 165 KELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSV 215

Query: 239 GGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPL 276
            GSC  +K+K   A ++ F V++ D        V   P+
Sbjct: 216 -GSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPV 253


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 102/250 (40%), Positives = 145/250 (58%), Gaps = 20/250 (8%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH---GVTKFSDLT 105
           AE H++ FKS   K+Y   +E   R  +F+ NL   +    ++ +      GV +F+D+T
Sbjct: 24  AEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            +EF    LGL  R ++  D+  + + +    DLP + DW   G VT VK+QG CGSCW+
Sbjct: 84  NTEFSNMLLGLGGRNKIAGDSVFESSHV---QDLPAEVDWTQKGYVTEVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG+LEG  F  TG+LVSLSEQ LVDC        +   + GCNGGLM+ AF YI K 
Sbjct: 141 FSTTGSLEGQVFKKTGKLVSLSEQNLVDCS-------TSEGNQGCNGGLMDQAFTYIKKN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
           GG++ E  YPYTG+D G+C+F ++K+ A VS F  V S DE+ +   +   GP++    +
Sbjct: 194 GGIDTEAAYPYTGSD-GTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPIS---VA 249

Query: 283 IELPHISFSF 292
           I+   I F F
Sbjct: 250 IDASSIFFQF 259


>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 109/279 (39%), Positives = 144/279 (51%), Gaps = 39/279 (13%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           RL +  +L+  + S LA    V  D                    NA   +  FK K+ K
Sbjct: 2   RLFVCCVLVTTIWSALARTTQVEPD--------------------NARALYEEFKLKYKK 41

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           TY+  ++ + RF +FK NL RAKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R
Sbjct: 42  TYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMR 96

Query: 122 LPADAQKAPILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                    + P  D+  D   FDWR+HGAV  V DQG CGSCW+FS  G + G  F  T
Sbjct: 97  FDGPIVSEDLTPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKT 156

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L++LSEQ LVDCD+          D GC+GG        I K GG+E   DYPYTG  
Sbjct: 157 GHLLALSEQPLVDCDY---------LDGGCDGGYPPQTNTAIQKMGGLELASDYPYTGV- 206

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG C  DKSK  A ++  +++   E   A  L   GPL+
Sbjct: 207 GGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLS 245


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 107/282 (37%), Positives = 153/282 (54%), Gaps = 34/282 (12%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
            +I S+L+LL++      A+A        R     DG       L  ++ F  + +K  K
Sbjct: 4   NMIASTLILLVVVGATPFAIA--------RPAALEDGRA-----LEIKNMFEDWAAKHGK 50

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           +Y++  E   R  +F   L   ++     + T   G+ KFSDLT +EFR   +G  +R R
Sbjct: 51  SYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPR 110

Query: 122 ----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
               LPA+ +   +   + LPT  DWR  GAVT +KDQG CGSCW+FSA  ++E AHFL+
Sbjct: 111 YQDRLPAEDEDVDV---SSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLA 167

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           T ELVSLSEQQL+DCD         + D+GC+GGLM +AF++++K GGV  E  YPYTG+
Sbjct: 168 TKELVSLSEQQLMDCD---------TVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGS 218

Query: 238 DGGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPL 276
             GSC  +K  I    A ++ F V++ D        V   P+
Sbjct: 219 V-GSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPV 259


>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
 gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
          Length = 227

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 87/156 (55%), Positives = 113/156 (72%), Gaps = 17/156 (10%)

Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
           AP+LPT++LP  FDWR+HGA+T VK+QG+CGSCW+FS+TGA+EGAHFL + EL+SL E+Q
Sbjct: 1   APLLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQ 60

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS------- 241
           LVDCD           D GC GG M +A+EYI KA G+E E+DYPY   +          
Sbjct: 61  LVDCDRM---------DGGCKGGDMLNAYEYI-KAKGLEAEEDYPYQEENYKEYMFPHHR 110

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           C F  SK+AA ++N+S +S DEDQ+AANLVK+GPL+
Sbjct: 111 CHFRPSKVAATIANYSTVSEDEDQIAANLVKNGPLS 146


>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 100/260 (38%), Positives = 147/260 (56%), Gaps = 17/260 (6%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLT 105
           + +E  F  F + +++TYAT+EE + R  +F+ NL   +  R+    T  +GV +F+D++
Sbjct: 721 MRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVS 780

Query: 106 PSEFRRQFLGLNRRLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
             EF   +LGL   LR   +   +   +P  +LP  FDWR  GAVT VK+QG CGSCW+F
Sbjct: 781 TEEFHAFYLGLRPDLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPVKNQGMCGSCWAF 840

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG +EG + +   +L+SLSEQ+LVDCD           D GCNGGL ++A+  I K G
Sbjct: 841 SVTGNVEGQYAIKHNKLLSLSEQELVDCD---------DLDEGCNGGLPDNAYRAIEKLG 891

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
           G+E E DYPY   +   C F K+     V +   I+S+E Q+A  LV +GP++     N 
Sbjct: 892 GLELESDYPYEA-ENERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGINANA 950

Query: 281 ASIELPHISFSFLFTVSSPK 300
               +  +S  F F + +PK
Sbjct: 951 MQFYMGGVSHPFKF-LCNPK 969


>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
          Length = 410

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 99/242 (40%), Positives = 139/242 (57%), Gaps = 24/242 (9%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
           +D  L     F  F + +++TY T+EE  +R  VF  N+ RA++ Q LD  TA +GVTKF
Sbjct: 103 QDFYLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKF 162

Query: 102 SDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           SDLT  EFR  +L       L +++RL            +  P ++DWR  GAVT VK+Q
Sbjct: 163 SDLTEEEFRTMYLNPLLKEELGKKMRLVK-------FVGDPAPPEWDWRKKGAVTKVKNQ 215

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FS TG +EG  FL  G+L+SLSEQ+LVDCD           D  C GGL ++
Sbjct: 216 GMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCD---------KVDKACMGGLPSN 266

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           A+  I   GG+E E DY Y+G    +C F   K    +++   +S +E ++AA L K+GP
Sbjct: 267 AYSAIKTLGGLETEDDYSYSG-HLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGP 325

Query: 276 LA 277
           ++
Sbjct: 326 IS 327


>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
          Length = 467

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 100/237 (42%), Positives = 129/237 (54%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK ++ + Y +  E  +R  VF+ NL  AK     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F    +R R+P D      +   D P   DWRD GAVT VKDQG CGSCW+F
Sbjct: 97  RHHSGAAHFAAGRKRARVPVD------VGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK-- 222
           SA G +EG  FL+   L SLSEQ LV CD         + DSGC+GGLMNSAFE+I++  
Sbjct: 151 SAIGNVEGQWFLAGNALTSLSEQMLVSCD---------TMDSGCDGGLMNSAFEWIVEHH 201

Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E+ Y Y   DG +  C+     + A ++    +  DE +MA  L  +GPLA
Sbjct: 202 NGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLA 258


>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
           [Strongylocentrotus purpuratus]
          Length = 453

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 95/246 (38%), Positives = 137/246 (55%), Gaps = 23/246 (9%)

Query: 53  FSLFKSKFSKTYATQE---EHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSE 108
           F  F   F + Y   +   E++YR+ VF  N+   +   Q    TA +G TKF+D+T +E
Sbjct: 156 FDKFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKFADMTEAE 215

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           FR+   G  ++  +    +K   +P   +P ++DWR HGAVT VK+QG CGSCW+FSA G
Sbjct: 216 FRKLQSGPLKKTGI----KKQAAIPQGPVPEEYDWRTHGAVTPVKNQGMCGSCWAFSAIG 271

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
            +EG   +  GEL+SLSEQ+LVDCD           D GC GG M+ A+E I+K GG   
Sbjct: 272 NMEGQWQIKKGELISLSEQELVDCD---------KVDGGCEGGEMSDAYEAIIKLGGAMS 322

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E+ YPY G +   CKF+ + +   ++ +  IS +E +MA  L  HGP+     SI +  +
Sbjct: 323 EEKYPYRG-ENEKCKFNMTDVRVKINGYVNISKNETEMAGWLAAHGPI-----SIGINAL 376

Query: 289 SFSFLF 294
              F F
Sbjct: 377 MMQFYF 382


>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 107/246 (43%), Positives = 144/246 (58%), Gaps = 20/246 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  FK+ F K Y + EE   RF +F  NL    R        +H    GV +F+DLT  E
Sbjct: 20  FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +R+ +L       L  + Q+  +   N      DWR  GAVT +K+QG CGSCWSFS TG
Sbjct: 80  YRQLYLRPYPTELLGRERQEVWLDGPN--AGSVDWRQKGAVTPIKNQGQCGSCWSFSTTG 137

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
           ++EGAH ++TG LVSLSEQQLVDC        SGS  + GCNGGLM++AF+YI+  GG++
Sbjct: 138 SVEGAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDNAFKYIISNGGLD 189

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELP 286
            E+DYPYT  DG   K  +SK A ++S +  V  ++EDQ+AA  V+ GP++    +IE  
Sbjct: 190 TEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAA-AVEKGPVS---VAIEAD 245

Query: 287 HISFSF 292
             SF  
Sbjct: 246 QQSFQM 251


>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
          Length = 450

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
 gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 450

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 451

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 108/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 103/248 (41%), Positives = 148/248 (59%), Gaps = 26/248 (10%)

Query: 21  AVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
            V  N D  +IR  +P+D    +D LL  +  F+ +  K  K Y+  EE  +RF V+K N
Sbjct: 21  GVVANGD--VIR--MPTD--VGKDQLLAGQ--FAAWAHKHGKVYSAAEERAHRFLVWKDN 72

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTND 136
           L   +R    + +   G+TKF+DLT  EFRRQ+ G     +RRL+   +A  +     ++
Sbjct: 73  LEYIQRHSEKNLSYWLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSE 132

Query: 137 LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHEC 196
            P   DWR+ GAVT VKDQG+CGSCW+FSA G++EG + + TG+ +SLS Q+LVDCD + 
Sbjct: 133 APKSIDWREKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK- 191

Query: 197 DPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF 256
                   + GCNGGLM+ AF+++++ GG++ EKDYPY G DG   + D +K+ A V   
Sbjct: 192 -------YNQGCNGGLMDYAFDFVIQNGGIDTEKDYPYQGYDG---RCDVNKMNARV--- 238

Query: 257 SVISSDED 264
             I S ED
Sbjct: 239 VTIDSYED 246


>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 389

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 95/230 (41%), Positives = 126/230 (54%), Gaps = 14/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EFR +
Sbjct: 34  FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEFRTR 93

Query: 113 FLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +    R         +  + +P    P   DWR  GAVT VKDQG CGSCWSFSA G +E
Sbjct: 94  YHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGNIE 153

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGVERE 229
           G    +   L SLSEQ LV CD +         D+GC GG M++AFE+I+K  +G V   
Sbjct: 154 GQWAAAGNPLTSLSEQMLVSCDFK---------DNGCGGGFMDNAFEWIVKENSGKVYTG 204

Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K YPY   DG    C     ++ A ++    I  DED +A  L  +GP+A
Sbjct: 205 KSYPYVSEDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVA 254


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 131/235 (55%), Gaps = 21/235 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           NA   +  FK K+ KTY   ++ +YRF VFK NL RA + Q ++  TA +GVT+F DLT 
Sbjct: 302 NARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTS 360

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPIL-PTNDLPTD---FDWRDHGAVTGVKDQGACGSCW 162
            EF+ Q+LG         D Q    + P+  +  D   FDWRDHGAV  V DQG CGSCW
Sbjct: 361 QEFQIQYLGFKYE-----DMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCW 415

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS  G +EG  FL TGEL+SLSEQQL+DCD         + D GCNGG     +  ++K
Sbjct: 416 AFSTIGNIEGQWFLKTGELLSLSEQQLIDCD---------NVDEGCNGGYPPKTYGAVIK 466

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG+E   DYPY       C  D+ K+   +++  V   +E   A  L   GPL+
Sbjct: 467 MGGLELNSDYPYKAL-AEKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLS 520



 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/139 (47%), Positives = 83/139 (59%), Gaps = 12/139 (8%)

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
           +FDWR HGAV  V +QG CGSCW+FSA G +EG  FL +GEL+ LS QQ++DCDH     
Sbjct: 42  NFDWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCDH----- 96

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
                D GCNGG     +  + + GG++ + DY Y     G C  D+SK  A V N SVI
Sbjct: 97  ----VDHGCNGGYPPQVYRQVNQMGGLQLDADYSYKAAV-GKCHTDRSKFRAYV-NSSVI 150

Query: 260 SSDEDQMAANLVKH-GPLA 277
            S  +Q  AN +K  GPLA
Sbjct: 151 LSQNEQFQANKLKTIGPLA 169


>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
 gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
          Length = 475

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 103/263 (39%), Positives = 151/263 (57%), Gaps = 19/263 (7%)

Query: 22  VAVNDDDAM-IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           + + DDD++ ++++  +   +  D+++   + F  F  +  K Y+ + E   RFR FK N
Sbjct: 142 IQLTDDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKN 199

Query: 81  LRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPT 134
            +  +  Q  +  TAV+G TKFSD+T  EF++  L       +     AD +K  I +  
Sbjct: 200 AKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMDQADFEKEGITISE 259

Query: 135 NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDH 194
            DLP  FDWRD GAVT VK+QG CGSCW+FS TG +EGA FL+  +LVSLSEQ+LVDCD 
Sbjct: 260 EDLPESFDWRDKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCD- 318

Query: 195 ECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS 254
                     D GCNGGL ++A++ I++ GG+E E  YPY G  G +C   +  IA  ++
Sbjct: 319 --------GVDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGK-GETCHLVRKDIAVYIN 369

Query: 255 NFSVISSDEDQMAANLVKHGPLA 277
               +  DE +M   LV  GP++
Sbjct: 370 GSIELPHDEVEMQKWLVTKGPIS 392


>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 107/288 (37%), Positives = 150/288 (52%), Gaps = 43/288 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +R+R      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRVR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 214 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 261


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 104/297 (35%), Positives = 162/297 (54%), Gaps = 18/297 (6%)

Query: 1   MERLILSSLLLLLLSSVLASA-VAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSK 59
           M+ + +++L    L S++++  +++ + DA       S      D  +NA +   L K  
Sbjct: 1   MKLIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVK-- 58

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--- 116
             KTY    E D RF++FK NLR        D T   G+ KF+DLT  E+R  + G+   
Sbjct: 59  HGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTI 118

Query: 117 -NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
            +++      + +      + LP   DWR+ GAVT VKDQG+CGSCW+FS TG++EG + 
Sbjct: 119 DDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNK 178

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           + TG+L+S+SEQ+LV+CD         S + GCNGGLM+ AFE+I+K GG++ E+DYPYT
Sbjct: 179 IVTGDLISVSEQELVNCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           G DG   K  K+     + ++  +  +++      V + P+A    +IE     F F
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVA---VAIEAGGRDFQF 284


>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 290

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 96/242 (39%), Positives = 142/242 (58%), Gaps = 18/242 (7%)

Query: 62  KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           K Y    E + RF++FK NL+   +   + D T   G+T+F+DLT  EFR  +L   +++
Sbjct: 53  KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110

Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
               D+ K    +    D LP + DWR +GAV  VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGEL+SLSEQ+LVDCD        G  ++GC+GG+MN AFE+I+K GG+E ++DYPY   
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223

Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           D G C  DK+     V+   +  +  D+++     V H P++    +IE    +F    +
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS---VAIEASSQAFQLYKS 280

Query: 296 VS 297
           V+
Sbjct: 281 VN 282


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  173 bits (439), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 106/295 (35%), Positives = 157/295 (53%), Gaps = 26/295 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--------GEQSEDHLLNAEHHFSL 55
           L+ +++ LL+ +S L       DDD  +    P +         E  E+H  NA   F  
Sbjct: 67  LVAAAVSLLVFASFLIQWQG--DDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA---FGS 121

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           F++ + K+YAT+EE   R+ +FK NL           +    +  F DL+  EFRR++LG
Sbjct: 122 FRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLG 181

Query: 116 LNRRLRLPAD----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            N+   L ++    A +   +  +D+P+  DWR+ G VT VKDQ  CGSCW+FSATGALE
Sbjct: 182 YNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALE 241

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           GAH   TGEL+SLSEQ+LVDC            + GC+GG MN AF+Y++ +GG+  E+ 
Sbjct: 242 GAHCAKTGELLSLSEQELVDCS-------LAEGNQGCSGGEMNDAFQYVVDSGGLCSEEG 294

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
           YPY   D G CK    K+   +S F  +    +      + H P++  + + +LP
Sbjct: 295 YPYLARD-GECKRACKKV-VTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLP 347


>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 133/236 (56%), Gaps = 20/236 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFR+FK ++ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97

Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           R  +L   +     L+ P   +K   + T   P   DWR  GAVT VKDQ  CGSCW+FS
Sbjct: 98  RATYLNGAKYYAAALKRP---RKVVTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFS 154

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           A G +EG   ++  EL SLSEQ LV CD+          D GC GGLM+ A ++I+ +  
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DDGCQGGLMDRALKWIVSSNK 205

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E+ YPY  TDG     +KS   + A +S    +  DE+ +A  L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIA 261


>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
 gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
          Length = 366

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 91/229 (39%), Positives = 138/229 (60%), Gaps = 15/229 (6%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEF 109
            +F  F  +F+K Y T++    ++ +FK+N+  AKR Q  +  TA++G T F+D+TP EF
Sbjct: 64  ENFKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEF 123

Query: 110 RRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R+  L  N   ++ P   ++   +P +++    DWR   AVT VKDQG CGSCW+F    
Sbjct: 124 RKTHLNFNPNNVKKP---KRMANIPKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVA 180

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
            +EGA  + T +L+SLSEQQLVDCD           D GC GGL  +A+  I++ GG+E+
Sbjct: 181 NIEGAWAVKTAQLISLSEQQLVDCDR---------LDDGCEGGLPVNAYLEIIRLGGLEK 231

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E+DY YT    G CKF+ +K A  +++  V+  DED +A  + ++GP+A
Sbjct: 232 EEDYKYTAR-SGKCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVA 279


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/248 (39%), Positives = 141/248 (56%), Gaps = 20/248 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
           ++ +K++  K Y + EE   R  +++ NL    +   +  L   T   G+ +F+DL   E
Sbjct: 28  WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           F     G  R       A+ +  LP+N+   LP   DWR  G VT VKDQG CGSCW+FS
Sbjct: 88  FVAMMTGF-RVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFS 146

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG+LEG HF +TG+LVSLSEQ LVDC  +   E       GC+GGLM+ AF+YI+KAGG
Sbjct: 147 TTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNE-------GCDGGLMDQAFQYIIKAGG 199

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
           ++ E+ YPY   D G C F K+ I A V+ ++ ++SD +      V H GP++    +I+
Sbjct: 200 IDTEESYPYKAVD-GECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPIS---VAID 255

Query: 285 LPHISFSF 292
             H+SF  
Sbjct: 256 ASHMSFQL 263


>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 134/230 (58%), Gaps = 15/230 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F  KF + Y++ EE   RFR++  N+  AK+ Q  +  TA++G TKFSD+T  EF++
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218

Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
             L      R+ ++     +    L   +LP+ FDWR  G VT VKDQG+CGSCW+FS T
Sbjct: 219 IMLPSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCWAFSVT 278

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G +E    + TG+L+SLSEQ+L+DCD           D GCNGGL  +AF  I + GG+E
Sbjct: 279 GNIESLWAIKTGKLISLSEQELIDCD---------VIDKGCNGGLPINAFREIKRMGGLE 329

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            E  YPY   + G+C   +++IA ++ +   I  +E  M A + + GPL+
Sbjct: 330 PEDQYPYEAKN-GTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLS 378


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 105/252 (41%), Positives = 139/252 (55%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLL---DPTAVHGVTKFSDLTPSE 108
           +  FK +  KTY  + E  +R ++F  N  + AK  Q     + T    V K++D+   E
Sbjct: 27  WHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLHHE 86

Query: 109 FRRQFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR    G N    + LR   P+      I P +  LP   DWR+ GAVT VKDQG CGSC
Sbjct: 87  FRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSC 146

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  TG LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 147 WAFSSTGALEGQHFRKTGTLVSLSEQNLVDC-------SAKYGNNGCNGGLMDNAFRYIK 199

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY G D  SC F+K  + A    F+ I   +E +MA  +   GP++   
Sbjct: 200 DNGGIDTEKSYPYEGID-DSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVS--- 255

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 256 VAIDASHESFQF 267


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 102/250 (40%), Positives = 135/250 (54%), Gaps = 21/250 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K + ++ E  +R ++F  N  + AK  QL     V    G+ K+SD+   E
Sbjct: 27  WQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHE 86

Query: 109 FRRQFLGLNRRLRLPADAQK----APILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           F+    G N  +R    AQ       I P N  +P   DWR HGAVT VKDQG CGSCW+
Sbjct: 87  FKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWA 146

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS+T ALEG HF   G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI   
Sbjct: 147 FSSTAALEGQHFRKAGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDN 199

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
           GG++ EK YPY G D  SC F KS + A  + F  I   DE+ +   +   GP++    +
Sbjct: 200 GGIDTEKSYPYEGID-DSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVS---VA 255

Query: 283 IELPHISFSF 292
           I+  H SF  
Sbjct: 256 IDASHESFQL 265


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 109/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)

Query: 5   ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
           +L+  + L ++SVLA  VAV  D                  L N EH    FK  F KTY
Sbjct: 140 VLTIEMRLYIASVLALVVAVGAD------------------LTNFEH----FKEHFGKTY 177

Query: 65  ATQEEHDYRFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
              +EH  R  +F+ NL   ++    +        G+T+F+D++ +EFR+ +LGL     
Sbjct: 178 EG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFRQTYLGLRMNAS 236

Query: 122 LPADA---QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             A     Q+  +    DLP   DWRD GAV+ VKDQG CGSCW+FS +GA+EG HFL  
Sbjct: 237 TIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKN 296

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GEL+SLSEQQ+VDC            D GCNGG    A EY+   GG+E E  YPY G  
Sbjct: 297 GELLSLSEQQMVDCSW---------LDFGCNGGQPMLAMEYVRFNGGLELETAYPYKGV- 346

Query: 239 GGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
           GGSC  DK   AA ++ F +     E  +   + K GP++
Sbjct: 347 GGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPIS 386


>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
          Length = 881

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 107/260 (41%), Positives = 151/260 (58%), Gaps = 17/260 (6%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
           +  E  F  F  KF+KT+++  E   RF++FK NL+  K  Q  +  TA +GVT F+DLT
Sbjct: 570 IKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFADLT 629

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           P EF+ ++LG    L+   +   A I  ++  LP  FDWRD+ AVT VKDQG CGSCW+F
Sbjct: 630 PKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLCGSCWAF 689

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG +EG + +   +L+SLSEQ+L+DCD         + D GCNGG M +A++ I K G
Sbjct: 690 SVTGNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLG 740

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNV 280
           G+E E DYPY G +   C F K      V     I+S+E +MA  L+K+GP++     N 
Sbjct: 741 GLELESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANA 799

Query: 281 ASIELPHISFSFLFTVSSPK 300
               +  +S  F F + +PK
Sbjct: 800 MQFYIGGVSHPFHF-LCNPK 818


>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
          Length = 454

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/242 (39%), Positives = 145/242 (59%), Gaps = 19/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           N    ++ FK  + K Y  + +++ RF +FK+NL +A+  Q+L+  +AV+GVT +SDLT 
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L    R    A +++  I P     D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 266

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD         S D GCNGGL ++A+E I++ 
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 317

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+  E +YPY   +   C    + +AA +++   ++ DE ++A  L  H  ++  + ++
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376

Query: 284 EL 285
            L
Sbjct: 377 LL 378


>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 95/236 (40%), Positives = 133/236 (56%), Gaps = 20/236 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFR+FK ++ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97

Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           R  +L   +     L+ P   +K   + T   P   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 98  RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFS 154

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           A G +EG   ++  EL SLSEQ LV CD+          D GC GG ++ A ++I+ +  
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DYGCRGGFLDRALKWIVSSNK 205

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E+ YPY  TDG     +KS   + A +S    +  DE+ +A  L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDENAIAEWLAKNGPIA 261


>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
          Length = 477

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 100/259 (38%), Positives = 150/259 (57%), Gaps = 18/259 (6%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           +DD   ++++  +   +  D+++   + F  F  +  K Y+ + E   RFR FK N +  
Sbjct: 148 HDDSITVQELRKAKIIRPRDYVI--WNSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKVI 205

Query: 85  KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI-LPTNDLP 138
           +  Q  +  +AV+G TKFSD+T  EF++  L       +     AD +K  + +  +DLP
Sbjct: 206 RELQKNEQGSAVYGFTKFSDMTTMEFKQTMLPYQWEQPVYPMAEADFEKEGVTISEDDLP 265

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
             FDWRDHGAVT VK+QG CGSCW+FS TG +EGA +L+  +LVSLSEQ+LVDCD     
Sbjct: 266 DSFDWRDHGAVTQVKNQGNCGSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDCD----- 320

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
               S D GCNGGL ++A++ I++ GG+E E  YPY G  G +C   +  IA  ++    
Sbjct: 321 ----SVDQGCNGGLPSNAYKEIMRMGGLEPEDAYPYDGK-GETCHIVRKDIAVYINGSVE 375

Query: 259 ISSDEDQMAANLVKHGPLA 277
           +  DE ++   LV  GP++
Sbjct: 376 LPHDEVKIQKWLVTKGPIS 394


>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 95/236 (40%), Positives = 131/236 (55%), Gaps = 20/236 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFR+FK ++ RAK     +P A  GVT+FSD++P E 
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEL 97

Query: 110 RRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           R  +L   +     L+ P   +K   + T   P   DWR  GAVT VKDQ  CGSCW+FS
Sbjct: 98  RATYLNGAKYYAAALKRP---RKVVNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFS 154

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           ATG +EG   ++  EL SLSEQ LV CD+          D GC GGLM+ A ++I+ +  
Sbjct: 155 ATGNIEGQWKVAGHELTSLSEQMLVSCDNM---------DDGCQGGLMDRALKWIVSSNK 205

Query: 224 GGVEREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E+ YPY  TDG    C      + A +S    +  DE+ +A  L K+GP+A
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261


>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
          Length = 444

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/233 (41%), Positives = 128/233 (54%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG CGSCW+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD           D GC GGLM+ AF++I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G     DKS   + A + +   +  DE+ +A  L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVA 261


>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
 gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
          Length = 1834

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)

Query: 26   DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
            DDDA +R++                  F  F+    + YA+  EH+ RF +F+ NL + +
Sbjct: 1515 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1556

Query: 86   RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
            +    +  TA +GVTKF+D+T +E+R    GL       A+      A +  +    DLP
Sbjct: 1557 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1615

Query: 139  TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD     
Sbjct: 1616 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1670

Query: 199  EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
                  D+GC GG M+ AF+ I + GG+E E DYPY      SC F++S     V     
Sbjct: 1671 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1726

Query: 259  ISSDEDQMAANLVKHGPLA 277
            +  +E  +A  L+K+GP+A
Sbjct: 1727 MPKNETYIAKYLIKNGPIA 1745


>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
 gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
          Length = 953

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)

Query: 26  DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           DDDA +R++                  F  F+    + YA+  EH+ RF +F+ NL + +
Sbjct: 634 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 675

Query: 86  RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
           +    +  TA +GVTKF+D+T +E+R    GL       A+      A +  +    DLP
Sbjct: 676 QLNKFERGTAKYGVTKFADMTVAEYRAH-TGLVVPKHDRANHVGNRVASEEDVAGVGDLP 734

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
             FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD     
Sbjct: 735 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 789

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
                 D+GC GG M+ AF+ I + GG+E E DYPY      SC F++S     V     
Sbjct: 790 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 845

Query: 259 ISSDEDQMAANLVKHGPLA 277
           +  +E  +A  L+K+GP+A
Sbjct: 846 MPKNETYIAKYLIKNGPIA 864


>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
          Length = 325

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 140/247 (56%), Gaps = 16/247 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           NA   +  FK  + K YA +++   RF +FK NL RA++ Q  +  TA +GVT+FSDLT 
Sbjct: 27  NARELYEQFKRDYGKAYANEDDQK-RFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTN 85

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF   +LG     R+     +  +      P   DWR+ GAV  V+ QG+CGSCW+FS 
Sbjct: 86  EEFAAMYLGS----RIDERVDRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSV 141

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           T  +EG  FL TG LVSLS+QQLVDCD           D GC+GG     ++ I + GG+
Sbjct: 142 TANVEGQWFLKTGRLVSLSKQQLVDCDR---------LDHGCSGGYPPYTYKEIKRMGGL 192

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
           E +  YPYTG +  +C+ D+SK+ A + +  V+  +E++ AA L +HGP++  + +  L 
Sbjct: 193 ELQSAYPYTGWE-QACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQ 251

Query: 287 HISFSFL 293
              +  L
Sbjct: 252 FYRYGIL 258


>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
 gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
          Length = 1810

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 99/259 (38%), Positives = 141/259 (54%), Gaps = 35/259 (13%)

Query: 26   DDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
            DDDA +R++                  F  F+    + YA+  EH+ RF +F+ NL + +
Sbjct: 1491 DDDAHVRRM------------------FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIE 1532

Query: 86   RRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD------AQKAPILPTNDLP 138
            +    +  TA +GVTKF+D+T +E+R    GL       A+      A +  +    DLP
Sbjct: 1533 QLNKFERGTAKYGVTKFADMTVAEYR-AHTGLVVPKHDRANHVGNRVASEEDVAGVGDLP 1591

Query: 139  TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              FDWRDHGAVT VK+QG+CGSCW+FSA G +EG H + T +L S SEQ+L+DCD     
Sbjct: 1592 RSFDWRDHGAVTEVKNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCD----- 1646

Query: 199  EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSV 258
                  D+GC GG M+ AF+ I + GG+E E DYPY      SC F++S     V     
Sbjct: 1647 ----KVDNGCGGGYMDDAFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVD 1702

Query: 259  ISSDEDQMAANLVKHGPLA 277
            +  +E  +A  L+K+GP+A
Sbjct: 1703 MPKNETYIAKYLIKNGPIA 1721


>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
 gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
          Length = 299

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K+Y    E D RF +FK NL+       L+ T   G+T+F+DLT  E+R +FLG   
Sbjct: 61  KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120

Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
             NRR++    ++     P   + LP   DWR  GAV GVKDQ +CGSCW+FSA  A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           PY   DG   +  K+     + ++  + + ++      V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277


>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 130/233 (55%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG CGSCW+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV     CDP E       C GG M++AF +I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQTLVS----CDPTE-----YACEGGFMDNAFRWIISSNKGKV 208

Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY+  G +  +C      + A +S++  +  DE+ +A  L K+GP++
Sbjct: 209 FTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDENAIAEWLAKNGPVS 261


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/249 (38%), Positives = 144/249 (57%), Gaps = 23/249 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           ++ + +K SKTY    E + RF +FK NLR   +     + T   G+T+F+DLT  E+R 
Sbjct: 48  YNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRA 107

Query: 112 QFLGLN----RRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +FLG      RRL    + +Q+      + LP   DWR  GAV+ +KDQG+CGSCW+FS 
Sbjct: 108 KFLGTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFST 167

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
             A+EG + + TGEL+SLSEQ+LVDCD         S ++GCNGGLM++AF++I+  GG+
Sbjct: 168 IAAVEGVNKIVTGELISLSEQELVDCDR--------SYNAGCNGGLMDNAFQFIINNGGI 219

Query: 227 EREKDYPYTGTDGGSCKFDKSKI---AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           + +KDYPY   DG   K D +K+   A  +  F  + + ++      V H P++    +I
Sbjct: 220 DTDKDYPYQAVDG---KCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVS---VAI 273

Query: 284 ELPHISFSF 292
           E   ++  F
Sbjct: 274 EASGMALQF 282


>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 380

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 96/233 (41%), Positives = 128/233 (54%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG CGSCW+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD           D GC GGLM+ AF++I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTN---------DFGCEGGLMDDAFKWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G     DKS   + A + +   +  DE+ +A  L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDENAIAEWLAKNGPVA 261


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/239 (38%), Positives = 134/239 (56%), Gaps = 15/239 (6%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--L 116
           K  K+Y    E + RF++FK NLR          T   G+ +F+DLT  E+R  +LG   
Sbjct: 52  KHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGART 111

Query: 117 NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
             R RL    +    +P     LP   DWR+ GAV GVKDQG+CGSCW+FS   A+EG +
Sbjct: 112 GSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGIN 171

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
            + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ E+DYPY
Sbjct: 172 QIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPY 223

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
              DG   ++ K+     + ++  +  + +Q     V + P++    +IE   ++F F 
Sbjct: 224 NARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVS---VAIEASGMAFQFY 279


>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 127/233 (54%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG CGSCW+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG--GV 226
            +EG   ++   L SLSEQ LV CD E         D GC GGLM++AF++I+ +    V
Sbjct: 158 NIEGQWKVTGHNLTSLSEQMLVSCDTE---------DLGCAGGLMDNAFKWIVSSNRHNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G    C+     + A + +   +  DE+ +A  L K+GP+A
Sbjct: 209 FTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDENAIAEWLAKNGPVA 261


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           K Y    E + RF++FK NL+   +   + D T   G+T+F+DLT  EFR  +L   +++
Sbjct: 53  KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110

Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
               D+ K    +    D LP + DWR +GAV  VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGEL+SLSEQ+LVDCD        G  ++GC+GG+MN AFE+I+K GG+E ++DYPY   
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223

Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
           D G C  DK+     V+   +  +  D+++     V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/222 (41%), Positives = 134/222 (60%), Gaps = 15/222 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           K Y    E + RF++FK NL+   +   + D T   G+T+F+DLT  EFR  +L   +++
Sbjct: 53  KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYL--RKKM 110

Query: 121 RLPADAQKAP--ILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
               D+ K    +    D LP + DWR +GAV  VKDQG CGSCW+FSA GA+EG + ++
Sbjct: 111 ERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQIT 170

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           TGEL+SLSEQ+LVDCD        G  ++GC+GG+MN AFE+I+K GG+E ++DYPY   
Sbjct: 171 TGELISLSEQELVDCDR-------GFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223

Query: 238 DGGSCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
           D G C  DK+     V+   +  +  D+++     V H P++
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265


>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
          Length = 1165

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 141/253 (55%), Gaps = 29/253 (11%)

Query: 37   SDGE----QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP 92
            SDGE    + EDH   A H F  FK K S+ Y +  EH+ RFR+FK NL + ++    + 
Sbjct: 841  SDGEGHYSKGEDH---ARHLFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQ 897

Query: 93   -TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWR 144
             TA +G+T F+D+T +E+R Q  GL     +P D         KA I    +LP  FDWR
Sbjct: 898  GTAKYGITHFADMTSAEYR-QRTGL----VIPRDEDRNHVGNPKAEIDENMELPESFDWR 952

Query: 145  DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
            + GAV+ VK+QG CGSCW+FS  G +EG H + T  L   SEQ+L+DCD         + 
Sbjct: 953  ELGAVSPVKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCD---------AV 1003

Query: 205  DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
            DS C GG M+ A++ I K GG+E E +YPY      +C F+ +++   V     +  +E 
Sbjct: 1004 DSACQGGYMDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNET 1063

Query: 265  QMAANLVKHGPLA 277
             MA  LV +GP++
Sbjct: 1064 AMAQYLVANGPIS 1076


>gi|52546916|gb|AAU81591.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 190

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 82/120 (68%), Positives = 98/120 (81%), Gaps = 4/120 (3%)

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           ELVSLSEQQLVDCDHECDPEE  SCDSGCNGGLMNSAFEY LKAGG+ RE+DYPYTGTD 
Sbjct: 3   ELVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDR 62

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
             CKFD +K+AA V+NFSV+S DE+Q+AANLVK+GPLA  + ++ +     +++  VS P
Sbjct: 63  AKCKFDNTKVAAKVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ----TYVGGVSCP 118


>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
          Length = 317

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 140/234 (59%), Gaps = 19/234 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           N    ++ FK  + K Y  + +++ RF +FK+NL +A+  Q+L+  +AV+GVT +SDLT 
Sbjct: 15  NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 73

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L    R    A +++  I P     D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74  DEFSRTHLTAPWR----ASSKRNTIPPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 129

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD         S D GCNGGL ++A+E I++ 
Sbjct: 130 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 180

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+  E +YPY   +   C      +AA +++   ++ DE ++A  L  H  ++
Sbjct: 181 GGLMLEDNYPYDAKN-EKCHLKVGNVAAYINSSVNLTQDESELAIWLYHHSAIS 233


>gi|238683695|gb|ACR54126.1| cathepsin L [Palaemonetes varians]
          Length = 248

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 134/247 (54%), Gaps = 20/247 (8%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDL 104
           A   +  FK    K Y+  +E  YR  +F+ NLR  +    R    + T    + +F D+
Sbjct: 15  ASESWDSFKLTHGKAYSNAKEELYRKTIFENNLRFVEEHNARFHNGEVTFNVAMNRFGDM 74

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           T  EF  Q  GL +        Q     P      D DWR  GAVTGVKDQG CGSCWSF
Sbjct: 75  TTEEFVAQMTGLTKLEDTVG--QVFAHFPDAPRAADVDWRSKGAVTGVKDQGQCGSCWSF 132

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SATGALEGAHF+ TG L SLSEQQLVDC  E         +SGCNGG++  A++Y+   G
Sbjct: 133 SATGALEGAHFIKTGSLPSLSEQQLVDCSTE---------NSGCNGGVVQWAYDYLKSCG 183

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASI 283
           G + E  YPY   D  +C+FD SK+AA V  ++ I  +DE   A+ +   GP++     +
Sbjct: 184 GSQTESSYPYEAAD-RTCRFDSSKVAATVRGYTNIPYADEQTQASAVHDKGPVS---VCV 239

Query: 284 ELPHISF 290
           +  H+SF
Sbjct: 240 DAGHLSF 246


>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
 gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
          Length = 477

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 148/268 (55%), Gaps = 36/268 (13%)

Query: 25  NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           +DD   ++++  +   +  D+++   + F  F  +  K Y  + E   RFRVFK N +  
Sbjct: 148 HDDSITVQELRKAKIIRPRDYVI--WNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVI 205

Query: 85  KRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-------- 135
           +  Q  +  TAV+G TKFSD+T  EF++        + LP   ++ P+ P          
Sbjct: 206 RELQKNEQGTAVYGFTKFSDMTTMEFKK--------IMLPYQWEQ-PVYPMEQANFEKHD 256

Query: 136 ------DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
                 DLP  FDWR+ GAVT VK+QG CGSCW+FS TG +EGA F++  +LVSLSEQ+L
Sbjct: 257 VTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQEL 316

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCD         S D GCNGGL ++A++ I++ GG+E E  YPY G  G +C   +  I
Sbjct: 317 VDCD---------SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGR-GETCHLVRKDI 366

Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           A  ++    +  DE +M   LV  GP++
Sbjct: 367 AVYINGSVELPHDEVEMQKWLVTKGPIS 394


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 98/249 (39%), Positives = 134/249 (53%), Gaps = 19/249 (7%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTP 106
            H+ L+K   +K Y+  EEH  R   ++ NL++ +   L     VH    G+ K++D+T 
Sbjct: 26  QHWKLWKEANNKRYSDAEEH-VRRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTV 84

Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +EF +   G N  +R     D           LP   DWRD G VT VKDQG CGSCW+F
Sbjct: 85  TEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCWAF 144

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TGALEG HF  TG+LVSLSEQ LVDC  +         + GCNGGLM+ AFEYI +  
Sbjct: 145 STTGALEGQHFKQTGKLVSLSEQNLVDCSGK-------QGNMGCNGGLMDQAFEYIKENN 197

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASI 283
           G++ E  YPY   D   C+F  + + A  + F+ I+S DE  +   +   GP++    +I
Sbjct: 198 GIDTEDSYPYEAVD-NQCRFKAANVGATDTGFTDITSKDESALQQAVATVGPIS---VAI 253

Query: 284 ELPHISFSF 292
           +  H SF  
Sbjct: 254 DAGHTSFQL 262


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 107/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y    E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 29  WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   D+ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 89  FRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  I A    F+ I   DE +MA  +   GP+A   
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVA--- 257

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 258 VAIDASHESFQF 269


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 106/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y    E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 29  WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   D+ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 89  FRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  I A    F+ I   DE +MA  +   GP++   
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 258 VAIDASHESFQF 269


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 88/226 (38%), Positives = 129/226 (57%), Gaps = 8/226 (3%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F  +  K  K Y++ EEH +R+ V+K NL   +R    + +   G+TKF+D+T  EFRR
Sbjct: 45  QFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDEFRR 104

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           Q+ G        +  +       ++ P   DWR  GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 105 QYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVE 164

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + + TGE VSLSEQ+LVDCD E         + GCNGGLM+ AF++IL+ GG++ E D
Sbjct: 165 GINAIRTGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFILENGGIDTEND 216

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY G DG      K+     +  +  +  ++++     V   P++
Sbjct: 217 YPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVS 262


>gi|343412631|emb|CCD21595.1| hypothetical protein, conserved in T. vivax [Trypanosoma vivax
           Y486]
          Length = 257

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 96/233 (41%), Positives = 125/233 (53%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F+ FK K+ ++Y T  E  +R RVF+ N+RR++     +P A  GVT FSDLTP EF
Sbjct: 11  EPLFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYAAANPHATFGVTPFSDLTPEEF 70

Query: 110 RRQFLGLNRRLRLPADAQKAPI-LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R ++    R         +  + +P    P   DWR  GAVT VKDQG+CGSCWSFSA G
Sbjct: 71  RTRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIG 130

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG    +   L SLSEQ LV CD +         D GC GG M++AF  I+K   G  
Sbjct: 131 NIEGQWAAAGNPLTSLSEQMLVSCDTK---------DKGCGGGFMDNAFYSIVKENIGKE 181

Query: 227 EREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             EK YPY   G +   CK    K+ A ++    I  DED +A  L  +GP+A
Sbjct: 182 YTEKSYPYVSGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVA 234


>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
          Length = 358

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 110/280 (39%), Positives = 152/280 (54%), Gaps = 22/280 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
           + ILSS++L++L +  A+A    D+   IR V  SDG    E+S   +L    H   F+ 
Sbjct: 4   KTILSSVVLVVLFAASAAANIGFDESNPIRMV--SDGLREVEESVSQILGQSRHVLSFAR 61

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           F  ++ K Y   EE   RF +FK NL   +       +   GV +F+DLT  EF+R  LG
Sbjct: 62  FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121

Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             +     A  + +  +    LP   DWR+ G V+ VKDQG CGSCW+FS TGALE A+ 
Sbjct: 122 AAQNC--SATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
            + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ EK YPYT
Sbjct: 180 QAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYT 232

Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           G D  +CKF    +   V    N ++ + DE + A  LV+
Sbjct: 233 GKD-ETCKFSAENVGVQVLNSVNITLGAEDELKHAVGLVR 271


>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
          Length = 454

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 95/242 (39%), Positives = 144/242 (59%), Gaps = 19/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           N    ++ FK  + K Y  + +++ RF +FK+NL +A+  Q+L+  +AV+GVT +SDLT 
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L    R    A +++  I P     D+P +FDWR  GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWRKKGAVTEVKNQGMCGSCWA 266

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD         + D GCNGGL ++A+E I++ 
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------NLDDGCNGGLPSNAYESIIRM 317

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+  E +YPY   +   C    + +AA +++   ++ DE ++A  L  H  ++  + ++
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDESELAIWLYHHSAISVGMNAL 376

Query: 284 EL 285
            L
Sbjct: 377 LL 378


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  170 bits (430), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 98/228 (42%), Positives = 136/228 (59%), Gaps = 19/228 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRR 111
           FKS +SK+Y ++     R   F+ANL    +        +H    GV +F+DLT  EF  
Sbjct: 1   FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            ++       +P +    P    + +    DWR  GAVT +K+QG CGSCWSFS TG+ E
Sbjct: 61  LYVPSKFNRTMPYNTVYLPATSEDSV----DWRTKGAVTPIKNQGQCGSCWSFSTTGSTE 116

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVEREK 230
           GAH ++TG LVSLSEQQLVDC        SGS  + GCNGGLM+ AF+YI+   G++ E+
Sbjct: 117 GAHAIATGNLVSLSEQQLVDC--------SGSFGNQGCNGGLMDDAFKYIISNKGLDTEE 168

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLA 277
           DYPYT  DG   K  ++K AA +S++S V  ++EDQ+AA + K GP++
Sbjct: 169 DYPYTAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVS 215


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 103/268 (38%), Positives = 148/268 (55%), Gaps = 36/268 (13%)

Query: 46  LLNAEHH--FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           L+N  ++  ++ FK K +K+Y T++E   RF+VF +N +  ++  +      H     + 
Sbjct: 34  LINHPYYPVWTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLN 93

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPAD---AQKAPI--------LPTN-DLPTDFDWRDHG 147
           KF+D+T +EFR++  G     +LPA    A+  P+        +P N  +P   DWR  G
Sbjct: 94  KFADMTNAEFRQRMNGF----KLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEG 149

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
            VT VKDQG+CGSCW+FSATG+LEG H+  TG+LVSLSEQ LVDCD   D       D G
Sbjct: 150 YVTKVKDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGD-------DEG 202

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQM 266
           CNGG M+ AF+Y+    G++ E  YPY G D G C+F    + A  + F  I   +E  +
Sbjct: 203 CNGGYMDGAFQYVETNKGIDTEASYPYKGRD-GRCRFKSEDVGATDTGFVDIPEGNETLL 261

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLF 294
            A +   GP+     S+ +   SF F F
Sbjct: 262 EAAIATVGPV-----SVAIDAASFKFQF 284


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 106/252 (42%), Positives = 137/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y    E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 29  WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 89  FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  I A    F+ I   DE +MA  +   GP+A   
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVA--- 257

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 258 VAIDASHESFQF 269


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 98/230 (42%), Positives = 131/230 (56%), Gaps = 17/230 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
           F  FK K  KTY  Q E   RF +FK NLR  ++  +L    +     G+ +F+D+T  E
Sbjct: 25  FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           FR  FL L+   + P       +L    +P   DWR  G VTGVKDQG CGSCW+FS TG
Sbjct: 85  FR-AFLTLSSSKK-PHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           + E A++   G+LVSLSEQQLVDC        S   ++GCNGG ++  F Y+ K+ G+E 
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDC--------STDINAGCNGGYLDETFTYV-KSKGLEA 193

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
           E  YPY GTD GSCK+  SK+   VS   S+ S DE+ +   +   GP++
Sbjct: 194 ESTYPYKGTD-GSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVS 242


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 103/280 (36%), Positives = 148/280 (52%), Gaps = 21/280 (7%)

Query: 19  ASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFK 78
           AS  ++ D DA      P    + +DH   ++  F  F+   +K YAT+EE   R+ +FK
Sbjct: 61  ASPSSITDGDAKY----PEKIWEWKDHHFQSQ--FYQFQRDHNKFYATEEERLKRYAIFK 114

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR-RLRLPADAQKAPI--LPTN 135
            NL       +   + V  + KF DLT  EFR+++LG  +  LR P       +  +  N
Sbjct: 115 NNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDN 174

Query: 136 DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
           D+PT  DWR  G VT VKDQG CGSCW+FSATGA+EG +   TG+LV+LS+QQLVDC   
Sbjct: 175 DIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRF 234

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
                    + GC+GG M  AFEY+++ GG+   ++YPY   D G CK  +    A ++ 
Sbjct: 235 LG-------NQGCDGGRMEEAFEYVVENGGICSGENYPYMRKD-GVCKSSQCTSVATITG 286

Query: 256 F-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF 294
           + SV    E  M   L    P++    +I+    +F F +
Sbjct: 287 YRSVPRRSEKSMKTALALRSPVS---VAIQANQAAFQFYY 323


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score =  169 bits (429), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 111/283 (39%), Positives = 156/283 (55%), Gaps = 28/283 (9%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
           + IL S++L++L +  A+A    D+   IR V  SDG    E+S   +L    H   F+ 
Sbjct: 4   KTILPSVVLVILIAASAAADIGFDESNPIRMV--SDGLREIEESVVQILGQSRHVLSFAR 61

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  ++ K Y   EE   RF +FK NL   R   +++L   +   GV +F+DLT  EF+R 
Sbjct: 62  FTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRL---SYKLGVNQFADLTWQEFQRN 118

Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
            LG  +     A  + +  L    LP   DWR+ G V+ VKDQG CGSCW+FS TGALE 
Sbjct: 119 KLGAAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEA 176

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
           A+  + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ Y
Sbjct: 177 AYHQAFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAY 229

Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           PYTG D G+CK+    +   V    N ++ + DE + A  LV+
Sbjct: 230 PYTGKD-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLVR 271


>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
 gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
          Length = 605

 Score =  169 bits (429), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 94/238 (39%), Positives = 140/238 (58%), Gaps = 19/238 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVHGVTKFS 102
           L   +H F +F+ K+ + YA   EH  R R+F+ NLR  +  +L D    +A +G+T+F+
Sbjct: 292 LNKVDHLFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQ--ELNDNEQGSAKYGITEFA 349

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGS 160
           D+T SE+  Q  GL +R        K  ++P    +LP +FDWR+  AVT VK+QG+CGS
Sbjct: 350 DMTSSEYT-QRAGLWQRSANKPTGGKPAVVPAYKGELPKEFDWREKNAVTQVKNQGSCGS 408

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG + + TGEL   SEQ+L+DCD         S DS CNGGLM++A++ I
Sbjct: 409 CWAFSVTGNIEGLYAIKTGELREFSEQELLDCD---------STDSACNGGLMDNAYKAI 459

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
              GG+E E +YPY       C F+K+     V++F  +   +E  M   L+ +GP++
Sbjct: 460 KDIGGLEYESEYPYLAKK-KQCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPIS 516


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/241 (41%), Positives = 137/241 (56%), Gaps = 17/241 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  +  K  + Y+ +E  D R++ FK N+    +    +   V G+TKF+DLT  E+++ 
Sbjct: 33  FIGWMRKHDRAYSHEEFTD-RYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKH 91

Query: 113 FLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           +LG+  N +  L A AQK         P   DWR+ GAV+ VKDQG CGSCWSFS TGA+
Sbjct: 92  YLGIKVNVKKNLNA-AQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAV 150

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EGAH + +G +VSLSEQ LVDC  +         + GC GGLM +AFEYI+  GG+  E 
Sbjct: 151 EGAHQIKSGNMVSLSEQNLVDCSGQYG-------NQGCEGGLMVNAFEYIIDNGGIATES 203

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHIS 289
            YPYT    G CKF KS   A +  +  I   +ED + A L K  P++    +I+  H+S
Sbjct: 204 SYPYTAAQ-GRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQ-PVS---VAIDASHMS 258

Query: 290 F 290
           F
Sbjct: 259 F 259


>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
          Length = 347

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 105/289 (36%), Positives = 153/289 (52%), Gaps = 52/289 (17%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           +++LI++ LLL+ L+S   S ++                          E  F  F+ K+
Sbjct: 2   IKKLIVAILLLVALASARTSNLSF------------------------EETQFREFQLKY 37

Query: 61  SKTYATQEEHDY--RFRVFKANLRRAK------RRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +K Y   E H++  +   FK +L+R +      +R  +D     GV KF+DL+  EF   
Sbjct: 38  NKHY---ESHEFAQKLATFKNSLKRIQELNDMAKRAKVDTE--FGVNKFADLSKEEFANY 92

Query: 113 FLGLNRRLRLPADAQK-APILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L  N+      D++  AP       ++LPT FDWR  GAVT VKDQG CGSCWSFS TG
Sbjct: 93  YL--NKGGMESTDSETYAPDYSDKEISNLPTSFDWRTQGAVTPVKDQGQCGSCWSFSTTG 150

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
            +EG  FL+  +L  LSEQ LVDC  + D         GCNGGLM  A++YI++  G++ 
Sbjct: 151 NVEGQWFLAGNDLTGLSEQNLVDCSTKND---------GCNGGLMPLAYDYIVENNGIDT 201

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E  YPY      +C+F+ + I A +  +  +SS+E QM  NLV +GPL+
Sbjct: 202 EASYPYLAIQQKNCQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLS 250


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 103/253 (40%), Positives = 140/253 (55%), Gaps = 20/253 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H++L+K   SK Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 25  DEHWNLWKDWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYSLGMNHFGDMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G   +L+     + +  +  N L  P   DWRD G VT VKDQG CGSCW+
Sbjct: 84  HEEFRQIMNGY--KLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWA 141

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG HF  TG LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 142 FSTTGAMEGQHFRKTGTLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
           GG++ E+ YPY GTD G C +D S  +A  + F  V S  E  +   +   GP++    +
Sbjct: 195 GGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVS---VA 251

Query: 283 IELPHISFSFLFT 295
           I+  H SF F  +
Sbjct: 252 IDAGHESFQFYHS 264


>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
          Length = 1785

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 136/234 (58%), Gaps = 22/234 (9%)

Query: 52   HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
             F  FK    + YA+  EH+ R+ +F+ NL +  +    +  T  +GVTKF+D+T +E+R
Sbjct: 1477 QFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYR 1536

Query: 111  RQFLGLNRRLRLP---ADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
                  +  L +P   ++  + PI   +     LPT FDWRDHGAVTGVK+QG CGSCW+
Sbjct: 1537 -----AHTGLIVPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSCWA 1591

Query: 164  FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
            FSA G +EG H + T +L + SEQ+L+DCD         + D+GCNGG M+ AF+ I K 
Sbjct: 1592 FSAIGNIEGLHQIKTKKLEAYSEQELIDCD---------TVDNGCNGGYMDDAFKAIEKL 1642

Query: 224  GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY      +C F+K+     V     +  +E  +A  L+++GP+A
Sbjct: 1643 GGLELEDEYPYQAKAQKTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIA 1696


>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
          Length = 318

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/229 (43%), Positives = 130/229 (56%), Gaps = 13/229 (5%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
           +L N+E  F+ + SK+ KTYA  EE  YR RVF  NL + K     +     GV KF+D+
Sbjct: 16  NLRNSE--FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNAKNLPWTLGVNKFADV 73

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +  EF  +F G  +  +     Q   +    D+P   DWR+ GAVT VK+QG CGSCW+F
Sbjct: 74  SAEEFAYKFCGCAKDPKTRGTRQTTLV---GDVPARVDWREQGAVTPVKNQGMCGSCWAF 130

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG  EGA+FL TG LVSLSEQQLVDC    DPE     + GC+GG   SA +Y+ K  
Sbjct: 131 STTGTTEGAYFLKTGNLVSLSEQQLVDCAR--DPEYE---NFGCSGGWPWSAVDYVTKH- 184

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAA-AVSNFSVISSDEDQMAANLVK 272
           G+  E+DYPY G D   CK    K+A  +V    +   DED +A  + K
Sbjct: 185 GLCTEEDYPYKGVD-AECKESSCKVAVQSVDKVQLPVGDEDSLAVAVSK 232


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/239 (40%), Positives = 146/239 (61%), Gaps = 18/239 (7%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIR--QVVPSDG-EQSEDHLLNAEHHFSLFKSKFSKTY 64
           S+L   L +V+++A A  +D ++I   Q  P+ G  +SED +   +  F  +  K  K+Y
Sbjct: 5   SILFTFLFAVVSAAAAAAEDMSIITYDQQHPAKGLVRSEDEV---KEMFESWLVKHGKSY 61

Query: 65  ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNR---RL 120
              +E D RF++F+ NL+    +  L+  +   G+ +F+D+T  E+R  +LG  R   R 
Sbjct: 62  NAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASRN 121

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
            + + + +   +  + LP   DWR+ GAVTGVKDQG+CGSCW+FS   A+EG + L+TG 
Sbjct: 122 MVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLATGN 181

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           L+SLSEQ+LVDCD +         + GCNGG M  AF++I+K GG++ E+DYPYTG DG
Sbjct: 182 LISLSEQELVDCDRK--------INQGCNGGDMGYAFQFIIKNGGIDSEEDYPYTGKDG 232


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L L    ++SAV ++      +  V + G +SE  +++    + L K   +++  +  E
Sbjct: 10  ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
            D RF +FK NLR        + +   G+T+F+DLT  E+R ++LG        RR  L 
Sbjct: 69  KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +A+       ++LP   DWR  GAV  VKDQG CGSCW+FS  GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG   +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             K+     + ++  + +  ++     V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/247 (38%), Positives = 137/247 (55%), Gaps = 15/247 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F+ +K+  ++ YA+ +E   R  ++ +NL            +   G+ +F DL   EF  
Sbjct: 21  FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80

Query: 112 QFLGLN-RRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           ++LG+    +        +  LP    LP   DWR  G VT VK+QG CGSCWSFS TG+
Sbjct: 81  KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG H   TG LVSLSEQ LVDC  +   E       GCNGGLM+ AFEYI+K GG++ E
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNE-------GCNGGLMDDAFEYIIKNGGIDTE 193

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHI 288
             YPYT T  G+CKF+ + I A V+++  +I+  E  +   +   GP++    +I+  HI
Sbjct: 194 ASYPYTATT-GTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVS---VAIDASHI 249

Query: 289 SFSFLFT 295
           +F F FT
Sbjct: 250 NFQFYFT 256


>gi|228244|prf||1801240B Cys protease 2
          Length = 323

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK K+ + Y   EE  YR  +F+ N +      K+ +  + T    + KF
Sbjct: 13  LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            D+T  EF     G   R   P      P   T    T+ DWR  GAVT VKDQG CGSC
Sbjct: 73  GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS TG+LEG HFL TG L+SL+EQQLVDC     P+       GCNGG MN AF+YI 
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
              G++ E  YPY   D GSC+FD + +AA  S  + I+S  +      V+  GP++   
Sbjct: 185 ANNGIDTEASYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 241 VTIDAAHSSFQFY 253


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L L    ++SAV ++      +  V + G +SE  +++    + L K   +++  +  E
Sbjct: 10  ILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
            D RF +FK NLR        + +   G+T+F+DLT  E+R ++LG        RR  L 
Sbjct: 69  KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +A+       ++LP   DWR  GAV  VKDQG CGSCW+FS  GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG   +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             K+     + ++  + +  ++     V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 136/242 (56%), Gaps = 16/242 (6%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +SE+ ++     +  + +K  K Y    E + RF +FK NL+        + T   G+ +
Sbjct: 37  RSEEEVMGM---YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNR 93

Query: 101 FSDLTPSEFRRQFLGL----NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           F+DLT  E+R  +LG      RR  +L   + +  ++P   LP   DWR+ GAV  VKDQ
Sbjct: 94  FADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQ 153

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
            +CGSCW+FS   A+EG + + TGEL+SLSEQ+LVDCD E         D GCNGGLM+ 
Sbjct: 154 RSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTE--------YDMGCNGGLMDY 205

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           AF++I+K GG++ EKDYPYTG DG      KS    ++  +  +   +++     V H P
Sbjct: 206 AFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQP 265

Query: 276 LA 277
           ++
Sbjct: 266 VS 267


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 133/227 (58%), Gaps = 15/227 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFL 114
           F  KF + Y++  E   RF+ +  NL   ++ Q  +  TA++GVT+FSD++P EF++  L
Sbjct: 173 FIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQKTML 232

Query: 115 GLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
                 R+ ++  +  +    L  N+LP  FDWR  G VT VK+QG+CGSCW+FS TG +
Sbjct: 233 PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKNQGSCGSCWAFSVTGNI 292

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG   + TG+L+SLSEQ+L+DCD           D GCNGGL  +AF  I + GG+E E 
Sbjct: 293 EGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGLPINAFREIQRMGGLEPED 343

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            YPY   + G+C   +S IA  + +   I  +E  M A +V+ GPL+
Sbjct: 344 QYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLS 389


>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
           mellifera]
          Length = 881

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 105/257 (40%), Positives = 148/257 (57%), Gaps = 17/257 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSE 108
           E  F  F  KF+KT+++  E   RF++FK NL+     Q  +  TA +GVT F+DLTP E
Sbjct: 573 EMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKE 632

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           F+ ++LG    L+   +   A I  ++  LP  FDWRD+  VT VKDQG CGSCW+FS T
Sbjct: 633 FKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLCGSCWAFSVT 692

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G +EG + +   +L+SLSEQ+L+DCD         + D GCNGG M +A++ I K GG+E
Sbjct: 693 GNVEGQYAIKYKKLLSLSEQELLDCD---------TLDEGCNGGYMENAYKAIEKLGGLE 743

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA----GNVASI 283
            E DYPY G +   C F K      V     I+S+E +MA  L+K+GP++     N    
Sbjct: 744 LESDYPYDGRN-EKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQF 802

Query: 284 ELPHISFSFLFTVSSPK 300
            +  +S  F F + +PK
Sbjct: 803 YIGGVSHPFHF-LCNPK 818


>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
          Length = 442

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 130/236 (55%), Gaps = 20/236 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 33  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 92

Query: 110 RRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           R  +   +      A A K P     + T   P   DWR  GAVT VKDQGACGSCW+FS
Sbjct: 93  RATY---HNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFS 149

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           A G +EG   ++  EL SLSEQ LV CD         + D GC GGLM+ + ++I+ +  
Sbjct: 150 AIGNIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNK 200

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V   + YPY    G     +KS   + A +S    +  DE+ +A  L K+GP+A
Sbjct: 201 GNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 256


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/245 (38%), Positives = 136/245 (55%), Gaps = 16/245 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  + ++K   +K Y+ + E + R+ ++K N+ R           +  +  F D+T +EF
Sbjct: 24  ESSWYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEF 83

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R +  GL   L          ++P++   P   DWR  G VT VK+QG CGSCW+FS+TG
Sbjct: 84  RAKMNGL---LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTG 140

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALEG HF  TG LVSLSEQ LVDC  +         ++GCNGGLM++AF YI   GG++ 
Sbjct: 141 ALEGQHFKKTGRLVSLSEQNLVDCSTDYG-------NNGCNGGLMDNAFSYIKANGGIDT 193

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPH 287
           E  YPY G D G+C++ KS I A  + F  I   DED +   +   GP++    +I+  H
Sbjct: 194 ETGYPYEGQD-GTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVS---VAIDASH 249

Query: 288 ISFSF 292
           +SF F
Sbjct: 250 MSFQF 254


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/274 (35%), Positives = 152/274 (55%), Gaps = 20/274 (7%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L L    ++SAV ++      +  V + G +SE  +++    + L K   +++  +  E
Sbjct: 10  ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAW-LVKHGKAQSQNSLVE 68

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------RRLRLP 123
            D RF +FK NLR        + +   G+T+F+DLT  E+R ++LG        RR  L 
Sbjct: 69  KDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +A+       ++LP   DWR  GAV  VKDQG CGSCW+FS  GA+EG + + TG+L++
Sbjct: 129 YEARVG-----DELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLIT 183

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG   +
Sbjct: 184 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQ 235

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             K+     + ++  + +  ++     V H P++
Sbjct: 236 IRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPIS 269


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 59  WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 118

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 119 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 178

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 179 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFRYIK 231

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  + A    F+ I   DE +MA  +   GP++   
Sbjct: 232 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 287

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 288 VAIDASHESFQF 299


>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 130/236 (55%), Gaps = 20/236 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           R  +   +      A A K P     + T   P   DWR  GAVT VKDQGACGSCW+FS
Sbjct: 98  RATY---HNGAEYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           A G +EG   ++  EL SLSEQ LV CD         + D GC GGLM+ + ++I+ +  
Sbjct: 155 AIGNIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNK 205

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V   + YPY    G     +KS   + A +S    +  DE+ +A  L K+GP+A
Sbjct: 206 GNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 63  WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 122

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 123 FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 182

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 183 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 235

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  + A    F+ I   DE +MA  +   GP++   
Sbjct: 236 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 291

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 292 VAIDASHESFQF 303


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 105/252 (41%), Positives = 137/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y    E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 29  WHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 89  FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  I A    F+ I   DE +MA  +   GP++   
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 258 VAIDASHESFQF 269


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/252 (41%), Positives = 138/252 (54%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  Q      V     V K++DL   E
Sbjct: 29  WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+   G N    ++LR   ++ K    I P +  LP   DWR  GAVT VKDQG CGSC
Sbjct: 89  FRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 149 WAFSSTGALEGQHFRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY   D  SC F+K  + A    F+ I   DE +MA  +   GP++   
Sbjct: 202 DNGGIDTEKSYPYEAID-DSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS--- 257

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 258 VAIDASHESFQF 269


>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/233 (39%), Positives = 129/233 (55%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFR+FK ++ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +L G           +K   + T   P   DWR  GAVT VKDQG+CGSCW+F+ATG
Sbjct: 98  RATYLNGAKYYAAALERPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + +  C GG  + AF++I+ +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCRGGFADRAFKWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY  TDG    C      + A +S    +  DE+ +A  L ++GP+A
Sbjct: 209 FTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVA 261


>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 94/233 (40%), Positives = 128/233 (54%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQGACGSCW+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + D GC GGLM+ + ++I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TTDYGCRGGLMDKSLQWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              + YPY    G     +KS   + A +S    +  DE+ +A  L K+GP+A
Sbjct: 209 FTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVA 261


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/245 (39%), Positives = 143/245 (58%), Gaps = 19/245 (7%)

Query: 38  DGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVH 96
           +G+++E  L N+   F  F  KF + Y++  E   RF+ +  NL   ++ Q  +  TA++
Sbjct: 124 EGKKTE-MLWNS---FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIY 179

Query: 97  GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGV 152
           GVT+FSD++P EF++  L      R+ ++  +  +    L  N+LP  FDWR  G VT V
Sbjct: 180 GVTQFSDMSPEEFQKTMLPSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPV 239

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           K+QG+CGSCW+FS TG +EG   + TG+L+SLSEQ+L+DCD           D GCNGGL
Sbjct: 240 KNQGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDR---------IDKGCNGGL 290

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
             +AF  I + GG+E E  YPY   + G+C   +S IA  + +   I  +E  M A +V+
Sbjct: 291 PINAFREIQRMGGLEPEDQYPYKARN-GTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQ 349

Query: 273 HGPLA 277
            GPL+
Sbjct: 350 RGPLS 354


>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
 gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 323

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 132/253 (52%), Gaps = 17/253 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK K+ + Y   EE  YR  +F+ N +      K+ +  + T    + KF
Sbjct: 13  LAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            D+T  EF     G   R   P      P   T    T+ DWR  GAVT VKDQG CGSC
Sbjct: 73  GDMTLEEFNAVMKGNIPRRSAPVSV-FYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSC 131

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS TG+LEG HFL TG L+SL+EQQLVDC     P+       GCNGG MN AF+YI 
Sbjct: 132 WAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQ-------GCNGGWMNDAFDYIK 184

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNV 280
              G++ E  YPY   D GSC+FD + +AA  S  + I+S  +      V+  GP++   
Sbjct: 185 ANNGIDTEAAYPYEARD-GSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS--- 240

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 241 VTIDAAHSSFQFY 253


>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
          Length = 322

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 133/229 (58%), Gaps = 17/229 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKFSDLTPSE 108
           F  FK +  K+Y  Q E   RF +F+AN+   ++   L    +      + +F+DLT  E
Sbjct: 26  FETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTDLTQEE 85

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           F+  +LGL+ +  L    Q    L   ++PT  DWR  G VTGVK+QG+CGSCWSF+ TG
Sbjct: 86  FKA-YLGLHVKPVLNNTIQYE--LKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSFALTG 142

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           + EGA++    +LVSLSEQQLVDC        S S + GCNGG +++ F YI +  G++ 
Sbjct: 143 STEGAYYRKHKQLVSLSEQQLVDC--------STSINYGCNGGFLDATFPYIEQY-GLQT 193

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E  YPYTG D GSCK+D SK+   +SN+  +   E ++   +   GP+A
Sbjct: 194 ESSYPYTGVD-GSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVA 241


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/224 (40%), Positives = 134/224 (59%), Gaps = 15/224 (6%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K Y    E D RF++FK NLR   ++   + T   G+ +F+DLT  E+R ++LG   
Sbjct: 46  KHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKI 105

Query: 117 --NRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
             NRRL R P++     +  T  LP   DWR  GAV  VKDQ +CGSCW+FSA GA+EG 
Sbjct: 106 DPNRRLGRTPSNRYAPRVGET--LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGI 163

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           + + TG+L+SLSEQ+LVDCD           + GCNGGLM+ AFE+I+K GG++ E+DYP
Sbjct: 164 NKIVTGDLISLSEQELVDCDT--------GYNMGCNGGLMDYAFEFIIKNGGIDSEEDYP 215

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y G DG   ++ K+    ++  +  +++ ++      V + P++
Sbjct: 216 YKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVS 259


>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
          Length = 322

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 97/244 (39%), Positives = 133/244 (54%), Gaps = 18/244 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  +K K++K Y++QEE   R RV+ +NL+  +            + +F+DL P EF   
Sbjct: 19  WEQWKLKYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSEREGYTVAMNEFADLDPREFVSH 78

Query: 113 FLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           + GL RR   P  +   P     D   LPT  DWR  G VTGVK+QG CGSCW+FSATG+
Sbjct: 79  YNGLRRR---PHTSSGEPCTLGEDVSALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGS 135

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LEG HF +TG+LVSLSEQ LVDC        S   + GCNGGL + AF+Y++K GG++ E
Sbjct: 136 LEGQHFNATGKLVSLSEQNLVDC-------SSAEGNEGCNGGLPDDAFKYVIKNGGIDTE 188

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHI 288
             YPY   D   C +  + I +  S++  I S  E Q+       GP+      I+  H+
Sbjct: 189 ASYPYVARD-EKCHYSSANIGSTCSSYVDIESKSEAQLQVASATVGPIP---VGIDASHL 244

Query: 289 SFSF 292
            F  
Sbjct: 245 GFQL 248


>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 134/256 (52%), Gaps = 17/256 (6%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-- 99
           ++ H  + +  +  +KS + K YA  EE D+R  V++ N++  +R         HG T  
Sbjct: 18  AQKHDESLDEQWYQWKSLYKKPYAANEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMT 76

Query: 100 --KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
              F D+T  EFR+   G   + R+       P+     +P   DW   G VT VKDQG 
Sbjct: 77  MNAFGDMTNEEFRQVMNGFQNQKRIQGKLLYEPVF--GHIPKSVDWTQKGYVTPVKDQGQ 134

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF
Sbjct: 135 CGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAF 187

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +YI   GG++ E+ YPYT  D   C+++    AA  + F  I   E  +   +   GP++
Sbjct: 188 QYIKDNGGLDSEESYPYTAMDKQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPIS 247

Query: 278 GNVASIELPHISFSFL 293
               +++  H SF F 
Sbjct: 248 ---VAVDAGHESFQFY 260


>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 105/255 (41%), Positives = 146/255 (57%), Gaps = 31/255 (12%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLT 105
           + H+ LFK + +KTY  Q++   R  +F+AN+++     LL      +   G+  F+D+T
Sbjct: 23  DEHWELFKRQHNKTY-LQKQDVGRRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMT 81

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGS 160
           P EF +      R  R  A+  +   L   D     +P   DWR  G VT VK+QG CGS
Sbjct: 82  PDEFEKY-----RGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGS 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TGALEG HF  +G+LVSLSEQ LVDC        +   ++GCNGGLM++AF +I
Sbjct: 137 CWAFSTTGALEGQHFRRSGDLVSLSEQMLVDC-------SAVYGNAGCNGGLMDNAFRFI 189

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQM--AANLVKHGPLA 277
             AGG+E EK YPYTG D G+C FD   I A ++ F  V S DE+ +  AA +V  GP++
Sbjct: 190 KDAGGLETEKSYPYTGKD-GTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVV--GPVS 246

Query: 278 GNVASIELPHISFSF 292
               +I+    +F F
Sbjct: 247 ---VAIDASGQNFQF 258


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 151/285 (52%), Gaps = 16/285 (5%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
           M+   LS  + L++  +++S       D  I     +  ++S     N E    +  +  
Sbjct: 1   MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K+Y    E D RF +FK NL+       L+ T   G+T+F+DLT  E+R +FLG   
Sbjct: 61  KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120

Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
             NRR++    ++     P   + LP   DWR  GAV GVKDQ +CGSCW+FSA  A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           PY   DG   +  K+     + ++  + + ++      V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 151/285 (52%), Gaps = 16/285 (5%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAE--HHFSLFKS 58
           M+   LS  + L++  +++S       D  I     +  ++S     N E    +  +  
Sbjct: 1   MDSNTLSPAMKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLV 60

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K+Y    E D RF +FK NL+       L+ T   G+T+F+DLT  E+R +FLG   
Sbjct: 61  KHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKI 120

Query: 117 --NRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
             NRR++    ++     P   + LP   DWR  GAV GVKDQ +CGSCW+FSA  A+EG
Sbjct: 121 DPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEG 180

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E DY
Sbjct: 181 INKIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIISNGGIDSEDDY 232

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           PY   DG   +  K+     + ++  + + ++      V + P+A
Sbjct: 233 PYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIA 277


>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1454

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 95/245 (38%), Positives = 141/245 (57%), Gaps = 24/245 (9%)

Query: 41   QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVT 99
            +SEDH   + H F  FK++ ++TY +  EH+ RFR+FK NL + ++    +  TA +G+T
Sbjct: 1137 KSEDH---SRHLFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGIT 1193

Query: 100  KFSDLTPSEFR-RQFLGLNRR------LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
             F+D+T +E+R R  L + R       +R P     A I    +LP  FDWR+ GAV+ V
Sbjct: 1194 HFADMTSAEYRARTGLVVPREGDEVNHIRNPM----AEIDEHMELPDAFDWRELGAVSEV 1249

Query: 153  KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
            K+QG CGSCW+FS  G +EG H + T +L   SEQ+L+DCD         + DS CNGG 
Sbjct: 1250 KNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCD---------TVDSACNGGF 1300

Query: 213  MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
            M+ A++ I K GG+E E +YPY      +C F+K+     V     +  +E  +A  LV 
Sbjct: 1301 MDDAYKAIEKIGGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAVDLPKNETAIAQFLVA 1360

Query: 273  HGPLA 277
            +GP++
Sbjct: 1361 NGPVS 1365


>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 108/280 (38%), Positives = 152/280 (54%), Gaps = 22/280 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSL 55
           + +LSS++L++L +  A+A    D+   IR V  SDG    E++   +L    H   F+ 
Sbjct: 4   KTVLSSVVLVILIAASAAADIGFDELNPIRMV--SDGLREVEETVSQILGQSRHVLTFAR 61

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           F  ++ K Y   EE   RF +FK NL   +       +   GV +F+DLT  EF+R  LG
Sbjct: 62  FTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLG 121

Query: 116 LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             +     A  + +  L    LP   DWR+ G V+ VKDQG CGSCW+FS TGALE A+ 
Sbjct: 122 AAQNC--SATLKGSHKLTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
            + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPY 
Sbjct: 180 QAFGKGISLSEQQLVDCAGAYN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYI 232

Query: 236 GTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           G D G+CKF    +   V    N ++ + DE + A  LV+
Sbjct: 233 GKD-GTCKFSAENVGVQVLDSVNITLGAEDELKHAVGLVR 271


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 125/232 (53%), Gaps = 16/232 (6%)

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
           EEH  RF +FK N++        D     G+ KF+DL+  EF+  ++G    LR   + Q
Sbjct: 62  EEHAERFEIFKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQ 121

Query: 128 KAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
               +  N   LP   DWR  GAV  VK+QG CGSCW+FS   ++EG ++++TG LVSLS
Sbjct: 122 SGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLS 181

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT--GTDGGSCK 243
           EQQLVDC  E         +SGCNGGLM++AF+YI+  GG+  E +YPYT   T+  S K
Sbjct: 182 EQQLVDCSTE---------NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTK 232

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
            +       +  F  + ++ +Q     V H P++    +IE     F F  T
Sbjct: 233 INSQTTRVVIDGFEDVPANNEQALKEAVAHQPVS---VAIEASGQDFQFYST 281


>gi|402584107|gb|EJW78049.1| hypothetical protein WUBG_11042, partial [Wuchereria bancrofti]
          Length = 213

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 133/222 (59%), Gaps = 22/222 (9%)

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNR 118
           +++ Y +++E   RFR++K NLR AK  Q  +  TA++G T +SD+T  EFR+  L    
Sbjct: 1   YNRKYRSKKEFLKRFRIYKRNLRLAKLIQNKEEGTAIYGETPYSDMTQEEFRKIMLPY-- 58

Query: 119 RLRLPADAQKAPIL-------PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
             + P +  K  ++         +++P  FDWRD G VT VK+QG+CGSCW+FS TG +E
Sbjct: 59  --KWPLNENKKQMIDLAEYGITDDEIPESFDWRDKGVVTEVKNQGSCGSCWAFSVTGNIE 116

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           GA  +  G+L+SLSEQ+LVDCD           D GC GGL  +A++ I++ GG+E EKD
Sbjct: 117 GAWAIKKGKLISLSEQELVDCD---------VIDQGCKGGLPLNAYKEIIRMGGLESEKD 167

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
           YPY G  G  C   +  IA  +++   + +DE ++AA L K 
Sbjct: 168 YPYDGY-GEKCHLVRRDIAVYINDSVQLPADEFKIAAWLTKK 208


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 99/283 (34%), Positives = 157/283 (55%), Gaps = 28/283 (9%)

Query: 5   ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD----GEQSEDHLLNAEHHFSLFKSKF 60
           I  S L ++ S  LAS   ++ D       +P+D     E++E H++    H+ +   K 
Sbjct: 10  IAISFLFMVFSLSLASMSIIDYD-------LPADPLQSTERTEAHMMKMYEHWLV---KH 59

Query: 61  SKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LN 117
            K Y    E + RF +FK NLR   ++  +   T   G+TKF+DLT  E+R  +LG  + 
Sbjct: 60  GKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKME 119

Query: 118 RRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           ++ +L  +  +  +      +DLP+  DWR+ GAVT VKDQG CGSCW+FS  G++EG +
Sbjct: 120 KKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGIN 179

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
            + TG+L+SLSEQ+LVDCD         + + GCNGGLM+ AFE+I+K GG++ E DYPY
Sbjct: 180 QIVTGDLISLSEQELVDCDK--------AYNQGCNGGLMDYAFEFIIKNGGIDSEADYPY 231

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             +D       K+     +  +  +  ++++     V + P++
Sbjct: 232 RASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVS 274


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 102/248 (41%), Positives = 137/248 (55%), Gaps = 20/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+KS  SK Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T  
Sbjct: 27  HWELWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHE 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G  R+      A+ +  L  N L  P   DWRD+G VT VKDQG CGSCW+FS
Sbjct: 86  EFRQLMNGYKRKAE--TKARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFS 143

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+    G
Sbjct: 144 TTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 196

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
           ++ E  YPY GTD   C +D +  +   + F  I S +++     V   GP++    +I+
Sbjct: 197 LDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVS---VAID 253

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 254 AGHESFQF 261


>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 419

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 140/242 (57%), Gaps = 17/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           N +  +  FK K+ K Y   E+ + RF +FK+N+ +A+  Q+ +  +A++GVT +SDLT 
Sbjct: 115 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 173

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L       +P+     P       N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 174 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 231

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD           D GCNGGL ++A+E I+K 
Sbjct: 232 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 282

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+  E +YPY   +   C      +A  +++   ++ DE ++AA L  +  ++  + ++
Sbjct: 283 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 341

Query: 284 EL 285
            L
Sbjct: 342 LL 343


>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 326

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 97/236 (41%), Positives = 123/236 (52%), Gaps = 26/236 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK K  + Y +  E  +R  VF+ANL  A+     +P A  GVT FSDLT  EFR +
Sbjct: 71  FAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRSR 130

Query: 113 -------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
                  F     R R+P D +          P   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 131 YHNGAAHFAAAQERARVPVDVEVV------GAPAAKDWRARGAVTAVKDQGQCGSCWAFS 184

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA-- 223
           A G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++   
Sbjct: 185 AIGNVECQWFLAGHPLTNLSEQMLVSCD---------KTDSGCGGGLMNNAFEWIVQENN 235

Query: 224 GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 236 GAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 291


>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 457

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 140/242 (57%), Gaps = 17/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           N +  +  FK K+ K Y   E+ + RF +FK+N+ +A+  Q+ +  +A++GVT +SDLT 
Sbjct: 153 NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 211

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L       +P+     P       N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 212 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 269

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD           D GCNGGL ++A+E I+K 
Sbjct: 270 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 320

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+  E +YPY   +   C      +A  +++   ++ DE ++AA L  +  ++  + ++
Sbjct: 321 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 379

Query: 284 EL 285
            L
Sbjct: 380 LL 381


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/299 (37%), Positives = 153/299 (51%), Gaps = 42/299 (14%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           + + L+L L +++A A AV+  + +          Q E H    EH          K Y 
Sbjct: 1   MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41

Query: 66  TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
            + E  +R ++F  N  + AK  QL    AV     V K++D+   EF     G N    
Sbjct: 42  DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101

Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           ++LR   ++ K     + +   LP   DWR  GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           +  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI   GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              D  SC F+K  I A    F  I   +E +MA  +   GP+A    +I+  H SF F
Sbjct: 215 EAID-DSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVA---VAIDASHESFQF 269


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 95/242 (39%), Positives = 137/242 (56%), Gaps = 18/242 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ +  +  K+YA  EE  YR+ V++ N    +     + +    + KF DLT +EF + 
Sbjct: 30  FADWMQEHQKSYA-NEEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKFGDLTNAEFNKL 88

Query: 113 FLGLNRRLRLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           F GL+    + AD   Q++ I P   LP DFDWR  GAVT VK+QG CGSCWSFS TG+ 
Sbjct: 89  FKGLS----ITADQAKQESDIAPAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGST 144

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EGA+FL  G L SLSEQ LVDC        +   + GCNGGLM+ AFEYI++  G++ E+
Sbjct: 145 EGANFLKHGRLTSLSEQNLVDC-------STSYGNHGCNGGLMDYAFEYIIRNKGIDTEE 197

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
            YPY  +  G+C+++K      + +++ + S  +    N V   P +    +I+  H SF
Sbjct: 198 SYPYHASQ-GTCRYNKQHSGGELVSYTNVPSGNEGALLNAVATQPTS---VAIDASHSSF 253

Query: 291 SF 292
            F
Sbjct: 254 QF 255


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 100/243 (41%), Positives = 134/243 (55%), Gaps = 23/243 (9%)

Query: 62  KTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN 117
           K Y  + E  +R ++F  N  + AK  QL     V     V K++D+   EFR+   G N
Sbjct: 114 KNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFN 173

Query: 118 ----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
               + LR   ++ K     + +   LP   DWRD GAVTGVKDQG CGSCW+FS+TGAL
Sbjct: 174 YTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGAL 233

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG H+  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI   GG++ EK
Sbjct: 234 EGQHYRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEK 286

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHIS 289
            YPY   D  SC F+K  I A    F  I   +E ++A  +   GP++    +I+  H S
Sbjct: 287 SYPYEALD-DSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVS---VAIDASHES 342

Query: 290 FSF 292
           F F
Sbjct: 343 FQF 345


>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
          Length = 364

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 94/240 (39%), Positives = 132/240 (55%), Gaps = 23/240 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLT 105
             A +HF+ F  +  K Y  + E   RF +FK NL   +  Q  D  TA++G+ +F+DL+
Sbjct: 58  FGAWNHFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLS 117

Query: 106 PSEFRRQFLG--------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           P EF++  L          NR + L A+     + P   LP  FDWR+HGAVT VK +G 
Sbjct: 118 PEEFKKTHLPHTWKQPDHPNRIVDLAAEG----VDPKEPLPESFDWREHGAVTKVKTEGH 173

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           C +CW+FS TG +EG  FL+  +LVSLS QQL+DCD           D GCNGG    A+
Sbjct: 174 CAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDCD---------VVDEGCNGGFPLDAY 224

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           + I++ GG+E E  YPY       C+   S IA  ++    +  DE++M A LVK GP++
Sbjct: 225 KEIVRMGGLEPEDKYPYE-AKAEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPIS 283


>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 456

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 139/242 (57%), Gaps = 18/242 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           N +  +  FK K+ K Y   E  + RF +FK+N+ +A+  Q+ +  +A++GVT +SDLT 
Sbjct: 153 NVDEKYVQFKLKYRKQY--HETDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTT 210

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L       +P+     P       N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 268

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD           D GCNGGL ++A+E I+K 
Sbjct: 269 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 319

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG+  E +YPY   +   C      +A  +++   ++ DE ++AA L  +  ++  + ++
Sbjct: 320 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNAL 378

Query: 284 EL 285
            L
Sbjct: 379 LL 380


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/299 (37%), Positives = 153/299 (51%), Gaps = 42/299 (14%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           + + L+L L +++A A AV+  + +          Q E H    EH          K Y 
Sbjct: 1   MRTALILPLLALVAVAQAVSYAEVI----------QEEWHTFKLEHR---------KNYQ 41

Query: 66  TQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLN---- 117
            + E  +R ++F  N  + AK  QL    AV     V K++D+   EF     G N    
Sbjct: 42  DETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH 101

Query: 118 RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           ++LR   ++ K     + +   LP   DWR  GAVT VKDQG CGSCW+FS+TGALEG H
Sbjct: 102 KQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQH 161

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           +  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF YI   GG++ EK YPY
Sbjct: 162 YRKSGVLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 214

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              D  SC F+K  I A    F  I   +E +MA  +   GP+A    +I+  H SF F
Sbjct: 215 EAID-DSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVA---VAIDASHESFQF 269


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)

Query: 44  DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +HL N +    LF+S   + SK Y + EE  +RF VF+ NL    +R     +   G+ +
Sbjct: 39  EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98

Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
           F+DLT  EF+ ++LGL +    R R P+   +   +   DLP   DWR  GAV  VKDQG
Sbjct: 99  FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FS   A+EG + ++TG L SLSEQ+L+DCD         + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCD--------TTFNSGCNGGLMDYA 208

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
           F+YI+  GG+ +E DYPY   + G C+  K  +    +S +  +  ++D+     + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267

Query: 276 LAGNVASIELPHISFSF 292
           ++    +IE     F F
Sbjct: 268 VS---VAIEASGRDFQF 281


>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
          Length = 567

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 130/231 (56%), Gaps = 22/231 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + ++K+YA   E   R  +F  NL  A + Q LD  +A +GVTKFSDLT  EFR 
Sbjct: 270 FKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTEEEFRM 329

Query: 112 QFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            +L      L  R   PA   + P       P  +DWRDHGA+T  K+QG CGSCW+FS 
Sbjct: 330 FYLNPLLSSLPGRALRPAPRARGPA------PASWDWRDHGALTAAKNQGMCGSCWAFSV 383

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG  FL  G L++LSEQ+LVDCD         + D  C GGL ++A+  I   GG+
Sbjct: 384 TGNVEGQWFLRRGALLTLSEQELVDCD---------TLDQACGGGLPSNAYTAIETLGGL 434

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E EKDY Y G     C F   K  A +++   +S DE ++AA L ++GP++
Sbjct: 435 ETEKDYSYEGRK-ERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVS 484


>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 363

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 95/249 (38%), Positives = 135/249 (54%), Gaps = 14/249 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT V+D+  C S W+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ L+ CD   D         GC GGLM+ AF++I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLLSCDTRED---------GCGGGLMDRAFQWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
             E+ YPY  TDG   + +KS   + A +S++  +  DE+ +A  L K+GP+A  V +  
Sbjct: 209 FTEQSYPYASTDGDVPRCNKSGKVVGAKISDYVDLPQDENAIAEWLAKNGPVAIAVEATS 268

Query: 285 LPHISFSFL 293
           L   +   L
Sbjct: 269 LQRYTGGVL 277


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 94/282 (33%), Positives = 146/282 (51%), Gaps = 30/282 (10%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M  +++ +LLLL  +   A+A+++ +               SE+ +++    + +   K 
Sbjct: 1   MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
            K Y   +E + RF+VFK NL   +     + T   G+ KF+D+T  E+R  +LG     
Sbjct: 44  RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDA 103

Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             R ++      +      + LP   DWR  GAV  +KDQG CGSCW+FS   A+EG + 
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           + TGE VSLSEQ+LVDCD E         D GCNGGLM+ AF++I++ GG++ E+DYPY 
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G DG   +  K      +  +  + S+ +      V H P++
Sbjct: 216 GIDGTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVS 257


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 94/270 (34%), Positives = 151/270 (55%), Gaps = 10/270 (3%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++L L    +ASAV ++      +  V + G +S+  +++    + L K   ++   +  
Sbjct: 2   VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAW-LVKHGKAQNQNSLV 60

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA-DAQ 127
           E D RF +FK NLR        + +   G+T+F+DLT  E+R ++LG     +     +Q
Sbjct: 61  EKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQ 120

Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +      ++LP   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + TG+L++LSEQ
Sbjct: 121 RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 180

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
           +LVDCD         S + GCNGGLM+ AFE+I+K GG++ +KDYPY G DG   +  K+
Sbjct: 181 ELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKN 232

Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                + ++  + +  ++     V H P++
Sbjct: 233 AKVVTIDSYEDVPTYSEESLKKAVAHQPVS 262


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 104/243 (42%), Positives = 134/243 (55%), Gaps = 21/243 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK+ F K Y T EE   RF +F+  L R     ++  +   +   GV +FSD++  E+ R
Sbjct: 57  FKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLR 116

Query: 112 QFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
              GL R  R  +  +       +   L    DWRD G VT VK+QG CGSCWSFS TG+
Sbjct: 117 HN-GLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGS 175

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVER 228
           LEG HF  TG+L+SLSEQQLVDC        SG+  + GCNGGLM++AFEYI   GG+E 
Sbjct: 176 LEGQHFRQTGKLISLSEQQLVDC--------SGTFGNEGCNGGLMDNAFEYIKSIGGLEG 227

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPH 287
           E DYPYT    G C   KS   A  +  + V S DED +   L   GP++    +I+  H
Sbjct: 228 EDDYPYTAKQ-GKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPIS---VAIDASH 283

Query: 288 ISF 290
            SF
Sbjct: 284 ASF 286


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 94/282 (33%), Positives = 146/282 (51%), Gaps = 30/282 (10%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M  +++ +LLLL  +   A+A+++ +               SE+ +++    + +   K 
Sbjct: 1   MPSMLIPTLLLLSFTFSHATAMSIIN--------------YSENEVMDMYEEWLV---KH 43

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN--- 117
            K Y   +E + RF+VFK NL   +     + T   G+ KF+D+T  E+R  +LG     
Sbjct: 44  RKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDA 103

Query: 118 --RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             R ++      +      + LP   DWR  GAV  +KDQG CGSCW+FS   A+EG + 
Sbjct: 104 KRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINN 163

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           + TGE VSLSEQ+LVDCD E         D GCNGGLM+ AF++I++ GG++ E+DYPY 
Sbjct: 164 IVTGEFVSLSEQELVDCDRE--------YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQ 215

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G DG   +  K      +  +  + S+ +      V H P++
Sbjct: 216 GIDGTCDETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVS 257


>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 337

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 100/280 (35%), Positives = 149/280 (53%), Gaps = 47/280 (16%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +L  LL S  +A   + +DD+ M                      F+++  K+ KTY+T 
Sbjct: 9   ALFFLLASFTVALPFSPSDDEVMAES-------------------FNMWMKKYEKTYSTM 49

Query: 68  EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL-------GLNRR 119
           EE++ R RV+ +N    ++  +   P   + + +FSDLT +EF++ +L         N  
Sbjct: 50  EEYNERLRVYTSNYYYIEQLNKEHGPHTEYELNQFSDLTFAEFKKIYLTEPQHCSATNGN 109

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            + P +A+          P   DWR+   +T VKDQG CGSCW+FS TG LE  H + TG
Sbjct: 110 FQKPVNARD---------PVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLEAHHAIKTG 160

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           +L+SLSEQQLVDC        +G+ ++ GCNGGL + AFEYI   GG+E E +Y YT  D
Sbjct: 161 QLISLSEQQLVDC--------AGAFNNHGCNGGLPSQAFEYIKYNGGIESESNYNYTAKD 212

Query: 239 GGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLA 277
            G C+F+ S +AA VS+   I+ D E  +   +   GP++
Sbjct: 213 -GVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVS 251


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 97/245 (39%), Positives = 136/245 (55%), Gaps = 22/245 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +K++  K Y + EE   R  +++ NL    R   +  L   T   G+ +F+DL   EF  
Sbjct: 31  WKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKEFVA 90

Query: 112 QFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
              G  R       A+ +  LP N+   LP   DWR  G VT VKDQG CGSCW+FSATG
Sbjct: 91  MMTGF-RVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATG 149

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           +LEG HF  TG+LVSLSEQ LVDC  +         + GCNGGLM+ AF+YI+ AGG++ 
Sbjct: 150 SLEGQHFKKTGKLVSLSEQNLVDCSDK---------NYGCNGGLMDRAFQYIIDAGGIDT 200

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPH 287
           E+ YPY   D G+C F  + + A V+ ++ ++S  ++     V H GP++    +I+  H
Sbjct: 201 EESYPYIAMD-GNCHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPIS---VAIDASH 256

Query: 288 ISFSF 292
            SF  
Sbjct: 257 FSFQL 261


>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
 gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
 gi|1094710|prf||2106314A cathepsin L
          Length = 319

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 136/234 (58%), Gaps = 17/234 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTP 106
           N +  +  FK K+ K Y   E+ + RF +FK+N+ +A+  Q+ +  +A++GVT +SDLT 
Sbjct: 15  NVDEKYVQFKLKYRKQYHETED-EIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPI---LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L       +P+     P       N++P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 74  DEFARTHL--TASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD           D GCNGGL ++A+E I+K 
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD---------GLDDGCNGGLPSNAYESIIKM 182

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+  E +YPY   +   C      +A  +++   ++ DE ++AA L  +  ++
Sbjct: 183 GGLMLEDNYPYDAKN-EKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTIS 235


>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 359

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 114/293 (38%), Positives = 156/293 (53%), Gaps = 22/293 (7%)

Query: 1   MERLILSSLLLLLLSSVL-ASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH--- 52
           M R+  +S LL+L++ V  ASA +   D   I+QVV SDG    E S   ++    H   
Sbjct: 1   MARVSPASFLLILIACVAGASAGSSFADQNPIKQVV-SDGLRELEASVLQVIGQTRHSLA 59

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ F  ++ K+Y T EE   RF +F  +L+  +       +   GV +F+DLT  EFR+ 
Sbjct: 60  FARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLSYTLGVNEFADLTWEEFRKH 119

Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
            LG  +     A  +    L    LP   DWR+ G VT VK+QG CGSCW+FS TGALE 
Sbjct: 120 RLGAAQNC--SATLKGNHKLTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEA 177

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
           A+  + G+ + LSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ Y
Sbjct: 178 AYVQAFGKAIFLSEQQLVDCARAYN-------NFGCNGGLPSQAFEYIKANGGLDTEEAY 230

Query: 233 PYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
           PYTG D G CKF    I   V    N ++ + DE + A   V+   +A  V S
Sbjct: 231 PYTGVD-GVCKFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVS 282


>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 124/233 (53%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD +         D GC GG  + AF++IL +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTD---------DFGCRGGFSDPAFKWILWSNKGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +CK     + A +SN   +  DED +   L + GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCKMSGKVVGAKISNRLYLPEDEDMITEWLARKGPVA 261


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 101/243 (41%), Positives = 135/243 (55%), Gaps = 17/243 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTKFSDLTPSEFRR 111
           +K K+ K+Y  + E   R RV+++NL+  ++  +L D    +   G+  ++DL   EF  
Sbjct: 22  WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEFMA 81

Query: 112 -QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +  G   + +  +  Q    L    LP+  DWR+ G VT VKDQG CGSCW+FSATG+L
Sbjct: 82  LKGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFSATGSL 141

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG HF  TG L+SLSEQQLVDC            + GCNGGLM SA++YI   GGVE E 
Sbjct: 142 EGQHFAKTGNLLSLSEQQLVDCAGRYG-------NYGCNGGLMESAYDYIKGVGGVELES 194

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHIS 289
            YPYT  D G CKFD+SK+ A    + VI   DE  +   +   GP+A    SI+    S
Sbjct: 195 AYPYTARD-GRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVA---VSIDASGYS 250

Query: 290 FSF 292
           F  
Sbjct: 251 FQL 253


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 86/248 (34%), Positives = 142/248 (57%), Gaps = 20/248 (8%)

Query: 39  GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
           GE+S+D +      +  +K++ +++Y   +E + R  +F+ NLR   +         +  
Sbjct: 36  GERSDDEV---HRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSF 92

Query: 97  --GVTKFSDLTPSEFRRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
             G+T+F+DLT  E+R  +LG+      RR      + +     ++DLP   DWRD GAV
Sbjct: 93  RLGLTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAV 152

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             VKDQG+CGSCW+FS   A+EG + + TG+L+SLSEQ+LVDCD           + GCN
Sbjct: 153 VDVKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDT--------YYNQGCN 204

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ AFE+I+  GG++ ++DYPYTG DG   ++ K+     + ++  +  ++++    
Sbjct: 205 GGLMDYAFEFIISNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQK 264

Query: 270 LVKHGPLA 277
            V + P++
Sbjct: 265 AVANQPVS 272


>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
          Length = 383

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 96/237 (40%), Positives = 123/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ANL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTAFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 97/257 (37%), Positives = 143/257 (55%), Gaps = 22/257 (8%)

Query: 44  DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +HL N +    LF+S   + SK Y + EE  +RF VF+ NL    +R     +   G+ +
Sbjct: 39  EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98

Query: 101 FSDLTPSEFRRQFLGLNR----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
           F+DLT  EF+ ++LGL +    R R P+   +   +   DLP   DWR  GAV  VKDQG
Sbjct: 99  FADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQG 156

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FS   A+EG + ++TG L SLSEQ+L+DCD         + +SGCNGGLM+ A
Sbjct: 157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDT--------TFNSGCNGGLMDYA 208

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGP 275
           F+YI+  GG+ +E DYPY   + G C+  K  +    +S +  +  ++D+     + H P
Sbjct: 209 FQYIISTGGLHKEDDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQP 267

Query: 276 LAGNVASIELPHISFSF 292
           ++    +IE     F F
Sbjct: 268 VS---VAIEASGRDFQF 281


>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
 gi|1582620|prf||2119193A cathepsin L-related Cys protease
          Length = 324

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/254 (40%), Positives = 133/254 (52%), Gaps = 24/254 (9%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK KF + Y   EE  YR  VF  NL+      K+ +  + T    + +F
Sbjct: 13  LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
           SDLT  EF     G    LR P     A    T+  P  T+ DWR  G VT VKDQG CG
Sbjct: 73  SDLTNDEFNSMMKGYKTSLR-PKPV--AVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCG 129

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC--DSGCNGGLMNSAF 217
           SCW+FSATG+LEG HFL  GELVSL+EQQLVDC        +G    + GCNGG +N AF
Sbjct: 130 SCWAFSATGSLEGQHFLKYGELVSLAEQQLVDC--------AGGIYYNQGCNGGWVNQAF 181

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
           +YI   GG++ E  YPY   D  +C+F+ + +AA  S F S+    E          GP+
Sbjct: 182 KYIKANGGIDTESSYPYEARD-NTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPI 240

Query: 277 AGNVASIELPHISF 290
           +    +I+  H SF
Sbjct: 241 S---VAIDAAHRSF 251


>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
 gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
          Length = 617

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 91/238 (38%), Positives = 136/238 (57%), Gaps = 16/238 (6%)

Query: 45  HLLNA-EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFS 102
           H LN  EH F  F+ K+ + YA   EH  R R+F+ NLR  +     +  +A +G+T+F+
Sbjct: 302 HTLNKIEHLFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFA 361

Query: 103 DLTPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGS 160
           D+T +E++    GL +R         A ++P    ++P +FDWR   AVT VK+QG CGS
Sbjct: 362 DMTSTEYKLH-AGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVTHVKNQGQCGS 420

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG + + TGEL   SEQ+L+DCD         S DS CNGGLM++A++ I
Sbjct: 421 CWAFSVTGNIEGLYAIKTGELEEFSEQELLDCD---------STDSACNGGLMDNAYKAI 471

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
              GG+E E +YPY       C F+++     +S F  +   +E  M   L+ +GP++
Sbjct: 472 KDIGGLEYESEYPYAAKK-MQCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPIS 528


>gi|29841177|gb|AAP06190.1| similar to GenBank Accession Number U07345 preprocathepsin L in
           Schistosoma mansoni [Schistosoma japonicum]
          Length = 356

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/220 (41%), Positives = 134/220 (60%), Gaps = 19/220 (8%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTP 106
           N    ++ FK  + K Y  + +++ RF +FK+NL +A+  Q+L+  +AV+GVT +SDLT 
Sbjct: 152 NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTT 210

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            EF R  L    R    A +++  I P     D+P +FDWR+ GAVT VK+QG CGSCW+
Sbjct: 211 DEFSRTHLTAPWR----ASSKRNTISPRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWA 266

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +E   F  TG+L+SLSEQQLVDCD         S D GCNGGL ++A+E I++ 
Sbjct: 267 FSTTGNIESQWFRKTGKLLSLSEQQLVDCD---------SLDDGCNGGLPSNAYESIIRM 317

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDE 263
           GG+  E +YPY   +   C    + +AA +++   ++ DE
Sbjct: 318 GGLMLEDNYPYDAKN-EKCHLKVANVAAYINSSVNLTQDE 356


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 139/244 (56%), Gaps = 14/244 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           ++ F  +  +F K Y   E    RF +FK+N+         +   V G+   +DLT  E+
Sbjct: 178 KNEFENWIDRFEKKYDVSE-FKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLADLTNLEY 236

Query: 110 RRQFLGLNRR--LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           R+ +LG +++  L  P + + + +          DWR  GAV+ +KDQG CGSCWSFS T
Sbjct: 237 RQFYLGTHKKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTT 296

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G++EGAH + +G +V LSEQ LVDC        +   + GCNGGLM+ AFEYI+   G++
Sbjct: 297 GSVEGAHQIKSGNMVELSEQNLVDC-------STSEGNMGCNGGLMDYAFEYIITNNGID 349

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELP 286
            E  YPYT + G +CK++K+   A +S++  I++  +   A+ VK+ GP++    +I+  
Sbjct: 350 TESSYPYTASSGTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGPVS---VAIDAS 406

Query: 287 HISF 290
           H SF
Sbjct: 407 HNSF 410


>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
 gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
          Length = 615

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 88/242 (36%), Positives = 140/242 (57%), Gaps = 15/242 (6%)

Query: 40  EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
           + S   L  A+H F  F+ +F + Y +  E   R R+F+ NL+  ++  + +  +A +G+
Sbjct: 296 KHSHRALDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGI 355

Query: 99  TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQG 156
           T+F+D+T SE++ +  GL +R    A      ++P    +LP +FDWR   AVT VK+QG
Sbjct: 356 TEFADMTSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQG 414

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
           +CGSCW+FS TG +EG H + TG+L   SEQ+L+DCD         + DS CNGGLM++A
Sbjct: 415 SCGSCWAFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNGGLMDNA 465

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
           ++ I   GG+E E +YPY       C F+++     V+ F  +   +E  M   L+ +GP
Sbjct: 466 YKAIKDIGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGP 524

Query: 276 LA 277
           ++
Sbjct: 525 IS 526


>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
          Length = 283

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 124/208 (59%), Gaps = 16/208 (7%)

Query: 73  RFRVFKANLRRAKR---RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
           RF++F+ N+++       +L D  A +GVT+FSDL   EFRR +L     L    D  +A
Sbjct: 2   RFKIFRENMKKINTLNDNELGD--AEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRA 59

Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
            I P  D P  FDWRDH AVT VK+QG CGSCW+FS T  +EG   +   +LVSLSEQ+L
Sbjct: 60  KI-PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQEL 118

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCD           D GC GGL  +A+E I++ GG+E EK YPY   D   CKF    +
Sbjct: 119 VDCD---------KLDDGCEGGLPVNAYEEIIRLGGLESEKKYPYDAED-EKCKFTVGDV 168

Query: 250 AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           A  +++   ISS+E  MAA L K+GP++
Sbjct: 169 AVYINSSVNISSNEADMAAWLYKNGPIS 196


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 97/281 (34%), Positives = 149/281 (53%), Gaps = 22/281 (7%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M   +L +  L   +S+   +V  +D   +           S +HL + +    LF+S  
Sbjct: 1   MALSVLKTSFLTFFASLFVCSVLAHDFSIV---------GYSPEHLTSVDKLVELFESWI 51

Query: 61  S---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
           S   K Y + EE  +RF VFK NL+   +R     +   G+ +F+DL+  EF+ +FLGL 
Sbjct: 52  SGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLY 111

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
                   ++        DLP   DWR  GAVT VK+QG+CGSCW+FS   A+EG + + 
Sbjct: 112 PEFPRKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIV 171

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            G L SLSEQQL+DCD         S ++GCNGGLM+ AFE+I+  GG+ +E+DYPY   
Sbjct: 172 AGNLTSLSEQQLIDCDT--------SFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYL-M 222

Query: 238 DGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           + G+C   + ++    +S +  +  +++Q     + H PL+
Sbjct: 223 EEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLS 263


>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
 gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
          Length = 620

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 135/236 (57%), Gaps = 15/236 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
           L   EH F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+T+F+D+
Sbjct: 307 LDKVEHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADM 366

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILP--TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T +E++ +  GL +R    A      ++P  + +LP +FDWR   AVTGVK+QG CGSCW
Sbjct: 367 TSTEYKER-TGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSCW 425

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG + L  GEL   SEQ+L+DCD         + DS CNGGLM++A++ I  
Sbjct: 426 AFSVTGNIEGLYALKYGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 476

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY       C F+K+     V +F  +   +E  M   LV +GP++
Sbjct: 477 IGGLEYEAEYPYEAKK-KQCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPIS 531


>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
 gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
          Length = 610

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/245 (36%), Positives = 139/245 (56%), Gaps = 16/245 (6%)

Query: 39  GEQSEDH--LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAV 95
           G +  +H  L   EH F  F+ KF + Y    E   R R+F+ NLR  ++    +  +A 
Sbjct: 287 GHKKHNHHSLDKVEHLFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAK 346

Query: 96  HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA--PILPTNDLPTDFDWRDHGAVTGVK 153
           +G+T+F+D+T +E++ +     R    P   QKA  P  P  +LP +FDWR  GAV+ VK
Sbjct: 347 YGITEFADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVK 406

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           +QG+CGSCW+FS  G +EG + + TG+L   SEQ+L+DCD         + DS CNGGL 
Sbjct: 407 NQGSCGSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCD---------TKDSACNGGLP 457

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
           ++A++ I + GG+E E +YPY       C F+K+     V+ F  +  ++E  M   L+ 
Sbjct: 458 DNAYKAIQEIGGLEYESEYPYKARK-EQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIA 516

Query: 273 HGPLA 277
           +GP++
Sbjct: 517 NGPIS 521


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 130/228 (57%), Gaps = 11/228 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           + ++  +  K Y    E + RF +FK NLR       +D +   G+ +F+DLT  E++  
Sbjct: 51  YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAM 110

Query: 113 FLG--LNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           FLG  + R+ R L   +Q+      +DLP + DWR+ GAV  VKDQG CGSCW+FS  GA
Sbjct: 111 FLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGA 170

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG + + TGEL+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E
Sbjct: 171 VEGINQIVTGELISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDTE 222

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +DYPY  +D       K+     +  +  +  +++      V H P++
Sbjct: 223 EDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVS 270


>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
 gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
          Length = 496

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 139/236 (58%), Gaps = 15/236 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFR 110
            F  F   F K Y +++E   R+ +FK N++  +  Q  +  TAV+GVT F+DLTP EFR
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254

Query: 111 RQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           + +L    +R +LP   Q+   +P   +   +DWR+H AVT VK+QG CGSCW+F+    
Sbjct: 255 KFYLSPQWKRDQLP---QRKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIAN 311

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +  GELVSLSEQ+LVDCD         + D GC+GG  ++A++ I++ GG+  E
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCD---------TLDQGCSGGYPSNAYKEIIRLGGLTTE 362

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
            +Y Y G + G+C+F        +++   +  DE ++AA + ++GP+A  + +  +
Sbjct: 363 TNYSYDG-NQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAM 417


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/234 (40%), Positives = 136/234 (58%), Gaps = 15/234 (6%)

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           SK+Y + EE  +R+ V++ N +  +     + T+   + KF DLT +EF + F GL    
Sbjct: 38  SKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGDLTNAEFNKLFKGLAFDY 96

Query: 121 RLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
              A+   A   +P   L  DFDWR  GAVT VK+QG CGSCWSFS TG+ EGA+FL TG
Sbjct: 97  SFHANKAAAEKAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTG 156

Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
            L SLSEQ L+DC        SGS  ++GCNGGLM+ AFEYI+   G++ E  YPY  T 
Sbjct: 157 RLTSLSEQNLIDC--------SGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ-TA 207

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
             +C+++ +    ++++++ +SS ++    N V   P +    +I+  H SF F
Sbjct: 208 QYTCQYNPANSGGSLTSYTDVSSGDENALLNAVATEPTS---VAIDASHNSFQF 258


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/242 (40%), Positives = 133/242 (54%), Gaps = 27/242 (11%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP---TAVH----GVT 99
           LN E  F  +K  F K+Y+   E   R  V++AN      + L+D      +H    G+ 
Sbjct: 26  LNME--FEAWKRTFGKSYSDAVEEINRRAVWEAN------KMLVDAHNGAGIHSYTLGMN 77

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQG 156
            F+DLT  EF+R +LG    L  P     +  +PT +   LP   DWR  G VT VKDQG
Sbjct: 78  IFADLTHEEFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQG 137

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCWSFS TG++EG H   TG+LVSLSEQ LVDC            + GCNGGLM+ A
Sbjct: 138 QCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSK-------AQGNQGCNGGLMDDA 190

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GP 275
           F+YI+   G++ E  YPYT  D G+CKF+ + + A +S+F  I+   +    N V   GP
Sbjct: 191 FQYIITNKGIDTEASYPYTAKD-GTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGP 249

Query: 276 LA 277
           ++
Sbjct: 250 VS 251


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 100/251 (39%), Positives = 135/251 (53%), Gaps = 19/251 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN---LRRAKRRQLLDPTAVHGVTKFSDL 104
           N   H+  FK++ +K Y +  E   R  +F+ N   +     ++  D     G+  F DL
Sbjct: 76  NLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFD--FYLGMNHFGDL 133

Query: 105 TPSEFRRQFLGLNRRLRLPADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T  E+R ++LG  R    P+ A    +      D+P   DWRD G VT VK+QG CGSCW
Sbjct: 134 TNKEYRERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCW 193

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA G+LEG HF STG+LVSLSEQ LVDC     PE     +SGCNGG M+ AFEY+  
Sbjct: 194 AFSAVGSLEGQHFKSTGKLVSLSEQNLVDCS---TPE----GNSGCNGGWMDQAFEYVKD 246

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY GTD GSC F    I A +  F  V   DE+ +   +   GP++    
Sbjct: 247 NHGIDTEDSYPYVGTD-GSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVS---V 302

Query: 282 SIELPHISFSF 292
           +I+   + F F
Sbjct: 303 AIDASSMLFQF 313


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 99/279 (35%), Positives = 154/279 (55%), Gaps = 20/279 (7%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + ++++LLL     ++SA+   D   +      +   +S++ L++    + +   K  K 
Sbjct: 36  MAMATILLLFTVFAVSSAL---DMSIISYDNAHAATSRSDEELMSMYEQWLV---KHGKV 89

Query: 64  YATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NR 118
           Y    E + RF++FK NLR         D T   G+ +F+DLT  E+R ++LG     NR
Sbjct: 90  YNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNR 149

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           RL      + AP +  + LP   DWR  GAV  VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 150 RLGKTPSNRYAPRV-GDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVT 208

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GEL+SLSEQ+LVDCD           + GCNGGLM+ AFE+I+  GG++ E+DYPY G D
Sbjct: 209 GELISLSEQELVDCDT--------GYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVD 260

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G    + K+    ++ ++  + + ++      V + P++
Sbjct: 261 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVS 299


>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
          Length = 320

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/241 (41%), Positives = 142/241 (58%), Gaps = 13/241 (5%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK K+ K+Y+  ++ +YRFRVFK NL R K+ Q ++  TA +GVT+FSDLT 
Sbjct: 26  NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF+ ++L  ++   +P D +  P +  +    +FDWR+HGAV  V DQG CGSCW+FSA
Sbjct: 85  QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  F  T  L+ LSEQQL+DCD           D GCNGG    AF  IL  GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFRQILGMGGL 194

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
           + + DYPY G + G C+   SK+   ++   ++  DE   A  L + GPL+  + ++ L 
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQ 253

Query: 287 H 287
           H
Sbjct: 254 H 254


>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 139/249 (55%), Gaps = 18/249 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           ++H++L+K+   K+YA +EE  +R  +++ NLR  +   L      H    G+ +F D+T
Sbjct: 26  DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
             EFR+   G   + ++      AP     + P   DWR  G VT VKDQG CGSCW+FS
Sbjct: 85  NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG H+ +TG+++SLSEQ LVDC            + GCNGGLM+ AF+Y+   GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
           ++ E  YPYT  D   C +D +  +A  + F  ++S+ ++   N V   GP++    +++
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSESEKDLMNAVASVGPVS---VAVD 252

Query: 285 LPHISFSFL 293
             H SF F 
Sbjct: 253 AGHQSFQFY 261


>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 344

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 129/239 (53%), Gaps = 23/239 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FKSKF+K Y  + EH   F  +K +     + Q+ +P A  G TKFSD++P EF  +
Sbjct: 33  FEEFKSKFNKYYHNEHEHHSSFHNYKTSREHIVKHQMENPNAKFGHTKFSDMSPEEFENK 92

Query: 113 FLGLN---------RRLRLPADAQKAPI-----LPTNDLPTDFDWRDHGAVTGVKDQGAC 158
            L  +         + ++L A+  K  +     +  +DLP  FDWRD G +T  K Q  C
Sbjct: 93  MLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTC 152

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCW+F+ TG +E  + L  GEL+  SEQ L+DCD         + + GC GGLM  A++
Sbjct: 153 GSCWTFATTGVIESQYALKYGELLHFSEQMLLDCD---------NINQGCRGGLMTDAYQ 203

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           ++ ++GG++    Y         C FDK+K+ A V ++  I  +E+ +   LVK+GP+A
Sbjct: 204 FLQQSGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVA 262


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/246 (36%), Positives = 140/246 (56%), Gaps = 18/246 (7%)

Query: 54  SLFKSKF---SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEF 109
           +LF+S      K+Y    E + RF++FK NLR    + L++      G+ KF+DLT  E+
Sbjct: 43  TLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEY 102

Query: 110 RRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           R ++ G+     R ++ A + +   L    LP   DWR+ GAV  VKDQG+CGSCW+FS 
Sbjct: 103 RSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFST 162

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
             A+EG + ++TG+L++LSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG+
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDYAFEFIINNGGI 214

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELP 286
           + + DYPYTG DG   ++ K+     + ++  + + ++        + P++    +IE  
Sbjct: 215 DTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPIS---VAIEAS 271

Query: 287 HISFSF 292
              F F
Sbjct: 272 GRDFQF 277


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/246 (40%), Positives = 138/246 (56%), Gaps = 35/246 (14%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ--SEDHLLNAEHHFSLFKSKF 60
           R +  SL+LL++      A+    D      +V  +G Q  S+D +L+  H +       
Sbjct: 6   RALGLSLVLLVI------AIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWL---ETH 56

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG---LN 117
           S+ Y +  E  +RF++FK N            +   G+ KFSDLT  EFR Q+LG   +N
Sbjct: 57  SRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPVN 116

Query: 118 RRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           R+ +    +  D +  P +         DWR  GAVT VKDQGACGSCW+FSA G++EG 
Sbjct: 117 RQRKEANFMYEDVEAEPKV---------DWRLKGAVTDVKDQGACGSCWAFSAVGSVEGV 167

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           + + TGELVSLSEQ+LVDCD +         + GCNGGLM+ AFE+I+K GG++ EKDYP
Sbjct: 168 NAIKTGELVSLSEQELVDCDRK--------QNQGCNGGLMDYAFEFIIKNGGIDTEKDYP 219

Query: 234 YTGTDG 239
           Y   DG
Sbjct: 220 YKARDG 225


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/298 (37%), Positives = 155/298 (52%), Gaps = 32/298 (10%)

Query: 11  LLLLSSVLASA--VAVNDDDAMIRQVVPSDGEQSEDHLLNA---------EHHFSLFKSK 59
           +L + SVLA A    V   +     +  +     + H+L A         E  +  FK  
Sbjct: 3   VLWIVSVLAVARGATVQTGNVQWFDLEAAQKHPEQLHILKAKAGINYQPYEQAWKEFKIL 62

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRRQFLG 115
             KTY   EE   RF +F+ N+++ +    L      +   GV +FSDL   EF + + G
Sbjct: 63  HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEFVK-YNG 121

Query: 116 LNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           L ++  L  D   +  L  N+L  P   DWR  G VT VK+QG CGSCWSFS TG+LEG 
Sbjct: 122 L-KKTSLK-DGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQ 179

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           HF  +G+LVSLSE QLVDC      E       GCNGGLM++AF+YI   GG+E E+DYP
Sbjct: 180 HFRKSGKLVSLSESQLVDCSQSFGNE-------GCNGGLMDNAFKYIKSVGGLESEEDYP 232

Query: 234 YTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
           Y     G+CKFD +K+AA  +    V S  E  +   + + GP++    +I+  H SF
Sbjct: 233 YKPKQ-GTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVS---VAIDASHSSF 286


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 84/219 (38%), Positives = 126/219 (57%), Gaps = 8/219 (3%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           K  K   +  E D RF +FK NLR        + +   G+TKF+DLT  E+R  +LG   
Sbjct: 48  KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           + +    + +      + +P   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + T
Sbjct: 108 KRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ E+DYPY G D
Sbjct: 168 GDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVD 219

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G   +  K+     + ++  + ++ ++     + H P++
Sbjct: 220 GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPIS 258


>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 524

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/233 (38%), Positives = 127/233 (54%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFR+FK ++ RAK     +P A  GVT+FSD++P EF
Sbjct: 117 QQQFAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKEEAAANPYATFGVTQFSDMSPEEF 176

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +L G           +K   + T   P   DWR  GAVT VKDQG+CGSCW+F+A G
Sbjct: 177 RATYLNGAKYYAAALKRPRKVVNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIG 236

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + +  C GG  + AF++I+ +  G V
Sbjct: 237 NIEGQWKIAGHELTSLSEQMLVSCD---------TTEDNCGGGFADRAFKWIVSSNKGNV 287

Query: 227 EREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY   DG    C      + A +S    +  DE+ +A  L ++GP+A
Sbjct: 288 FTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDENAIAEWLARNGPVA 340


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/258 (37%), Positives = 140/258 (54%), Gaps = 21/258 (8%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +S+D +++    +  +  K  K Y    E   RF +FK NLR        + T   G+TK
Sbjct: 19  RSDDEVMSI---YKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNRTYKVGLTK 75

Query: 101 FSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQ 155
           F+DLT  E+R  FLG      RRL    +  +       D LP   DWR  GAV  +KDQ
Sbjct: 76  FADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQ 135

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G+CGSCW+FS   A+EG + + TGEL+SLSEQ+LVDCD           ++GCNGGLM+ 
Sbjct: 136 GSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDR--------FYNAGCNGGLMDY 187

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHG 274
           AF++I+  GG++ EKDYPY G D  +C  DK K  A ++  F  +   +++     V H 
Sbjct: 188 AFQFIINNGGLDTEKDYPYLGND-DTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQ 246

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +IE   ++  F
Sbjct: 247 PVS---VAIEASGMALQF 261


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 96/254 (37%), Positives = 135/254 (53%), Gaps = 20/254 (7%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFS 102
           L+ E  +  FK K  K Y+ +EE+  R  +F+ NL+  +       T  H    GV +F+
Sbjct: 18  LSFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFA 76

Query: 103 DLTPSEFRRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           D+T +E+  Q +G   +   L           +P   +    DWRD G VT +KDQG CG
Sbjct: 77  DMTHAEYLNQVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG+LEG H  +TG LVSLSEQ LVDC  +         + GC GG M+  F+Y
Sbjct: 137 SCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQ-------EGNKGCEGGDMDQGFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAG 278
           I++  G++ E+ YPY   +   CKFD S I A +S+F+ V S DED +       GP++ 
Sbjct: 190 IIQNKGIDTEQCYPYKAKN-HRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPIS- 247

Query: 279 NVASIELPHISFSF 292
               I+  H SF F
Sbjct: 248 --VGIDASHQSFQF 259


>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 359

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 151/279 (54%), Gaps = 20/279 (7%)

Query: 3   RLIL-SSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLF 56
           R IL S++LL+L+++  A ++   D+   IR V     + E+S   +L    H   F+ F
Sbjct: 5   RTILPSAVLLILIAASTAESIGF-DESNPIRMVSDRLREVEESVVQILGQSRHVISFARF 63

Query: 57  KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
             ++ K Y   EE   RF +FK NL   +       +   GV +F+D+T  EF+R  LG 
Sbjct: 64  AHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADMTWQEFQRTKLGA 123

Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
            +     A  +    L    LP   DWR+ G V+ VKDQG CGSCW+FS TGALE A+  
Sbjct: 124 AQNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 181

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
           + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 AFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234

Query: 237 TDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            D G+CK+    +   V    N ++ + DE + A  LV+
Sbjct: 235 ED-GTCKYSAENVGVEVLDSVNITLGAEDELKHAVGLVR 272


>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 333

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 96/254 (37%), Positives = 134/254 (52%), Gaps = 19/254 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D  L+A+  ++ ++S + K YA  EE D+R  V++ N++  +R         HG T    
Sbjct: 22  DQSLDAQ--WNQWRSTYKKVYAVNEE-DWRRAVWEKNMKMIERHNQEYSQGKHGFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D T  EFR+   G   +          P+     +PT  DW   G VT VKDQG CG
Sbjct: 79  AFGDKTNEEFRQLMNGFQSQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKDQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWR-------EGNEGCNGGLMDNAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +   GG++ E+ YPYT TD   C+++    AA  + F  I   E  +   +   GP++  
Sbjct: 190 VKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVDIPPQEKALMKAVATVGPIS-- 247

Query: 280 VASIELPHISFSFL 293
             +I+   +SF F 
Sbjct: 248 -VAIDAGQVSFQFY 260


>gi|328868405|gb|EGG16783.1| cysteine protease 4 [Dictyostelium fasciculatum]
          Length = 454

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/240 (37%), Positives = 128/240 (53%), Gaps = 13/240 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ +  K  + Y++ E    R+ VFK N+   +         V G+  F+D++  E++R 
Sbjct: 30  FTSWMQKQGRVYSSHE-FGARYNVFKKNMDYVQEWNSKGSETVLGLNVFADISNEEYQRI 88

Query: 113 FLG--LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           +LG  ++   RL A A               DWR  GAVT +K+QG CGSCWSFS TG+ 
Sbjct: 89  YLGTKVDGTARLAAAASTTMDRIYEVQAATVDWRQQGAVTAIKNQGQCGSCWSFSTTGST 148

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EGAHFLST  LVSLSEQ L+DC        +   + GCNGGLM  AF YI+K GG++ E 
Sbjct: 149 EGAHFLSTKNLVSLSEQNLIDCS-------TAEGNQGCNGGLMTQAFTYIIKNGGIDTEA 201

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
            YPY    G  C ++ +  AA +S ++ ++S  +   A      P++    +I+  H SF
Sbjct: 202 SYPYKAVQGKKCLYNTANKAATISKYTEVTSGSEAALATAANAAPIS---VAIDASHNSF 258


>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
 gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
          Length = 599

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/236 (38%), Positives = 136/236 (57%), Gaps = 15/236 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDL 104
           L   +H F  F+ K+ + YA   EH  R R+F+ +L+  +     +  +A +G+T+F+D+
Sbjct: 286 LNKVDHLFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADM 345

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T +E+  Q  GL +R         A ++P    +LP +FDWR   AVT VK+QG CGSCW
Sbjct: 346 TSTEYA-QRAGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSCW 404

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EGA+ + TG+L   SEQ+L+DCD         S DS CNGGLM++A++ I  
Sbjct: 405 AFSVTGNIEGAYAIKTGDLQEFSEQELLDCD---------SKDSACNGGLMDNAYKAIKD 455

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY G     C F+++     VS F  +   +E  M   L+ +GP++
Sbjct: 456 IGGLEYESEYPYEGKK-KQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPIS 510


>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
 gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
 gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
          Length = 344

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 128/234 (54%), Gaps = 16/234 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K+Y T EE   R+ +FKAN+   ++        V G+  F+D+T  E+R  +LG      
Sbjct: 39  KSY-TSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
                Q+  +  T+   +  DWR  GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98  SLIGTQEEKVFTTSSAASK-DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ L+DC  E         +SGC+GGLM  AFEYI+   G++ E  YPY   + G 
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           C++      A +S++  +++  +    + V   P++    +I+  H SF  L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ + +   +TY    E + RF VF+ NLR            
Sbjct: 31  IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LG+  R +         +   N DLP   DWR  GAV
Sbjct: 88  VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             VKDQG+CGSCW+FS   A+EG + + TG+++SLSEQ+LVDCD         S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ AFE+I+  GG++ E+DYPY GTDG      K+     + ++  + ++ ++    
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259

Query: 270 LVKHGPLA 277
            V + P++
Sbjct: 260 AVANQPIS 267


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 102/253 (40%), Positives = 142/253 (56%), Gaps = 24/253 (9%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDL 104
           A  ++ L+K    K+Y   EEH +R ++F  ++ +      R  L   T   G+ KF+D+
Sbjct: 15  ASANWDLYKKVHGKSYGHDEEH-FRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDM 73

Query: 105 TPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           T  EFR  F GL     +  R     QK   L    LPT  DWR+ G VT VK+QG CGS
Sbjct: 74  TSEEFR-NFKGLKFDATKTKRNGTRFQKE--LLGEALPTQVDWREKGYVTPVKNQGQCGS 130

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG+LEG HF +TG+LVSLSEQ LVDC            ++GCNGGLM++ F YI
Sbjct: 131 CWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRV-------EGNNGCNGGLMDNGFTYI 183

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGN 279
            + GG++ E+ YPYTG D G C F+++ + A V  F  V   DE  + A +   GP++  
Sbjct: 184 QQNGGIDTEESYPYTGKD-GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVS-- 240

Query: 280 VASIELPHISFSF 292
             +I+  + SF +
Sbjct: 241 -VAIDASNDSFQY 252


>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
          Length = 387

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 90/232 (38%), Positives = 130/232 (56%), Gaps = 20/232 (8%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T +E ++R+ VFK NL    +      + V G+  F+DLT +E++R +LG         +
Sbjct: 46  TTQEFNHRYGVFKKNLNFVNQWNAKGSSTVLGMNVFADLTNAEYQRIYLGSKIDTSSMMN 105

Query: 126 AQKAPIL----PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           A  A +         L    DWR  GAVT +K+Q  CGSCWSFS TG++EGAH ++TG L
Sbjct: 106 ANAARLFDRTYNVKALSPTVDWRQKGAVTHIKNQQQCGSCWSFSTTGSIEGAHEIATGNL 165

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ L+DC        +   + GCNGGLM +AFEY++K GG++ E  YPY+ T    
Sbjct: 166 VSLSEQNLIDC-------STAEGNQGCNGGLMTNAFEYVIKNGGIDTEASYPYSATGPNK 218

Query: 242 CKFDKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
           C+++ +   A +S   N +V S      AAN+   GP++    +I+  H SF
Sbjct: 219 CRYNPANSGATISSYVNVTVGSETALMAAANI---GPVS---VAIDASHNSF 264


>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
          Length = 320

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 99/232 (42%), Positives = 133/232 (57%), Gaps = 21/232 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLDPTAVHGVTKFSDLTPS 107
           F  FK K +KTY T  E   R+ +F+A L       ++  Q L+ T   GV KFSD T  
Sbjct: 23  FQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLE-TYKKGVNKFSDWTQD 81

Query: 108 EFRRQFLGLNRRLRLPADAQKA-PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF   +LGL+ +   PA   K  P + T   +P   DWR  G VTGVK+QG CGSCW+FS
Sbjct: 82  EFN-AYLGLHPK---PAKLGKGIPYVKTGVSVPASVDWRTEGYVTGVKNQGDCGSCWAFS 137

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG++EGA F STG+LVSLSEQQLVDC +       G+ + GC+GG +   F YI +  G
Sbjct: 138 LTGSVEGALFKSTGKLVSLSEQQLVDCTY-------GTVNFGCDGGYLEETFPYIQET-G 189

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +E E  YPY   D G+CKFD SK+   ++++     DE+ +       GP++
Sbjct: 190 LEAEASYPYKARD-GTCKFDASKVVTKINDYVYWYGDEEALLEATATIGPIS 240


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/246 (38%), Positives = 137/246 (55%), Gaps = 22/246 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
           ++ +K++  K Y + EE   R  +++ NL    +   +  L   T   G+ +F+DL   E
Sbjct: 28  WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           F     G  R       A+ +  LP N+   LP   DWR  G VT VKDQG CGSCW+FS
Sbjct: 88  FVAMMTGF-RVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFS 146

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG++EG HF +TG+LVSLSEQ LVDC            D+GC+GG M+ AF+YI+ AGG
Sbjct: 147 TTGSVEGQHFKATGKLVSLSEQNLVDCSGR---------DAGCDGGFMDRAFQYIIDAGG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
           ++ E  YPY   D G C F K+ + A V+ ++ ++S  ++     V H GP++    +I+
Sbjct: 198 IDTEASYPYKAVD-GKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPIS---VAID 253

Query: 285 LPHISF 290
             H+SF
Sbjct: 254 ASHMSF 259


>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
          Length = 357

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 131/231 (56%), Gaps = 26/231 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTA-VHGVTKFSDLTP 106
           F L++ +    Y   +E   RF +F +NL       AKR     P+  + G+  F+D +P
Sbjct: 52  FQLWRKEHGLVYKDLKEMAKRFEIFLSNLNYIIEFNAKRS---SPSGYLLGLNNFADWSP 108

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           SEF+  +L     L +P D+      P+L     P   DWR+  AVT +K+QG+CGSCW+
Sbjct: 109 SEFQEIYL---HSLDMPTDSAPKLNGPLLSC-IAPASLDWRNKVAVTAIKNQGSCGSCWA 164

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSA GA+EG H ++TGEL+SLSEQ+LV+CD             GCNGG +N AF++++  
Sbjct: 165 FSAAGAIEGIHAITTGELISLSEQELVNCDR---------VSKGCNGGWVNKAFDWVISN 215

Query: 224 GGVEREKDYPYTGTDGGSCKFDKS-KIAAAVSNFSVISSDEDQMAANLVKH 273
           GG+  E +YPYTG DGG+C  DK   I A +  +  +   ++ +  ++VK 
Sbjct: 216 GGITLEAEYPYTGKDGGNCNSDKQVPIKATIDGYEQVEQSDNGLLCSIVKQ 266


>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
 gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
          Length = 323

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 97/231 (41%), Positives = 123/231 (53%), Gaps = 23/231 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK+KF K YA  EE  +R  VF   L+      +R    + T    +  FSDLT  E   
Sbjct: 23  FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82

Query: 112 QFLGLNRRLR----LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
              G+ RR      LP  A      PT  +  D DWR+ GAVT VKDQG CGSCW+FSA 
Sbjct: 83  TKTGMTRRRHPLSVLPKSA------PTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAV 136

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ALEGAHFL TG+LVSLSEQ LVDC        S   + GCNGG    A++YI+   G++
Sbjct: 137 AALEGAHFLKTGDLVSLSEQNLVDC-------SSSYGNQGCNGGWPYQAYQYIIANRGID 189

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            E  YPY   D  +C++D   I A VS++    S DE  +   +   GP++
Sbjct: 190 TESSYPYKAID-DNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVS 239


>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
          Length = 266

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 83/185 (44%), Positives = 116/185 (62%), Gaps = 10/185 (5%)

Query: 93  TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           TAV+G T FSD + +E++    G N  LR      +   +P  DLP +FDWR+H  VT V
Sbjct: 3   TAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNARLRQAAIPEIDLPDEFDWRNHSVVTPV 62

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG+CGSCW+FS TG +EG + +  G+L+SLSEQ+LVDCD           DSGCNGGL
Sbjct: 63  KDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCD---------KLDSGCNGGL 113

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
             +A++ I   GG+E E DYPY G +   CKF+ +     V+    IS++E +MA  L++
Sbjct: 114 PENAYKAIHDIGGLETESDYPYNGHE-NKCKFNSNITRVQVTGGVEISTNETEMAQWLIQ 172

Query: 273 HGPLA 277
           +GP++
Sbjct: 173 NGPIS 177


>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
          Length = 333

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 93/248 (37%), Positives = 137/248 (55%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H++LFK+ F K Y+T EE   R   ++AN+   ++  L     +H    G+  ++DLT +
Sbjct: 27  HWALFKTTFGKQYSTAEEITRRL-AWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNA 85

Query: 108 EFRRQFLGLNRRLRLPADA-QKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF +   GL         A ++  + P   +LPT  DWR  G VT +KDQG CGSCW+FS
Sbjct: 86  EFNQVMNGLRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           +TG+LEG HF  TG+LVSLSEQ L DC  +         + GCNGGLM+ AF YI +  G
Sbjct: 146 STGSLEGQHFAKTGQLVSLSEQNLTDCSQK-------QGNMGCNGGLMDQAFTYIKENNG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY   D   C F  + + A  + ++ I+  DE+ + + +   GP++    +I+
Sbjct: 199 IDTESSYPYKAVD-EKCHFKAADVGATDTGYTDIAQQDENALQSAIATVGPIS---VAID 254

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 255 ASHSSFQL 262


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 133/230 (57%), Gaps = 13/230 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           ++ + +K  K Y    E + RF +FK NL+        + +   G+ +F+DLT  E+R  
Sbjct: 47  YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLNRFADLTNEEYRSM 106

Query: 113 FLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           FLG       R ++  + +++  +  ++ LP   DWR+ GAV  +KDQG+CGSCW+FS  
Sbjct: 107 FLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFSTV 166

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            A+EG + ++TGE++ LSEQ+LVDCD         + D+GCNGGLM+ AFE+I+  GG++
Sbjct: 167 AAVEGVNQIATGEMIQLSEQELVDCDR--------TYDAGCNGGLMDYAFEFIINNGGID 218

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            E+DYPY G DG      K+    +++++  +   ++      V H P++
Sbjct: 219 TEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVS 268


>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 330

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 96/234 (41%), Positives = 135/234 (57%), Gaps = 23/234 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  F   ++K Y+++E ++ R  +FK NLRR +     D  A HG+T+F+DLT  EF   
Sbjct: 30  FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE-AQHGITQFADLTHEEFADM 88

Query: 113 FLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           +LG   +LR   ++Q    L +     PT  DW   GAVT VK+QG+CGSCW+FS TG++
Sbjct: 89  YLGYKPQLR---NSQAKVSLSSTPFTAPTAIDWTTKGAVTPVKNQGSCGSCWAFSTTGSI 145

Query: 171 EGAHFLSTGE-LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           EG + L   + L S SEQQLVDCD +         D GCNGGLM++AF Y L++  +E E
Sbjct: 146 EGQYVLQLKQNLTSFSEQQLVDCDTK--------EDQGCNGGLMDNAFTY-LESAKLETE 196

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF------SVISSDEDQMAANLVKHGPLA 277
             YPYT  D GSCK+++S     V++F        ++  E+ M   L   GPL+
Sbjct: 197 SAYPYTAVD-GSCKYNQSLGVVGVASFVDIEQGKTVADTENTMGVALDNIGPLS 249


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 139/238 (58%), Gaps = 26/238 (10%)

Query: 53  FSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSE 108
             L+KS   +  K Y    E + RF +FK NLR        + T    G+ KF+DLT  E
Sbjct: 43  MGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQE 102

Query: 109 FRRQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           +R +FLG      RRL   ++P+   A +A     ++LP   +WRDHGAV+ VKDQG+CG
Sbjct: 103 YRAKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVNWRDHGAVSRVKDQGSCG 158

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA  A+EG + + +GEL+SLSEQ+LVDCD         S D+GCNGGLM+ AF++
Sbjct: 159 SCWAFSAIAAVEGINKIVSGELISLSEQELVDCDR--------SYDAGCNGGLMDYAFQF 210

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I+  GG++ EKDYPY G +       K+    ++  +  + ++E+ +    V H P++
Sbjct: 211 IIDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVS 267


>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTAFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/295 (35%), Positives = 157/295 (53%), Gaps = 32/295 (10%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSK 62
            S L+ +  S++L SA+A    D  I    P       + L + E    LF+S   + SK
Sbjct: 11  FSLLVAISASALLCSALA---RDFSIVGYTP-------EQLTSTEKLLELFESWMSEHSK 60

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
            Y + EE  +RF VF+ NL    +R     +   G+ +F+DLT  EF+ ++LGL +    
Sbjct: 61  VYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFS 120

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           R R P+   +   +   DLP   DWR  GAV  VKDQG CGSCW+FS   A+EG + ++T
Sbjct: 121 RKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L SLSEQ+L+DCD         + +SGCNGGLM+ AF+YI+  GG+ +E DYPY   +
Sbjct: 179 GNLSSLSEQELIDCDT--------TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL-ME 229

Query: 239 GGSCKFDKSKIA-AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            G C+  K  +    +S +  +  ++D+     + H P++    +IE     F F
Sbjct: 230 EGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVS---VAIEASGRDFQF 281


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 127/208 (61%), Gaps = 25/208 (12%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL-----RRAKRRQLLDPTAVH 96
           SE+ +L     F  +K K  K Y   EE + RF  FK NL     R AKR+       V 
Sbjct: 41  SEERVLEI---FQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHV- 96

Query: 97  GVTKFSDLTPSEFRRQFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
           G+ KF+D++  EFR+ +L      +N+ + L  + ++   + + D P+  DWR++G VT 
Sbjct: 97  GLNKFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRK--VQSCDAPSSLDWRNYGVVTA 154

Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
           VKDQG+CGSCW+FS+TGA+EG + L TG+L+SLSEQ+LV+CD         + + GC GG
Sbjct: 155 VKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD---------TSNYGCEGG 205

Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDG 239
            M+ AFE+++  GG++ E DYPYTG DG
Sbjct: 206 YMDYAFEWVINNGGIDSESDYPYTGVDG 233


>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 96/240 (40%), Positives = 124/240 (51%), Gaps = 26/240 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A  V
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGV 261


>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYGSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 123/233 (52%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD           D GC  GL + AF++IL +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCELGLKDPAFQWILWSNKGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +C      + A +SN   +  DED +A  L + GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVA 261


>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
 gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 136/249 (54%), Gaps = 18/249 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           ++H++L+K+   K+YA +EE  +R  +++ NLR  +   L      H    G+ +F D+T
Sbjct: 26  DNHWNLWKNWHKKSYAPKEE-GWRRVLWEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
             EFR+   G   + ++      AP     + P   DWR  G VT VKDQG CGSCW+FS
Sbjct: 85  NEEFRQLMNGYKNQKKIRGSTFLAP--NNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG H+ +TG+++SLSEQ LVDC            + GCNGGLM+ AF+Y+   GG
Sbjct: 143 TTGALEGQHYRNTGKMISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPYT  D   C +D +  +A  + F  V S  E  +   +   GP++    +++
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSGSEKDLMNAVASVGPVS---VAVD 252

Query: 285 LPHISFSFL 293
             H SF F 
Sbjct: 253 AGHQSFQFY 261


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 91/278 (32%), Positives = 147/278 (52%), Gaps = 18/278 (6%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQ---SEDHLLNAEHHFSLFKSK 59
           +L+ S+ ++L L+ ++ S+       AM   ++  D      S          +  +  K
Sbjct: 2   KLLNSATVILFLTMIVVSS-------AMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVK 54

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
             K   +  E D RF +FK NLR        + +   G+TKF+DLT  E+R  +LG   +
Sbjct: 55  HGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLK 114

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            +    + +  +   + +P   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + TG
Sbjct: 115 RKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTG 174

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           +L++LSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E+DYPY G DG
Sbjct: 175 DLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDG 226

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              +  K+     +  +  + ++ ++     + H P++
Sbjct: 227 RCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 264


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 100/275 (36%), Positives = 149/275 (54%), Gaps = 29/275 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+S++    A   ++ + +   +TY    E + R++VF+ NLR            
Sbjct: 31  IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 87

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG      R  +L A    A      DLP   DWR  
Sbjct: 88  VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 144

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG+CGSCW+FS   A+EG + + TG+L+SLSEQ+LVDCD         S + 
Sbjct: 145 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 196

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ EKDYPY GTDG      K+     + ++  + +++++ 
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 256

Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
               V + P++    +IE    +F    S +FT S
Sbjct: 257 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 288


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 99/253 (39%), Positives = 131/253 (51%), Gaps = 19/253 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN--LRRAKRRQLLDPTAVHGV--TKF 101
           L  A   +  FK+++ + Y   +E  YR RVF+ N  L  A  ++  +      V   +F
Sbjct: 5   LATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQF 64

Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
            D+T  EF     G  +  R  P     A   P   +  D DWR  GAVT VKDQG CGS
Sbjct: 65  GDMTNEEFNAVMKGYKKGSRGEPTTVFTAEGRP---MAADVDWRTKGAVTPVKDQGQCGS 121

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FSATG+LEG HFL   ELVSLSEQ+LVDC  E         + GC GG M SAF+YI
Sbjct: 122 CWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYG-------NDGCGGGWMTSAFDYI 174

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
              GG++ E  YPY   D  SC+FD + I A  + F  +   E+ +   +   GP++   
Sbjct: 175 KDNGGIDTESSYPYEAQD-RSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPIS--- 230

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 231 VAIDASHFSFQFY 243


>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
          Length = 366

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 92/244 (37%), Positives = 135/244 (55%), Gaps = 18/244 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ +  KF + Y+  E    ++  FK+N+         +   V  +   +D +P E+++ 
Sbjct: 27  FTDWTHKFQRLYSNNEFLK-KYHTFKSNMDYVHSWNAKNSDTVLELNHLADHSPEEYKKF 85

Query: 113 FLGLNRRLRLPADAQKAPI---LPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           +LG  R   +  + Q   I   L T   D     DWR  GAV+ +KDQG CGSCWSFS T
Sbjct: 86  YLGT-RVKHIHFNVQGTHINTQLSTVFEDSGATVDWRKKGAVSPIKDQGQCGSCWSFSTT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G++EGAH + TG +V LSEQ LVDC        S   + GCNGGLMN+AF+YI+   G++
Sbjct: 145 GSVEGAHQIKTGNMVELSEQNLVDC-------SSAEGNMGCNGGLMNNAFDYIISNHGID 197

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELP 286
            E+ YPYT   G  CKF+K+ + A +S++  I+   +   AN VK  GP++    +I+  
Sbjct: 198 TEQSYPYTANTGSVCKFNKTNVGATISSYKSITPGSETDLANAVKTAGPVS---VAIDAS 254

Query: 287 HISF 290
           H SF
Sbjct: 255 HRSF 258


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 90/245 (36%), Positives = 133/245 (54%), Gaps = 16/245 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  + +K  K Y    E   RF +FK NLR        + T   G+TKF+DLT  E+R  
Sbjct: 4   YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEYRAM 63

Query: 113 FLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           FLG       R ++  + +++      + LP   DWR  GAV  +KDQG+CGSCW+FS  
Sbjct: 64  FLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFSTV 123

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            A+EG + + TGEL+SLSEQ+LVDCD         + ++GCNGGLM+ AF++I+  GG++
Sbjct: 124 AAVEGINQIVTGELISLSEQELVDCDR--------TYNAGCNGGLMDYAFQFIINNGGLD 175

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
            EKDYPY G D    K      A ++  F  +   +++     V H P++    +IE   
Sbjct: 176 TEKDYPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVS---VAIEASG 232

Query: 288 ISFSF 292
           ++  F
Sbjct: 233 MALQF 237


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 137/238 (57%), Gaps = 26/238 (10%)

Query: 53  FSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSE 108
             L+KS   +  K Y    E + RF +FK NLR        + T    G+ KF+DLT  E
Sbjct: 42  MGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQE 101

Query: 109 FRRQFLGLN----RRL---RLPAD--AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           +R +FLG      RRL   ++P+   A +A     ++LP   DWRDHGAV+ VKDQG+CG
Sbjct: 102 YRAKFLGTRTDPRRRLMKSKIPSSRYAHRA----GDNLPDSVDWRDHGAVSPVKDQGSCG 157

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS    +EG + + +GELVSLSEQ+LVDCD         S D+GCNGGLM+ AF++
Sbjct: 158 SCWAFSTIATVEGINKIVSGELVSLSEQELVDCDR--------SYDAGCNGGLMDYAFQF 209

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I+  GG++ EKDYPY G +       K+    ++  +  + ++E+ +    V H P++
Sbjct: 210 IMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVS 266


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 100/252 (39%), Positives = 135/252 (53%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +  FK +  K Y  + E  +R ++F  N  + AK  Q      V     V K++D+   E
Sbjct: 27  WQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLHHE 86

Query: 109 FRRQFLGLN----RRLRL--PADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           F     G N    ++LR   P+      I P +  +P   DWR  GAVT VKDQG CGSC
Sbjct: 87  FHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSC 146

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGALEG HF   G L+SLSEQ LVDC        +   ++GCNGGLM++AF YI 
Sbjct: 147 WAFSSTGALEGQHFRKAGTLISLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYIK 199

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNV 280
             GG++ EK YPY G D  SC F+K+ I A    +  +   DE +MA  +   GP++   
Sbjct: 200 DNGGIDTEKSYPYEGID-DSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVS--- 255

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 256 VAIDASHESFQF 267


>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 134/252 (53%), Gaps = 20/252 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           E  F  +K KF ++Y T  E   R +++  N +      +L    +     G+T+F+D+ 
Sbjct: 24  EMEFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83

Query: 106 PSEFRRQF-LGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
             E++    LG  R     A  + +      +   LPT  DWRD G VTGVKDQ  CGSC
Sbjct: 84  NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FSATG+LEG +F  TG+LVSLSEQQLVDC  +         + GCNGGLM+ AF+YI 
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYG-------NMGCNGGLMDYAFKYIQ 196

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
           + GG++ EK YPY   D G C+F    + A  + +  V   DED +   +   GP++   
Sbjct: 197 ENGGIDTEKSYPYEAED-GQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVS--- 252

Query: 281 ASIELPHISFSF 292
             I+  H SF  
Sbjct: 253 VGIDASHSSFQL 264


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 138/242 (57%), Gaps = 17/242 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVT 99
           ++E+ L++    + +   K  K Y    E + RF++FK NLR         D T   G+ 
Sbjct: 50  RTEEELMSMYEQWLV---KHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLN 106

Query: 100 KFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           +F+DLT  E+R ++LG     NRRL      + AP +  + LP   DWR  GAV  VKDQ
Sbjct: 107 RFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRV-GDKLPDSVDWRKEGAVPPVKDQ 165

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FSA GA+EG + + TGEL+SLSEQ+LVDCD           + GCNGGLM+ 
Sbjct: 166 GGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT--------GYNQGCNGGLMDY 217

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           AFE+I+  GG++ ++DYPY G DG    + K+    ++ ++  + + ++      V + P
Sbjct: 218 AFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQP 277

Query: 276 LA 277
           ++
Sbjct: 278 VS 279


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 102/250 (40%), Positives = 132/250 (52%), Gaps = 22/250 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   K+Y +  E   RF++F  N L  A+  +      V    G+ +F DL P 
Sbjct: 26  QWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           EF R F G   R    A      + P N     LP   DWR+ GAVT VK+QG CGSCW+
Sbjct: 86  EFARMFNGY--RGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG+LEG HFL TG LVSLSEQ LVDC      E  G  + GC GGLM++AF+YI   
Sbjct: 144 FSTTGSLEGQHFLKTGVLVSLSEQNLVDC-----SETFG--NHGCEGGLMDNAFQYIKAN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
           GG++ EK YPY   D G C+F K  + A  + F  I    ED +   +   GP++    +
Sbjct: 197 GGIDTEKSYPYEAED-GECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVS---VA 252

Query: 283 IELPHISFSF 292
           I+  H SF  
Sbjct: 253 IDASHSSFQL 262


>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
          Length = 316

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK K+ K+Y+  ++ +YRFRVFK NL R K+ Q ++  TA +GVT+FSDLT 
Sbjct: 15  NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 73

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF+ ++L  ++   +P D +  P +  +    +FDWR+HGAV  V DQG CGSCW+FSA
Sbjct: 74  QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 132

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  F  T  L+ LSEQQL+DCD           D GCNGG    AF+ IL  GG+
Sbjct: 133 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 183

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + + DYPY G + G C+   SK+   ++   ++  DE   A  L + GPL+  + ++ L
Sbjct: 184 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 241


>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
 gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
          Length = 327

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK K+ K+Y+  ++ +YRFRVFK NL R K+ Q ++  TA +GVT+FSDLT 
Sbjct: 26  NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF+ ++L  ++   +P D +  P +  +    +FDWR+HGAV  V DQG CGSCW+FSA
Sbjct: 85  QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  F  T  L+ LSEQQL+DCD           D GCNGG    AF+ IL  GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + + DYPY G + G C+   SK+   ++   ++  DE   A  L + GPL+  + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252


>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 112/295 (37%), Positives = 152/295 (51%), Gaps = 43/295 (14%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           L L+L++V+ S  AV+  D +  Q                   +S FK + SK Y ++ E
Sbjct: 3   LFLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43

Query: 70  HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
             +R ++F  N  + AK  +L     V    G+ K++D+   EF     G N+    +  
Sbjct: 44  ERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103

Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            +D   A   I P N  LP   DWRD GAVT VKDQG CGSCWSFSATG+LEG HF  TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTG 163

Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           +LVSLSEQ LVDC        SG   ++GCNGGLM++AF YI   GG++ EK YPY   D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAED 215

Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              C +      A    F  I  ++ED + A +   GP++    +I+  H +F  
Sbjct: 216 -EKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVS---IAIDASHETFQL 266


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 92/245 (37%), Positives = 129/245 (52%), Gaps = 27/245 (11%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--------GVTKFSDLTPS 107
           +K +  K Y +  E   R  +++AN      R+ +D    H        G+ +F+DL  S
Sbjct: 25  WKKEHGKVYNSDREELTRHIIWQAN------RKYVDEHNAHAEKFGFTVGMNQFADLESS 78

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EF R + G N +  +     K       DLPT  DWR  G VT +K+QG CGSCW+FSA 
Sbjct: 79  EFGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
             LEG HF +TG LVSLSEQ LVDC        +   + GCNGGLM++AF+Y++K GG++
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCS-------TAEGNQGCNGGLMDNAFQYVIKNGGID 191

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLAGNVASIEL 285
            E  YPY   D   CKF+ + + +  S FS I     E  +   +   GP++    +I+ 
Sbjct: 192 TEASYPYKAVD-QKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPIS---VAIDA 247

Query: 286 PHISF 290
            H SF
Sbjct: 248 SHTSF 252


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 101/258 (39%), Positives = 135/258 (52%), Gaps = 20/258 (7%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---G 97
           S   +L  E  +  FKS+ +K Y++  E   RF++F  N L  AK         V     
Sbjct: 18  SSQEILRTE--WEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLA 75

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQ 155
           + KF DL P EF +   G   +          P    ND  LPT  DWR  GAVT VK+Q
Sbjct: 76  MNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQ 135

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FS TG+LEG HF  TG+LVSLSEQ LVDC  +         + GCNGGLM++
Sbjct: 136 GQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG-------NQGCNGGLMDN 188

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
            F+YI   GG++ E+ +PYT  D G CKF K+ + A  + F  +    ED +   +   G
Sbjct: 189 GFQYIKANGGIDTEESHPYTAQD-GDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVG 247

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +I+  H SF  
Sbjct: 248 PVS---VAIDASHGSFQL 262


>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 330

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 99/261 (37%), Positives = 145/261 (55%), Gaps = 24/261 (9%)

Query: 39  GEQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
           G  S + L + +H+   F+ +  +  + Y   E  D R+  FK NL    +      + V
Sbjct: 12  GIASANRLFSEQHYQNQFTNWMVRLDRAYDVFEFQD-RYNAFKNNLDLIHKWNSQGHSTV 70

Query: 96  HGVTKFSDLTPSEFRRQFLGLNRRL-RLPADAQKAPILPTNDL----PTDFDWRDHGAVT 150
            GV   +DL+  E+R  +LG+     RLP   Q+A  +  N +        DWR  GAV 
Sbjct: 71  LGVNHLADLSNEEYRNLYLGVKVDASRLP---QQAASIKLNKVFAPVAASLDWRSSGAVG 127

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            VKDQG CGSCWSFS TG++EGA+ ++TG   SLSEQQL+DC  +   E       GCNG
Sbjct: 128 RVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLSEQQLMDCSRDYGNE-------GCNG 180

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAAN 269
           GLM++A +Y++  GG++ E+ YPYT +D  +CKF+ + I A +S++  V    E  +AA 
Sbjct: 181 GLMDAAMKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIGAKISSYIDVQRGSETDLAAK 240

Query: 270 LVKHGPLAGNVASIELPHISF 290
           L K GP++    +I+  H SF
Sbjct: 241 LNK-GPVS---VAIDASHSSF 257


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 100/282 (35%), Positives = 142/282 (50%), Gaps = 43/282 (15%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+  IL+SLL++ +S+ L                +  DG            HF  FK K 
Sbjct: 1   MKSFILASLLVVAVSATL----------------LKEDGV-----------HFQSFKLKH 33

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
            KTY  Q E   RF +F+ NLR+ +         +H    G+ KF+D+T +EF+   L  
Sbjct: 34  GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92

Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             + +    A K   L     +P   DWR    VT +KDQ  CGSCWSF+  G+ EGA+ 
Sbjct: 93  QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYA 152

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           LSTG+L   SEQQLVDC        +   + GC+GG ++  F YI +  G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G D GSC +D SK+   VS++  + ++E  +   +   GP+A
Sbjct: 204 GYD-GSCSYDSSKVVTKVSSYVSVPANEQALLEAVGTAGPVA 244


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ + +   +TY    E + RF VF+ NLR            
Sbjct: 31  IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LG+  R +         +   N DLP   DWR  GAV
Sbjct: 88  VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             VKDQG+CGSCW+FS   A+EG + + TG+++SLSEQ+LVDCD         S + GCN
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ AFE+I+  GG++ E+DYPY GTDG      K+     + ++  + ++ ++    
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259

Query: 270 LVKHGPLA 277
            V + P++
Sbjct: 260 AVANQPIS 267


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 99/250 (39%), Positives = 136/250 (54%), Gaps = 20/250 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+KS  +K Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 25  DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  R+       + +  +  N L  P   DWRD+G VT VKDQG CGSCW+
Sbjct: 84  HEEFRQIMYGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E  YPY GTD   C +D    +A  + F  + S  E  +   +   GP++    +
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS---VA 251

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 252 IDAGHESFQF 261


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 82/219 (37%), Positives = 125/219 (57%), Gaps = 8/219 (3%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           K  K   +  E D RF +FK NLR        + +   G+TKF+DLT  E+R  +LG   
Sbjct: 48  KHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRL 107

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           + +    + +  +   + +P   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + T
Sbjct: 108 KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVT 167

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L++LSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E+DYPY G D
Sbjct: 168 GDLITLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVD 219

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G   +  K+     +  +  + ++ ++     + H P++
Sbjct: 220 GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 258


>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
          Length = 327

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 99/239 (41%), Positives = 142/239 (59%), Gaps = 13/239 (5%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK K+ K+Y+  ++ +YRFRVFK NL R K+ Q ++  TA +GVT+FSDLT 
Sbjct: 26  NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF+ ++L  ++   +P D +  P +  +    +FDWR+HGAV  V DQG CGSCW+FSA
Sbjct: 85  QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSA 143

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  F  T  L+ LSEQQL+DCD           D GCNGG    AF+ IL  GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCD---------GVDEGCNGGTPQQAFKQILGMGGL 194

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + + DYPY G + G C+   SK+   ++   ++  DE   A  L + GPL+  + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFL 252


>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
          Length = 500

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 70  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 129

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 130 RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 183

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 184 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 234

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 235 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 291


>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 333

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 128/244 (52%), Gaps = 18/244 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSE 108
           +S +K+   K Y   EE  +R  V+K N++  ++         H  T     F D+T  E
Sbjct: 29  WSQWKATHGKLYGMDEE-GWRREVWKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           F++   GL  +        +AP+     +P+  DWR+ G VT VKDQG CGSCW+FSATG
Sbjct: 88  FKQVMNGLQMQKHKKGKMFQAPLFAK--IPSSVDWREKGYVTPVKDQGPCGSCWAFSATG 145

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALEG  F  TG+LVSLSEQ LVDC            + GCNGGLMN+AF+Y+   GG++ 
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSQ-------AEGNEGCNGGLMNNAFQYVKDNGGLDS 198

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E+ YPY   D  SCK+     AA  + F  I   E  +   +   GP++     I+  H 
Sbjct: 199 EESYPYHAQD-ESCKYKPQDSAANDTGFFDIPQQEKALMVAVATKGPIS---VGIDASHF 254

Query: 289 SFSF 292
           +F F
Sbjct: 255 TFQF 258


>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
            castaneum]
          Length = 1726

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 106/271 (39%), Positives = 149/271 (54%), Gaps = 29/271 (10%)

Query: 44   DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
            D+LL   + E+H SLF    K       ++E+ YRF VF  NL + +     +  TA +G
Sbjct: 1408 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1467

Query: 98   VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVK 153
            +T+F+D+T  EF R  LGL   LR   +  + P     +P  +LP +FDWR    VT VK
Sbjct: 1468 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1523

Query: 154  DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            +Q  CGSCW+FS TG +EG + L  G+L+  SEQ+LVDCD +         D GCNGGLM
Sbjct: 1524 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1574

Query: 214  NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            ++A+  I K GG+E E+DYPY   D   C F+++     V+    IS +E  MA  LV +
Sbjct: 1575 DTAYRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1633

Query: 274  GPLA----GNVASIELPHISFSFLFTVSSPK 300
            GP++     N     +  +S  F F + SPK
Sbjct: 1634 GPISIAINANAMQFYMGGVSHPFKF-LCSPK 1663


>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
 gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
          Length = 615

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 15/242 (6%)

Query: 40  EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGV 98
           + S   L   +H F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+
Sbjct: 296 KHSHRGLDKVDHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGI 355

Query: 99  TKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQG 156
           T+F+DLT SE++ +  GL +R    A    A ++P    +LP +FDWR   AVT VK+QG
Sbjct: 356 TEFADLTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVTPVKNQG 414

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
           +CGSCW+FS TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A
Sbjct: 415 SCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNA 465

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
           ++ I   GG+E E +YPY       C F+++     V+ F  +   +E  M   L+  GP
Sbjct: 466 YKAIKDIGGLEYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGP 524

Query: 276 LA 277
           ++
Sbjct: 525 IS 526


>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
          Length = 1761

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 106/271 (39%), Positives = 149/271 (54%), Gaps = 29/271 (10%)

Query: 44   DHLL---NAEHHFSLFKS--KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHG 97
            D+LL   + E+H SLF    K       ++E+ YRF VF  NL + +     +  TA +G
Sbjct: 1443 DNLLGCDDREYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYG 1502

Query: 98   VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVK 153
            +T+F+D+T  EF R  LGL   LR   +  + P     +P  +LP +FDWR    VT VK
Sbjct: 1503 ITRFADMTQKEFSRS-LGLRTDLR---NENETPFAQAKIPNIELPKEFDWRKKNVVTEVK 1558

Query: 154  DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            +Q  CGSCW+FS TG +EG + L  G+L+  SEQ+LVDCD +         D GCNGGLM
Sbjct: 1559 NQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD---------DQGCNGGLM 1609

Query: 214  NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            ++A+  I K GG+E E+DYPY   D   C F+++     V+    IS +E  MA  LV +
Sbjct: 1610 DTAYRSIEKIGGLETEQDYPYDAED-EKCHFNRTLARVQVTGALNISHNETDMAKWLVAN 1668

Query: 274  GPLA----GNVASIELPHISFSFLFTVSSPK 300
            GP++     N     +  +S  F F + SPK
Sbjct: 1669 GPISIAINANAMQFYMGGVSHPFKF-LCSPK 1698


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 137/248 (55%), Gaps = 16/248 (6%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ + +   +TY    E + RF VF+ NLR            
Sbjct: 31  IVSYGERSEEE---ARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAG 87

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LG+  R +         +   N DLP   DWR  GAV
Sbjct: 88  VHSFRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAV 147

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG+CGSCW+FS   A+EG + + TG+++SLSEQ+LVDCD         S + GCN
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT--------SYNQGCN 199

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ AFE+I+  GG++ E+DYPY GTDG      K+     + ++  + ++ ++    
Sbjct: 200 GGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQK 259

Query: 270 LVKHGPLA 277
            V + P++
Sbjct: 260 AVANQPIS 267


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 100/275 (36%), Positives = 148/275 (53%), Gaps = 29/275 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+S +    A   ++ + +   +TY    E + R++VF+ NLR            
Sbjct: 26  IVSYGERSXEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 82

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG      R  +L A    A      DLP   DWR  
Sbjct: 83  VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 139

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG+CGSCW+FS   A+EG + + TG+L+SLSEQ+LVDCD         S + 
Sbjct: 140 GAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 191

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ EKDYPY GTDG      K+     + ++  + +++++ 
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 251

Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
               V + P++    +IE    +F    S +FT S
Sbjct: 252 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 283


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/261 (37%), Positives = 138/261 (52%), Gaps = 22/261 (8%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
           S   +L AE  +S FK+K  K+Y ++ E  +R +++  N  + AK  +      V     
Sbjct: 18  SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
           + +F D+   EF     G  R  +         + P N     LP   DWR  GAVT VK
Sbjct: 76  MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           +QG CGSCW+FSATG+LEG HF  +G +VSLSEQ LVDC  +         ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFG-------NNGCEGGLM 188

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
           ++AF+YI    G++ EK YPY GTD G+C F KS + A  S F  +    E Q+   +  
Sbjct: 189 DNAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247

Query: 273 HGPLAGNVASIELPHISFSFL 293
            GP++    +I+  H SF F 
Sbjct: 248 VGPIS---VAIDASHESFQFY 265


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/231 (41%), Positives = 128/231 (55%), Gaps = 14/231 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            +  FK+KF ++Y  +EE   R  VF  N++          T   GV +F+DLT  EF +
Sbjct: 18  QWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSK 77

Query: 112 QFLGLNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            ++G  +  +   DA      +     LPT  DW   GAVT VK+QG CGSCWSFS TG+
Sbjct: 78  TYMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTTGS 137

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LEGA+ +STG+LVSLSEQQ VDC            + GCNGGLM+SAF+Y  +A  +  E
Sbjct: 138 LEGANEISTGKLVSLSEQQFVDCAGTYG-------NQGCNGGLMDSAFKYA-EANALCTE 189

Query: 230 KDYPYTGTDGGSCKFDKSKIAAA---VSNFSVISSDEDQMAANLVKHGPLA 277
           + YPY GTD GSC+        A   VS +  +SSD +Q   + V   P++
Sbjct: 190 QSYPYKGTD-GSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVS 239


>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
          Length = 351

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 136/251 (54%), Gaps = 18/251 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA----VHGVTKF 101
           +L+AE  +  FK + +K Y   EE   R  +F  N +  K    L  T       GV +F
Sbjct: 34  VLDAEVAWHKFKLEHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEF 93

Query: 102 SDLTPSEFRRQFLGLN-RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           +D+T  EF +   GL     R+      +P +    LP + DWR  G V+ VK+QG+CGS
Sbjct: 94  ADMTVHEFAQMMNGLKPDSTRVSGSTYLSPNIDA-PLPVEVDWRTKGLVSEVKNQGSCGS 152

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG+LEG H   TG +V LSEQ LVDC        +   + GCNGGLM +AF+YI
Sbjct: 153 CWAFSTTGSLEGQHMRKTGTMVDLSEQNLVDC-------STSYGNDGCNGGLMTNAFKYI 205

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
               G++ E+ YPY G D G CKF K+K+ A V+ F  I + +E ++   L   GP++  
Sbjct: 206 KDNKGIDTEEAYPYAGRD-GDCKFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPVS-- 262

Query: 280 VASIELPHISF 290
             +I+  H SF
Sbjct: 263 -VAIDANHQSF 272


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/244 (39%), Positives = 138/244 (56%), Gaps = 19/244 (7%)

Query: 56  FKSKFSKTYA-TQEEH-DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
           + S+  + YA  QE+H + RF VFK N+ R +       T    + +F+DLT  EFR  +
Sbjct: 40  WMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADLTNEEFRASY 98

Query: 114 LGLNRRLRLPADAQK-APILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G    + L +   K  P    N    LP   DWR  GAVT VK+QG CG CW+FSA  A
Sbjct: 99  NGFKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAA 158

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +STG+L+SLSEQ+LVDCD       +   D GC GGLM++AFE+I+  GG+  E
Sbjct: 159 IEGITQISTGKLISLSEQELVDCD-------TKGIDHGCEGGLMDTAFEFIINNGGLTTE 211

Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
            +YPY G D G+C F+K+  IA +++ +  + ++++Q     V H P++    +IE    
Sbjct: 212 SNYPYKGED-GTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVS---VAIEAGGS 267

Query: 289 SFSF 292
            F F
Sbjct: 268 DFQF 271


>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
           Full=Major cysteine proteinase; Flags: Precursor
 gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
 gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
          Length = 467

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  +  +  KTY ++EE   R ++FK N     +  L+ + T    +  F+DLT  EF+ 
Sbjct: 32  FDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LGL+        A K   L  N  +P   DWR  GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92  SRLGLSVSASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TG+L+SLSEQ+L+DCD         S ++GCNGGLM+ AFE+++K  G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           DYPY   D G+CK DK K     + +++ + S++++     V   P++  +   E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/276 (35%), Positives = 147/276 (53%), Gaps = 23/276 (8%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQS----EDHLLNAEHHFSLFKSKFSKTY 64
            +LL  +S L+SA      D  I     S G +S    +D ++     + +   K  K Y
Sbjct: 2   FMLLFFASTLSSA-----SDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLV---KHGKAY 53

Query: 65  ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---NRRLR 121
            +  E + RF VFK NLR        + T   G+ +F+DLT  E+R  +LG     RR +
Sbjct: 54  NSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRNK 113

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
           L   + +      + LP   DWR  GAV GVKDQG+CGSCW+FSA  A+EG + + TG+L
Sbjct: 114 LRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDL 173

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           +SLSEQ+LVDCD+        S + GCNGGLM+  FE+I+  GG++ E+DYPY   DG  
Sbjct: 174 ISLSEQELVDCDN--------SYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRC 225

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             + K+    ++ ++  +  + +      V + P++
Sbjct: 226 DTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVS 261


>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAVHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/250 (37%), Positives = 136/250 (54%), Gaps = 14/250 (5%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L+ +  +  +K    KTY T EE D R  ++  NL   K+    + +    +  F+DLT 
Sbjct: 21  LSQDRQWHAWKDFHGKTY-TGEEEDLRRAIWNDNLEIVKKHNAENHSYKLDMNHFADLTV 79

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +EF+++F+G          +   P L    LP + DWRD G VT VK+QG CGSCW+FS+
Sbjct: 80  TEFKQRFMGYRAASNSTGGSTFLP-LSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSS 138

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG+LEG HF  TG+LVSLSEQ LVDC  +         ++GC GGLM+ AF+YI    G+
Sbjct: 139 TGSLEGQHFRKTGKLVSLSEQNLVDCSKKYG-------NNGCEGGLMDYAFKYIKNNDGI 191

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPYT  D G C F    + A V+ ++ V    E  + + +   GP++    +I+ 
Sbjct: 192 DTEQSYPYTARD-GQCHFKPGSVGATVTGYTDVQRGSEGDLQSAVATVGPIS---VAIDA 247

Query: 286 PHISFSFLFT 295
            H SF    T
Sbjct: 248 GHSSFQLYKT 257


>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  +  K  KTY ++EE   R ++FK N     +  L+ + T    +  F+DLT  EF+ 
Sbjct: 30  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 89

Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LGL+        A K   L  +  +P   DWR  GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 90  SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 149

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TG+L+SLSEQ+L+DCD         S ++GCNGGLM+ AFE+++K  G++ EK
Sbjct: 150 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 201

Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           DYPY   D G+CK DK K     + +++ + S++++     V   P++  +   E
Sbjct: 202 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/256 (34%), Positives = 143/256 (55%), Gaps = 17/256 (6%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTK 100
           S D ++ A +   L K    K+Y    E + RF++FK N L   ++    D +   G+ +
Sbjct: 35  STDDVIMAAYESWLVK--HGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNR 92

Query: 101 FSDLTPSEFRRQFLGL---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           F+DLT  E+R ++ G+   + R ++   +Q+   L    LP   DWR+HGAV  VKDQG 
Sbjct: 93  FADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQ 152

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS   A+EG + ++TG+L++LSEQ+LVDCD         S + GCNGGLM+ AF
Sbjct: 153 CGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDR--------SYNEGCNGGLMDDAF 204

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           ++I+  GG++ + DYPYTG DG   ++ K+     + ++  +   +++       + P++
Sbjct: 205 QFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPIS 264

Query: 278 GNVASIELPHISFSFL 293
               +IE     F F 
Sbjct: 265 ---VAIEASGRDFQFY 277


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/248 (40%), Positives = 128/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  ED +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVA 258


>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
 gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
          Length = 349

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/276 (35%), Positives = 136/276 (49%), Gaps = 56/276 (20%)

Query: 39  GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-------- 90
           G   E   LN   +F  FK  + K YAT+EEH  R+++F  N+    +  ++        
Sbjct: 4   GAYDEKEALN---YFQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAG 60

Query: 91  DPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK----APILP-----TNDLPTDF 141
            P A +G+T+F D++P+EF R  L       LP   QK     P  P      + LP  F
Sbjct: 61  KPVAQYGITQFMDMSPNEFARVKL-------LPPTKQKDINHTPTAPKEKYQIDALPESF 113

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR+HGAVT VKDQ +CGSCW+FS    +EGA+FL+   L   S QQLVDCD        
Sbjct: 114 DWREHGAVTAVKDQASCGSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCD-------- 165

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG--------------------S 241
            + + GC GG    A +YI K GG+  E  YPY     G                    +
Sbjct: 166 -NLNCGCFGGFPFIAMQYIQKRGGLATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRT 224

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           C     ++ A V+ +  +S +ED +AA LVK+GPL+
Sbjct: 225 CSVQNYQLVAKVAGYENVSQNEDDIAAYLVKNGPLS 260


>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 389

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 100/284 (35%), Positives = 152/284 (53%), Gaps = 34/284 (11%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSD 103
           +L   +  FS FK++  K Y   EE   RF +F+ NL   ++  Q+ + TA +G+T+FSD
Sbjct: 32  NLTQVKQLFSKFKAEHKKFYNFLEEQR-RFEIFRQNLDIISELNQVEEGTAEYGITQFSD 90

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           +T  EF+ Q L  +   R    ++       + D PT +DWRDHGAVT VK+QG  G+CW
Sbjct: 91  MTTEEFKSQILIPSTYARNFTGSRYHGFQKISQDAPTSYDWRDHGAVTPVKNQGTVGTCW 150

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG  FL+   LVSLSE+Q+VDCD   +P  +G  D G  GG    AF+Y++ 
Sbjct: 151 TFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEP-STGHADCGVFGGWPYLAFDYVIN 209

Query: 223 AGGVEREKDYPYTGTDGGS--------------------------CKFDKSKIAAAVSNF 256
           AGG+  E+ YPY   +GG                           C+  +  IAA + ++
Sbjct: 210 AGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYPCRQGQVPIAAKIEDW 269

Query: 257 SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSPK 300
             +S DED +   L + GPL+    +++  ++ F +   +S+PK
Sbjct: 270 KALSKDEDSIKQQLFEIGPLS---VALDASYLQF-YKKGISAPK 309


>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
          Length = 359

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 105/279 (37%), Positives = 149/279 (53%), Gaps = 20/279 (7%)

Query: 3   RLILSSLLLLLLSSV-LASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLF 56
           R IL S+ LL+L +V  A ++   + +  IR V     + E+S   +L    H   F+ F
Sbjct: 5   RTILPSVALLILIAVSTAESIGFYESNP-IRMVFDRLLEVEESVVQILGQTRHVLSFARF 63

Query: 57  KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
             ++ K Y   EE   RF +FK NL   +       +   GV +F+D+T  EF+R  LG 
Sbjct: 64  THRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFTDMTWQEFQRTKLGA 123

Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
            +     A  +    L    LP   DWR+ G V+ VKDQG CGSCW+FS TGALE A+  
Sbjct: 124 AQNC--SATLKGTHKLTGEALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 181

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
           + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 AFGKGISLSEQQLVDCAGAFN-------NYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 234

Query: 237 TDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            D G+CK+    +   V    N ++ + DE + A  L++
Sbjct: 235 ED-GTCKYSAENVGVQVLDSVNITLGAEDELKHAVGLLR 272


>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
          Length = 324

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 98/248 (39%), Positives = 135/248 (54%), Gaps = 24/248 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDP--TAVHGVTKFSDLTPSE 108
           F  FK ++ + YAT +E  YR  V+  N+    A   Q  +   T +  + +F D+T  E
Sbjct: 22  FHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEE 81

Query: 109 FRRQFLGLNRRLRLPA-DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
                 GL     LPA +++   +L   D  LP + DWR  GAVT VKDQ ACGSCW+FS
Sbjct: 82  INAVMNGL-----LPASESRGVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFS 136

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  G+LVSLSEQ LVDC        +   D GC GGLM+ AF YI   GG
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDC-------STKQGDHGCGGGLMDFAFTYIKDNGG 189

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY  TD G C+++ +   A V+ +  +  D ED +   +   GP++    +I+
Sbjct: 190 IDTEASYPYEATD-GKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPIS---VAID 245

Query: 285 LPHISFSF 292
               +F F
Sbjct: 246 ASRSTFHF 253


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/248 (40%), Positives = 129/248 (52%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI +  G
Sbjct: 145 ATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKENDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  ED +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 105/292 (35%), Positives = 158/292 (54%), Gaps = 23/292 (7%)

Query: 11  LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
           L L ++ L+ +VA + D +++    P D E S D L+     F  + S F K Y T EE 
Sbjct: 14  LALSAATLSLSVAASHDYSIV-GYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEK 68

Query: 71  DYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
             RF VFK NL+          +   G+ +F+DL+  EF++ +LGL   +    + +   
Sbjct: 69  LLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA 128

Query: 131 ILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
                D+   P   DWR  GAV  VK+QG+CGSCW+FS   A+EG + + TG L +LSEQ
Sbjct: 129 EFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQ 188

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--D 245
           +L+DCD         + ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+  + G+C+   D
Sbjct: 189 ELIDCDT--------TYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKD 239

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
           +S+      +  V ++DE  +   L  H PL+    +I+     F F   VS
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALA-HQPLS---VAIDASGREFQFYSGVS 287


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 156/278 (56%), Gaps = 26/278 (9%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
           S+LL+L+ S L+SA     D ++I        +++  H    +   +L++S   +  K+Y
Sbjct: 11  SILLMLIFSTLSSA----SDMSIISY------DETHIHRRTDDEVSALYESWLIEHGKSY 60

Query: 65  ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
               E D RF++FK NLR   ++  + + +   G+TKF+DLT  E+R  +LG      R 
Sbjct: 61  NALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRK 120

Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
           +L  +     +    D LP   DWR+ G + GVKDQG+CGSCW+FSA  A+E  + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            L+SLSEQ+LVDCD         S + GC+GGLM+ AFE+++K GG++ E+DYPY   +G
Sbjct: 181 NLISLSEQELVDCDR--------SYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNG 232

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              ++ K+     + ++  +  + ++     V H P++
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVS 270


>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 426

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 122/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  +  K  KTY ++EE   R ++FK N     +  L+ + T    +  F+DLT  EF+ 
Sbjct: 32  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LGL+        A K   L  +  +P   DWR  GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92  SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TG+L+SLSEQ+L+DCD         S ++GCNGGLM+ AFE+++K  G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           DYPY   D G+CK DK K     + +++ + S++++     V   P++  +   E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 137/235 (58%), Gaps = 12/235 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  +  K  KTY ++EE   R ++FK N     +  L+ + T    +  F+DLT  EF+ 
Sbjct: 32  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LGL+        A K   L  +  +P   DWR  GAVT VKDQG+CG+CWSFSATGA+
Sbjct: 92  SRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAM 151

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TG+L+SLSEQ+L+DCD         S ++GCNGGLM+ AFE+++K  G++ EK
Sbjct: 152 EGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 231 DYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           DYPY   D G+CK DK K     + +++ + S++++     V   P++  +   E
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 138/253 (54%), Gaps = 21/253 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
           +  +  FK   +K Y ++ E  +R ++F  N    AK  +L     V    G+ K++D+ 
Sbjct: 24  QEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83

Query: 106 PSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGS 160
             EF +   G NR    LR          LP  +  LP   DWRD GAVT VKDQG CGS
Sbjct: 84  HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CWSFSATG+LEG HF  +G+LVSLSEQ LVDC      E+ G  ++GCNGGLM++AF YI
Sbjct: 144 CWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
              GG++ E+ YPY   D   C +  K+K A       + S +ED++ + +   GP++  
Sbjct: 197 KANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS-- 253

Query: 280 VASIELPHISFSF 292
             +I+  H SF  
Sbjct: 254 -VAIDASHQSFQL 265


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 95/284 (33%), Positives = 150/284 (52%), Gaps = 22/284 (7%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
           L+LS+ L    A+  D +++          S +HL + +    LF+S   K SKTY + E
Sbjct: 11  LILSATLFITYAIAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKTYRSIE 62

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
           E  +RF +F  NL+          +   G+ +F+DL+  EF+ ++LGL         ++ 
Sbjct: 63  EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122

Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
                  DLP   DWR  GAVT VK+QG+CGSCW+FS   A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
           L+DCD         S ++GC GGLM+ AF+YI+   G+ +E+DYPY   +G   +  +  
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234

Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
               +S +  + ++++Q     + H P++    +IE    +F F
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVS---VAIEASSRNFQF 275


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 99/248 (39%), Positives = 138/248 (55%), Gaps = 22/248 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  +K+    +YAT  E   R  +++ANL   ++      +    V KF+DLT  EF  +
Sbjct: 22  FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81

Query: 113 FLGLNRRLRLPA-DAQKA----PILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +LGL    R  A +A K+      LP    LP   DWR  G VT +KDQG CGSCWSFS 
Sbjct: 82  YLGL----RFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFST 137

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG++EG H   TG+LVSLSEQ LVDC        S   ++GCNGGLM+ AF+YI+   G+
Sbjct: 138 TGSVEGQHARKTGQLVSLSEQNLVDC-------SSAQGNAGCNGGLMDQAFQYIISNNGI 190

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
           + E  YPYT  D G+C+F+ + + A V+++  I+S  +    N V   GP++    +I+ 
Sbjct: 191 DTESSYPYTAQD-GTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPIS---VAIDA 246

Query: 286 PHISFSFL 293
              SF F 
Sbjct: 247 SQPSFQFY 254


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 99/286 (34%), Positives = 147/286 (51%), Gaps = 25/286 (8%)

Query: 1   MERLILSSLLLLL----LSSVLASAVAVNDDDAMIRQVVP-SDGEQSEDHLLNAEHHFSL 55
           M  L LS ++LLL    +S  +  ++   D++  I  V   SD E         E  +  
Sbjct: 1   MGFLKLSPMILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAE--------VERIYEA 52

Query: 56  FKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +  +  K    Q     E D RF +FK NLR        + +   G+T+F+DLT  E+R 
Sbjct: 53  WMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRS 112

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +LG     R+   + +      + LP   DWR  GAV  VKDQG+CGSCW+FS  GA+E
Sbjct: 113 MYLGAKPVKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVE 172

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ E D
Sbjct: 173 GINKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEAD 224

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   DG   +  K+     + ++  +  + +      + H P++
Sbjct: 225 YPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPIS 270


>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
 gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
          Length = 615

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
           +H F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+T+F+D+T SE
Sbjct: 306 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 365

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           ++ +  GL +R    A    A ++P    +LP +FDWR   AVT VK+QG+CGSCW+FS 
Sbjct: 366 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 424

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I   GG+
Sbjct: 425 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 475

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
           E E +YPY       C F+++     V+ F  +   +E  M   L+ +GP++
Sbjct: 476 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPIS 526


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 99/250 (39%), Positives = 136/250 (54%), Gaps = 20/250 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+KS  +K Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 25  DEHWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  R+       + +  +  N L  P   DWRD+G VT VKDQG CGSCW+
Sbjct: 84  HEEFRQIMNGYKRKSE--RKFKGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWA 141

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 142 FSTTGAMEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDN 194

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E  YPY GTD   C +D    +A  + F  + S  E  +   +   GP++    +
Sbjct: 195 QGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS---VA 251

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 252 IDAGHESFQF 261


>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 121/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EF  
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFWS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    I  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVEIPQDEAQIAAWLAVNGPVA 258


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/296 (37%), Positives = 155/296 (52%), Gaps = 44/296 (14%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L+LL + +A+A AV+     + ++V  +              ++ FK +  K Y ++ E
Sbjct: 3   ILILLMAFVAAANAVS-----LYELVKEE--------------WNAFKLQHRKNYDSETE 43

Query: 70  HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
              R +++  N  + AK  Q  D         V K++DL   EF +   G NR   +  L
Sbjct: 44  ERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSL 103

Query: 123 PADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
                + P+    P N ++PT  DWR  GAVT VKDQG CGSCWSFSATGALEG HF  T
Sbjct: 104 KGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKT 163

Query: 179 GELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
           G+LVSLSEQ LVDC        SG   ++GCNGG+M+ AF+YI   GG++ EK YPY   
Sbjct: 164 GKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAI 215

Query: 238 DGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           D  +C F+   + A    +  I   DE+ +   L   GP++    +I+  H SF F
Sbjct: 216 D-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVS---IAIDASHESFQF 267


>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
 gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
           Precursor
 gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
 gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
          Length = 614

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
           +H F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+T+F+D+T SE
Sbjct: 305 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 364

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           ++ +  GL +R    A    A ++P    +LP +FDWR   AVT VK+QG+CGSCW+FS 
Sbjct: 365 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 423

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I   GG+
Sbjct: 424 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 474

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
           E E +YPY       C F+++     V+ F  +   +E  M   L+ +GP++
Sbjct: 475 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 525


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 138/248 (55%), Gaps = 20/248 (8%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M   I S  L LL+ SVL  ++++         V  ++  ++E     A   +  +  + 
Sbjct: 1   MATSIKSITLALLIFSVLLISLSLG-------SVTATETTRNEAE---ARRMYERWLVEN 50

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL-GLNR 118
            K Y    E + RF +FK NL+  +    + + T   G+T+F+DLT  EFR  +L     
Sbjct: 51  RKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKME 110

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           R R+P   +K      + LP   DWR  GAV  VKDQG+CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKT 170

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GEL+SLSEQ+LVDCD         S + GC GGLM+ AF++I++ GG++ E+DYPY  TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATD 222

Query: 239 GGSCKFDK 246
              C  DK
Sbjct: 223 VNVCNSDK 230


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/261 (41%), Positives = 142/261 (54%), Gaps = 30/261 (11%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR---QLLDPTAVHGVTKF 101
           L+N E  +  FK +  K Y  + E   R +++  N L+ A+     +L   T    + K+
Sbjct: 23  LVNQE--WINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKY 80

Query: 102 SDLTPSEFRRQFLGLNRRL-------RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVK 153
            D+   EF+    G NR +       RLP  A  A I P N +LP   DWR  GAVT VK
Sbjct: 81  GDMLNHEFKNMLNGYNRTINHTLRNERLPVGA--AFIEPCNVELPKMVDWRKCGAVTEVK 138

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGL 212
           DQG CGSCW+FSATG+LEG HF  TG LVSLSEQ L+DC        SGS  ++GCNGGL
Sbjct: 139 DQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDC--------SGSYGNNGCNGGL 190

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLV 271
           M+ AF YI    G++ EK YPY G D   C++DK    A+   F  I   DE ++ A + 
Sbjct: 191 MDQAFSYIKDNKGLDTEKTYPYEGED-DKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVA 249

Query: 272 KHGPLAGNVASIELPHISFSF 292
             GP++    +I+  H SF F
Sbjct: 250 TVGPVS---VAIDASHQSFQF 267


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 97/287 (33%), Positives = 155/287 (54%), Gaps = 18/287 (6%)

Query: 7   SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
           S  L L  S  L +++AV  D +++     S+  +S D L+     F  + S+  K Y +
Sbjct: 6   SKALFLACSFCLFASLAVAGDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYQS 60

Query: 67  QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA 126
            EE  +RF +FK NL+    R  +      G+ +F+DL+  EF+ ++LGL        ++
Sbjct: 61  IEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRES 120

Query: 127 QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
            +       +LP   DWR  GAVT VK+QG+CGSCW+FS   A+EG + + TG L SLSE
Sbjct: 121 PEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180

Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
           Q+L+DCD         + ++GCNGGLM+ AF +I++ GG+ +E+DYPY   + G+C+  K
Sbjct: 181 QELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCEMTK 231

Query: 247 SKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            +     +S +  +  + +Q     + + PL+    +IE     F F
Sbjct: 232 EETEVVTISGYHDVPQNNEQSLLKALVNQPLS---VAIEASGRDFQF 275


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/201 (43%), Positives = 124/201 (61%), Gaps = 18/201 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFR 110
           ++ +  K  K+Y    E + RF++FK NLR        DP   +  G+ +F+DLT  E+R
Sbjct: 49  YNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNA-DPDRSYELGLNRFADLTNEEYR 107

Query: 111 RQFLGLNRRLRLPADAQK-----APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            ++LG   R   P  ++      AP+    +LP   DWR+ GAV  VKDQG+CGSCW+FS
Sbjct: 108 AKYLGTKSRESRPKLSKGPSDRYAPV-EGEELPDSIDWREKGAVAAVKDQGSCGSCWAFS 166

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           A GA+EG + ++TGEL++LSEQ+LVDCD         S + GC GGLM+ AF +I+K GG
Sbjct: 167 AIGAVEGINQITTGELITLSEQELVDCDR--------SYNEGCEGGLMDYAFNFIIKNGG 218

Query: 226 VEREKDYPYTGTDGGSCKFDK 246
           ++ + DYPYTG D G+C  +K
Sbjct: 219 IDSDLDYPYTGRD-GTCNQNK 238


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 102/253 (40%), Positives = 137/253 (54%), Gaps = 25/253 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           ++ FK +  K Y ++ E   R +++  N  + AK  Q  D         V K++DL   E
Sbjct: 27  WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86

Query: 109 FRRQFLGLNR---RLRLPADAQKAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           F +   G NR   +  L     + P+    P N ++PT  DWR  GAVT VKDQG CGSC
Sbjct: 87  FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYI 220
           WSFSATGALEG HF  TG+LVSLSEQ LVDC        SG   ++GCNGG+M+ AF+YI
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDC--------SGKYGNNGCNGGMMDYAFQYI 198

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
              GG++ EK YPY   D  +C F+   + A    +  I   DE+ +   L   GP++  
Sbjct: 199 KDNGGIDTEKSYPYEAID-DTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVS-- 255

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 256 -IAIDASHESFQF 267


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/186 (44%), Positives = 115/186 (61%), Gaps = 11/186 (5%)

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
           EE D RF +FK NLR        + +   G+T+F+DLT  E+R  +LG   + R+   + 
Sbjct: 68  EEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLKTSD 127

Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +      + +P   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + TG+L+SLSEQ
Sbjct: 128 RYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQ 187

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
           +LVDCD         S + GCNGGLM+ AFE+I+K GG++ E+DYPY   DG   + D++
Sbjct: 188 ELVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADG---RCDQT 236

Query: 248 KIAAAV 253
           +  A V
Sbjct: 237 RKNAKV 242


>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
 gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
          Length = 356

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/288 (39%), Positives = 148/288 (51%), Gaps = 20/288 (6%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
           RL   S LLL+LS  +A +V   DD   IR V     + E     +L    H   F+ F 
Sbjct: 4   RLFFVSSLLLVLSCAVAGSVF--DDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFA 61

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y T EE   RF +F  +L   K       +   GV +F+D T  EFR+  LG  
Sbjct: 62  HRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLSYKLGVNQFADWTWEEFRKHRLGAA 121

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  + +  L    LP   DWR  G V+ VKDQG CGSCW+FS TGALE A+  +
Sbjct: 122 QNC--SATTKGSHKLTDTALPESKDWRKDGIVSPVKDQGHCGSCWTFSTTGALEAAYAQA 179

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            G+ +SLSEQQLVDC         G  + GCNGGL + AFEYI   GG++ E+ YPYTG 
Sbjct: 180 HGKGISLSEQQLVDCGR-------GFNNFGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGV 232

Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
           D GSCKF    +   V    N ++ + DE + A   V+   +A  V S
Sbjct: 233 D-GSCKFVPENVGVQVIDSVNITLGAEDELKHAVAFVRPVSVAFEVVS 279


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 101/248 (40%), Positives = 128/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +R  R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HRGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  E  +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 98/256 (38%), Positives = 133/256 (51%), Gaps = 17/256 (6%)

Query: 29  AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRR 87
           A++  +V +    +   +L  E  +  FKS   KTY +  E   RF++F  N L  AK  
Sbjct: 5   ALLCAIVAAATAATSQEILRTE--WEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHN 62

Query: 88  QLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFD 142
                  V    G+ +F+DL P EF +   G   +      +   P    ND  LP   D
Sbjct: 63  VKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVD 122

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WR  GAVT VKDQG CGSCW+FS+TG+LEG HFL TG+LVSLSEQ LVDC        S 
Sbjct: 123 WRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDC-------SSA 175

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISS 261
             + GCNGGLM+++F YI   GG++ E  YPY   D G C++ K  + A  + F  +   
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAED-GDCRYKKEDVGATDTGFVDIKEG 234

Query: 262 DEDQMAANLVKHGPLA 277
            E  +   +   GP++
Sbjct: 235 SEKDLQKAVATVGPVS 250


>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
          Length = 344

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 126/234 (53%), Gaps = 16/234 (6%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           K+Y T EE   R+ +F AN+   ++        V G+  F+D+T  E+R  +LG      
Sbjct: 39  KSY-TSEEFGARYNIFTANMDYVQQWNSKGSETVLGLNNFADITNEEYRNTYLGTKFDAS 97

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
                Q+  +  TN      DWR  GAVT VK+QG CG CWSFS TG+ EGAHF S GEL
Sbjct: 98  SLIGTQEEKV-HTNSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGEL 156

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ L+DC  E         +SGC+GGLM  AFEYI+   G++ E  YPY   + G 
Sbjct: 157 VSLSEQNLIDCSTE---------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGK 206

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           C++      A +S++  +++  +    + V   P++    +I+  H SF  L+T
Sbjct: 207 CEYKSENSGATLSSYKTVTAGSESSLESAVNVNPVS---VAIDASHQSFQ-LYT 256


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/209 (39%), Positives = 119/209 (56%), Gaps = 8/209 (3%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
           E D RF +FK NLR        + +   G+T+F+DLT  E+R  +LG     R+   + +
Sbjct: 70  EKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKPTKRVLKTSDR 129

Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
                 + LP   DWR  GAV  VKDQG+CGSCW+FS  GA+EG + + TG+L+SLSEQ+
Sbjct: 130 YQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQE 189

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
           LVDCD         S + GCNGGLM+ AFE+I+K GG++ E DYPY   DG   +  K+ 
Sbjct: 190 LVDCDT--------SYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNA 241

Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
               + ++  +  + +      + H P++
Sbjct: 242 KVVTIDSYEDVPENSEASLKKALAHQPIS 270


>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
          Length = 327

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 97/239 (40%), Positives = 141/239 (58%), Gaps = 13/239 (5%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTP 106
           NA   +  FK K+ K+Y+  ++ +YRFRVFK NL R K+ Q ++  TA +GVT+FSDLT 
Sbjct: 26  NARQLYEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTA 84

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EF+ ++L  ++   +P D +  P +  +    +FDWR+HGAV  V D+G CGSCW+FSA
Sbjct: 85  QEFKVRYL-RSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDKGDCGSCWAFSA 143

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
            G +EG  F  T  L+ LSEQQL+DCD           D GCNGG    AF+ IL  GG+
Sbjct: 144 VGNIEGQWFRKTDNLLQLSEQQLLDCDE---------VDEGCNGGTPQQAFKQILGMGGL 194

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + + DYPY G + G C+   SK+   ++   ++  DE   A  L + GP +  + ++ L
Sbjct: 195 QLDSDYPYEGRE-GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPFSSALNALSL 252


>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
          Length = 316

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/248 (39%), Positives = 136/248 (54%), Gaps = 20/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+KS  +K Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T  
Sbjct: 23  HWDLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHTYRLGMNHFGDMTHE 81

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   +L+       +  +  N L  P   DWRD+G VT VKDQG CGSCW+FS
Sbjct: 82  EFRQLMNGY--KLKAARKFSGSLFMEPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWAFS 139

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  +G+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+    G
Sbjct: 140 TTGALEGQHFRKSGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 192

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C +D +  +A  + F  + S  E  +   +   GP++    +I+
Sbjct: 193 LDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDIPSGKEHALMKAVAAVGPVS---VAID 249

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 250 AGHESFQF 257


>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/237 (39%), Positives = 121/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           DSGC GGLMN+AF +I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DSGCGGGLMNNAFGWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTENSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/300 (38%), Positives = 151/300 (50%), Gaps = 47/300 (15%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           L LLL S LA+A AV+     I  +V  +              ++ FK +  K Y ++ E
Sbjct: 3   LFLLLVSFLAAANAVS-----IFNLVKEE--------------WNAFKLQHRKKYDSESE 43

Query: 70  HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRL----R 121
              R +++  N  + AK  Q  D         V K++DL   EF     G NR      +
Sbjct: 44  ERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSK 103

Query: 122 LPADAQ----KAPIL---PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           L    Q    + PI    P N D+PT  DWR+ GAVT VKDQG CGSCWSFSATGALEG 
Sbjct: 104 LLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQ 163

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           HF  TG+LVSLSEQ LVDC        +   ++GCNGGLM++AF+Y+    G++ EK YP
Sbjct: 164 HFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYP 216

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           Y   D   C ++   I A    F  I   DE  +   L   GP++    +I+  H SF F
Sbjct: 217 YEAID-DECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVS---VAIDASHESFQF 272


>gi|111226635|ref|XP_641720.2| cysteine proteinase [Dictyostelium discoideum AX4]
 gi|38372247|sp|Q94504.1|CYSP7_DICDI RecName: Full=Cysteine proteinase 7; AltName: Full=Proteinase 1;
           Flags: Precursor
 gi|1644502|gb|AAC47482.1| cysteine proteinase [Dictyostelium discoideum]
 gi|90970688|gb|EAL67742.2| cysteine proteinase [Dictyostelium discoideum AX4]
          Length = 460

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 122/217 (56%), Gaps = 22/217 (10%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EE + R+ +FKAN+             V G+  F+D++  E+R  +LG       P D
Sbjct: 42  SSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRATYLGT------PFD 95

Query: 126 AQKAPILPTN---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE-- 180
           A    +  ++   D     DWR  GAVT +K+QG CG CWSFS TGA EGA +L+ G+  
Sbjct: 96  ASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKN 155

Query: 181 LVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           LVSLSEQ L+DC        SGS  ++GC GGLM  AFEYI+   G++ E  YPYT  DG
Sbjct: 156 LVSLSEQNLIDC--------SGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDG 207

Query: 240 GSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGP 275
             CKF+   +AA +S++ +V S  E  +AA  V  GP
Sbjct: 208 KKCKFNPKNVAAQLSSYVNVTSGSESDLAAK-VTQGP 243


>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
 gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
 gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
          Length = 475

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 135/232 (58%), Gaps = 15/232 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSE 108
           +H F  F+ +F + Y +  E   R R+F+ NL+  +     +  +A +G+T+F+D+T SE
Sbjct: 166 DHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSE 225

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           ++ +  GL +R    A    A ++P    +LP +FDWR   AVT VK+QG+CGSCW+FS 
Sbjct: 226 YKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSV 284

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I   GG+
Sbjct: 285 TGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKDIGGL 335

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
           E E +YPY       C F+++     V+ F  +   +E  M   L+ +GP++
Sbjct: 336 EYEAEYPYKAKK-NQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPIS 386


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 86/241 (35%), Positives = 132/241 (54%), Gaps = 13/241 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  + S+  K Y + EE   RF +FK NL+       +      G+ +F+DL+  EF++Q
Sbjct: 8   FESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLSHHEFKKQ 67

Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
           +LGL        ++ +       DLP   DWR  GAVT +K+QG+CGSCW+FS   A+EG
Sbjct: 68  YLGLKVDFSTRRESSEEFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWAFSTVAAVEG 127

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            + + TG L SLSEQ+L+DCD         + +SGCNGGLM+ AF +I++ GG+ +E DY
Sbjct: 128 INQIVTGNLTSLSEQELIDCDR--------TYNSGCNGGLMDYAFSFIVENGGLHKEDDY 179

Query: 233 PYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
           PY   + G+C+  K +     +S +  +  + +Q     + + PL+    +IE     F 
Sbjct: 180 PYI-MEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQ 235

Query: 292 F 292
           F
Sbjct: 236 F 236


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/270 (34%), Positives = 140/270 (51%), Gaps = 12/270 (4%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +L+LL  V A + A +       Q   +      D  + A +   L K    K Y    E
Sbjct: 1   MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKH--GKNYNALGE 58

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRLPADAQ 127
            + RF +FK NL    +    + T   G+ +F+DLT  EFR  +LG     + RLP  + 
Sbjct: 59  KEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD 118

Query: 128 KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +      + LP   DWR  GAV  VKDQG CGSCW+FS   A+EG + + TG+L++LSEQ
Sbjct: 119 RYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
           +LVDCD         S + GCNGGLM+ AFE+I+  GG++ E DYPY G DG    + K+
Sbjct: 179 ELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKN 230

Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
               ++ ++  +  +++      V + P++
Sbjct: 231 AKVVSIDSYEDVPENDETALKKAVANQPVS 260


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 88/225 (39%), Positives = 123/225 (54%), Gaps = 12/225 (5%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           +KS   K+Y+   E   R  +++ NL + KR    D +    +    DLT  EFR  +LG
Sbjct: 30  WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89

Query: 116 LNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           +              + P+N  +P+  DW   G VTGVK+QG CGSCW+FS TG++EG H
Sbjct: 90  VRAHHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQH 149

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           F  TG LVSLSEQ L+DC        SGS  ++GC GGLM++AF YI   GG++ E  YP
Sbjct: 150 FRKTGSLVSLSEQNLIDC--------SGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYP 201

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQ-MAANLVKHGPLA 277
           Y G   GSC F  S + A V+ +  I    +Q + + +   GP++
Sbjct: 202 YLGQQ-GSCHFSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVS 245


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 107/278 (38%), Positives = 150/278 (53%), Gaps = 20/278 (7%)

Query: 5   ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFK 57
           ++SS++LLL  +  ASA A + DD+   ++V SDG    E S   ++    H   F+ F 
Sbjct: 7   LVSSVILLLCCAAAASASASSFDDSNPIRLVSSDGLRDFETSVLQVIGQARHALSFARFA 66

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y + EE   RF  F  NL   +       +   G+ KF+D +  EF+R  LG  
Sbjct: 67  RRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLSYRLGLNKFADWSWEEFQRHRLGAA 126

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  +    L  + LP   DWR+ G V+ VKDQG CGSCW+FS TG+LE A+  +
Sbjct: 127 QNC--SATTKGNHKLTADVLPETKDWRESGIVSPVKDQGHCGSCWTFSTTGSLEAAYHQA 184

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPYTG 
Sbjct: 185 FGKGISLSEQQLVDCAQAFN-------NQGCNGGLPSQAFEYIKYNGGLDTEEAYPYTGK 237

Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           D G CKF    +   V    N ++ + DE Q A  LV+
Sbjct: 238 D-GVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR 274


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 135/244 (55%), Gaps = 20/244 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
           + ++  + YA   E + R+ VFK N+ R +R   +    T    V +F+DLT  EFR  +
Sbjct: 41  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100

Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G      L +  +        + ++ LP   DWR  GAVT +KDQG CGSCW+FSA  A
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +  G+L+SLSEQ+LVDCD         + D GC GGLM++AF Y +  GG+  E
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSE 211

Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
            +YPY  T+ G+C F+K+K IA ++  F  + +++++     V H P++  +A  +   I
Sbjct: 212 SNYPYKSTN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD---I 267

Query: 289 SFSF 292
            F F
Sbjct: 268 GFQF 271


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 128/229 (55%), Gaps = 12/229 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  +  K  K+Y    E + RF +FK NLR  +    ++ T   G+ +F+DLT  E+R +
Sbjct: 54  YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSR 113

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +LG      R LR    + +       DLP   DWR+ GAV  VKDQG CGSCW+FS   
Sbjct: 114 YLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIA 173

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           A+EG + ++TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ 
Sbjct: 174 AVEGINQIATGDLISLSEQELVDCDK--------SYNQGCNGGLMDYAFEFIINNGGIDS 225

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E+DYPY   D       K+    ++  +  +  ++++     V + P++
Sbjct: 226 EEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVS 274


>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
          Length = 337

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 135/248 (54%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+KS  SK Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T  
Sbjct: 27  HWDLWKSWHSKKYHEKEE-GWRRMVWEKNLKKIELHNLEHSMGKHPYRLGMNHFGDMTHE 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G  +R +     + +  +  N L  P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 86  EFRQIMNGYKQR-KTERKFKGSLFMEPNFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+    G
Sbjct: 145 TTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDNQG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C +D +  +A  + F  V S  E  +   +   GP++    +I+
Sbjct: 198 LDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGKERALMKAVAAVGPVS---VAID 254

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 255 AGHESFQF 262


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/250 (37%), Positives = 135/250 (54%), Gaps = 13/250 (5%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
           I ++  SD  Q  D  + A +   L      K Y    E + RF +FK NLR        
Sbjct: 42  IPEIPHSDAHQRPDEEVAALYESWLVH--HGKAYNAIGEKERRFEIFKDNLRFIDEHNRE 99

Query: 91  DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRL-PADAQKAPILPTNDLPTDFDWRDHG 147
             T   G+T+F+DLT  E+R +FLG   +R+ RL  A + +      +DLP D DWR  G
Sbjct: 100 SRTYKVGLTRFADLTNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKG 159

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AV  VKDQG CGSCW+FS+  A+EG + + TGEL+ LSEQ+LVDCD         S + G
Sbjct: 160 AVATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDK--------SFNMG 211

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLM+ AF++I+  GG++ E+DYPY G D       K+     +  +  +  +++   
Sbjct: 212 CNGGLMDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSL 271

Query: 268 ANLVKHGPLA 277
              V + P++
Sbjct: 272 KKAVANQPVS 281


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 92/293 (31%), Positives = 153/293 (52%), Gaps = 21/293 (7%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M   IL++ + +LL       +A   +        P   +Q    +   +  F  +  + 
Sbjct: 1   MTSTILTTTIFILLMLCNTCVIASESE-------CPPTHKQKSSDVEAMKKRFDGWVKRH 53

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
            + Y   +E + RF +++AN++  + +     +      KF+DLT  EF+  ++GL+ RL
Sbjct: 54  GRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRL 113

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
           R      +       DLP   DWR  GAVT + DQG CG CW+F+A  A+EG + + +G+
Sbjct: 114 RSHNTGFRYD--EHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGK 171

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           L+SLSEQ+L+DCD +       S + GC GGLM +A+ +I++ GG+  E+DYPY G D G
Sbjct: 172 LISLSEQELIDCDVK-------SGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVD-G 223

Query: 241 SCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           +CK +K+   AA++S +  + +D +        H P++    +I+    SF F
Sbjct: 224 TCKMEKAAHYAASISGYEEVPADNEAKLKAAAAHQPVS---VAIDAGGYSFQF 273


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/252 (38%), Positives = 136/252 (53%), Gaps = 23/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           ++ FK +  K YA   E  +R ++F  N    AK  Q      V     + K++D+   E
Sbjct: 29  WNTFKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHE 88

Query: 109 FRRQFLGLN----RRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSC 161
           FR    G N    ++LR   ++       + +   LPT  DWR  GAVT VKDQG CGSC
Sbjct: 89  FRETMNGFNYTLHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGA+EG HF  +G LVSLSEQ LVDC        +   ++GCNGGLM++AF Y+ 
Sbjct: 149 WAFSSTGAIEGQHFRKSGTLVSLSEQNLVDC-------STKYGNNGCNGGLMDNAFRYVK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
             GG++ EK Y Y G D  SC FDK+ I A    F+ I   +E ++A  +   GP++   
Sbjct: 202 DNGGIDTEKSYAYEGID-DSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVS--- 257

Query: 281 ASIELPHISFSF 292
            +I+    SF F
Sbjct: 258 VAIDASQQSFQF 269


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 128/228 (56%), Gaps = 11/228 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  + +   K Y    E + RF +FK NLR       +  +   G+ +F+DLT  E+R  
Sbjct: 47  YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSM 106

Query: 113 FLGLNRRLRLPADAQKA---PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           FLG N  ++  + + K+        + LP   DWR+ GAV+ VKDQG CGSCW+FS   A
Sbjct: 107 FLGGNMEMKERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISA 166

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG + + TGEL+SLSEQ+LVDCD         S + GCNGGLM+  F++I+  GG++ E
Sbjct: 167 VEGINQIVTGELISLSEQELVDCDK--------SYNMGCNGGLMDYGFQFIINNGGIDTE 218

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +DYPY   DG   +F K+    +++ +  +  D++      V + P++
Sbjct: 219 EDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVS 266


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 115/192 (59%), Gaps = 15/192 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAVHGVTKFSDLTPS 107
           F L+K K  K Y   EE + R   FK NL+       KR+  L+     G+ KF+DL+  
Sbjct: 50  FKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKV--GLNKFADLSNE 107

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EFR  +L   ++     + +K   L T D P+  DWR+ G VT VKDQG CGSCWSFS T
Sbjct: 108 EFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFSTT 167

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GA+E  + + TG+L+SLSEQ+LVDCD         + + GC GG M+SAF++++  GG++
Sbjct: 168 GAIEAINAIVTGDLISLSEQELVDCDT--------TNNYGCEGGDMDSAFQWVIGNGGID 219

Query: 228 REKDYPYTGTDG 239
            E DYPYTG DG
Sbjct: 220 TEADYPYTGVDG 231


>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
          Length = 271

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 93/221 (42%), Positives = 124/221 (56%), Gaps = 18/221 (8%)

Query: 80  NLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            L  AKR Q ++  TA +GVT+FSDLT  EF+ ++L    R+R         + P  D+ 
Sbjct: 1   QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYL----RMRFDGPIVSEDLTPEEDVT 56

Query: 139 TD---FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHE 195
            D   FDWR+HGAV  V DQG CGSCW+FS  G +EG  F  TG+L++LSEQQLVDCDH 
Sbjct: 57  MDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDH- 115

Query: 196 CDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN 255
                    D GCNGG     +  I K GG+E   DYPYTG D G C  ++SK  A V++
Sbjct: 116 --------LDKGCNGGYPPKTYGEIEKMGGLELASDYPYTGVD-GICYMNQSKFVAYVND 166

Query: 256 FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTV 296
            +V+   E   A  L + GPL+  + ++ L       +F +
Sbjct: 167 STVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 207


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 100/253 (39%), Positives = 137/253 (54%), Gaps = 21/253 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLT 105
           +  +  FK   +K Y +  E  +R ++F  N    AK  +L     V    G+ K++D+ 
Sbjct: 24  QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83

Query: 106 PSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGS 160
             EF +   G NR    LR          LP  +  LP   DWRD GAVT VKDQG CGS
Sbjct: 84  HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CWSFSATG+LEG HF  +G+LVSLSEQ LVDC      E+ G  ++GCNGGLM++AF YI
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRYI 196

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
              GG++ E+ YPY   D   C +  K+K A       + S +ED++ + +   GP++  
Sbjct: 197 KANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS-- 253

Query: 280 VASIELPHISFSF 292
             +I+  H SF  
Sbjct: 254 -VAIDASHQSFQL 265


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 98/250 (39%), Positives = 133/250 (53%), Gaps = 24/250 (9%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +K   SK Y   EE  +R  +++ NL++ +   L     +H    G+  F D+T  
Sbjct: 28  HWDQWKKWHSKKYHATEE-GWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGDMTHE 86

Query: 108 EFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           EFR+   G     +RR R     +   I    ++P   DWR+ G VT VKDQG CGSCW+
Sbjct: 87  EFRQVMNGFKHKKDRRFRGSLFMEPNFI----EVPNKLDWREKGYVTPVKDQGECGSCWA 142

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGALEG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+   
Sbjct: 143 FSTTGALEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQ 195

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C FD    AA  + F  + S  E  +   +   GP++    +
Sbjct: 196 NGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVS---VA 252

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 253 IDAGHESFQF 262


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 101/242 (41%), Positives = 136/242 (56%), Gaps = 19/242 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK +  + Y   EE + RF +FK NL+      K+  L   +   G+ +F+D+   EFR 
Sbjct: 45  FKKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR- 103

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            + GL R      + Q +  L    L  P + DWR  G VT VK+QG CGSCWSFS TG+
Sbjct: 104 MYNGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LEG HF  +G+LVSLSEQQLVDC  +   E       GCNGGLM+ AFEYI+  GG+E E
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNE-------GCNGGLMDQAFEYIITNGGIETE 216

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           ++YPY       C F KS++AA  S    V S DE  +  ++ + GP++    +I+  H 
Sbjct: 217 EEYPYDARQ-ERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVS---IAIDASHQ 272

Query: 289 SF 290
           SF
Sbjct: 273 SF 274


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 104/299 (34%), Positives = 152/299 (50%), Gaps = 22/299 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           R + + + L L++ V  + +  N+   ++  +  +    +   L+ AE  +S FK+   K
Sbjct: 2   RPLEALIRLFLVTHVPLNGIWKNEGFVVLGCLFVTAAAITHQELVGAE--WSAFKALHGK 59

Query: 63  TYATQEEHDYRFRVFKANLRRAKR--RQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLNR 118
            Y ++ E  YR +++  N  +  R   +  +  A +   + +F DL   EF     G  R
Sbjct: 60  EYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKR 119

Query: 119 RLRLPADAQKAPILPT----NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
             R         I P       LP   DWR  GAVT VK+QG CGSCW+FS TG+LEG H
Sbjct: 120 NYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQH 179

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           F  TG +VSLSEQ LVDC  +         ++GC GGLM++AF+YI   GG++ E  YPY
Sbjct: 180 FRKTGRMVSLSEQNLVDCSGKFG-------NNGCEGGLMDNAFKYIKANGGIDTELSYPY 232

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPHISFSF 292
            GTD G C F+KS + A  + F  I    +Q+    V   GP++    +I+  H SF F
Sbjct: 233 NGTD-GICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVS---VAIDASHESFQF 287


>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 105/282 (37%), Positives = 148/282 (52%), Gaps = 21/282 (7%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLL----NAEH--HFS 54
           M R   S L++L+     AS+ +  DD+  IR VV     + E  +L    ++ H   F+
Sbjct: 1   MARTSFSLLIILIACVAGASSASTFDDENPIRTVVSDALREFETSILSVLGDSRHALSFA 60

Query: 55  LFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
            F  ++ K Y T EE   RF +F  NL+  +       +   GV  F+D T  EFRR  L
Sbjct: 61  RFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNKKGLSYTLGVNHFADWTWEEFRRHRL 120

Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           G  +     A  +    L    LP   DWR  G V+ VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGNHKLTEEALPEMKDWRVSGIVSPVKDQGHCGSCWTFSTTGALEAAY 178

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
             + G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEY+   GG++ E+ YP
Sbjct: 179 KQAFGKGISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYP 230

Query: 234 YTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           YTG + G CKF    +   V    N ++ + DE + A   V+
Sbjct: 231 YTGKN-GECKFSSENVGVQVLDSVNITLGAEDELKHAVAFVR 271


>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
          Length = 375

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/259 (38%), Positives = 136/259 (52%), Gaps = 24/259 (9%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           +SE+HL +    FS FK K  KTYA+  EH++R  VF+ NLR        +      V  
Sbjct: 64  RSEEHLHDE---FSRFKGKHQKTYASDREHEHRLNVFRQNLRFIHSHNRANRGFTVAVNH 120

Query: 101 FSDLTPSEFR--RQFLGLNRRLRLPADAQKAPILPT---NDLPTDFDWRDHGAVTGVKDQ 155
            +D T  E +  R F    R   +    Q  P  P    +DLP  +DWR  GAVT VKDQ
Sbjct: 121 LADRTEDEMKSLRGF----RSSNVYNGGQAFPYKPAAHMDDLPDSWDWRISGAVTPVKDQ 176

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
             CGSCWSF   G +EGA+F  T +LV  S+Q LVDC         G  ++GC+GG    
Sbjct: 177 SVCGSCWSFGTIGHIEGAYFRKTQKLVRFSQQALVDC-------SWGYGNNGCDGGEDFR 229

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
           A+++I++ GGV  E +Y Y G D G C+ +   + A ++ + +V S D D     L KHG
Sbjct: 230 AYQWIMQVGGVPMEDEYEYLGQD-GYCRVENVTLYAPITGWVNVTSGDPDAFKVALFKHG 288

Query: 275 PLAGNVASIELPHISFSFL 293
           PL+    +I+  H SFSF 
Sbjct: 289 PLS---IAIDAGHKSFSFY 304


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 96/248 (38%), Positives = 137/248 (55%), Gaps = 20/248 (8%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M   I S  L LL+ S+L  ++++         V  +D  ++E     A   +  +  + 
Sbjct: 1   MATPIKSITLALLIFSMLLISLSLG-------SVTAADTTRNEAE---ARRMYEQWLVEN 50

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL-GLNR 118
            K Y    E + RF +F  NL+  +    + + T   G+T+F+DLT  EFR  +L     
Sbjct: 51  RKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKME 110

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
           R R+P   ++      + LP   DWR  GAV  VKDQG CGSCW+FSA GA+EG + + T
Sbjct: 111 RTRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKT 170

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GEL+SLSEQ+LVDCD         S + GC GGLM+ AF++I++ GG++ E+DYPYT TD
Sbjct: 171 GELISLSEQELVDCDT--------SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATD 222

Query: 239 GGSCKFDK 246
              C  DK
Sbjct: 223 DNICNSDK 230


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 149/285 (52%), Gaps = 22/285 (7%)

Query: 2   ERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSK 59
           + + +++L+LLL    +A  +      A+  +V PS   G  +          +  + ++
Sbjct: 9   KHITMTTLMLLLCVIAIADCIC---QAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQ 65

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFRRQFLGLNR 118
           + + Y    E  +RF+VFKAN     R         V G  +F+DLT  EF   + GL +
Sbjct: 66  YRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRK 125

Query: 119 RLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
              +P+ A++ P           D     DWR  GAVT VK+QG CG CW+FSA GA+EG
Sbjct: 126 PAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEG 185

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
              ++TG LVSLSEQQ++DCD     E  G  + GCNGG M++AF+Y++  GGV  E  Y
Sbjct: 186 LIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVVNNGGVTTEDAY 238

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           PY+    G+C+    + AA +S F  + S ++   AN V + P++
Sbjct: 239 PYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVS 280


>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 92/245 (37%), Positives = 128/245 (52%), Gaps = 17/245 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPS 107
            ++ ++S + K YA  EE D+R  V++ N++  +R         HG T     F D+T  
Sbjct: 28  QWNQWRSTYKKPYAVNEE-DWRRAVWEKNVKMIERHNQEYSQGKHGFTMAMNAFGDMTNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EFR+   G   +          P+     +PT  DW   G VT VK+QG CGSCW+FSAT
Sbjct: 87  EFRQVMNGFQNQKHKKGKLFYEPVF--GHIPTSVDWTQKGYVTPVKNQGQCGSCWAFSAT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+Y+   GG++
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRR-------EGNEGCNGGLMDNAFQYVQDNGGLD 197

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
            E+ YPY  TD  +C +     AA  + F  I   E  +   +   GP++    +I+  H
Sbjct: 198 SEESYPYLATDTHTCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAIDAGH 254

Query: 288 ISFSF 292
            SF F
Sbjct: 255 ESFQF 259


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 99/292 (33%), Positives = 156/292 (53%), Gaps = 26/292 (8%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
            S  L+L  S  L +++A   D +++     S+  +S D L+     F  + SK  K Y 
Sbjct: 5   FSKALVLACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSKHGKIYQ 59

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLR 121
           + EE   RF +FK NL+    R  +      G+ +F+DL+  EF+ ++LGL    +RR  
Sbjct: 60  SIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            P +     +    +LP   DWR  GAV  VK+QG+CGSCW+FS   A+EG + + TG L
Sbjct: 120 SPEEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
            SLSEQ+L+DCD         + ++GCNGGLM+ AF +I++ GG+ +E+DYPY   + G+
Sbjct: 176 TSLSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGT 226

Query: 242 CKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           C+  K +     +S +  +  + +Q     + + PL+    +IE     F F
Sbjct: 227 CEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 275


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 104/297 (35%), Positives = 151/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           +LLL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MLLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V SS+E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +KS   K+Y  +EE  +R  V++ +LR  +   L      H    G+  F D+   
Sbjct: 28  HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   + +     Q +  L  N  ++P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQLMNGYKYK-QTHKKLQGSHFLEPNFQEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  TG+LVSLSEQ LV+C     PE     + GCNGGLM+ AF+Y+   GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C ++    AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 256 AGHTSFQF 263


>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
          Length = 334

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 98/261 (37%), Positives = 135/261 (51%), Gaps = 23/261 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
           +PSD        +  + H+  FK+  +KTYA   E  YR +VFK N +R AK   L    
Sbjct: 18  IPSD--------MEIQAHWESFKATHAKTYANTVEEAYRAKVFKENAIRIAKHNDLFASG 69

Query: 94  AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
            V    G ++++D+   E   +  G    L+  +         +       DWR  GAVT
Sbjct: 70  EVTFKVGYSQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            +KDQG CGSCWSFSATG+LEG  FL    LVSLSEQ LVDC  +   E       GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
           GLM+SAFEY+   GG++ E+ YPYT  DG SC +  +  A   + +  V +  E  +   
Sbjct: 183 GLMDSAFEYVESNGGIDTEESYPYTAVDGDSCLYKAANNAGVNTGYKDVQAKSESALRDA 242

Query: 270 LVKHGPLAGNVASIELPHISF 290
           + K GP++    +I+  + SF
Sbjct: 243 VEKAGPVS---VAIDASNWSF 260


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/254 (39%), Positives = 137/254 (53%), Gaps = 21/254 (8%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
            +  +  FK   +K Y +  E  +R ++F  N    AK  +L     V    G+ K++D+
Sbjct: 23  VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADM 82

Query: 105 TPSEFRRQFLGLNRR---LRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACG 159
              EF +   G NR    LR          LP  +  LP   DWRD GAVT VKDQG CG
Sbjct: 83  LHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCG 142

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCWSFSATG+LEG HF  +G+LVSLSEQ LVDC      E+ G  ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC-----SEKFG--NNGCNGGLMDNAFRY 195

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
           I   GG++ E+ YPY   D   C +  K+K A       + S +ED++ + +   GP++ 
Sbjct: 196 IKANGGIDTEQAYPYKAED-EKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVS- 253

Query: 279 NVASIELPHISFSF 292
              +I+  H SF  
Sbjct: 254 --VAIDASHQSFQL 265


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 123/221 (55%), Gaps = 10/221 (4%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K Y    E + RF +FK NL    +    + T   G+ +F+DLT  EFR  +LG   
Sbjct: 57  KHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRT 116

Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
             + RLP  + +      + LP   DWR  GAV  VKDQG CGSCW+FS   A+EG + +
Sbjct: 117 GHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKI 176

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+L++LSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E DYPY G
Sbjct: 177 VTGDLIALSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLG 228

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            DG    + K+    ++ ++  +  +++      V + P++
Sbjct: 229 RDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVS 269


>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
           gambiense DAL972]
          Length = 404

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 91/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ------- 112
           + K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       
Sbjct: 2   YGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASY 61

Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
           F    +RLR      K   + T   P   DWR+ GAVT +KDQG CGSCW+F + G +EG
Sbjct: 62  FAAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEG 115

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREK 230
              ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E 
Sbjct: 116 QWQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEA 166

Query: 231 DYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            YPY   +G    C+ +  +I AA+++   +  DED +AA L ++GPLA
Sbjct: 167 SYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLA 215


>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 379

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 97/247 (39%), Positives = 143/247 (57%), Gaps = 29/247 (11%)

Query: 59  KFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG- 115
           +F K+Y   E  D+  RF VFK N+             V  + +F+D+T  E+RR +LG 
Sbjct: 45  RFEKSY---ESFDFLQRFAVFKTNMDYVHEWNSKKLPTVLELNQFADITNQEYRRLYLGT 101

Query: 116 -LNRR--LRLPADAQKA----PILPTNDLPTD---FDWRDHGAVTGVKDQGACGSCWSFS 165
            +N R  L  P   + +     +   +D  +     DWR  GAV+ +K+QG CGSCWSFS
Sbjct: 102 RINARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATVDWRAKGAVSPIKNQGQCGSCWSFS 161

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAG 224
            TG++EGAH++STG++V LSEQ LVDC        SGS  + GC GGLMN AF+YI+K  
Sbjct: 162 TTGSVEGAHYISTGKMVPLSEQNLVDC--------SGSEGNMGCQGGLMNLAFDYIIKNE 213

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASI 283
           G++ E  YPY+   G  C F+K+ + A +S++  I+S ++   A+ VK+ GP++    +I
Sbjct: 214 GIDTEDSYPYSAETGKKCLFNKTNVGATISSYKNITSGDESNLADAVKNAGPVS---VAI 270

Query: 284 ELPHISF 290
           +  H SF
Sbjct: 271 DASHNSF 277


>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
          Length = 384

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 156/301 (51%), Gaps = 34/301 (11%)

Query: 11  LLLLSSVLA--SAVAVNDDDAMIRQVVPSDGEQSEDHLLNA---------EHHFSLFKSK 59
           +L + SVLA  S   V +++     +  +     + H+L A         E  +  FK  
Sbjct: 26  VLWIVSVLAVVSGANVQNENVQWFDLESAQKHPEQLHILKAQTGINYQPYEQAWKEFKIL 85

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRRQFLG 115
             K+Y   EE   RF +F+ N+ R ++   L      +   GV +F+DL  +EF   F G
Sbjct: 86  HDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFV-NFNG 144

Query: 116 LNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           L  ++    + + +  L  N++  P   DWR  G VT VK+QGACGSCW+FSATG+LEG 
Sbjct: 145 L--KMTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQ 202

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDY 232
           +F   G+LV LSE QLVDC        SGS  + GCNGG M +AF+Y+   GG+E E DY
Sbjct: 203 YFRKNGKLVPLSESQLVDC--------SGSFGNEGCNGGFMENAFKYVKSVGGIESESDY 254

Query: 233 PYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
           PY      +C FDK+K+ A VS    V S  E  +   + + GP++    +I+  H SF 
Sbjct: 255 PYKARQ-RTCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVS---VAIDAGHSSFQ 310

Query: 292 F 292
            
Sbjct: 311 L 311


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 87/241 (36%), Positives = 131/241 (54%), Gaps = 17/241 (7%)

Query: 60  FSKTYATQEEHDYRFR------VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQF 113
           F+K      + +YRF       +++ N+ R +     + +    + +F DLT +EF R F
Sbjct: 30  FAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQNKSYFLAMNQFGDLTNAEFNRLF 89

Query: 114 LGLNRRLRLPADAQKA-PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
            GL       A    A P  P   +P++FDWR  GAVT VK+QG CGSCWSFS TG+ EG
Sbjct: 90  KGLAFDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEG 149

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
           A+FL TG LVSLSEQ L+DC            ++GCNGGLM+ AFEYI+   G++ E  Y
Sbjct: 150 ANFLKTGRLVSLSEQNLIDCSVSYG-------NNGCNGGLMDYAFEYIINNRGIDTEASY 202

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           PY      +C+++ +    +++ ++ ++S ++    N     P++    +I+  H SF F
Sbjct: 203 PYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVS---VAIDASHNSFQF 259

Query: 293 L 293
            
Sbjct: 260 Y 260


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +KS   K+Y  +EE  +R  V++ +LR  +   L      H    G+  F D+   
Sbjct: 28  HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   + +     Q +  L  N  ++P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  TG+LVSLSEQ LV+C     PE     + GCNGGLM+ AF+Y+   GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C ++    AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 256 AGHTSFQF 263


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 120/202 (59%), Gaps = 12/202 (5%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ +  K  K Y   E+  +RF V+K NL   +  +  + T   G+TKF+DLT  EFRR
Sbjct: 53  QFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET-NRTYSLGLTKFADLTNEEFRR 111

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            + G        A  +       ++ P   DWR +GAVT VKDQG+CGSCW+FSA G++E
Sbjct: 112 MYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVE 171

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + +  GE VSLSEQ+LVDCD E         + GCNGGLM+ AF++I++ GG++ EKD
Sbjct: 172 GINAIRNGEAVSLSEQELVDCDLE--------YNQGCNGGLMDYAFDFIIQNGGIDTEKD 223

Query: 232 YPYTGTDGGSCKFDKSKIAAAV 253
           YPY G DG   + D SK  A V
Sbjct: 224 YPYKGFDG---RCDNSKKNAHV 242


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
           +PSDG+   D  + +   +  + ++  KT         + D RF +FK NLR        
Sbjct: 33  LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90

Query: 91  DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
           +  A +  G+TKF+DLT  E+R+ +LG      RR+    +  +      N  ++P   D
Sbjct: 91  NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WR  GAV  +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD         
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
           S + GCNGGLM+ AF++I+K GG+  EKDYPY G  G    F K+    ++  +  + + 
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262

Query: 263 EDQMAANLVKHGPLA 277
           ++      + + P++
Sbjct: 263 DETALKKAISYQPVS 277


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +KS   K+Y  +EE  +R  V++ +LR  +   L      H    G+  F D+   
Sbjct: 28  HWEQWKSWHGKSYEQKEE-TWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   + +     Q +  L  N  ++P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  TG+LVSLSEQ LV+C     PE     + GCNGGLM+ AF+Y+   GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C ++    AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 256 AGHTSFQF 263


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 133/248 (53%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +KS   K+Y  +EE  +R  V++ +LR  +   L      H    G+  F D+   
Sbjct: 28  HWEQWKSWHGKSYEQKEE-TWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   + +     Q +  L  N  ++P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQLMNGYKYK-QTHKKLQGSHFLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG HF  TG+LVSLSEQ LV+C     PE     + GCNGGLM+ AF+Y+   GG
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECS---KPE----GNEGCNGGLMDQAFQYVKDNGG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C ++    AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 199 IDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVS---VAID 255

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 256 AGHTSFQF 263


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNGYHGS-RKSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 TTGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  ED +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 79/193 (40%), Positives = 114/193 (59%), Gaps = 15/193 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F L+K +  + Y   EE   RF +FK NL+    R         G+ KF+D++  EF+ +
Sbjct: 46  FHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEK 105

Query: 113 FLGLNRRLR------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +L   ++        L    Q+     + + P+  DWR  G VTG+KDQG CGSCW+FS+
Sbjct: 106 YLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSS 165

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGA+EG + + TG+L+SLSEQ+LVDCD         + + GC GG M+ AFE+++  GG+
Sbjct: 166 TGAMEGINAIVTGDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVISNGGI 216

Query: 227 EREKDYPYTGTDG 239
           + E DYPYTGTDG
Sbjct: 217 DSESDYPYTGTDG 229


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 135/244 (55%), Gaps = 20/244 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
           + ++  + YA   E + R+ VFK N+ R +R   +    T    V +F+DLT  EFR  +
Sbjct: 35  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94

Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G      L +  +        + ++ LP   DWR  GAVT +KDQG CGSCW+FSA  A
Sbjct: 95  TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +  G+L+SLSEQ+LVDCD         + D GC GGLM++AF Y +  GG+  E
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCD---------TNDGGCMGGLMDTAFNYTITIGGLTSE 205

Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
            +YPY  T+ G+C F+K+K IA ++  F  + +++++     V H P++  +A  +   I
Sbjct: 206 SNYPYKSTN-GTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD---I 261

Query: 289 SFSF 292
            F F
Sbjct: 262 GFQF 265


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 99/249 (39%), Positives = 134/249 (53%), Gaps = 21/249 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   K+Y ++ E   R+++F  N L  AK         V    G+ +F DL P 
Sbjct: 6   QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF + F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 66  EFAKMFNGYHGE-RKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 124

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAG 224
           ATG+LEG HFL +G+LVSLSEQ L+DC        SGS  + GC GGLM++AF+YI    
Sbjct: 125 ATGSLEGQHFLKSGKLVSLSEQNLIDC--------SGSFGNEGCGGGLMDNAFKYIKAND 176

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASI 283
           G++ E+ YPY   D G C+F K  + A  + F  +    ED +   +   GP++    +I
Sbjct: 177 GIDTEESYPYEAMD-GDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPIS---VAI 232

Query: 284 ELPHISFSF 292
           +  H SF  
Sbjct: 233 DASHSSFQL 241


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 99/248 (39%), Positives = 130/248 (52%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVF-KANLRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F +++L  A+         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  ED +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  160 bits (406), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/255 (35%), Positives = 138/255 (54%), Gaps = 22/255 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
           +PSDG+   D  + +   +  + ++  KT         + D RF +FK NLR        
Sbjct: 33  LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNED 90

Query: 91  DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
           +  A +  G+TKF+DLT  E+R+ +LG      RR+    +  +      N  ++P   D
Sbjct: 91  NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WR  GAV  +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD         
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
           S + GCNGGLM+ AF++I+K GG+  EKDYPY G  G    F K+    ++  +  + + 
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262

Query: 263 EDQMAANLVKHGPLA 277
           ++      + + P++
Sbjct: 263 DETALKKAISYQPVS 277


>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
          Length = 503

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/251 (36%), Positives = 139/251 (55%), Gaps = 18/251 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
           N +  ++ +K+   K Y  ++E  +R  V++ N++   +         H     +  F D
Sbjct: 24  NLDARWTRWKAANGKLY-NKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           LT  EF++   GL  +++ P +     +LP  + P+  DWR+ G VT VKDQG CGSCW+
Sbjct: 83  LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            ++GCNGGLM++AF Y+   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY   D G CK+   + AA  + F+ I  DE+ +  ++   GP++    +I
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPIS---VAI 249

Query: 284 ELPHISFSFLF 294
           +    +F F +
Sbjct: 250 DASLDTFRFYY 260


>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
          Length = 333

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/251 (36%), Positives = 139/251 (55%), Gaps = 18/251 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSD 103
           N +  ++ +K+   K Y  ++E  +R  V++ N++   +         H     +  F D
Sbjct: 24  NLDARWTRWKAANGKLY-NKDEEVWRRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           LT  EF++   GL  +++ P +     +LP  + P+  DWR+ G VT VKDQG CGSCW+
Sbjct: 83  LTNEEFKQVMNGL--KIQNPREGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            ++GCNGGLM++AF Y+   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AEGNAGCNGGLMDNAFRYVKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY   D G CK+   + AA  + F+ I  DE+ +  ++   GP++    +I
Sbjct: 194 GGLDSEESYPYLAQD-GRCKYKPEQSAANDTGFADIHQDEESLMLSVATVGPIS---VAI 249

Query: 284 ELPHISFSFLF 294
           +    +F F +
Sbjct: 250 DASLDTFRFYY 260


>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
          Length = 335

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 102/273 (37%), Positives = 142/273 (52%), Gaps = 23/273 (8%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           +S++ L L   VL    +     A I  V+P  G          E +F  ++ K  K Y+
Sbjct: 1   MSAMKLFLGLCVLVHVCS-----AFIPLVLPIPGLY--------EDYFKEWQEKHGKVYS 47

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           T+EE   R +VF  N+           +    V +++D+T  EF+ Q+L   +       
Sbjct: 48  TEEESQSRLKVFMKNVIYIDNHNKQGHSYELEVNEYADMTLDEFKDQYLMEPQHCSATHS 107

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
            +  P     D P   DWR  GAVT VK+QG CGSCW+FS TG LE  HFL TG+LVSLS
Sbjct: 108 LKSDPP-KYRDPPKAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLS 166

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDC    +       ++GCNGGL + AFEYI   GG++ E+ YPY   D   C F 
Sbjct: 167 EQQLVDCAQAFN-------NNGCNGGLPSQAFEYIHYNGGLDSEESYPYRAHD-EKCHFV 218

Query: 246 KSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
            S+++A VSN  ++ S DE Q+   +   GP++
Sbjct: 219 PSEVSATVSNVVNITSKDEMQLYNAVGTVGPVS 251


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/254 (35%), Positives = 137/254 (53%), Gaps = 22/254 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE----EHDYRFRVFKANLRRAKRRQLL 90
           +PSDG+   D  + +   +  + ++  KT         + D RF +FK NLR        
Sbjct: 33  LPSDGKWRTDEEVRS--IYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEN 90

Query: 91  DPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFD 142
           +  A +  G+TKF+DLT  E+R+ +LG      RR+    +  +      N  ++P   D
Sbjct: 91  NKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVD 150

Query: 143 WRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESG 202
           WR  GAV  +KDQG CGSCW+FS T A+EG + + TGEL+SLSEQ+LVDCD         
Sbjct: 151 WRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDK-------- 202

Query: 203 SCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD 262
           S + GCNGGLM+ AF++I+K GG+  EKDYPY G  G    F K+    ++  +  + + 
Sbjct: 203 SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTK 262

Query: 263 EDQMAANLVKHGPL 276
           ++      + + P+
Sbjct: 263 DETALKKAISYQPV 276


>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
          Length = 274

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 94/223 (42%), Positives = 130/223 (58%), Gaps = 19/223 (8%)

Query: 82  RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF 141
           RR + ++  D  A +G + F+DLT  EFR+ +L     +      + A I P    P  F
Sbjct: 5   RRIQEKEQGD--ATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASI-PIETPPDAF 61

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWRDH AVT VK+QG+CGSCW+FS TG +EG   +   +L+SLSEQ+LVDCD        
Sbjct: 62  DWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDK------- 114

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
              D GCNGGL   A++ I++ GG+E EKDYPY G  G  C F+K+++   ++    ISS
Sbjct: 115 --VDLGCNGGLPLQAYKEIMRIGGLETEKDYPYEGK-GDKCVFEKAEVEVNITGAVNISS 171

Query: 262 DEDQMAANLVKHGPLA----GNVASIELPHIS--FSFLFTVSS 298
           +ED M A L K+GP++     N     +  +S  FSFL + SS
Sbjct: 172 NEDDMKAWLWKNGPISIGLNANAMQFYMGGVSHPFSFLCSPSS 214


>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 101/294 (34%), Positives = 152/294 (51%), Gaps = 37/294 (12%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           + S++ LL  +VLA    V                 S + +L+AE  + +FK   +K Y 
Sbjct: 1   MKSVVALLFLAVLAMGQTV-----------------SFNKILDAE--WFIFKLHHNKVYK 41

Query: 66  TQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
           +  E  YR +++  N R+     ++ +L + T   G+ K+ D+   EF     G N+ + 
Sbjct: 42  SPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVT 101

Query: 122 LPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
              + +    + P N  LP + DW   GAVT VKDQG CGSCW+FS+TGALEG HF STG
Sbjct: 102 AGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTG 161

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            LVSLSEQ L+DC  +         ++GCNGGLM+ AF+YI    G++ EK YPY   + 
Sbjct: 162 YLVSLSEQNLIDCSGKYG-------NNGCNGGLMDYAFQYIKDNKGLDTEKTYPYE-AEN 213

Query: 240 GSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
             C+++     A    +  I   DE+++ A +   GP++    +I+  H SF  
Sbjct: 214 DRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPIS---VAIDASHESFQL 264


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 97/250 (38%), Positives = 134/250 (53%), Gaps = 20/250 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+KS  SK Y  ++E  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 26  DEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFGDMT 85

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G   + R     + +  L  N++  P   DWR+ G VT VKDQG CGSCW+
Sbjct: 86  NEEFRQVMNGYKLQQR---KFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQCGSCWA 142

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG  F  T +LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 143 FSTTGAMEGQMFRKTQKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 195

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C +     AA  + F  I S  E  +   +   GP++    +
Sbjct: 196 SGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVS---VA 252

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 253 IDAGHESFQF 262


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 95/250 (38%), Positives = 138/250 (55%), Gaps = 23/250 (9%)

Query: 43  EDHLLNA------EHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLRRAKRRQ-LLDP 92
           E H LN+      +   SL++S   K  K Y    E + RF +FK N+    R   + + 
Sbjct: 41  ETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQ 100

Query: 93  TAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQKAPILPTND---LPTDFDWRDHG 147
           +   G+ KF+DLT  E+R  +L   + +R R   D  ++      D   LP   DWRD G
Sbjct: 101 SYKLGLNKFADLTNDEYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRG 160

Query: 148 AVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSG 207
           AV  VKDQG CGSCW+FS  GA+EG + + TGEL+SLSEQ+LVDCD+          + G
Sbjct: 161 AVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDN--------GYNQG 212

Query: 208 CNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
           CNGGLM+ AFE+I+K GG++ E DYPY G DG   +  K+     ++ +  +  ++++  
Sbjct: 213 CNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSL 272

Query: 268 ANLVKHGPLA 277
              V H P++
Sbjct: 273 KKAVAHQPVS 282


>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
          Length = 369

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 100/262 (38%), Positives = 146/262 (55%), Gaps = 28/262 (10%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDY--RFRVFKANLRRAKRRQLLDPTAVHGV--TKF 101
           L + E + + FK    +     E H++  RF +FK N+   K     D +  H +     
Sbjct: 33  LFSHEQYTTEFKGWVGQFEKNYESHEFLNRFDIFKKNMDYIKTWN--DKSVDHKLELNTL 90

Query: 102 SDLTPSEFRRQFLG--LNRRLRLP---ADAQ-----KAPILPTNDLPTDFDWRDHGAVTG 151
           +DLT  E++R +LG  +N  LR+    AD +     K+      D P + DWR  GAV+ 
Sbjct: 91  ADLTDKEYQRLYLGTKVNGALRVGLNHADERDFGHIKSVFSNVKDNP-NVDWRKQGAVSH 149

Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
           VK+QG CGSCWSFS+TGA+EGAH + TGE++SLSEQQLVDC            ++GCNGG
Sbjct: 150 VKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDCSKRYG-------NNGCNGG 202

Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANL 270
           LM  AF+Y++ AGG+E E+ YPYT TD  +C F+ +    ++S+   I + +E  +   L
Sbjct: 203 LMTLAFDYVIDAGGLESEEAYPYTTTDTSACMFNSTNAVTSISDHQNIRAGNEKHLETVL 262

Query: 271 VKHGPLAGNVASIELPHISFSF 292
              GP++    +I+    SF F
Sbjct: 263 RNVGPVS---VAIDASPRSFRF 281


>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
 gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
          Length = 467

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 94/237 (39%), Positives = 121/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P   +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVKVEVV------GAPAAVDWRARGAVTAVKDQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           SA G +E   FL+   L +LSEQ LV CD           D GC+GGLMN+AFE+I++  
Sbjct: 151 SAIGNVECQWFLAGHPLTNLSEQMLVSCDKT---------DFGCSGGLMNNAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 202 NGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVA 258


>gi|281207567|gb|EFA81750.1| cysteine protease 4 [Polysphondylium pallidum PN500]
          Length = 432

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 86/227 (37%), Positives = 128/227 (56%), Gaps = 14/227 (6%)

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
           +E ++R+ VFK N+   ++      + V G+  F+DLT +E++R +LG         +  
Sbjct: 48  KEFNHRYGVFKKNMDYVQQWNAKGSSTVLGMNIFADLTNAEYQRIYLGTKIDASGLLNVA 107

Query: 128 KAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            A     N     L    DWR  GAVT +K+Q  CGSCWSFS TG++EGAH +STG LV+
Sbjct: 108 AARAFDRNFNIKALNPTVDWRAKGAVTPIKNQAQCGSCWSFSTTGSVEGAHEISTGNLVA 167

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ L+DC     PE     + GCNGGLM +A EYI+K GG++ E  YPYT T    C+
Sbjct: 168 LSEQNLIDCSV---PEG----NQGCNGGLMWAAMEYIIKNGGIDTESSYPYTATGPNKCR 220

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
           ++ +   A +S++  ++S  +   A+     P++    +I+  H SF
Sbjct: 221 YNSANSGAKISSYVNVTSGSETSLASAANVNPVS---VAIDASHNSF 264


>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
          Length = 348

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 99/253 (39%), Positives = 129/253 (50%), Gaps = 33/253 (13%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
            +  +  FK +  K Y ++ E++YR  VF  NL +      L    +      +    DL
Sbjct: 24  VQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDL 83

Query: 105 TPSEFRRQFLGLNR--------------RLRLPADAQK--APILPTN----DLPTDFDWR 144
           T  EF R +  +N                L LP D Q      LPTN    DLPTD DWR
Sbjct: 84  TKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWR 142

Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
             GAVT VK+Q  CGSCWSFSATGALE   F  T +L+SLSEQQLVDC            
Sbjct: 143 QKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYG------- 195

Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDED 264
           + GC+GG M+ AF YI + GG++ E+ YPYT  D G C +     AA VS   ++   E+
Sbjct: 196 NHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKD-GRCAYKPGNKAATVSQVIMVPRGEN 254

Query: 265 QMAANLVKHGPLA 277
           Q+AA +   GP++
Sbjct: 255 QLAAKVSSVGPIS 267


>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
          Length = 343

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 128/227 (56%), Gaps = 12/227 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           F  ++ +FSK Y T EE   R + F  N        Q  D T   G+   +DLT SEF+ 
Sbjct: 42  FRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSEFQS 101

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           ++L +++     A + +   +    LP +FDWR+HG V+ VK+QG CGSCW+FS TG LE
Sbjct: 102 RYLMVSQDC--SATSTRDLDIDILSLPENFDWREHGGVSPVKNQGHCGSCWTFSTTGCLE 159

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
            AH +   +  +LSEQQLVDC  + D       + GCNGGL + AFEYI   GG+E E+D
Sbjct: 160 SAHLIHHKKAYNLSEQQLVDCAQDFD-------NHGCNGGLPSHAFEYIHYVGGLEEEQD 212

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
           Y Y   + G C+FD +K A  V   F++  +DEDQ+   L    P++
Sbjct: 213 YSYHAEE-GLCEFDPTKTAGTVREVFNITETDEDQLTIALAYFNPVS 258


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 93/245 (37%), Positives = 132/245 (53%), Gaps = 24/245 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRR 111
           +K++  K+Y   +E   R   ++AN +            V G T    +F DL  SEF+ 
Sbjct: 25  WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHN--QHAGVFGYTLKMNQFGDLENSEFKS 82

Query: 112 QFLGLNRRLRLPADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
            + G  R    P   +  P +P     DLP   DW   G VT VK+QG CGSCWSFSATG
Sbjct: 83  LYNGY-RMSNAPRKGK--PFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ++EG HF +TG L+SLSEQ LVDC        +   + GCNGGLM+ AFEY++K  G++ 
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDC-------SAAEGNHGCNGGLMDDAFEYVIKNNGIDT 192

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVASIELPH 287
           E  YPY   D  +CKF+ + + A +S +  ++ D E  +   +   GP++    +I+  H
Sbjct: 193 EASYPYRAVD-STCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVS---VAIDASH 248

Query: 288 ISFSF 292
           ISF F
Sbjct: 249 ISFQF 253


>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 132/232 (56%), Gaps = 22/232 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVH---GVTKFSDLTPSEFRR 111
           +K K+ K+Y  + E   R RV+++NL+  ++  +L D    +   G+  ++DL    +  
Sbjct: 22  WKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADL----YNE 77

Query: 112 QFLGLNR-----RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +F+ L       + +  +  Q    L    LP+  DWR+ G VT VKDQG CGSCWSFSA
Sbjct: 78  EFMALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSA 137

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG+LEG HF  TG LVSLSEQQLVDC            + GC+GGLM SA++YI  AGGV
Sbjct: 138 TGSLEGQHFAKTGTLVSLSEQQLVDCSWSYG-------NYGCSGGLMESAYDYIRDAGGV 190

Query: 227 EREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           + E  YPYT  + G C FD+SK +A    + ++ S DE  +   +   GP+A
Sbjct: 191 QLESAYPYTAQN-GRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVA 241


>gi|144228217|gb|ABO93617.1| papain-like cysteine proteinase [Vitis vinifera]
          Length = 161

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 75/112 (66%), Positives = 92/112 (82%), Gaps = 4/112 (3%)

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS 247
           QLVDCDHECDPEE G+CD GCNGGLM SAFEYILKAGGVERE+ YPY G+D GSCKF+KS
Sbjct: 1   QLVDCDHECDPEEYGACDQGCNGGLMTSAFEYILKAGGVEREETYPYIGSDRGSCKFNKS 60

Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVSSP 299
           +I A+VSNFSV+S DEDQ+AAN+VK+GPLA  + ++ +     +++  VS P
Sbjct: 61  QIVASVSNFSVVSLDEDQIAANMVKNGPLAVGINAVFMQ----TYMKGVSCP 108


>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
          Length = 294

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 102/243 (41%), Positives = 140/243 (57%), Gaps = 37/243 (15%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           R+IL  ++LLL+ S + +A+  N  D             SE+ LL+    F  + +   K
Sbjct: 5   RMILKLVMLLLVFSSV-TAITYNPRDL------------SENGLLSL---FDRWCNHHGK 48

Query: 63  TYATQEEHDYRFRVFKANL-RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----- 116
           TY T ++   RF+VFK NL   ++     + T   G+  FSDLT  EFR Q +GL     
Sbjct: 49  TY-TAKQRPLRFQVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPP 107

Query: 117 --NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
               R R P    K+ +L   ++P+  DWRD  AVTGVKDQGACG CW+FSATGA+EG +
Sbjct: 108 SLKSRRREP----KSGLLELYNIPSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGIN 163

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
            + TG LVSLSEQ+L DCD         S +SGC+GGLM+ AF++++  GG++ E DYPY
Sbjct: 164 KIVTGSLVSLSEQELCDCDT--------SYNSGCDGGLMDYAFQWVIVNGGIDTEVDYPY 215

Query: 235 TGT 237
            G 
Sbjct: 216 KGV 218


>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
          Length = 324

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 138/249 (55%), Gaps = 20/249 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP----TAVHGVTKFSDLT 105
           E ++++FK+K +KTY+  E+   R+ +++ NL++ +    L      T   G  K++D+T
Sbjct: 19  EANWAIFKAKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMT 77

Query: 106 PSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
             EFRR   GL     L P D      +  + LPT  DWR  G VT VKDQG CGSCW+F
Sbjct: 78  NEEFRRTLSGLRVDKELTPGDFVSG--MFKDSLPTAVDWRKEGYVTEVKDQGQCGSCWAF 135

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TG+LEG HF +T +LVSLSE  LVDC  +         + GCNGGLM++AF+YI    
Sbjct: 136 STTGSLEGQHFKATKQLVSLSESNLVDCSKKWG-------NQGCNGGLMDNAFKYIADNK 188

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASI 283
           G++ EK YPY   D   C F K+ + A    +  I+S  ED +   +   GP++    +I
Sbjct: 189 GIDTEKSYPYKPED-RKCNFKKANVGATDKLYKDITSGSEDALQEAVATIGPIS---VAI 244

Query: 284 ELPHISFSF 292
           +  H SF  
Sbjct: 245 DASHDSFQL 253


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 109/278 (39%), Positives = 150/278 (53%), Gaps = 26/278 (9%)

Query: 9   LLLLLLSSVLASAVA---VNDDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKS 58
           L L++   + ASA+A      D+  IRQVV SDG    E +   ++    H   F+ F  
Sbjct: 8   LALVVAGGLFASALAGPATFADENPIRQVV-SDGLHELENAILQVVGKTRHALSFARFAH 66

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR 118
           ++ K Y + EE   RF VF  NL+  +       +   GV +F+DLT  EFRR  LG  +
Sbjct: 67  RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126

Query: 119 RLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
                +   K  +  TN  LP   DWR+ G V+ VK+QG CGSCW+FS TGALE A+  +
Sbjct: 127 NC---SATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQA 183

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPYTG 
Sbjct: 184 FGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGK 236

Query: 238 DGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           + G CKF    +   V    N ++ + DE + A  LV+
Sbjct: 237 N-GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR 273


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D   NA+ H   +KS   + Y T EE ++R  V++ N+R  +          HG T    
Sbjct: 22  DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H+         + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 324

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 95/238 (39%), Positives = 134/238 (56%), Gaps = 21/238 (8%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR------RQLLDPTAVHGVTK 100
           L+ +  +  FK  FSK+Y    E   RF +F +NL R +       R L   T   GV K
Sbjct: 17  LSDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGL--STYEMGVNK 74

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           F+DLTP EF  +F  L R+ +    +++A      DLP + DW   GAVT VK QG+CGS
Sbjct: 75  FADLTPEEFMERFRPL-RKTKPKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCGS 133

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG++E  +F+ TG+L+SLSEQQLVDC            +SGC GG M+ A EYI
Sbjct: 134 CWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN---------NSGCAGGWMDIALEYI 184

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
            +A G+  E DYPY   +  +C+F+ SK A  + ++  I  +DE  +   +   GP++
Sbjct: 185 -EADGIMSEDDYPYEERN-TTCRFNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVS 240


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D  I    P D E S D L+     F  + S F K Y T EE   RF VFK NL+     
Sbjct: 30  DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
                +   G+ +F+DL+  EF++ +LGL   +    + +        D+   P   DWR
Sbjct: 86  NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145

Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
             GAV  VK+QG+CGSCW+FS   A+EG + + TG L +LSEQ+L+DCD         + 
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197

Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
           ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+  + G+C+   D+S+      +  V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256

Query: 263 EDQMAANLVKHGPLA 277
           E  +   L  H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270


>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 382

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 92/233 (39%), Positives = 125/233 (53%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + D GC GG  + AF++I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCD---------TNDFGCGGGFSDPAFKWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G     DKS   + A + +   +  DE+ +A  L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKNGPVA 261


>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 289

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 93/271 (34%), Positives = 142/271 (52%), Gaps = 25/271 (9%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+ +      ++ + ++   TY    E + RF  F+ NLR   +        
Sbjct: 28  IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG     +R  +L A  Q A     ++LP   DWR  
Sbjct: 85  VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG CGSCW+FSA  A+EG + + TG+++ LSEQ+LVDCD         S + 
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ E+DYPY   D       K+     +  +  +  + ++ 
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253

Query: 267 AANLVKHGPLAGNVASIELPHISFSFLFTVS 297
               V + P++    +IE    +F    +VS
Sbjct: 254 LQKAVANQPIS---VAIEAGGRAFQLYKSVS 281


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 108/291 (37%), Positives = 143/291 (49%), Gaps = 38/291 (13%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           L L LL +++A  VA N  + +  Q                   +  FK+   K+Y +  
Sbjct: 2   LRLSLLCAIVAVTVAANSHEILRTQ-------------------WEAFKTTHKKSYESHM 42

Query: 69  EHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   RF++F  N L  AK         V    G+ +F DL   EF + F G  R  R   
Sbjct: 43  EELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGY-RGQRTSR 101

Query: 125 DAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
            +   P    ND  LP+  DWR  GAVT VKDQG CGSCW+FSATG+LEG HFL  GELV
Sbjct: 102 GSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELV 161

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQ LVDC            ++GC GGLM++AF+YI    G++ E+ YPY   D   C
Sbjct: 162 SLSEQNLVDCSQSFG-------NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMD-DKC 213

Query: 243 KFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           +F K  + A  + F  I    ED +   +   GP++    +I+  H SF  
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPIS---VAIDAGHSSFQL 261


>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 357

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)

Query: 26  DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
           D+   IR V  SDG    E+S   +L    H   F+ F  ++ K Y   EE   RF +FK
Sbjct: 27  DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            NL   +       +   GV +F+DLT  EF+R  LG  +     A  + +  +    LP
Sbjct: 85  ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              DWR+ G V+ VKDQG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC    + 
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
                 + GCNGGL + AFEYI   GG++ EK YPYTG D  +CKF    +   V    N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254

Query: 256 FSVISSDEDQMAANLVK 272
            ++ + DE + A  LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/239 (38%), Positives = 131/239 (54%), Gaps = 17/239 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           +K   +K Y+   E   R+ ++K N RR +   L     +  + +F D+T SEF+     
Sbjct: 30  WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFK----A 85

Query: 116 LNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
            N  L          + P N + P   DWR+ G VT VKDQG CGSCW+FS TG+LEG H
Sbjct: 86  FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           F  TG+LVSLSEQ LVDC        +   ++GC+GGLM++AF YI +  G++ E  YPY
Sbjct: 146 FKKTGKLVSLSEQNLVDC-------STAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPY 198

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           T  D G C F KS +AA  + F  I   +E+++   +   GP++    +I+  H SF F
Sbjct: 199 TAED-GKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPIS---VAIDASHESFQF 253


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 107/291 (36%), Positives = 148/291 (50%), Gaps = 38/291 (13%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           L LL+L++ L+S ++    DA + +                  H+ L+KS  SK Y  +E
Sbjct: 2   LPLLVLTACLSSVLSAPVLDAQLNE------------------HWDLWKSWHSKKYHEKE 43

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLG--LNRRLRL 122
           E  +R  V++ NL++ +   L      H    G+  F D+T  EFR+   G  L  + + 
Sbjct: 44  E-GWRRMVWEKNLQKIELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKTQRKF 102

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
                  P   T   P+  DWR+ G VT VKDQG CGSCW+FS TGALEG  F  TG+LV
Sbjct: 103 TGSLFMEPNFMT--APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLV 160

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQ LVDC     PE     + GC GGLM+ AF+Y+    G++ E  YPYTGTD   C
Sbjct: 161 SLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPC 213

Query: 243 KFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            +D    +A  + F  V S  E  +   +   GP++    +I+  H SF F
Sbjct: 214 HYDPLYNSANDTGFVDVPSGKEHALMKAVASVGPVS---VAIDAGHESFQF 261


>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
 gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 133/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D   NA+ H   +KS   + Y T EE ++R  V++ N+R  +          HG T    
Sbjct: 22  DQTFNAQWH--QWKSTHRRLYGTNEE-EWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML--QIPKTVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H+         + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD-------QGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEYAVANDTGFVDIPQQEKALMKPVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/251 (39%), Positives = 134/251 (53%), Gaps = 20/251 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+K   SK Y  +EE  +R  V++ NLR+ +   L      H    G+  F D+T
Sbjct: 25  DQHWQLWKGWHSKNYHEKEE-GWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  RR +       +  +  N L  P   DWRD G VT VKDQG CGSCW+
Sbjct: 84  HEEFRQIMNGYKRREQRKYSG--SLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWA 141

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGALEG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+   
Sbjct: 142 FSTTGALEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDN 194

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVAS 282
            G++ E  YPY GTD   C+++    A   + F  I S +++     V   GP++    +
Sbjct: 195 QGLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVS---VA 251

Query: 283 IELPHISFSFL 293
           I+  H SF F 
Sbjct: 252 IDAGHESFQFY 262


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  + +K  K+Y++  E   R  +F   L   ++   L + T   G+ KFSDLT +EFR 
Sbjct: 2   FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61

Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
            ++G   + + P    + P     +  + LPT  DWR  GAVT +KDQG CGSCW+FSA 
Sbjct: 62  NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ++E AHFL+T ELVSLSEQQL+DCD         + D GC GG    AF+++++ GGV 
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            E+ YPYTG   GSC  +K+K+   ++ +  ++ D        V   P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 98/261 (37%), Positives = 136/261 (52%), Gaps = 22/261 (8%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAV---HG 97
           S   +L AE  +S FK+K  K+Y ++ E  +R +++  N  + AK  +      V     
Sbjct: 18  SYQEVLGAE--WSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMA 75

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVK 153
           + +F D+   EF     G  R  +         + P N     LP   DWR  GAVT VK
Sbjct: 76  MNEFGDMLHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVK 135

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           +QG CGSCW+FSATG+LEG HF  +G +VSLSEQ LV C  +         ++GC GGLM
Sbjct: 136 NQGQCGSCWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFG-------NNGCEGGLM 188

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
           + AF+YI    G++ EK YPY GTD G+C F KS + A  S F  +    E Q+   +  
Sbjct: 189 DDAFKYIRANKGIDTEKSYPYNGTD-GTCHFKKSTVGATDSGFVDIKEGSETQLKKAVAT 247

Query: 273 HGPLAGNVASIELPHISFSFL 293
            GP++    +I+  H SF F 
Sbjct: 248 VGPIS---VAIDASHESFQFY 265


>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
           Full=Senescence-associated gene product 2; Flags:
           Precursor
 gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
 gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
 gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
 gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
 gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
 gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 358

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)

Query: 26  DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
           D+   IR V  SDG    E+S   +L    H   F+ F  ++ K Y   EE   RF +FK
Sbjct: 27  DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            NL   +       +   GV +F+DLT  EF+R  LG  +     A  + +  +    LP
Sbjct: 85  ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              DWR+ G V+ VKDQG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC    + 
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
                 + GCNGGL + AFEYI   GG++ EK YPYTG D  +CKF    +   V    N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254

Query: 256 FSVISSDEDQMAANLVK 272
            ++ + DE + A  LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  + +K  K+Y++  E   R  +F   L   ++   L + T   G+ KFSDLT +EFR 
Sbjct: 2   FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61

Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
            ++G   + + P    + P     +  + LPT  DWR  GAVT +KDQG CGSCW+FSA 
Sbjct: 62  NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ++E AHFL+T ELVSLSEQQL+DCD         + D GC GG    AF+++++ GGV 
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            E+ YPYTG   GSC  +K+K+   ++ +  ++ D        V   P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216


>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
          Length = 331

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 107/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           LLL+L++V+ S  AV+  D +  Q                   +S FK + SK Y ++ E
Sbjct: 3   LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43

Query: 70  HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
             +R ++F  N  + AK  +L     V    G+ K++D+   EF     G N+    +  
Sbjct: 44  ERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103

Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            +D   A   I P N  LP   DWRD GAVT VKDQG CGSCWSFS +G+LEG HF  TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163

Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           +LVSLSEQ LVDC        SG   ++GCNGGLM++AF YI   GG++ E+ YPY   D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215

Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
              C +      A    F  I   +ED + A +   GP++
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPIS 254


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 20/289 (6%)

Query: 7   SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
           S  L+L  S  L  ++A   D +++     S+  +S D L+     F  + S+  K Y T
Sbjct: 6   SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60

Query: 67  QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
            EE   RF VFK NL+    R  +      G+ +F+DL+  EF+ ++LGL   L  R  +
Sbjct: 61  IEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRES 120

Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
             ++       DLP   DWR  GAVT VK+QG CGSCW+FS   A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180

Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
           SEQ+L+DCD         + ++GCNGGLM+ AF +I++ GG+ +E DYPY   +  +C+ 
Sbjct: 181 SEQELIDCDT--------TYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYI-MEESTCEM 231

Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            K +     ++ +  +  + +Q     + + PL+    +IE     F F
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASSRDFQF 277


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 97/256 (37%), Positives = 137/256 (53%), Gaps = 21/256 (8%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
           +LL  E H  LFK+   K Y +Q E   R +++  N  +  +  +L    + +    + K
Sbjct: 25  NLLADEWH--LFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F DL   EFR    G   + +  + A+       P N ++P   DWR+ GA+T VKDQG 
Sbjct: 83  FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS+TGALEG  F  TG+LVSLSEQ L+DC  +   E       GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY   D G C+++     A    F  I S +ED++ A +   GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-GVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 255 S---VAIDASHESFQF 267


>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
          Length = 456

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/230 (39%), Positives = 129/230 (56%), Gaps = 13/230 (5%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK++  K+Y +  E  YR RVF+ +++ A+     +P A  GVTKFSDLT  EF+ 
Sbjct: 35  QFAAFKAEHGKSYTSAAEEGYRMRVFEESMKAAQAHAAANPHAKFGVTKFSDLTHEEFKT 94

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +          A   + P+  T   P ++DWR  GAVT VKDQG CGSCW+FS TG +E
Sbjct: 95  LYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTTGNIE 154

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
           G   ++  EL +LSEQ LV CD           D GC+GGLM++AFE+I+    G V  E
Sbjct: 155 GQWAVAGNELTNLSEQMLVSCDAR---------DYGCSGGLMDNAFEWIVNQNDGFVFTE 205

Query: 230 KDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           + YPY    G +  C     K+ A +     + +DE++MAA L  +GP++
Sbjct: 206 ESYPYASGSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPIS 255


>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
          Length = 473

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 136/227 (59%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY T+EE  +R  VF  N+ RA++ Q LD  TA +G+TKFSDLT  EFR 
Sbjct: 176 FKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTEEEFRT 235

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P    +    P   +P D+DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 236 IYL--NPLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQGMCGSCWAFSVTGNV 293

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GG+ ++A+  I   GG+E E+
Sbjct: 294 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACMGGVPSNAYSAIKTLGGLETEE 344

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F   K    +++   +S +E ++AA L K+GP++
Sbjct: 345 DYSYHG-HLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPIS 390


>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
          Length = 440

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/233 (39%), Positives = 123/233 (52%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+FSA G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD           D GC GG  + AF++I+ +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DFGCGGGFSDPAFKWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G     DKS   + A + +   +  DE+ +A  L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDENAIAEWLAKKGPVA 261


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/249 (37%), Positives = 138/249 (55%), Gaps = 24/249 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSE 108
           ++ +K++  K Y + EE   R  +++ NL    +   +  L   T   G+ +F+DL   E
Sbjct: 28  WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87

Query: 109 FRRQFLGLNRRLRLPADAQK-APILPTND---LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           F     G   R+   + A K +  LP+N+   LP   DWR  G VT VKDQG CGSCW+F
Sbjct: 88  FVAMMTGF--RVNGTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAF 145

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SATG+LEG  F  TG+LVSLSEQ LVDC +          + GC+GG M+ AF+YI+ AG
Sbjct: 146 SATGSLEGQQFKKTGKLVSLSEQNLVDCSYR---------NYGCHGGFMDRAFQYIIDAG 196

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASI 283
           G++ E  Y Y   D G+C F K+ + A V+ ++ ++S  ++     V H GP++    +I
Sbjct: 197 GIDTEATYSYRAVD-GNCHFKKANVGATVTGYTDVTSGSEKALQKAVAHIGPIS---VAI 252

Query: 284 ELPHISFSF 292
           +  H  F F
Sbjct: 253 DASHKFFKF 261


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K+Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222


>gi|330842502|ref|XP_003293216.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
 gi|325076482|gb|EGC30264.1| hypothetical protein DICPUDRAFT_95775 [Dictyostelium purpureum]
          Length = 376

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 134/244 (54%), Gaps = 17/244 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ +  K  K Y  QE    R+ +FK N+             V G+  F+DLT  E+++ 
Sbjct: 34  FTEWTIKHGKQYENQE-FGRRYGIFKDNMDYVHDWNSKGSETVLGLNIFADLTNLEYQKY 92

Query: 113 FLG--LNRRLRLPADAQK-APILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           +LG  +N  L    D +    I  ++D   PT  DW   GAVT +KDQG CGSCWSFS T
Sbjct: 93  YLGTHVNSLLHRGYDGRALEEIFGSDDGRNPTSVDWNKKGAVTPIKDQGQCGSCWSFSTT 152

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G++EGAH + TG+LVSLSEQ LVDC            + GC+GGLM++AF YI++  G++
Sbjct: 153 GSVEGAHQIKTGKLVSLSEQNLVDC-------SGAEGNLGCDGGLMDNAFIYIIQNKGID 205

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
            E  YPY    G  C F  + I A +S + ++ +  E Q+   + K+GP++    +I+  
Sbjct: 206 TESSYPYKAQSGTKCLFKPTSIGATLSGYVNITAGSESQLETAVAKNGPVS---VAIDAS 262

Query: 287 HISF 290
           H SF
Sbjct: 263 HNSF 266


>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 361

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 137/257 (53%), Gaps = 22/257 (8%)

Query: 26  DDDAMIRQVVPSDG----EQSEDHLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFK 78
           D+   IR V  SDG    E+S   +L    H   F+ F  ++ K Y   EE   RF +FK
Sbjct: 27  DESNPIRMV--SDGLREVEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFK 84

Query: 79  ANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP 138
            NL   +       +   GV +F+DLT  EF+R  LG  +     A  + +  +    LP
Sbjct: 85  ENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNC--SATLKGSHKVTEAALP 142

Query: 139 TDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDP 198
              DWR+ G V+ VKDQG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC    + 
Sbjct: 143 ETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFN- 201

Query: 199 EESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
                 + GCNGGL + AFEYI   GG++ EK YPYTG D  +CKF    +   V    N
Sbjct: 202 ------NYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN 254

Query: 256 FSVISSDEDQMAANLVK 272
            ++ + DE + A  LV+
Sbjct: 255 ITLGAEDELKHAVGLVR 271


>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
          Length = 489

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/226 (43%), Positives = 132/226 (58%), Gaps = 13/226 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 193 FRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +L  N  LR P    K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +E
Sbjct: 253 TYL--NPLLREPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 310

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G  FL+ G L+SLSEQ+L+DCD           D  C GGL +SA+  I   GG+E E D
Sbjct: 311 GQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETEDD 361

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 362 YSYRG-HMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 406


>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
 gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
          Length = 490

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/242 (40%), Positives = 138/242 (57%), Gaps = 24/242 (9%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
           +D  +     F  F + +++TY T+EE  +R  VF  N+ RA++ Q LD  TA +GVTKF
Sbjct: 183 QDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKF 242

Query: 102 SDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           SDLT  EFR  +L         R++RL       P       P ++DWR  GAVT VKDQ
Sbjct: 243 SDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-------PPEWDWRKKGAVTKVKDQ 295

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FS TG +EG  FL  G L+SLSEQ+L+DCD           D GC GGL ++
Sbjct: 296 GMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------VDKGCMGGLPSN 346

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           A+  I   GG+E E+DY Y G    +C F+  K    +++   +S +E ++AA L + GP
Sbjct: 347 AYSAIKTLGGLETEEDYSYRG-HLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGP 405

Query: 276 LA 277
           ++
Sbjct: 406 IS 407


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYITANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
          Length = 467

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/231 (40%), Positives = 122/231 (52%), Gaps = 14/231 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYKSAAEEAFRLSVFRENLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 QFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           ++  G          A+    +    +P   DWR  GAVT VKDQG CGSCW+FSA G +
Sbjct: 97  RYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNV 156

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVER 228
           E   FL+   L +LSEQ LV CD           DSGC+GGLMN AFE+I++   G V  
Sbjct: 157 ESQWFLAGHPLTNLSEQMLVSCDKT---------DSGCSGGLMNDAFEWIVQENDGAVYT 207

Query: 229 EKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E+ YPY   +G S  C      + A ++    +  DE Q+AA L  +GP+A
Sbjct: 208 EESYPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVA 258


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/284 (33%), Positives = 148/284 (52%), Gaps = 22/284 (7%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTYATQE 68
           L+LS+ L    A   D +++          S +HL + +    LF+S   K SK Y + E
Sbjct: 11  LILSATLFITYATAHDFSIVGY--------SPEHLASMDKTIELFESWMSKHSKAYRSIE 62

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQK 128
           E  +RF +F  NL+          +   G+ +F+DL+  EF+ ++LGL         ++ 
Sbjct: 63  EKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRSSRG 122

Query: 129 APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
                  DLP   DWR  GAVT VK+QG+CGSCW+FS   A+EG + + TG L SLSEQ+
Sbjct: 123 FSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 182

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
           L+DCD         S ++GC GGLM+ AF+YI+   G+ +E+DYPY   +G   +  +  
Sbjct: 183 LIDCDR--------SFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQF 234

Query: 249 IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
               +S +  + ++++Q     + H P++    +IE    +F F
Sbjct: 235 EVVTISGYEDVPANDEQSLLKALSHQPVS---VAIEASSRNFQF 275


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  E  +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYKAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 120/209 (57%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K+Y    E + R+  F+ NLR            
Sbjct: 26  IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 82

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 83  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 142

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 194

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 195 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 223


>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
 gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
 gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
          Length = 460

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/237 (42%), Positives = 137/237 (57%), Gaps = 14/237 (5%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
           +D  +     F  F + +++TY +QEE  +R  VF  N+ RA++ Q LD  TA +GVTKF
Sbjct: 153 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKF 212

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPT-DFDWRDHGAVTGVKDQGACGS 160
           SDLT  EFR  +L  N  L+        P  P  D+P   +DWR+ GAVT VKDQG CGS
Sbjct: 213 SDLTEEEFRTIYL--NPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGS 270

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL  G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 271 CWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLPSNAYSAI 321

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E E DY Y G    +C F   K    +++   +S +E ++AA L K+GP++
Sbjct: 322 RTLGGLETEDDYSYRGRL-QTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVS 377


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  E  +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 97/282 (34%), Positives = 142/282 (50%), Gaps = 43/282 (15%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M+  IL+SLL++ +S+ L     + +D A                      HF  FK K 
Sbjct: 1   MKSFILASLLVVAVSATL-----LKEDGA----------------------HFQSFKLKH 33

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
            KTY  Q E   RF +F+ NLR+ +         +H    G+ KF+D+T +EF+   L  
Sbjct: 34  GKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFK-AMLAT 92

Query: 117 NRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
             + +    A K   L     +P   DWR    VT +KDQ  CGSCW+F+  G+ EGA+ 
Sbjct: 93  QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYA 152

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           LSTG+L   SEQQLVDC        +   + GC+GG ++  F YI +  G+E E DYPYT
Sbjct: 153 LSTGKLTRFSEQQLVDC--------TTDLNYGCDGGYLDDTFPYI-QTNGLELESDYPYT 203

Query: 236 GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G D G C ++ SK+   VS++  + ++E  +   +   GP+A
Sbjct: 204 GYD-GYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVA 244


>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
          Length = 597

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 139/238 (58%), Gaps = 14/238 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F + +++TY T+EE  +R  VF +N+ RA++ Q LD  TA +GVTK
Sbjct: 289 SQDFSVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTK 348

Query: 101 FSDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           FSDLT  EFR  +L  N  LR +P           +  P ++DWR +GAVT VKDQG CG
Sbjct: 349 FSDLTEEEFRTIYL--NPLLREVPGKKMHLAKSIGDPAPPEWDWRKNGAVTKVKDQGMCG 406

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  
Sbjct: 407 SCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSA 457

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I   GG+E E DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 458 IKNLGGLETEDDYSYQG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 514


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/255 (38%), Positives = 136/255 (53%), Gaps = 28/255 (10%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           ++H+  +K+   K Y  +EE  +R  V++ NL++ +   L      H    G+ +F D+T
Sbjct: 26  DNHWEQWKNWHGKKYHEKEE-GWRRMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMT 84

Query: 106 PSEFRRQFLGLN----RRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACG 159
             EFR+   G      RR R       +  +  N  ++P   DWR+ G VT VKDQG CG
Sbjct: 85  HEEFRQVMNGYKHKKERRFR------GSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECG 138

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y
Sbjct: 139 SCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQY 191

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAG 278
           I    G++ E+ YPY GTD   C +D    AA  + F  + S  E  +   +   GP++ 
Sbjct: 192 IKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVS- 250

Query: 279 NVASIELPHISFSFL 293
              +I+  H SF F 
Sbjct: 251 --VAIDAGHESFQFY 263


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 41/285 (14%)

Query: 11  LLLLSSVLASAVAV-NDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           LLLL   LA  +    +DD+ IR                       +K   +K Y+   E
Sbjct: 7   LLLLGVTLAYIIERPTEDDSWIR-----------------------WKMAHNKAYSHDGE 43

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
              R+ ++K N RR +   L     +  + +F D+T +EF+      N  L     +   
Sbjct: 44  ETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFKD----FNGYLSHKHVSGST 99

Query: 130 PILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
            + P + + P   DWR+ G VT VKDQG CGSCW+FS TG+LEG +F  TG+LVSLSEQ 
Sbjct: 100 FLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQN 159

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK 248
           LVDC        +   ++GCNGGLM++AF YI +  G++ E  YPYT  D G C F K  
Sbjct: 160 LVDC-------STAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKD-GKCAFTKPN 211

Query: 249 IAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           +AA  + F  I S DE+++   +   GP++    +I+  H SF F
Sbjct: 212 VAATDTGFVDIPSGDENKLKEAVASVGPIS---VAIDASHFSFQF 253


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 127/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   KTY +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  E  +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSS 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 147/265 (55%), Gaps = 23/265 (8%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
           S D  L+ E  ++ + +KF K  A+     D+RF  FK N R  +        +   G+ 
Sbjct: 4   SSDSDLSGE--YASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLN 61

Query: 100 KFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
           +FSDLT  EFR++FLGL      +  L++P D+         DLP   DWR HGAVT  K
Sbjct: 62  QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPK 121

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           DQG+CG CW+F+ TGA+EG + + TG+LVSLSEQ+L+DCD +         D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKK--------ADKGCDGGLM 173

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
            +A+++I++ GG++ E DYPY  ++   C   K +    A+  +  I   ++Q     V 
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVA 232

Query: 273 HGPLAGNV--ASIELPHISFSFLFT 295
             P++  +  AS +  H + S +FT
Sbjct: 233 KQPVSVAIEGASKDFQHYA-SGVFT 256


>gi|330794859|ref|XP_003285494.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
 gi|325084585|gb|EGC38010.1| hypothetical protein DICPUDRAFT_149375 [Dictyostelium purpureum]
          Length = 421

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 23/249 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L   + F+ +  +  + YA+ EE   R+ +FKAN+   +         + G+  F+D+T 
Sbjct: 24  LQYRNAFTNWMIQNQRHYAS-EEFATRYNIFKANMDYVQEWNSKGSETILGLNAFADITN 82

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDL----PTDFDWRDHGAVTGVKDQGACGSCW 162
            E+R  +LG       P DA       T  +        DWR  GAVT +K+Q  CG CW
Sbjct: 83  QEYRANYLGT------PFDASSIVGTETEKIFAAPAATVDWRTKGAVTPIKNQQQCGGCW 136

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
           SFS TG+ EGAH +STG LVSLSEQ L+DC        SGS  + GCNGGLM  AFEYI+
Sbjct: 137 SFSTTGSTEGAHQISTGNLVSLSEQNLIDC--------SGSYGNDGCNGGLMTLAFEYII 188

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
              G++ E  YPYT   G  CKF  + I A +S+++ ++S  +    +     P++    
Sbjct: 189 NNKGIDTESSYPYTAETGTVCKFKTANIGATLSSYNNVTSGSESSLESAANVNPVS---V 245

Query: 282 SIELPHISF 290
           +I+  H SF
Sbjct: 246 AIDASHNSF 254


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 139/255 (54%), Gaps = 19/255 (7%)

Query: 28  DAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRR 87
           D  I    P D E S D L+     F  + S F K Y T EE   RF VFK NL+     
Sbjct: 30  DYSIVGYSPEDLE-SHDKLIEL---FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85

Query: 88  QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL---PTDFDWR 144
                +   G+ +F+DL+  EF++ +LGL   +    + +        D+   P   DWR
Sbjct: 86  NKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWR 145

Query: 145 DHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC 204
             GAV  VK+QG+CGSCW+FS   A+EG + + TG L +LSEQ+L+DCD         + 
Sbjct: 146 KKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT--------TY 197

Query: 205 DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF--DKSKIAAAVSNFSVISSD 262
           ++GCNGGLM+ AFEYI+K GG+ +E+DYPY+  + G+C+   D+S+      +  V ++D
Sbjct: 198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYS-MEEGTCEMQKDESETVTINGHQDVPTND 256

Query: 263 EDQMAANLVKHGPLA 277
           E  +   L  H PL+
Sbjct: 257 EKSLLKALA-HQPLS 270


>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
          Length = 338

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/248 (38%), Positives = 132/248 (53%), Gaps = 20/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H++L+KS  +K Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T  
Sbjct: 29  HWNLWKSWHTKKYHEKEE-GWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNE 87

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   +         +  L  N L  P   DWRD G VT VKDQG CGSCW+FS
Sbjct: 88  EFRQLMNGYKHKAERKVKG--SLFLEPNFLEAPRSLDWRDKGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATGALEG  F  TG++V LSEQ LV+C     PE     + GCNGGLM+ AF+Y+    G
Sbjct: 146 ATGALEGQQFRKTGKMVQLSEQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDNQG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E+ YPY GTD   C +D    A   + F  + S  E  +   +   GP++    +I+
Sbjct: 199 LDSEESYPYLGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPIS---VAID 255

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 256 AGHESFQF 263


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/241 (38%), Positives = 136/241 (56%), Gaps = 15/241 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
           N    F ++ ++  K+Y++ EE  YR  VF  N         LD ++    +  ++DLT 
Sbjct: 24  NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83

Query: 107 SEFRRQFLGLNRRLR--LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EF+   LG +  LR   P   Q+ P LP  D+P   DWR  GAVT VKDQG+CG+CWSF
Sbjct: 84  HEFKVSRLGFSPALRNFRPVLPQE-PSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACWSF 141

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SATGA+EG + + TG L+SLSEQ+L+DCD         S +SGC GGLM+ A+++++   
Sbjct: 142 SATGAMEGINQIMTGSLISLSEQELIDCDR--------SYNSGCGGGLMDYAYQFVISNH 193

Query: 225 GVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           G++ E DYPY   D GSC+ DK  +    +  ++ I S+++      V   P++  +   
Sbjct: 194 GIDTENDYPYQARD-GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS 252

Query: 284 E 284
           E
Sbjct: 253 E 253


>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 357

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
           +L LSS +LL+L +  AS     D+   I+ V  +  + E +   +L    H   FS F 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y + EE   RF VFK NL   +       +    + +F+DLT  EF+R  LG  
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  + +  +    +P   DWR+ G V+ VK+QG CGSCW+FS TGALE A+  +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233

Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
            DGG CKF    I   V    N ++ + DE + A  LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 119/209 (56%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222


>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
          Length = 344

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTK 100
           ++  A   F+ FKS++ K Y +     YR +V+K N +  +    R +  + T    +  
Sbjct: 15  YIAEAASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNH 74

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKA-PILPTND--LPTDFDWRDHGAVTGVKDQGA 157
            +D+ P EF   FLG NR LR      +  P     D  +  + DWR  GA++ VKDQG 
Sbjct: 75  LADMHPREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGH 134

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS+TGALE   FL  G  VSLSEQ L+DC            ++GC GGLM  AF
Sbjct: 135 CGSCWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYG-------NNGCEGGLMEQAF 187

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +Y+    G++ E+ YPY G D   C+F K+ + A  + F  I S DE  +   +   GPL
Sbjct: 188 QYVRDNDGIDTEEAYPYEGED-SECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPL 246

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  + SF F
Sbjct: 247 S---IAIDASNPSFQF 259


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 119/209 (56%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
 gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
 gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
           +L LSS +LL+L +  AS     D+   I+ V  +  + E +   +L    H   FS F 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y + EE   RF VFK NL   +       +    + +F+DLT  EF+R  LG  
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  + +  +    +P   DWR+ G V+ VK+QG CGSCW+FS TGALE A+  +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233

Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
            DGG CKF    I   V    N ++ + DE + A  LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271


>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
          Length = 336

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 95/277 (34%), Positives = 148/277 (53%), Gaps = 20/277 (7%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQV-VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +++ +L ++  +A A  +D + + +V  P      E  +L     F  F  +++K Y ++
Sbjct: 1   MIVFVLCAISFTAAAPQNDVSDVEKVRKPVFYSMDEAPIL-----FENFIREYNKKYDSK 55

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
           E+ + RF++F  NL+R          AVHG+ KF+DL+  EF++ + G         D  
Sbjct: 56  EKEE-RFKIFVNNLKRINDLNHKSTNAVHGINKFTDLSKEEFKKFYTGFKPDKSFLDDNI 114

Query: 128 KAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           K P   + ++  P  FDWRD G VT VK+QG CGSCW+FS  G +E  + +  G LV LS
Sbjct: 115 KKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAIKHGNLVELS 174

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQQLVDCD         S D  C+ GL ++A +Y++  G +  E+ YPY G    +C +D
Sbjct: 175 EQQLVDCD---------SKDEACDSGLPDNAQQYLVSHGAIS-EQSYPYKGY-AANCTYD 223

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
            S++   +SNF  +   E QMA  L    PL+  +A+
Sbjct: 224 SSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAA 260


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/247 (39%), Positives = 132/247 (53%), Gaps = 17/247 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+KS  SK Y  +EE  +R  V++ NL+  +   L      H    G+ +F D+T  
Sbjct: 43  HWQLWKSWHSKDYHEREE-SWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAE 101

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           EFR+   G   +           + P+  + P   DWR+ G VT VKDQG CGSCW+FS 
Sbjct: 102 EFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 161

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+   GG+
Sbjct: 162 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PEG----NQGCNGGLMDQAFQYVQDNGGI 214

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPYT  D   C++     AA  + F  I    E  +   +   GP++    +I+ 
Sbjct: 215 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVS---VAIDA 271

Query: 286 PHISFSF 292
            H SF F
Sbjct: 272 GHSSFQF 278


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/274 (37%), Positives = 142/274 (51%), Gaps = 26/274 (9%)

Query: 33  QVVPSDGEQSEDHLL-------NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK 85
           QV+P   E S + L          + H+ L+KS   K Y  +EE  +R  V++ NL+  +
Sbjct: 107 QVIPVTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYHEREE-GWRRVVWEKNLKMIE 165

Query: 86  RRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PT 139
              L      H    G+ +F D+T  EFR+   G   + +     + +  L  N L  P 
Sbjct: 166 IHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGYVHK-KSERKYRGSQFLEPNFLEAPR 224

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
             DWR+ G VT VKDQG CGSCW+FS TGALEG HF  TG+LVSLSEQ LVDC     PE
Sbjct: 225 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSR---PE 281

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
                + GCNGGLM+ AF+Y+   GG++ E+ YPYT  D   C++     AA  + F  I
Sbjct: 282 ----GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDI 337

Query: 260 -SSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
               E  +   +   GP++    +I+  H SF F
Sbjct: 338 PQGHERALMKAVAAVGPVS---VAIDAGHSSFQF 368


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 98/254 (38%), Positives = 134/254 (52%), Gaps = 23/254 (9%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
            +  +  FK +  K Y +  E  +R ++F  N  + AK  +L +   V     + K++D+
Sbjct: 23  VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADM 82

Query: 105 TPSEFRRQFLGLNRRLRLP-----ADAQKAP-ILPTN-DLPTDFDWRDHGAVTGVKDQGA 157
              EF     G NR    P      D Q A  I P N   P + DWR+HGAVT VKDQG 
Sbjct: 83  LHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGH 142

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCWSFSATGALEG HF  T +LVSLSEQ LVDC        +   + GCNGGLM++AF
Sbjct: 143 CGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDC-------STKFGNDGCNGGLMDNAF 195

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +Y+    G++ E  YPY   D   C ++     A    F  I + DE+++ A +   GP+
Sbjct: 196 KYVKYNHGIDTEASYPYHADD-EKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPV 254

Query: 277 AGNVASIELPHISF 290
           +    +I+  H SF
Sbjct: 255 S---VAIDASHESF 265


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/219 (39%), Positives = 124/219 (56%), Gaps = 21/219 (9%)

Query: 62  KTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL 120
           K Y    E + R ++FK NL+   +   L + T   G+T+F+DLT  E  + F+  +R L
Sbjct: 11  KNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-PKDFMKADRYL 69

Query: 121 RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
               D           LP + DWR  GAV  VKDQG CGSCW+FSA GA+EG + + TGE
Sbjct: 70  YKEGDI----------LPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQIKTGE 119

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           L+SLS+Q+L+DCD        G  ++GC GG+MN AFE+I+  GG+E ++DYPYT TD G
Sbjct: 120 LISLSDQELIDCDR-------GFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLG 172

Query: 241 SCKFDKSKIAAAVS--NFSVISSDEDQMAANLVKHGPLA 277
            C  DK      V    +  ++ ++++     V H P+ 
Sbjct: 173 VCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVG 211


>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 479

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/236 (41%), Positives = 131/236 (55%), Gaps = 17/236 (7%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPS 107
           A  HF  FK +  K++  +    +RF  FK N++ A      +P A + V+ KF+ LTP 
Sbjct: 38  ASAHFMHFKKQHGKSFGEEAVEGHRFNAFKENMQTAVYLNAQNPHAHYDVSGKFAALTPQ 97

Query: 108 EFRRQFLGLNRRLR-LPADAQKAPILP-TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF +Q+L  +   R L A  ++A +        +  DWR+ GAVT VKDQG CGSCW+FS
Sbjct: 98  EFAKQYLNPDYYTRQLKAHKERAHVYEGVRGGLSAVDWREKGAVTEVKDQGLCGSCWAFS 157

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--A 223
           A G +EG   LS   LVSLSEQ LV CD         + D GCNGGLM+ A+ +I+K  +
Sbjct: 158 AIGNIEGQWALSGNTLVSLSEQMLVSCD---------TVDMGCNGGLMDQAWAWIIKNHS 208

Query: 224 GGVEREKDYPYTGTDG--GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E  YPYT  DG   SC     K+ A +S    +  DED + A L K+GP++
Sbjct: 209 GAVYTEVSYPYTSGDGSTASC-LSTGKVGARISGQVSLPQDEDAIEAWLEKNGPIS 263


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/275 (36%), Positives = 147/275 (53%), Gaps = 29/275 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+S++    A   ++ + +   +TY    E + R++VF+ NLR            
Sbjct: 29  IVSYGERSDEE---ARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAG 85

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG      R  +L A    A      DLP   DWR  
Sbjct: 86  VHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAAD---NEDLPESVDWRAK 142

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG+ GSCW+FS   A+EG + + TG+L+SLSEQ+LVDCD         S + 
Sbjct: 143 GAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ EKDYPY GTDG      K+     + ++  + +++++ 
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254

Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
               V + P++    +IE     F    S +FT S
Sbjct: 255 LQKAVANQPVS---VAIEAAGTQFQLYSSGIFTGS 286


>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
 gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
          Length = 336

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 134/248 (54%), Gaps = 21/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +K   +K Y  +EE  +R  V++ NL++ +   L      H     +  F D+   
Sbjct: 28  HWQQWKEWHNKDYHEKEE-GWRRMVWEKNLKKIELHNLEHSLGKHSYRLAMNHFGDMPHE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   ++R     + +  +  N L  P+  DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQVMNGYKHKVR---KIRGSLFMEPNFLEAPSKLDWREKGYVTPVKDQGQCGSCWAFS 143

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   GG
Sbjct: 144 TTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PEG----NEGCNGGLMDQAFQYIKDNGG 196

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY GTD   C +D S  AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 197 LDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVS---VAID 253

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 254 AGHESFQF 261


>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 134/258 (51%), Gaps = 38/258 (14%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  +K +F ++Y +  E   R  ++ +N R      ++    +     G+T F+D+   E
Sbjct: 26  FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85

Query: 109 FRRQF----LG-----LNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           ++RQ     LG     L RR    LRLP  A         DLP   DWR+ G VT VKDQ
Sbjct: 86  YKRQISQGCLGSFNASLPRRGSAYLRLPEGA---------DLPNSVDWREKGYVTDVKDQ 136

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
             CGSCW+FS TG+LEG  F  TG+LVSLSEQQLVDC  +   E       GC GGLM+S
Sbjct: 137 KQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNE-------GCMGGLMDS 189

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
           AF YI   GG++ E  YPY   D G C+++ + I A  + +  V   DED +   L   G
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAED-GQCRYNSANIGATCTGYVDVKQGDEDALKEALATIG 248

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +I+  H SF  
Sbjct: 249 PVS---VAIDASHSSFQL 263


>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
          Length = 331

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/280 (38%), Positives = 145/280 (51%), Gaps = 40/280 (14%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           LLL+L++V+ S  AV+  D +  Q                   +S FK + SK Y ++ E
Sbjct: 3   LLLILAAVVISCQAVSFYDLVQEQ-------------------WSSFKMQHSKNYDSETE 43

Query: 70  HDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNR---RLRL 122
             +R ++F  N  + AK  +L     V    G+ K++D+   EF     G N+    +  
Sbjct: 44  ERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILK 103

Query: 123 PADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            +D   A   I P N  LP   DWRD GAVT VKDQG CGSCWSFS +G+LEG HF  TG
Sbjct: 104 GSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQHFRKTG 163

Query: 180 ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           +LVSLSEQ LVDC        SG   ++GCNGGLM++AF YI   GG++ E+ YPY   D
Sbjct: 164 KLVSLSEQNLVDC--------SGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 215

Query: 239 GGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
              C +      A    F  I   +ED + A +   GP++
Sbjct: 216 -EKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVS 254


>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
          Length = 377

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 152/279 (54%), Gaps = 20/279 (7%)

Query: 3   RLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH---FSLFK 57
           +L LSS +LL+L +  AS     D+   I+ V  +  + E +   +L    H   FS F 
Sbjct: 4   KLNLSSSILLILFAAAASKEIGFDESNPIKMVSDNLHELEDTVVQILGQSRHVLSFSRFT 63

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
            ++ K Y + EE   RF VFK NL   +       +    + +F+DLT  EF+R  LG  
Sbjct: 64  HRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAA 123

Query: 118 RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
           +     A  + +  +    +P   DWR+ G V+ VK+QG CGSCW+FS TGALE A+  +
Sbjct: 124 QNC--SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQA 181

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG++ E+ YPYTG
Sbjct: 182 FGKGISLSEQQLVDC--------AGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTG 233

Query: 237 TDGGSCKFDKSKIAAAVS---NFSVISSDEDQMAANLVK 272
            DGG CKF    I   V    N ++ + DE + A  LV+
Sbjct: 234 KDGG-CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVR 271


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 17/233 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
           + ++  + YA   E + R+ VFK N+   +R   +    T    V +F+DLT  EFR  +
Sbjct: 40  WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 99

Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G      L +  +        + ++ LP   DWR  GAVT +KDQG+CGSCW+FSA  A
Sbjct: 100 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +  G+L+SLSEQ+LVDCD         + D GC GG MNSAF Y +  GG+  E
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSE 210

Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
            +YPY  TD G+C  +K+K IA ++  F  + +++++     V H P++  +A
Sbjct: 211 SNYPYKSTD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 262


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/290 (33%), Positives = 156/290 (53%), Gaps = 26/290 (8%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +L+L+  S  L +++A   D +++     S+  +S D L+     F  + S+  K Y   
Sbjct: 8   ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
           EE   RF +FK NL+    R  +      G+++F+DL+  EF  ++LGL    +RR   P
Sbjct: 63  EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRRRESP 122

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +     +    +LP   DWR  GAV  VK+QG+CGSCW+FS   A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+L+DCD         + ++GCNGGLM+ AF +I++ GG+ +E+DYPY   + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGACE 229

Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
             K +     +S +  +  + +Q     + + PL+    +IE     F F
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276


>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
          Length = 327

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/245 (39%), Positives = 131/245 (53%), Gaps = 17/245 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           ++ +K+   K+YA+ EE   R  +++ NLR   +        +H     +TKF+DL   E
Sbjct: 23  WNEWKNTHGKSYASHEELK-RQLIWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDE 81

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           F   +L   R+          P+    + PT  DWR  G VT VK+Q  CGSCW+FS TG
Sbjct: 82  FAAMYLPRMRKDSRNGFCSAQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTG 141

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           +LEG HF  T  LVSLSEQQL+DC  +         D GC GG+M+ AF+YI  AGGVE 
Sbjct: 142 SLEGQHFAKTKNLVSLSEQQLMDCSFK-------EGDEGCGGGIMDYAFDYIFLAGGVES 194

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
           E DYPY   +   C+FD S IAA ++    V S  E Q+   +   GP++    +I+  H
Sbjct: 195 EADYPYEARN-DHCRFDNSSIAATLTGCVDVTSGSETQLEKAVGSIGPVS---VAIDASH 250

Query: 288 ISFSF 292
           ISF  
Sbjct: 251 ISFQL 255


>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
          Length = 588

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 18/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
           N +  +  +K+   + Y T EE  +R  V++ N++  +          HG T     F D
Sbjct: 24  NLDTQWYQWKATHRRLYGTNEE-GWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+  +    +        + P+L   +LP   DWR  G VT VK+Q  CGSCW+
Sbjct: 83  MTNEEFRQVMVCFRNQKHKNRKVFRGPLL--LNLPKSVDWRKKGYVTPVKNQKQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC H          + GCNGG MN+AF+Y+ + 
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSHP-------QGNQGCNGGFMNNAFQYVKEN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E  YPY   D GSCK+      A  + F VI + E ++   +   GP++    ++
Sbjct: 194 GGLDSEASYPYVAKD-GSCKYKPENSVANDTGFVVIPAHEKELMKAVATVGPIS---VAV 249

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 250 DASHSSFQF 258


>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
 gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
          Length = 338

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 89/241 (36%), Positives = 129/241 (53%), Gaps = 21/241 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           L N+E  F  F +K+ K YA   E   RF VFKANL     R   + +A  G+  +SDL+
Sbjct: 30  LSNSEVLFDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEESATFGINFYSDLS 89

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---------LPTDFDWRDHGAVTGVKDQG 156
            +E  R+  G   +  L  D +K     T           LP  F+WRD  AVT VK Q 
Sbjct: 90  SNELLRKQTGF--KTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSDAVTSVKQQR 147

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FSA   +E  +++   + V LSEQQ+VDCD           ++GCNGGLM+ A
Sbjct: 148 DCGSCWAFSAVANIESQYYIKNKQYVDLSEQQIVDCD---------PINNGCNGGLMSWA 198

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            EY++++GGV+ E+DY Y G + G CK + + +       S    +E+++   LV +GP+
Sbjct: 199 MEYVMRSGGVQLEEDYQYVGNE-GVCKNNSANVVQISGCVSYDLRNEERLRELLVSNGPI 257

Query: 277 A 277
           +
Sbjct: 258 S 258


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 150/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILVAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 127/228 (55%), Gaps = 11/228 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  + +K  K+Y    E + RF++FK NLR        + T   G+ +F+DLT  E+R  
Sbjct: 53  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 112

Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           +LG     +R      + +      + LP   DWR  GAV  VKDQG+CGSCW+FS   A
Sbjct: 113 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 172

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG + + TG L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E
Sbjct: 173 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 224

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +DYPY  +DG   ++ K+     +  +  +  ++++     V + P++
Sbjct: 225 EDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVS 272


>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
          Length = 328

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 134/253 (52%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  + K Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 22  DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMT 81

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+   +     +N    LP   DWR+ G VT VK QGACG+CW
Sbjct: 82  SEEV----ISLMSSLRVPSQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACW 137

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVD    C  E+ G  + GCNGG M  AF+YI+ 
Sbjct: 138 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 191

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY  TD G C++D    AA  S ++ + S  ED +   +   GP++    
Sbjct: 192 NNGIDSEASYPYKATD-GKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVS---V 247

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 248 AIDARHSSF-FLY 259


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 19/289 (6%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
            S+  LL +S  + +  A   D +++      D   S D L +    F  + SK  K+Y 
Sbjct: 6   FSNFFLLFISMAVFAYSAFARDFSIVG--YSPDDLTSMDKLTDL---FESWMSKHGKSYR 60

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EE  +RF VF+ NL+          +   G+ +F+DL+  EF+R++LGL   L    D
Sbjct: 61  SFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRD 120

Query: 126 A-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
           + ++       DLP   DWR  GAV  VK+QGACGSCW+FS   A+EG + + TG L +L
Sbjct: 121 SPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTAL 180

Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
           SEQ+L+DCD           ++GCNGGLM+ AF +I+  GG+ +E+DYPY   + G+C  
Sbjct: 181 SEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-MEEGTCGE 231

Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            K ++    +S +  +  D +Q     + + PL+    +IE     F F
Sbjct: 232 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLS---VAIEASSRGFQF 277


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 88/234 (37%), Positives = 129/234 (55%), Gaps = 18/234 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA-VHGVTKFSDLTPSEFR 110
            +  + +++ + Y    E  +RF+VFKAN     R         V G  +F+DLT  EF 
Sbjct: 58  RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFA 117

Query: 111 RQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
             + GL +   +P+ A++ P   +        D     DWR  GAVT VK+QG CG CW+
Sbjct: 118 AMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWA 177

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSA GA+EG   ++TG LVSLSEQQ++DCD     E  G  + GCNGG M++AF+Y++  
Sbjct: 178 FSAVGAMEGLIMITTGNLVSLSEQQILDCD-----ESDG--NQGCNGGYMDNAFQYVINN 230

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GGV  E  YPY+    G+C+    + AA +S F  + S ++   AN V + P++
Sbjct: 231 GGVTTEDAYPYSAVQ-GTCQ--NVQPAATISGFQDLPSGDENALANAVANQPVS 281


>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
 gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
          Length = 627

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
           L   +H F  F+ +F + Y    E   R R+F+ NL+  +     +  +A +G+T+F+D+
Sbjct: 314 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 373

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T +E++ +  GL +R           ++P    + P +FDWR   AVT VK+QG+CGSCW
Sbjct: 374 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 432

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I  
Sbjct: 433 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 483

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY       C F+++     VS F  +   +E  M   L+ HGP++
Sbjct: 484 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 538


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 17/233 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTKFSDLTPSEFRRQF 113
           + ++  + YA   E + R+ VFK N+   +R   +    T    V +F+DLT  EFR  +
Sbjct: 34  WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 93

Query: 114 LGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G      L +  +        + ++ LP   DWR  GAVT +KDQG+CGSCW+FSA  A
Sbjct: 94  TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   +  G+L+SLSEQ+LVDCD         + D GC GG MNSAF Y +  GG+  E
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCD---------TNDDGCMGGYMNSAFNYTMTTGGLTSE 204

Query: 230 KDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
            +YPY  TD G+C  +K+K IA ++  F  + +++++     V H P++  +A
Sbjct: 205 SNYPYKSTD-GTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 256


>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
          Length = 442

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 133/236 (56%), Gaps = 20/236 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK+  ++ YA+ +E   RF +F AN+++A      +P A  G  +F+D++  EF+ +
Sbjct: 25  FRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84

Query: 113 FLGLNRRLRLPADAQKAPILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
                    + A   K     T +     +    DWR  GAVT VK+QG+CGSCWSFS T
Sbjct: 85  HNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFSTT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GG 225
           G +EG H ++TG+LVSLSEQ+LV CD         + D GC+GGLM++AF ++L A  G 
Sbjct: 145 GNIEGQHAIATGQLVSLSEQELVSCD---------TVDDGCSGGLMDNAFGWLLSAHNGQ 195

Query: 226 VEREKDYPYTGTDG--GSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +  E  YPY   +G   +C F+ +   + A +++F  I   E  MAA + K+GPL+
Sbjct: 196 ITTEASYPYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLS 251


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 97/290 (33%), Positives = 155/290 (53%), Gaps = 26/290 (8%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +L+L+  S  L +++A   D +++     S+  +S D L+     F  + S+  K Y   
Sbjct: 8   ALVLIACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYENI 62

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
           EE   RF +FK NL+    R  +      G+ +F+DL+  EF  ++LGL    +RR   P
Sbjct: 63  EEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRRRESP 122

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +     +    +LP   DWR  GAV  VK+QG+CGSCW+FS   A+EG + + TG L S
Sbjct: 123 EEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 178

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+L+DCD         + ++GCNGGLM+ AF +I++ GG+ +E+DYPY   + G+C+
Sbjct: 179 LSEQELIDCDR--------TYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGTCE 229

Query: 244 FDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
             K +     +S +  +  + +Q     + + PL+    +IE     F F
Sbjct: 230 MTKEETQVVTISGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276


>gi|332030000|gb|EGI69825.1| Cathepsin L [Acromyrmex echinatior]
          Length = 328

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 138/256 (53%), Gaps = 20/256 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVT 99
           + +L+AE  + +FK+   K Y +  E  YR ++F  N R+     ++ +L +     G+ 
Sbjct: 22  NKILDAE--WFIFKTHHKKIYKSSVEEGYRMKIFLDNKRKIAEHNRKYELNEVPYKLGMN 79

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           K+ D+   EF     G N+  +       A  + P N +LP + DWR HGAVT VKDQG 
Sbjct: 80  KYGDMLHHEFVNTLNGFNKSEKAQKQFMGATFISPANVELPKEVDWRKHGAVTEVKDQGH 139

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG+LEG HF  TG LVSLSEQ L+DC      E       GCNGGLM++AF
Sbjct: 140 CGSCWAFSTTGSLEGQHFRQTGILVSLSEQNLIDCSGNYGNE-------GCNGGLMDNAF 192

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +Y+    G++ EK YPY   +   C+++     A  + F  I   +E ++ A +   GP+
Sbjct: 193 KYVRDNKGLDTEKSYPYE-AENDKCRYNPRNSGAIDTGFVDIPRGNEHKLKAAVATIGPV 251

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF  
Sbjct: 252 S---VAIDASHESFQL 264


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 134/244 (54%), Gaps = 16/244 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           + L+ ++  + Y   +E   RF VFK N          + +   G+ +F+DL+  EF+  
Sbjct: 42  YELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +LG      +RL  P  +++       DLP   DWR+ GAVT VKDQG+CGSCW+FS   
Sbjct: 102 YLGAKLDTKKRLSRPP-SRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVA 160

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           A+EG + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ 
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAFEFIINNGGLDS 212

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E+DYPYT  DG    + K+     + ++  +  ++++       + P++    +IE    
Sbjct: 213 EEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPIS---VAIEASGR 269

Query: 289 SFSF 292
            F F
Sbjct: 270 EFQF 273


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 133/251 (52%), Gaps = 20/251 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           E H+ L+K+  SK+Y   EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 27  EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  +        + +  +  N L  P   DWR+ G VT VKDQG+CGSCW+
Sbjct: 86  NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C +      A  + F  + S  E  M   +   GP++    +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253

Query: 283 IELPHISFSFL 293
           I+  H SF F 
Sbjct: 254 IDAGHESFQFY 264


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/256 (38%), Positives = 131/256 (51%), Gaps = 27/256 (10%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           ++ FK +  K Y ++ E   R +++  N  + AK  Q  D         V K++DL   E
Sbjct: 28  WTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEE 87

Query: 109 FRRQFLGLNRRLR----------LPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F     G NR +            P +     I P N D+PT  DWR  GAVT VKDQG 
Sbjct: 88  FVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGAVTQVKDQGH 147

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCWSFSATGALEG HF  TG+LVSLSEQ LVDC  +         ++GCNGG+M+ AF
Sbjct: 148 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYG-------NNGCNGGMMDFAF 200

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ EK YPY   D   C ++   + A    F  I   +E  +   L   GP+
Sbjct: 201 QYIKDNKGIDTEKSYPYEAID-DECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGPV 259

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 260 S---VAIDASHESFQF 272


>gi|343474209|emb|CCD14094.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 307

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 124/233 (53%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+F+A G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPKTVDWRKKGAVTPVKDQGKCDSSWAFAAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + D GC  G +++AF++I+ +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFLDTAFKWIVSSNNGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +C      + A + +   I  +E+ +A  L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVA 261


>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 629

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
           L   +H F  F+ +F + Y    E   R R+F+ NL+  +     +  +A +G+T+F+D+
Sbjct: 316 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 375

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T +E++ +  GL +R           ++P    + P +FDWR   AVT VK+QG+CGSCW
Sbjct: 376 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 434

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I  
Sbjct: 435 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 485

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY       C F+++     VS F  +   +E  M   L+ HGP++
Sbjct: 486 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 540


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/246 (36%), Positives = 130/246 (52%), Gaps = 18/246 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+K+   K+Y  +EE  +R  +++ NLR  +   L      H    G+ +F D+T  
Sbjct: 28  HWHLWKNWHKKSYLPKEE-GWRRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EFR+   G   +  +      AP     + P   DWR+ G VT VKDQG CGSCW+FS T
Sbjct: 87  EFRQLMNGYKNQKMIKGSTFLAP--NNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GALEG H+   G+L+SLSEQ LVDC            + GCNGGLM+ AF+Y+   GG++
Sbjct: 145 GALEGQHYRKAGKLISLSEQNLVDC-------SRAQGNQGCNGGLMDQAFQYVKDNGGID 197

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
            E  YPYT  D   C +D +  +A  + F  V S  E  +   +   GP++    +++  
Sbjct: 198 SEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSGSEKDLMKAVASVGPVS---VAVDAG 254

Query: 287 HISFSF 292
           H SF F
Sbjct: 255 HKSFQF 260


>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
          Length = 340

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 134/253 (52%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  + K Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 34  DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMT 93

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+   +     +N    LP   DWR+ G VT VK QGACG+CW
Sbjct: 94  SEEV----ISLMSSLRVPSQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACW 149

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVD    C  E+ G  + GCNGG M  AF+YI+ 
Sbjct: 150 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 203

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY  TD G C++D    AA  S ++ + S  ED +   +   GP++    
Sbjct: 204 NNGIDSEASYPYKATD-GKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVS---V 259

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 260 AIDARHSSF-FLY 271


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 125/228 (54%), Gaps = 17/228 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVF-----KANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           +K+K+ KTY + E    R  ++     K     A+  Q L    + G+  F+D+   EFR
Sbjct: 30  YKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKL-GLNSFADMHNGEFR 88

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           +   G  R    P ++    +     LP   DWR  GAVT +K+QG CGSCW+FS TG+L
Sbjct: 89  KMMNGYRRGT--PRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSL 146

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG H L  G+LVSLSEQ+LVDC        +   + GC+GGLM+ AF YI K  G++ E+
Sbjct: 147 EGQHALKKGKLVSLSEQELVDC-------SAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQ 199

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            YPYTG D G+C F KS +AA V+ F  V S  E  +       GP++
Sbjct: 200 SYPYTGED-GTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPIS 246


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 127/228 (55%), Gaps = 11/228 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  + +K  K+Y    E + RF++FK NLR        + T   G+ +F+DLT  E+R  
Sbjct: 51  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 110

Query: 113 FLGLN---RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           +LG     +R      + +      + LP   DWR  GAV  VKDQG+CGSCW+FS   A
Sbjct: 111 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 170

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG + + TG L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E
Sbjct: 171 VEGINKIVTGGLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSE 222

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +DYPY  +DG   ++ K+     +  +  +  ++++     V + P++
Sbjct: 223 EDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVS 270


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/217 (39%), Positives = 119/217 (54%), Gaps = 16/217 (7%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
           + D RF +FK NLR        +  A +  G+T F++LT  E+R  +LG      RR+  
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83

Query: 123 PADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
             +         ND+  P   DWR  GAV  +KDQG CGSCW+FS   A+EG + + TGE
Sbjct: 84  AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQ+LVDCD         S + GCNGGLM+ AF++I+K GG+  EKDYPY GT+G 
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                K+     +  +  + S ++      V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232


>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
 gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
          Length = 324

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 133/237 (56%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  +F+K Y ++ E   RF++F+ NL     +   D  A + + KFSDL+
Sbjct: 21  LLKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQNDSAAKYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+A  +LE    +   +L+ LSEQQ++DCD         S D+GCNGGL+++AFE +
Sbjct: 137 CWAFAALASLESQFAMKHNQLIDLSEQQMIDCD---------SVDAGCNGGLLHTAFEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +K GGV+ EKDYPY   +  +C+ + +K    V + +  I   E+++   L   GP+
Sbjct: 188 IKMGGVQLEKDYPYEAAN-NNCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPI 243


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 95/278 (34%), Positives = 155/278 (55%), Gaps = 26/278 (9%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKS---KFSKTY 64
           SLLL+L+ S L+SA     D ++I        +++  H  + +   +L++S   +  K+Y
Sbjct: 11  SLLLMLIFSTLSSA----SDMSIISY------DETHIHHRSDDEVSALYESWLIEHGKSY 60

Query: 65  ATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---RL 120
               E D RF++FK NL+   ++  + + +   G+TKF+DLT  E+R  +LG      R 
Sbjct: 61  NALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRR 120

Query: 121 RLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
           +L  +     +    D LP   DWRD G + GVKDQG+CGSCW+FSA  A+E  + + TG
Sbjct: 121 KLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTG 180

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            L+SLSEQ+LVDCD         S + GC+GGLM+ AFE+++  GG++ E+DYPY   + 
Sbjct: 181 NLISLSEQELVDCDK--------SYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERND 232

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              ++ K+     + ++  +  + ++     V H P++
Sbjct: 233 VCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVS 270


>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
          Length = 331

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  +SK Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 25  DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+  Q+     +N    LP   DWR+ G VT VK QG+CG+CW
Sbjct: 85  GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 140

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVD    C  E+ G  + GCNGG M +AF+YI+ 
Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 194

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   + G C++D  K AA  S ++ +    ED +   +   GP++    
Sbjct: 195 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 250

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 251 AIDASHYSF-FLY 262


>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
 gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
 gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
          Length = 365

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
           +E +F  F  +++K+Y   +E+ YR+ VFK NL +  ++ R+           L  +A  
Sbjct: 51  SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 110

Query: 97  GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           GV KFSD TP E        FL L++   L  + +     P   LP  +DWRD   VT +
Sbjct: 111 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNKVTPI 169

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG CGSCW+F A G +E  + +   +L+ LSEQQL+DCD           D GCNGGL
Sbjct: 170 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 220

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
           M+ AF+ +L  GGVE E DYPY G++   C  D  KIA  + S F     DE+++   + 
Sbjct: 221 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 279

Query: 272 KHGPLAGNVASIEL 285
             GP+A  V ++++
Sbjct: 280 TTGPVAIAVDAMDI 293


>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
          Length = 339

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  +SK Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 33  DHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 92

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+  Q+     +N    LP   DWR+ G VT VK QG+CG+CW
Sbjct: 93  GEEV----ISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 148

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVD    C  E+ G  + GCNGG M +AF+YI+ 
Sbjct: 149 AFSAVGALEAQLKLKTGKLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTTAFQYIID 202

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   + G C++D  K AA  S ++ +    ED +   +   GP++    
Sbjct: 203 NNGIDSEASYPYKAMN-GKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVS---V 258

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 259 AIDASHYSF-FLY 270


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 128/229 (55%), Gaps = 19/229 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  + +K  K+Y++  E   R  VF   L   ++     + T   G+ KFSDLT +EFR 
Sbjct: 2   FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61

Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
            ++G   + + P    + P     +  + LPT  DWR  GAVT +KDQG CGSCW+FSA 
Sbjct: 62  NYVG---KFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ++E AHFL+T ELVSLSEQQL+DCD         + D GC GG  + AF+++++ GGV 
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPDDAFKFVVENGGVT 169

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            E+ YPYTG   GSC  +K+K+   ++ +  ++ D        V   P+
Sbjct: 170 TEEAYPYTGF-AGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216


>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
          Length = 334

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 133/261 (50%), Gaps = 23/261 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
           +PSD        +  + H+  FK+  +KTYA   E  YR +VFK N +R AK   L    
Sbjct: 18  IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDLFASG 69

Query: 94  AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
            V    G  +++D+   E   +  G    L+  +         +       DWR  GA T
Sbjct: 70  EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAAT 129

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            +KDQG CGSCWSFSATG+LEG  FL    LVSLSEQ LVDC  +   E       GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
           GLM+SAFEY+   GG++ E+ YPYT  DG SC +  +  A   + +  V +  E  +   
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAVDGDSCLYRAANNAGVNTGYKDVQAKSESALRDA 242

Query: 270 LVKHGPLAGNVASIELPHISF 290
           + K GP++    +I+  + SF
Sbjct: 243 VEKVGPVS---VAIDASNWSF 260


>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
           Australia]
          Length = 367

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
           +E +F  F  +++K+Y   +E+ YR+ VFK NL +  ++ R+           L  +A  
Sbjct: 53  SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112

Query: 97  GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           GV KFSD TP E        FL L++   L  + +     P   LP  +DWRD   VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPNIRLPDYYDWRDTNKVTPI 171

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG CGSCW+F A G +E  + +   +L+ LSEQQL+DCD           D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
           M+ AF+ +L  GGVE E DYPY G++   C  D  KIA  + S F     DE+++   + 
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281

Query: 272 KHGPLAGNVASIEL 285
             GP+A  V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/229 (37%), Positives = 127/229 (55%), Gaps = 19/229 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRR 111
           F  + +K  K+Y++  E   R  +F   L   ++     + T   G+ KFSDLT +EFR 
Sbjct: 2   FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61

Query: 112 QFLGLNRRLRLPADAQKAPI----LPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
            ++G   + + P    + P     +  + LPT  DWR  GAVT +KDQG CGSCW+FSA 
Sbjct: 62  NYVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAI 118

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ++E AHFL+T ELVSLSEQQL+DCD         + D GC GG    AF+++++ GGV 
Sbjct: 119 ASIESAHFLATKELVSLSEQQLIDCD---------TVDQGCQGGFPEDAFKFVVENGGVT 169

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            E+ YPYTG   GSC  +K+K+   ++ +  ++ D        V   P+
Sbjct: 170 TEEAYPYTGF-AGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPV 216


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 128/224 (57%), Gaps = 13/224 (5%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL-- 116
           K  K Y    E D RF +FK NLR        + T   G+ +F+DLT  E+R ++LG   
Sbjct: 10  KHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEYRARYLGTRI 69

Query: 117 --NRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
             NRR ++    + +      ++LP   DWR+  AV  VKDQG CGSCW+FS  GA+EG 
Sbjct: 70  DPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAFSTIGAVEGI 129

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ A+E+I+  GG++ E+DYP
Sbjct: 130 NKIVTGDLISLSEQELVDCDT--------SYNQGCNGGLMDYAYEFIINNGGIDSEEDYP 181

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y   DG   ++ K+     + ++  + ++++      V + P++
Sbjct: 182 YRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVS 225


>gi|281200606|gb|EFA74824.1| cysteine proteinase 5 precursor [Polysphondylium pallidum PN500]
          Length = 307

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 130/229 (56%), Gaps = 16/229 (6%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRL 122
           T +E   RF +FK N+    +      + V G+   +D++  E++R +LG +    + R 
Sbjct: 9   TAQEFGTRFNIFKKNMDFVHKWNAKGSSTVLGLNSMADISNEEYQRVYLGTHIDASQFRQ 68

Query: 123 PADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            A + K  +  T  +   + DWR  GAVT +K+QG CGSCWSFS TG+ EGAHF+ TG L
Sbjct: 69  QAASHK--LGRTFKVQAANVDWRAKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFIKTGNL 126

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ L+DC     PE     + GCNGGLM +AFEYI+K  G++ E  YPY   DG  
Sbjct: 127 VSLSEQNLMDCS---KPEG----NQGCNGGLMTAAFEYIIKNNGIDTESSYPYKAEDGKK 179

Query: 242 CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
           C ++ +  AA +S++  +++  +   A     GP++    +I+  H SF
Sbjct: 180 CLYNPANSAATLSSYVNVTTGSESDLAVKSGLGPVS---VAIDASHNSF 225


>gi|66815893|ref|XP_641963.1| cysteine protease 4 [Dictyostelium discoideum AX4]
 gi|166201984|sp|P54639.2|CYSP4_DICDI RecName: Full=Cysteine proteinase 4; Flags: Precursor
 gi|60469981|gb|EAL67962.1| cysteine protease 4 [Dictyostelium discoideum AX4]
          Length = 442

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 128/234 (54%), Gaps = 13/234 (5%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L   + F+ +     +TY++ EE + R+++FK+N+    +        V G+  F+D+T 
Sbjct: 24  LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            E+R  +LG           ++  I  T   PT  DWR  GAVT +K+QG CG CWSFS 
Sbjct: 83  QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140

Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           TG+ EGAHF+++G   +LVSLSEQ L+DC            ++GC GGLM  AFEYI+  
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLAFEYIINN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G++ E  YPYT  DG  CKF  S I A + ++  ++S  +    +   + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247


>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
          Length = 379

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 100/227 (44%), Positives = 134/227 (59%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 82  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 142 IYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 250

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    SC F   K    +++  V+S +E ++AA L K GP++
Sbjct: 251 DYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPIS 296


>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 93/236 (39%), Positives = 124/236 (52%), Gaps = 14/236 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+FSATG
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSATG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD +         D GC  G  + AF +I+ +  G V
Sbjct: 158 NIEGQWKVAGHELTSLSEQMLVSCDTD---------DLGCRDGFPDIAFNWIVSSNKGNV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
             E+ YPY    G     DKS   + A + +   ++ DED +A  L + GP A  V
Sbjct: 209 FTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDEDMIAEWLARKGPAAITV 264


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 98/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           E H+ L+K+  SK+Y   EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 27  EDHWHLWKNWHSKSYHESEE-GWRRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMT 85

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  +        + +  +  N L  P   DWR+ G VT VKDQG+CGSCW+
Sbjct: 86  NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C +      A  + F  I S  E  M   +   GP++    +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253

Query: 283 IELPHISFSFL 293
           I+  H SF F 
Sbjct: 254 IDAGHESFQFY 264


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 95/272 (34%), Positives = 146/272 (53%), Gaps = 20/272 (7%)

Query: 11  LLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEH 70
              LS  LA  +++ D +    QV     E++E   L     + ++  K+ K Y    E 
Sbjct: 14  FYFLSVCLAIDMSIIDYNLKHGQVP----ERTEAETLRL---YEMWLVKYGKAYNALGEK 66

Query: 71  DYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLPADAQ 127
           + RF +FK NL+   +   + +P+   G+ KF+DL+  E+R  +LG  ++ + RL    +
Sbjct: 67  ERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPK 126

Query: 128 KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
            A  L    +DLP   DWR+ GAV  VKDQG CGSCW+FS  GA+EG + + TG L SLS
Sbjct: 127 SARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLS 186

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQ+LVDCD           + GCNGGLM+ AFE+I+K GG++ E+DYPY   D       
Sbjct: 187 EQELVDCDK--------VYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNR 238

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           K+     +  +  +  ++++     V + P++
Sbjct: 239 KNARVVTIDGYEDVPQNDEKSLRKAVANQPVS 270


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 92/239 (38%), Positives = 130/239 (54%), Gaps = 17/239 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
           +K   +K Y+   E   R+ ++K N RR +   L     +  + +F D+T SEF+     
Sbjct: 30  WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFK----A 85

Query: 116 LNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
            N  L          + P N + P   DWR+ G VT VKDQG CGSCW+FS TG+LEG H
Sbjct: 86  FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQH 145

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           F  TG+LVSLSEQ LVDC        +   ++GCNGGLM++AF YI +  G++ E  YPY
Sbjct: 146 FKKTGKLVSLSEQNLVDC-------STAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPY 198

Query: 235 TGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           T  D G C F K  +AA  + F  +   +E+++   +   GP++    +I+  H SF F
Sbjct: 199 TAED-GKCVFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPIS---VAIDASHESFQF 253


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/287 (37%), Positives = 146/287 (50%), Gaps = 42/287 (14%)

Query: 13  LLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDY 72
           + SS +A+AV V    A   +V P       D+++     F+ FK+K+ K Y    E   
Sbjct: 1   MKSSCIAAAVLV----AAGHEVPP------PDYMM----MFNNFKTKYGKVYNGINEDAV 46

Query: 73  RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA-PI 131
           RF +FKAN+         + T   GV +F+DLT  E    + GL      PA      P 
Sbjct: 47  RFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEELAASYTGLK-----PASLWSGLPR 101

Query: 132 LPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSE 186
           L T++     L +  DW   G VT VK+QG CGSCWSFS TGALEGA  LSTG LVSLSE
Sbjct: 102 LSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSE 161

Query: 187 QQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK 246
           QQ VDCD         + DSGCNGG M++AF +  K   +  E  YPYT TD G+C    
Sbjct: 162 QQFVDCD---------TTDSGCNGGWMDNAFSFA-KKNSICTEGSYPYTATD-GTCNLSG 210

Query: 247 SKIA---AAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
            ++      V  ++ +S+D +Q   + V   P++    +IE    SF
Sbjct: 211 CQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVS---IAIEADQYSF 254


>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
          Length = 334

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 96/251 (38%), Positives = 132/251 (52%), Gaps = 22/251 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
            F  ++ KF +TY++  E   R + +  N +      +L    +     G+T F+D+   
Sbjct: 25  EFHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENE 84

Query: 108 EFRRQF----LGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
           E++R      LG +    LP        LP N DLP   DWRD G VT VKDQ  CGSCW
Sbjct: 85  EYKRLISQGCLG-SFNASLPRRGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSCW 143

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSATG+LEG  F  TG+LVSLSEQQLVDC  +         + GC GGLM+ AF YI  
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYG-------NMGCGGGLMDDAFRYIQA 196

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVA 281
            GG++ E+ YPY   D G C++    + A  + +  +SS DED +   +   GP++    
Sbjct: 197 TGGIDTEESYPYEAED-GECRYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPIS---V 252

Query: 282 SIELPHISFSF 292
            I+  HISF  
Sbjct: 253 GIDASHISFQL 263


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 99/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           E H+ L+K+  SK Y   EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 27  EDHWHLWKNWHSKHYHESEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMT 85

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  +        + +  +  N L  P   DWR+ G VT VKDQG+CGSCW+
Sbjct: 86  NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 144 FSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C +     AA  + F  I S  E  M   +   GP++    +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253

Query: 283 IELPHISFSFL 293
           I+  H SF F 
Sbjct: 254 IDAGHESFQFY 264


>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 329

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 134/256 (52%), Gaps = 31/256 (12%)

Query: 47  LNAEHHFSLFKSKFSKTYATQE------EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
           L AE H+   +++F+     Q+      E   R+  FK NL    R   ++     G T 
Sbjct: 19  LFAEKHY---QNQFTNWMVVQDRQYDAYEFRTRYSAFKDNLDFIHRWNAVNKETELGATV 75

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN------DLPTDFDWRDHGAVTGVKD 154
           F+DLT  E+R  +LG+N       DA      P         + +  DWR++GAV  VKD
Sbjct: 76  FADLTNEEYRAVYLGMN------VDASNFAAQPATLDQVYQPVRSTLDWRNNGAVGRVKD 129

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG CGSCW+FS TGA+EGAH ++TG  VSLSEQQL+DC            + GC GGLM+
Sbjct: 130 QGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYG-------NHGCQGGLMD 182

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
           SA  YI+K GG+  E+ YPY   D  +CK++ +   A +S +S I    +   A  +  G
Sbjct: 183 SAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAKLSGYSNIKRGSEADLAAKLNIG 242

Query: 275 PLAGNVASIELPHISF 290
           P+A    +++  H SF
Sbjct: 243 PVA---IALDASHSSF 255


>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 98/251 (39%), Positives = 133/251 (52%), Gaps = 20/251 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           E H+ L+K+  SK Y   EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 27  EDHWHLWKNWHSKNYHASEE-GWRRMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMT 85

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EFR+   G  +        + +  +  N L  P   DWR+ G VT VKDQG+CGSCW+
Sbjct: 86  NEEFRQTMNGYKQTTE--RKFKGSLFMEPNYLQAPKAVDWREKGYVTPVKDQGSCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGA+EG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   
Sbjct: 144 FSTTGAMEGQPFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIQDN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY GTD   C +     AA  + F  + S  E  M   +   GP++    +
Sbjct: 197 AGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVS---VA 253

Query: 283 IELPHISFSFL 293
           I+  H SF F 
Sbjct: 254 IDAGHESFQFY 264


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 137/256 (53%), Gaps = 21/256 (8%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
           +LL  E H  LFK+   K Y +Q E  +R +++  N  +  +  +L    + +    + K
Sbjct: 25  NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F DL   EFR    G   + +  + A+       P N ++P   DWR+ GA+T VKDQG 
Sbjct: 83  FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 142

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS+TGALEG  F  TG+L+SLSEQ L+DC  +   E       GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY   D   C+++     A    F  I S +ED++ A +   GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 254

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 255 S---VAIDASHESFQF 267


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 127/228 (55%), Gaps = 14/228 (6%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLPAD 125
           E + RF+VFK NLR        + +   G+ +F+DLT  E+R  +LG     +R RL   
Sbjct: 70  EKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS 129

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           + +      + LP   DWR  GAV  VKDQG+CGSCW+FS   A+EG + + TG+L+SLS
Sbjct: 130 SNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLS 189

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQ+LVDCD         S + GCNGGLM+ AF++I+  GG++ E+DYPY   DG    + 
Sbjct: 190 EQELVDCDR--------SYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYR 241

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
           K+     + N+  +  ++++     V + P++    +IE     F F 
Sbjct: 242 KNAKVVTIDNYEDVPVNDEKALQKAVANQPVS---VAIEAGGREFQFY 286


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 99/248 (39%), Positives = 127/248 (51%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKFSDLTPS 107
            +  FK+   K+Y +  E   RF++F  N L  AK         V    G+ +F DL   
Sbjct: 26  QWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF R F G +   R    +   P    ND  LP   DWR  GAVT VKDQG CGSCW+FS
Sbjct: 86  EFARIFNG-HHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFL  GELVSLSEQ LVDC            ++GC GGLM  AF+YI    G
Sbjct: 145 ATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG-------NNGCEGGLMEDAFKYIKANDG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ EK YPY   D G C+F K  + A  + +  I +  E  +   +   GP++    +I+
Sbjct: 198 IDTEKSYPYEAVD-GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPIS---VAID 253

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 254 ASHSSFQL 261


>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
 gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
 gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
          Length = 337

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 96/290 (33%), Positives = 158/290 (54%), Gaps = 35/290 (12%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           ++ +L+LLL   L SAV  + D     QVV    + +  ++ +A  +F  F S+++K Y+
Sbjct: 1   MNKILILLL---LVSAVLTSHD-----QVVAVTIKPNLYNINSAPLYFEKFISQYNKQYS 52

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           +++E  YR+ +F+ N+     +   + +AV+ + +F+D+T +E       +NR   L + 
Sbjct: 53  SEDEKKYRYNIFRHNIESINAKNSRNDSAVYKINRFADMTKNEV------VNRHTGLASG 106

Query: 126 AQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS 177
              A    T           P +FDWR++  VT VKDQG CG+CW+F+  GALE  + + 
Sbjct: 107 DIGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIK 166

Query: 178 TGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
              L+ L+EQQLVDCD           D GC+GGL+++A+E I+  GGVE+E DYPY   
Sbjct: 167 YDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMHIGGVEQEYDYPYKAV 217

Query: 238 DGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
               C     K A  V N +  +   E+++  +L++H GP+A  V +++L
Sbjct: 218 R-LPCAVKPHKFAVGVRNCYRYVLLSEERL-EDLLRHVGPIAIAVDAVDL 265


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 19/234 (8%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGV-------TKFSDLTPS 107
           + ++  +TYA  EE   R  +F+AN  R        D  A   V        +F+DLT  
Sbjct: 46  WMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLTDE 105

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD----FDWRDHGAVTGVKDQGACGSCWS 163
           EFR    GL R   +              L  D     DWR  GAVTGVKDQG+CG CW+
Sbjct: 106 EFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCGCCWA 165

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSA  A+EG   + TG LVSLSEQQLVDCD   D       D GC GGLM++AF+YI + 
Sbjct: 166 FSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGD-------DQGCEGGLMDNAFQYISRQ 218

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+  E  YPY+G DGGSC+  +++ AA++     + ++ +      V H P++
Sbjct: 219 GGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVS 272


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 91/235 (38%), Positives = 130/235 (55%), Gaps = 29/235 (12%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANL--------RRAKRRQLLDPTAVHGVTKFSDL 104
           +  +  +  K Y +  E+  RF++FK N+        RR     L       G+ KF+DL
Sbjct: 38  YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSL-------GLNKFADL 90

Query: 105 TPSEFRRQFLGLNRRLRLPADAQK-APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           T SEFR  ++G   RL+ PA   +   I    D  T  DWR  G VT +KDQG CGSCW+
Sbjct: 91  TNSEFRGLYVG---RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSA  A+EG  FLSTG LVSLSEQ+LVDCD         + + GC+GG+M+ AF+Y+++ 
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDT--------TVNQGCDGGIMDYAFQYMIRN 199

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+  + +YPY     G+C  DK K  AA ++ F  I    +++    V + P++
Sbjct: 200 GGITSQSNYPYRALR-GACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVS 253


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 150/295 (50%), Gaps = 41/295 (13%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           L LLL+ ++LA+A A+                 S   L+N E  ++ FK + +K Y    
Sbjct: 3   LFLLLIVAILATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDI 43

Query: 69  EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
           E  +R ++F  N  +  +     ++   +    + K+ D+   EF     G N+ +    
Sbjct: 44  EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103

Query: 125 DAQKAPI-----LPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
            +++ PI      P N  LP   DWR+HGAVT VKDQG CGSCWSFSATGALEG HF  T
Sbjct: 104 RSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L+ LSEQ L+DC  +         ++GCNGGLM+ AF+YI    G++ E  YPY   +
Sbjct: 164 GILIPLSEQNLIDCSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-AE 215

Query: 239 GGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              C+++ +   A  V    +   +E ++ A +   GP++    +I+  H SF F
Sbjct: 216 NDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVS---VAIDASHQSFQF 267


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 90/245 (36%), Positives = 128/245 (52%), Gaps = 14/245 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  +KS   K Y  Q E D+R  VF  N++          T    + +FSDLT  EF + 
Sbjct: 25  WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKT 83

Query: 113 FLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           + G    ++   +     + P N ++PT+ DWR  G VT +K+QG CGSCW+FS TG+LE
Sbjct: 84  YNGYRLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLE 143

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G HF  TG+LVSLSEQ L+DC        +   + GC GG M+ AFEYI    G++ E  
Sbjct: 144 GQHFRKTGKLVSLSEQNLIDC-------SAAEGNDGCGGGFMDDAFEYIKLNNGIDTEAS 196

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHISF 290
           YPY G D   C++ K+   A  + +  I    ED + A +   GP++    +I+  H SF
Sbjct: 197 YPYEGRD-DICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPIS---VAIDASHKSF 252

Query: 291 SFLFT 295
               T
Sbjct: 253 HMYHT 257


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 138/254 (54%), Gaps = 25/254 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           +S FK +    Y ++ E ++R +++  +    AK  Q  +   V    G+ K+ D+   E
Sbjct: 27  WSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHE 86

Query: 109 FRRQFLGLNR------RLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
           F +   G N+       L +   + +    I P N  LP   DWR HGAVT +KDQG CG
Sbjct: 87  FVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCG 146

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCWSFS TGALEG HF  +G LVSLSEQ L+DC      E+ G  ++GCNGGLM++AF+Y
Sbjct: 147 SCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC-----SEQYG--NNGCNGGLMDNAFKY 199

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
           I   GG++ E+ YPY G D   C+++ K+  A  V    +   DE ++   +   GP++ 
Sbjct: 200 IKDNGGIDTEQTYPYEGVD-DKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVS- 257

Query: 279 NVASIELPHISFSF 292
              +I+  H SF  
Sbjct: 258 --VAIDASHTSFQL 269


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 146/265 (55%), Gaps = 23/265 (8%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEE-HDYRFRVFKANLRRAKRRQLLDPTAVH-GVT 99
           S D  L+ E  ++ + +KF K  A+     D RF  FK N R  +        +   G+ 
Sbjct: 4   SSDSDLSGE--YASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLN 61

Query: 100 KFSDLTPSEFRRQFLGL------NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
           +FSDLT  EFR++FLGL      +  L++P D+         DLP   DWR HGAVT  K
Sbjct: 62  QFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPK 121

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           DQG+CG CW+F+ TGA+EG + + TG+L+SLSEQ+L+DCD +         D GC+GGLM
Sbjct: 122 DQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKK--------ADKGCDGGLM 173

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVK 272
            +A+++I++ GG++ E DYPY  ++   C   K +    A+  +  I   ++Q     V 
Sbjct: 174 ENAYQFIVENGGLDTETDYPYHASE-SHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVA 232

Query: 273 HGPLAGNV--ASIELPHISFSFLFT 295
             P++  +  AS +  H + S +FT
Sbjct: 233 KQPVSVAIEGASKDFQHYA-SGVFT 256


>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 477

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/236 (36%), Positives = 132/236 (55%), Gaps = 15/236 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
           L   +H F  F+ +F + Y    E   R R+F+ NL+  +     +  +A +G+T+F+D+
Sbjct: 164 LDKVDHLFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADM 223

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T +E++ +  GL +R           ++P    + P +FDWR   AVT VK+QG+CGSCW
Sbjct: 224 TSTEYKER-TGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSCW 282

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG + + TGEL   SEQ+L+DCD         + DS CNGGLM++A++ I  
Sbjct: 283 AFSVTGNIEGLYAVKTGELKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 333

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            GG+E E +YPY       C F+++     VS F  +   +E  M   L+ HGP++
Sbjct: 334 IGGLEYEAEYPYEAKK-QQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPIS 388


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 99/253 (39%), Positives = 134/253 (52%), Gaps = 23/253 (9%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDL 104
            +  +  FK    K Y ++ E  +R ++F  N  + AK  +L     V    GV K+SD+
Sbjct: 23  VQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDM 82

Query: 105 TPSEFRRQFLGLNRRLRLPA-----DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGAC 158
              EF     G NR  + P      D     I P N +LP   DWR  GAVT VKDQG C
Sbjct: 83  LNHEFVHTLNGYNRS-KTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQC 141

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCWSFS TG+LEG HF  + +LVSLSEQ L+DC      E+ G  ++GCNGGLM++AF 
Sbjct: 142 GSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCS-----EKYG--NNGCNGGLMDNAFR 194

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
           YI   GG++ E+ YPY   D   C +      A    F  I S DE+++ A +   GP++
Sbjct: 195 YIKDNGGIDTEQSYPYKAED-EKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPIS 253

Query: 278 GNVASIELPHISF 290
               +I+  H +F
Sbjct: 254 ---VAIDASHPTF 263


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 102/289 (35%), Positives = 146/289 (50%), Gaps = 22/289 (7%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIR--QVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
            +LL LS  L+SA     D ++I   Q   +      D  + A +   L K    K Y  
Sbjct: 12  FVLLFLSFTLSSA----SDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQ--GKVYNA 65

Query: 67  QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RRLRLP 123
             E + RF+VFK NLR        + T   G+  F+DLT  E+R  +LG     +R RL 
Sbjct: 66  LGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLR 125

Query: 124 ADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
             + +        LP   DWR  GAV  VKDQG+CGSCW+FS   A+EG + + TG+L+S
Sbjct: 126 KTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLIS 185

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E+DYPY   DG    
Sbjct: 186 LSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDT 237

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           + K+     + ++  +  + +      V + P++    +IE     F F
Sbjct: 238 YRKNAKVVTIDDYEDVPVNSETALQKAVANQPVS---VAIEAGGRDFQF 283


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 133/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E+ +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEEALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 122/233 (52%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+F+  G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD           D GC  G M++AF++I+ +  G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCDTN---------DLGCRAGFMDTAFKWIVSSNNGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +C      + A + +   I  +E+ +A  L K GP+A
Sbjct: 209 FTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILDNENAIAEWLAKKGPVA 261


>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 326

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 94/243 (38%), Positives = 129/243 (53%), Gaps = 19/243 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +K  + K Y TQ+E   R  ++  NL+  +    +      T    + +F DLT  E+R 
Sbjct: 25  WKRTYGKEY-TQKEEALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRE 83

Query: 112 QFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
              G  +  +         +LP+N   P   DWR  G VT VKDQGACGSCW+FS+TG+L
Sbjct: 84  LMCGYKKSNKTVISKPSTFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSL 143

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  F  TG+LV LSEQQLVDC  +         + GC GG M+ AF YI K  G E E 
Sbjct: 144 EGQTFKKTGKLVPLSEQQLVDCSGDYG-------NMGCGGGWMDQAFSYI-KDKGEESED 195

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIELPHIS 289
            YPYTGTD  +C +D SK+ A  + ++ I   DE+ +   +   GP++    +I+  H S
Sbjct: 196 GYPYTGTD-DTCVYDASKVVATDTGYTDIPEMDENALQQAVATVGPIS---VAIDATHSS 251

Query: 290 FSF 292
           F F
Sbjct: 252 FQF 254


>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
          Length = 331

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 116/206 (56%), Gaps = 13/206 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +  ++++K  F+K Y   EE   R  V++ N+   ++         H    G  +++D+T
Sbjct: 25  DQEWAIYKDMFAKNYVADEERMRRL-VWEDNIDYIEKHNRRADRGEHKFWLGTNEYADMT 83

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
             EF+    G   +     D   +P     DLP   DWRD G VT VK+QG CGSCWSFS
Sbjct: 84  IDEFKAIMNGFIMQNGTKGDTYMSPS-NIGDLPDKVDWRDKGYVTPVKNQGHCGSCWSFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HF STG+LVSLSEQ L+DC  +         + GC GGLM+ AFEYI K  G
Sbjct: 143 ATGSLEGQHFKSTGKLVSLSEQNLIDCSKK-------EGNHGCKGGLMDFAFEYIQKNDG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAA 251
           ++ E+ YPYT  DG  C+F K+ + A
Sbjct: 196 IDTEQSYPYTAKDGIECRFKKADVGA 221


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 107/296 (36%), Positives = 158/296 (53%), Gaps = 30/296 (10%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFK---SKF 60
           + LS  LLLL    + + VA N D +++          SE+ L + E    LF+   +K 
Sbjct: 8   MKLSGALLLL---CVGACVARNSDFSIVGY--------SEEDLSSNERLVELFEKWLAKH 56

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR- 119
            K YA+ EE  +RF VFK NL+   +      +   G+ +F+DLT  EF+  +LGL+   
Sbjct: 57  QKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAAP 116

Query: 120 -LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             R  + + +   +  +DLP   DWR  GAVT VK+QG CGSCW+FS   A+EG + + T
Sbjct: 117 ARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVT 176

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L +LSEQ+L+DC        S   +SGCNGGLM+ AF YI  +GG+  E+ YPY   +
Sbjct: 177 GNLTALSEQELIDC--------SVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYL-ME 227

Query: 239 GGSCKFDKSKIAAAV--SNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            GSC   K   + AV  S +  + ++++Q     + H P++    +IE     F F
Sbjct: 228 EGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVS---VAIEASGRHFQF 280


>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
          Length = 358

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 108/294 (36%), Positives = 152/294 (51%), Gaps = 26/294 (8%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNA------EHHFS 54
           M R +    L+ L S++LA A    D+  +I Q V    +  E  LL          HF+
Sbjct: 1   MARFLAFLALVFLSSAILARANHAFDEANLI-QSVTERIDSLETSLLGVLGQTRNALHFA 59

Query: 55  LFKSKFSKTYATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F  ++ K Y + EE   RF +F  NL   R   RR L  P  + G+ +++D++  EFR 
Sbjct: 60  RFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNRRGL--PYKL-GINRYADMSWEEFRA 116

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
             LG  +     A  +    +    LP   DWR+ G V+ VKDQG+CGSCW+FS TGALE
Sbjct: 117 SRLGAAQNC--SATLKGNHKMTDELLPKTKDWREDGIVSPVKDQGSCGSCWTFSTTGALE 174

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
            A+  +TG+ +SLSEQQLVDC +  +       + GCNGGL + AFEYI   GG++ E+ 
Sbjct: 175 AAYTQATGKGISLSEQQLVDCAYAFN-------NFGCNGGLPSQAFEYIKYNGGLDTEES 227

Query: 232 YPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVKHGPLAGNVAS 282
           YPY G + G C F    +   V    N ++ + DE   A  LV+   +A  V S
Sbjct: 228 YPYAGVN-GFCHFKPENVGVKVVESVNITLGAEDELLHAVGLVRPVSIAFEVVS 280


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 97/254 (38%), Positives = 135/254 (53%), Gaps = 25/254 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVHGV---TKFSDLTPSE 108
           +S FK + SK Y ++ E  +R +++  N  R AK  Q  +  AV       K++D+   E
Sbjct: 27  WSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLSHE 86

Query: 109 FRRQFLGLNRRLRLPADAQKAP--------ILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
           F     G N+ L+ P               I P +   P   DWR  GAVT VKDQG CG
Sbjct: 87  FVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGKCG 146

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TGALEG HF  TG LVSLSEQ L+DC        +   ++GCNGGLM++AF+Y
Sbjct: 147 SCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDC-------SAAYGNNGCNGGLMDNAFKY 199

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
           I   GG++ EK YPY G D   C+++ K+  A  V    +   DE+++   +   GP++ 
Sbjct: 200 IKDNGGIDTEKAYPYEGVD-DKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVS- 257

Query: 279 NVASIELPHISFSF 292
              +I+    SF F
Sbjct: 258 --VAIDASQESFQF 269


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 86/246 (34%), Positives = 135/246 (54%), Gaps = 21/246 (8%)

Query: 37  SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH 96
           SDGE  E         + L+ +K  K Y   +E + RF++FK NL+        + T   
Sbjct: 27  SDGEVRE--------IYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKV 78

Query: 97  GVTKFSDLTPSEFRRQFLGLN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
           G+  F+DLT  E+R  +LG       R ++    +++  +   + LP   DWR  GAV  
Sbjct: 79  GLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAP 138

Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
           VK+QG+CGSCW+FS   A+EG + + TGEL+SLSEQ+LV CD +         +SGCNGG
Sbjct: 139 VKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKK--------YNSGCNGG 190

Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
           LM+ AF++I+  GG++ E+DYPY   DG      K+    ++  +  + +++++     V
Sbjct: 191 LMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDEESLKKAV 250

Query: 272 KHGPLA 277
            H P++
Sbjct: 251 AHQPVS 256


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 95/248 (38%), Positives = 132/248 (53%), Gaps = 17/248 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+KS  +K Y  +EE  +R  V++ NL+  +   L      H    G+ +F D+T  
Sbjct: 9   HWQLWKSWHNKDYHEREE-SWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTE 67

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           EFR+   G   +           + P+  + P   DWR+ G VT VKDQG CGSCW+FS 
Sbjct: 68  EFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFST 127

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+   GG+
Sbjct: 128 TGALEGQHFRKTGKLVSLSEQNLVDCSR---PE----GNQGCNGGLMDQAFQYVQDNGGI 180

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPYT  D   C++     AA  + F  +    E  +   +   GP++    +I+ 
Sbjct: 181 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVS---VAIDA 237

Query: 286 PHISFSFL 293
            H SF F 
Sbjct: 238 GHSSFQFY 245


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/252 (36%), Positives = 138/252 (54%), Gaps = 24/252 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSE 108
           ++ FK    K Y ++ E  +R ++F  N  +     ++ +L + +   G+ K+ D+   E
Sbjct: 28  WNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAPI-----LPTN-DLPTDFDWRDHGAVTGVKDQGACGSCW 162
           F     G N+ +     AQ+ PI      P N ++P+  DWR HGAVT +KDQG CGSCW
Sbjct: 88  FINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSCW 147

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
           SFSATGALEG H+  TG+LVSLSEQ L+DC        SG   ++GCNGGLM+ AF+YI 
Sbjct: 148 SFSATGALEGQHYRITGKLVSLSEQNLIDC--------SGRYGNNGCNGGLMDQAFQYIK 199

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNV 280
              G++ E  YPY   +   C+++     A  S +  I   +E ++ A +   GP++   
Sbjct: 200 DNHGLDTEISYPYE-AENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVS--- 255

Query: 281 ASIELPHISFSF 292
            +I+    SF F
Sbjct: 256 VAIDASAESFQF 267


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 149/295 (50%), Gaps = 41/295 (13%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           L L L+ +VLA+A A+                 S   L+N E  ++ FK + +K Y    
Sbjct: 3   LFLFLIVAVLATAQAI-----------------SFFELVNQE--WTTFKMEHNKVYKNDV 43

Query: 69  EHDYRFRVFKANLRRAKRR----QLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
           E  +R ++F  N  +  +     ++   +    + K+ D+   EF     G N+ +    
Sbjct: 44  EERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQL 103

Query: 125 DAQKAPIL-----PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
            +++ PI      P N  LP   DWR+HGAVT VKDQG CGSCWSFSATGALEG HF  T
Sbjct: 104 RSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRT 163

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G L+ LSEQ L+DC  +         ++GCNGGLM+ AF+YI    G++ E  YPY   +
Sbjct: 164 GILIPLSEQNLIDCSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEVTYPYE-AE 215

Query: 239 GGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              C+++ +   A  V    +   +E ++ A +   GP++    +I+  H SF F
Sbjct: 216 NDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVS---VAIDASHQSFQF 267


>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
          Length = 467

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/237 (38%), Positives = 121/237 (51%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ANL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P D +          P   DWR+ GAVT VK+QG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAEERARVPVDVEVV------GAPAAKDWREEGAVTAVKNQGICGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA- 223
           +A G +EG  FL+   L  LSEQ LV CD+          +SGC GGL + AFE+I++  
Sbjct: 151 AAIGNIEGQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGLSSKAFEWIVQEN 201

Query: 224 -GGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  YPY    G    CK     + A ++    +  DE Q+AA+    GPL+
Sbjct: 202 NGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATITGHVELPQDEAQIAASGAVKGPLS 258


>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
 gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
 gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
 gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
          Length = 367

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR--AKRRQL----------LDPTAVH 96
           +E +F  F  +++K+Y   +E+ YR+ VFK NL +  ++ R+           L  +A  
Sbjct: 53  SEIYFKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQF 112

Query: 97  GVTKFSDLTPSEFRRQ----FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           GV KFSD TP E        FL L++   L  + +     P   LP  +DWRD   VT +
Sbjct: 113 GVNKFSDKTPDEVLHSNTGFFLNLSQHYTL-CENRIVKGAPDIRLPDYYDWRDTNKVTPI 171

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG CGSCW+F A G +E  + +   +L+ LSEQQL+DCD           D GCNGGL
Sbjct: 172 KDQGVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCD---------EVDLGCNGGL 222

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLV 271
           M+ AF+ +L  GGVE E DYPY G++   C  D  KIA  + S F     DE+++   + 
Sbjct: 223 MHLAFQELLLMGGVETEADYPYQGSE-QMCTLDNRKIAVKLNSCFKYDIRDENKLKELVY 281

Query: 272 KHGPLAGNVASIEL 285
             GP+A  V ++++
Sbjct: 282 TTGPVAIAVDAMDI 295


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/265 (35%), Positives = 145/265 (54%), Gaps = 23/265 (8%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
            S  L+L  S  L +++A   D +++     S+  +S D L+     F  + SK  K Y 
Sbjct: 5   FSKALVLACSFCLFASLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSKHGKIYQ 59

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLR 121
           + EE   RF +FK NL+    R  +      G+ +F+DL+  EF+ ++LGL    +RR  
Sbjct: 60  SIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRE 119

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
            P +     +    +LP   DWR  GAV  VK+QG+CGSCW+FS   A+EG + + TG L
Sbjct: 120 SPEEFTYKDV----ELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
            SLSEQ+L+DCD         +  +GCNGGLM+ AF +I++ GG+ +E+DYPY   + G+
Sbjct: 176 TSLSEQELIDCDR--------TYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYI-MEEGT 226

Query: 242 CKFDKSKI-AAAVSNFSVISSDEDQ 265
           C+  K +     +S +  +  + +Q
Sbjct: 227 CEMTKEETEVVTISGYHDVPQNNEQ 251


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 150/286 (52%), Gaps = 26/286 (9%)

Query: 1   MERLILSSLLLLLLSS----VLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLF 56
           M + I+++LL  L SS    +  S +   ++    +  + SD    ED + N    + ++
Sbjct: 1   MAKTIITTLLFALFSSLSYAIDMSIIDYKNNHYARKWTLQSD----EDQVKN---RYEMW 53

Query: 57  KSKFSKTYATQEEHDYRFRVFKANLRRAK-RRQLLDPTAVHGVTKFSDLTPSEFRRQFLG 115
            ++  + Y    E + RF +FK NLR  +      + T   G+ +F+DLT  E+R  +LG
Sbjct: 54  LAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLG 113

Query: 116 LN-----RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
                  R ++    +Q+    P   +P   DWR  GAV  +K+QG+CGSCW+FS   A+
Sbjct: 114 TKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAV 173

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TGE+++LSEQ+LVDCD           +SGCNGGLM+ AFE+I+  GG++ EK
Sbjct: 174 EGINQIVTGEMITLSEQELVDCDR--------VQNSGCNGGLMDYAFEFIISNGGMDTEK 225

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
            YPY G +G      K+    ++  +  +  +E  +    V H P+
Sbjct: 226 HYPYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERAL-QKAVAHQPV 270


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 120/203 (59%), Gaps = 16/203 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFR 110
           F  ++ +  K Y   EE + RF  FK NL+    +   + T  H  G+ KF+DL+  EF+
Sbjct: 43  FQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFADLSNEEFK 102

Query: 111 RQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           + +L   ++     R+ A+ +    L + D P+  DWR  G VT VKDQG CGSCWSFS 
Sbjct: 103 QLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGSCWSFST 162

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGA+EG + + T +L+SLSEQ+LVDCD         + + GC GG M+ AFE+++  GG+
Sbjct: 163 TGAIEGINAIVTSDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVINNGGI 213

Query: 227 EREKDYPYTGTDGGSCKFDKSKI 249
           + E +YPYTG D G+C   K +I
Sbjct: 214 DTEANYPYTGVD-GTCNTAKEEI 235


>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 98/258 (37%), Positives = 134/258 (51%), Gaps = 38/258 (14%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  +K +F ++Y +  E   R  ++ +N R      ++    +     G+T F+D+   E
Sbjct: 26  FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85

Query: 109 FRRQF----LG-----LNRR----LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
           ++RQ     LG     L RR    LRLP  A         DLP   DWR+ G VT VKDQ
Sbjct: 86  YKRQISQGCLGSFNASLPRRGSAYLRLPEGA---------DLPNSVDWREKGYVTEVKDQ 136

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
             CGSCW+FS TG+LEG  F  TG+LVSLSEQQLVDC  +   E       GC GGLM+S
Sbjct: 137 KQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNE-------GCMGGLMDS 189

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
           AF YI   GG++ E  YPY   D G C+++ + I A  + +  V   DED +   +   G
Sbjct: 190 AFRYIQANGGIDTEDSYPYEAED-GQCRYNSANIGATCTGYVDVKQGDEDALKEAVATIG 248

Query: 275 PLAGNVASIELPHISFSF 292
           P++    +I+  H SF  
Sbjct: 249 PVS---VAIDASHSSFQL 263


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 122/217 (56%), Gaps = 16/217 (7%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
           + D RF +FK NLR        +  A +  G+T F++LT  E+R  +LG      RR+  
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRRITK 83

Query: 123 PADA--QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
             +   + +  +  +++P   DWR  GAV  +KDQG CGSCW+FS   A+EG + + TGE
Sbjct: 84  AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LVSLSEQ+LVDCD         S + GCNGGLM+ AF++I+K GG+  EKDYPY GT+G 
Sbjct: 144 LVSLSEQELVDCDK--------SYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGK 195

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                K+     +  +  + S ++      V + P++
Sbjct: 196 CNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVS 232


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 97/253 (38%), Positives = 134/253 (52%), Gaps = 24/253 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           ++ +K +  K Y ++ E   R +++  N  + AK  Q  +         V K++DL   E
Sbjct: 27  WNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEE 86

Query: 109 FRRQFLGLNR-RLRLPA------DAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
           F +   G NR   + P       D     I P N ++P   DWR+ GAVT VKDQG CGS
Sbjct: 87  FVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGS 146

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CWSFSATGALEG HF  TG+LVSLSEQ LVDC        +   ++GCNGG+M+ AF+YI
Sbjct: 147 CWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS-------TKYGNNGCNGGMMDFAFQYI 199

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGN 279
              GG++ EK YPY   D  +C ++   + A    F  I   DE  +   +   GP++  
Sbjct: 200 KDNGGIDTEKAYPYEAID-DTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVS-- 256

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 257 -VAIDASHESFQF 268


>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
 gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
          Length = 334

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 94/249 (37%), Positives = 130/249 (52%), Gaps = 17/249 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
           N + H+  +K+   + Y   EE  +R  V++ N +             HG    +  F D
Sbjct: 24  NLDAHWHQWKATHRRLYGMNEE-GWRRAVWEKNKKIIDLHNQEYSQGKHGFSMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   + R      + P+L   D+P   DW   G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKRKKGKLFREPLLI--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC     P+     + GCNGGLM++AF+YI + 
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGLMDNAFQYIKEN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY  TD  SC +     AA  + F  I   E  +   +   GP++    +I
Sbjct: 194 GGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 251 DAGHASFQF 259


>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 97/250 (38%), Positives = 131/250 (52%), Gaps = 22/250 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  ++ KF K+Y +  E  +R +++  N +      +L          G+T F+D+   E
Sbjct: 26  FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85

Query: 109 FR----RQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           ++    R  LG +    LP        LP   DLP   DWR+ G VTGVKDQ  CGSCW+
Sbjct: 86  YKKLVSRGCLG-SFNASLPRRGSTFLRLPEGIDLPDAVDWREQGYVTGVKDQKQCGSCWA 144

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG HF  TG LVSLSEQQLVDC      E       GCNGG M+SAF YI   
Sbjct: 145 FSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNE-------GCNGGWMDSAFRYIEAN 197

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
           GG++ E  YPY   D   C+++ + + A  S +  V   DE+ +   +   GP++    +
Sbjct: 198 GGIDTEASYPYEAED-WLCRYNPASVGATCSGYVDVNKYDEEALKEAVATIGPVS---VA 253

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 254 IDASHASFQF 263


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 127/232 (54%), Gaps = 14/232 (6%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-----GVTKFSDLT 105
           +H   +  K  K Y    E + RF +F+ NL    +    +          G+ KF+DLT
Sbjct: 3   YHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLT 62

Query: 106 PSEFRRQFLGLNRRLRLPA-DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
             EFRR + G+ R  +  +  + +  +   ++LP   DWR  GAV+ VKDQG CGSCW+F
Sbjct: 63  NDEFRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SA GA+EG + + TG+L++LSEQ+LVDCD         S +SGC+GGLM+ AF +I+  G
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDT--------SYNSGCDGGLMDYAFRFIINNG 174

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           G++ +KDYPY  TDG      K+     +     + ++ ++     V H P+
Sbjct: 175 GIDTDKDYPYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPV 226


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 99/293 (33%), Positives = 144/293 (49%), Gaps = 40/293 (13%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V+  A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILGAVITMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R   F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFTFEKNTIKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKVN 99

Query: 125 DAQKAPILPTND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
                  +  ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H   TG+
Sbjct: 100 KPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVKDQGECGSCWAFSTTGSLEGQHANKTGK 159

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT TD  
Sbjct: 160 LVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDK 212

Query: 241 SCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 213 PCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPIS---VAIDAGHESFQF 262


>gi|38048171|gb|AAR09988.1| similar to Drosophila melanogaster CG12163, partial [Drosophila
           yakuba]
          Length = 213

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 119/192 (61%), Gaps = 13/192 (6%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDL 104
           L  A+H F  F+ +F + Y +  E   R R+F+ NL+  ++  + +  +A +G+T+F+D+
Sbjct: 30  LDKADHLFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADM 89

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           T SE++ +  GL +R    A      ++P    +LP +FDWR   AVT VK+QG+CGSCW
Sbjct: 90  TSSEYKER-TGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQGSCGSCW 148

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TG +EG H + TG+L   SEQ+L+DCD         + DS CNGGLM++A++ I  
Sbjct: 149 AFSVTGNIEGLHAVKTGDLKEFSEQELLDCD---------TTDSACNGGLMDNAYKAIKD 199

Query: 223 AGGVEREKDYPY 234
            GG+E E +YPY
Sbjct: 200 IGGLEYEAEYPY 211


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 152/289 (52%), Gaps = 20/289 (6%)

Query: 7   SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYAT 66
           S  L+L  S  L  ++A   D +++     S+  +S D L+     F  + S+  K Y T
Sbjct: 6   SKTLVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYET 60

Query: 67  QEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRL--RLPA 124
            EE   RF VFK NL+    R  +      G+ +F+DL+  EF+ ++LGL   L  R  +
Sbjct: 61  IEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRES 120

Query: 125 DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
             ++       DLP   DWR  GAVT VK+QG CGSCW+FS   A+EG + + TG L SL
Sbjct: 121 SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSL 180

Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
           SEQ+L+DCD         + ++GCNGGLM+ AF +I + GG+ +E+DYPY   +  +C+ 
Sbjct: 181 SEQELIDCDT--------TYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYI-MEESTCEM 231

Query: 245 DKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            K +     ++ +  +  + +Q     + + PL+    +IE     F F
Sbjct: 232 KKEETQVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASSRDFQF 277


>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
           familiaris]
          Length = 490

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 95/226 (42%), Positives = 133/226 (58%), Gaps = 11/226 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY T+EE ++R  VF  N+ RA++ Q LD  TA +G+TKFSDLT  EFR 
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +L    R       + A  +  +  P ++DWR  GAVT VKDQG CGSCW+FS TG +E
Sbjct: 252 IYLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 311

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G  FL  G L+SLSEQ+L+DCD           D  C GGL ++A+  I+  GG+E E D
Sbjct: 312 GQWFLKEGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYSAIMTLGGLETEDD 362

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 363 YSYQG-HLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPIS 407


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDYAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
          Length = 460

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 101/256 (39%), Positives = 143/256 (55%), Gaps = 16/256 (6%)

Query: 26  DDDAMIRQVVPSDGEQS--EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRR 83
           D +  ++  +P+    S  +D  +     F  F   +++TY ++EE  +R  VF +N+ R
Sbjct: 134 DRNETLKSTLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNMVR 193

Query: 84  AKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDF 141
           A++ Q LD  TA +G+TKFSDLT  EFR  +L  N  LR     +     P  D  P  +
Sbjct: 194 AQKIQSLDRGTAQYGITKFSDLTEEEFRTIYL--NPLLRSEPGKKMQLAKPVEDPAPPQW 251

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           DWR  GAVT VKDQG CGSCW+FS TG +EG  FL  G L+SLSEQ+L+DCD        
Sbjct: 252 DWRSKGAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDK------- 304

Query: 202 GSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS 261
              D  C GGL ++A+  I   GG+E E+DY Y G    +C F   K    +++   +S 
Sbjct: 305 --LDKACLGGLPSNAYSAIKNLGGLETEEDYTYQG-HMQACNFSAQKAKVYINDSVELSQ 361

Query: 262 DEDQMAANLVKHGPLA 277
           +E ++AA L K GP++
Sbjct: 362 NEQKLAAWLAKRGPIS 377


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML--KIPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANGTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
          Length = 350

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 103/267 (38%), Positives = 150/267 (56%), Gaps = 24/267 (8%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
           SLL++L     A+A     D   IR V  SD E+    ++    H   F+ F +++ K Y
Sbjct: 5   SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62

Query: 65  ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
            + +E   RF++F  NL   R + +R+L   +   GV  F+D T  EFR   LG  +   
Sbjct: 63  DSVDEMKLRFKIFSENLELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
             A  +    +   +LP + DWR  G V+GVKDQG+CGSCW+FS TGALE A+  + G+ 
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177

Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228

Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQM 266
            CKF    +A  V  + ++    ED++
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDEL 255


>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
          Length = 338

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 99/231 (42%), Positives = 127/231 (54%), Gaps = 18/231 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKAN---LRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           +K   +KTYAT  E   R R+F  N   +R    R  L   T    +  F+DLT  EF  
Sbjct: 33  WKVANNKTYATLREEHLRMRIFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAE 92

Query: 112 QFLGLNRRLR--LPADAQKAPI-LPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           ++L L +     +  D     +  PT  L P   DWR  G VT +KDQG CGSCW+FSAT
Sbjct: 93  KYLTLKQTPMEGIWQDMSTQYVERPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSAT 152

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GALEG     TG+L+SLSEQQLVDC        + + + GCNGG MN AF Y ++  G E
Sbjct: 153 GALEGQLKRKTGKLISLSEQQLVDC-------STYTGNEGCNGGDMNDAFRYWMR-NGAE 204

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLA 277
            E DYPYT  D G CKF+ SK+   VS F  V    EDQ+  ++ + GP++
Sbjct: 205 SESDYPYTAMD-GKCKFNSSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVS 254


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 101/297 (34%), Positives = 149/297 (50%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           ++LL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MMLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  + + N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFILEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT----ND----LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +    ND    LP   DWR+   V+ VKDQG CGSCW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIKANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|66816665|ref|XP_642342.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
 gi|60470393|gb|EAL68373.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
          Length = 337

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 99/272 (36%), Positives = 142/272 (52%), Gaps = 29/272 (10%)

Query: 29  AMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ 88
           A++  V  +  E SE    +A   F+ +     K+Y++ E    R+ +FK N    +   
Sbjct: 9   ALLITVATAKQELSESQYRDA---FTDWMISNQKSYSSSE-FITRYNIFKTNFDYIEEWN 64

Query: 89  LLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ-----KAPILPTNDLPTDFDW 143
                 V G+ K +D+T  E+R  +LG       P DA      K  IL +N   +  DW
Sbjct: 65  SKGSETVLGLNKMADITNEEYRSLYLGK------PFDASSLIGTKEEILFSNKFSSTVDW 118

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLS---TGELVSLSEQQLVDCDHECDPEE 200
           R  GAVT VK+Q +C  CWSFSATGA EGAH L+   T ELVSLSEQ L+DC        
Sbjct: 119 RKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFG--- 175

Query: 201 SGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS 260
               ++GCNGG++  AFEYI+  GG++ EK YP+ GTD G+C++      A +S++  ++
Sbjct: 176 ----NTGCNGGVITYAFEYIISNGGIDTEKSYPFEGTD-GTCRYKSENSGATISSYVNVT 230

Query: 261 SDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
              +    + V   P+A    SI+  H SF F
Sbjct: 231 FGSESSLESAVNVNPVA---CSIDASHSSFLF 259


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 97/252 (38%), Positives = 137/252 (54%), Gaps = 27/252 (10%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKF-------SDLT 105
           F+LFK    K Y  + E  YR ++F  N +R ++    +     G   F       +D+ 
Sbjct: 27  FTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKH---NSRYKQGKVSFKLKLNHLADML 83

Query: 106 PSEFRRQFLGLNRRLRLPADA-QKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCW 162
             E+   +LG N+  +   +  Q    +P     L  + DWR  GAVT VK+QG CGSCW
Sbjct: 84  IHEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCW 143

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYIL 221
           +FS TGALEG +F  TG+LVSLSEQ LVDC        SGS  ++GC GGLM++AF+YI 
Sbjct: 144 AFSTTGALEGQNFRKTGKLVSLSEQNLVDC--------SGSYGNNGCEGGLMDNAFQYIK 195

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
           +  G++ EK YPY G D  +C+F K+ I A  S F  +   DE+ +   +   GP++   
Sbjct: 196 ENHGIDTEKSYPYEGED-ETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPIS--- 251

Query: 281 ASIELPHISFSF 292
            +I+  H SF F
Sbjct: 252 VAIDASHQSFQF 263


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 129/242 (53%), Gaps = 19/242 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEFRRQFL 114
           + +K  K Y   EE   RF++FK N+   +      + + + G+ +F+DLT  EFR  + 
Sbjct: 42  WMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWN 101

Query: 115 GLNRRLRLPADAQK--APILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           G  R    P DA +   P    N   LP   DWR  GAVT +KDQ  CGSCW+FSA  A 
Sbjct: 102 GYKR----PLDASRIVTPFKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAAT 157

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG H L TG+LVSLSEQ+LVDCD + +       D GC GGLM  AF++I + GG+  E 
Sbjct: 158 EGVHKLRTGKLVSLSEQELVDCDVKGE-------DKGCQGGLMEDAFKFIKRNGGITTEA 210

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISF 290
           +Y Y G DG      ++   A ++ + V+  + +      V H P++    SI+   +SF
Sbjct: 211 NYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAVAHQPVS---VSIDAGSMSF 267

Query: 291 SF 292
            F
Sbjct: 268 QF 269


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 104/296 (35%), Positives = 151/296 (51%), Gaps = 27/296 (9%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMI-------RQVVPSDGEQSEDHLLNAEHHFSLF 56
           L+ +++ LL+ +S L       DD A+         Q   +  E  E H  +A   FS F
Sbjct: 65  LVAAAVSLLVFASFLIQWQG-EDDRAVFPPSPVEDHQPPANIWEWKEAHFQDA---FSSF 120

Query: 57  KSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL 116
           ++ ++K+YAT+EE   R+ +FK NL           +    +  F DL+  EFRR++LG 
Sbjct: 121 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGF 180

Query: 117 NRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +   L +       +   +LP+ +LP   DWR  G VT VKDQ  CGSCW+FS TGALE
Sbjct: 181 KKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALE 239

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           GAH   TG+LVSLSEQ+L+DC            +  C+GG MN AF+Y+L +GG+  E  
Sbjct: 240 GAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMNDAFQYVLDSGGICSEDA 292

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELP 286
           YPY   D   C+    +    +  F  V    E  M A L K  P++  + + ++P
Sbjct: 293 YPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMP 346


>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
 gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
          Length = 347

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 90/247 (36%), Positives = 135/247 (54%), Gaps = 19/247 (7%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L   + F+ +  +  + YA+ EE   R+ +FKAN+   +         V G+  F+D+T 
Sbjct: 24  LQYRNAFTNWMIQNQRHYAS-EEFAARYNIFKANMDYVQEWNSKGSETVLGLNTFADITN 82

Query: 107 SEFRRQFLG--LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
            EFR  +LG   +    +  + +K    P   +    DWR  GAVT +K+Q  CG CWSF
Sbjct: 83  QEFRSIYLGTPFDGSSIINTETEKIFAAPAASI----DWRTKGAVTPIKNQQQCGGCWSF 138

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKA 223
           S TG+ EGA  ++ G L SLSEQ L+DC        SGS  ++GCNGGLM  AFEYI+  
Sbjct: 139 STTGSTEGATAIAKGNLPSLSEQNLIDC--------SGSYGNNGCNGGLMTLAFEYIINN 190

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
            G++ E  YPYT  DG +CK++ + I A +S++S ++S  +    +    GP++    +I
Sbjct: 191 KGIDTESSYPYTAKDGKTCKYNPANIGATLSSYSNVTSGSEPSLESAANIGPVS---VAI 247

Query: 284 ELPHISF 290
           +  H SF
Sbjct: 248 DASHNSF 254


>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
 gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
          Length = 392

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 84  SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 143

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           FSDLT  EFR  +L    R + P +  K      +  P ++DWR  GAVT VKDQG CGS
Sbjct: 144 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 202

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 203 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 253

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 254 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 309


>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
          Length = 548

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 100/227 (44%), Positives = 134/227 (59%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 311 IYL--NPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 368

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 369 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 419

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    SC F   K    +++  V+S +E ++AA L K GP++
Sbjct: 420 DYSYQG-HMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPIS 465


>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 315

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 87/197 (44%), Positives = 116/197 (58%), Gaps = 19/197 (9%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPS 107
           N +  +S FK+K++K YA  +   YR  +F  NL+  +       T  +G+T+F D+T  
Sbjct: 35  NIQALWSAFKTKYNKKYADPDFERYRIEIFTENLKVVESN-----TKNYGITQFMDITRE 89

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EF++ +L L  +  L A    +P    ND   + DW   GAVT VKDQG CGSCWSFS T
Sbjct: 90  EFKQTYLTLKMKNGLKA----SPFAKFNDAGVEIDWTTKGAVTPVKDQGQCGSCWSFSTT 145

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GA+EGA FLST +L SLSEQ LVDC        S   + GCNGGLM++AF++I +  G+ 
Sbjct: 146 GAVEGALFLSTKKLTSLSEQYLVDC--------SKDGNEGCNGGLMDTAFDFISQH-GIP 196

Query: 228 REKDYPYTGTDGGSCKF 244
            E  YPY   D G+CK 
Sbjct: 197 TEAAYPYKAVD-GTCKM 212


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 151/285 (52%), Gaps = 19/285 (6%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           L+L  S  L  ++A   D +++     S+  +S D L+     F  + S+  K Y T EE
Sbjct: 9   LVLTCSLCLFLSLAFGRDFSIVG--YSSEDLKSMDKLIEL---FESWMSRHGKIYETIEE 63

Query: 70  HDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKA 129
              RF VFK NL+    R  +      G+ +F+DL+  EF+ ++LGL   L    ++ + 
Sbjct: 64  KLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSEE 123

Query: 130 PILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQ 188
                + DLP   DWR  GAVT VK+QG CGSCW+FS   A+EG + + TG L SLSEQ+
Sbjct: 124 EFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 183

Query: 189 LVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKS- 247
           L+DCD         + ++GCNGGLM+ AF +I+K GG+ +E+DYPY   +  +C+  K  
Sbjct: 184 LIDCDT--------TYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYI-MEESTCEMKKEV 234

Query: 248 KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
                ++ +  +  + +Q     + + PL+    +IE     F F
Sbjct: 235 SEVVTINGYHDVPQNNEQSLLKALANQPLS---VAIEASGRDFQF 276


>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
          Length = 521

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 92/248 (37%), Positives = 129/248 (52%), Gaps = 27/248 (10%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ +  K  ++Y + E  + RF VFK N+             V  +T F+D++  E++R 
Sbjct: 31  FTNWMIKNDRSYQSAEFGN-RFNVFKKNMDYVNEWNSKGSETVLDLTIFADISNEEYQRI 89

Query: 113 FLGLN----------RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           +LG             R+ +  +   AP+          DWR  GAVT +K+QG CGSCW
Sbjct: 90  YLGTKIDATQKLIDAARITMNNNFAAAPVFNAT-----VDWRQKGAVTPIKNQGQCGSCW 144

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           SFS TG+ EGAHFLSTG LVSLSEQ LVDC     PE     + GCNGGLM+ AF YI+K
Sbjct: 145 SFSTTGSTEGAHFLSTGNLVSLSEQNLVDCS---GPEG----NDGCNGGLMDQAFTYIIK 197

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
             G++ E  YPY     G C F+   I A ++ ++ + S  +         GP++    +
Sbjct: 198 NKGIDTESSYPYKAVQ-GKCAFNPKNIGATLTGYTDVKSGSESDLEAKANTGPVS---VA 253

Query: 283 IELPHISF 290
           I+  H SF
Sbjct: 254 IDASHNSF 261


>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
          Length = 338

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 30  SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           FSDLT  EFR  +L    R + P +  K      +  P ++DWR  GAVT VKDQG CGS
Sbjct: 90  FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 200 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 255


>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
          Length = 333

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 16/234 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL---DPTAVH-GVTKFSDLTPS 107
            +  +KS + K Y +++E  +R  VF+ NL+R  +  LL      + H G+ K+SDL   
Sbjct: 26  QWDTWKSTYGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELH 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAP--ILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           E+  + +G    LR     + AP  +   ++LP   DWR  G VT VK+QG CGS W+FS
Sbjct: 86  EYHEKVVGRFWNLRNGTRRRGAPFPLRSMDNLPEQVDWRLKGYVTPVKEQGLCGSSWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HF +TG L SLSEQQLVDC            ++GCNGG    A +YI+   G
Sbjct: 146 ATGSLEGQHFAATGNLTSLSEQQLVDC-------TKSYYNNGCNGGRSERALQYIIDNNG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI--SSDEDQMAANLVKHGPLA 277
           ++ E  YPY   D G C+F  + +A   S++  +  SS+E+ +   +   GP+A
Sbjct: 199 IDSELSYPYEHAD-GKCRFKPANVATKCSSYQFVEPSSNEEVLRQAVASVGPIA 251


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 90/228 (39%), Positives = 125/228 (54%), Gaps = 20/228 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV----HGVTKF 101
           L N    F  FK K SK+Y+ Q E   R  +F  NLR  +    L    +      V +F
Sbjct: 18  LENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQF 77

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGS 160
           +DLT  EF+  +L L+ +  L       P + T   +PT  DWR  G VTGVKDQG CGS
Sbjct: 78  TDLTIDEFK-AYLTLHSKPTL----NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGS 132

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS  G+ EGA++ STG+LVSLSEQQL+DC        + + + GC+GG +   F Y+
Sbjct: 133 CWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDC--------TTNVNDGCDGGYLEETFPYV 184

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAA 268
            + G V  E  YPYTG D G+C+  +S +   VS + ++  + D + A
Sbjct: 185 QQTGLVS-ESSYPYTGRD-GNCRISESDVVTKVSKYVLLGGEADLLEA 230


>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
 gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
 gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 110/290 (37%), Positives = 158/290 (54%), Gaps = 33/290 (11%)

Query: 1   MERLILSSLLLLLLSSVLASAV-----AVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH- 52
           M RL + + +L+LL +V +        +  D++  IR V  S  D E S   L+    H 
Sbjct: 1   MARLSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHA 60

Query: 53  --FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSE 108
             F+ F  ++ K+Y T +E   RF +F  NL+  R+  R+ L  T    V +F+D T  E
Sbjct: 61  HSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLA--VNQFADWTWEE 118

Query: 109 FRRQFLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           FRR  LG   N    L  + +   ++    LP   DWR+ G V+ +KDQG CGSCW+FS 
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVI----LPETKDWREDGIVSPIKDQGHCGSCWTFST 174

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGG 225
           TGALE A+  + G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG
Sbjct: 175 TGALEAAYAQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGG 226

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           ++ E+ YPYTG D G+CKF    I   V    N ++ + DE + A   V+
Sbjct: 227 LDTEEAYPYTGLD-GTCKFSSENIGVQVLDSVNITLGAEDELKHAVAFVR 275


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 94/262 (35%), Positives = 144/262 (54%), Gaps = 25/262 (9%)

Query: 39  GEQSEDHLLNAEHHFSLFKSKFS---KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAV 95
           G  SED L + +    LF+S  S   K Y + EE  +RF +FK NL+    R  +     
Sbjct: 32  GYSSED-LKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYW 90

Query: 96  HGVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTG 151
            G+ +F+DL+  EF+ ++LGL    +RR   P +     +    +LP   DWR  GAVT 
Sbjct: 91  LGLNEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDV----ELPKSVDWRKKGAVTQ 146

Query: 152 VKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGG 211
           VK+QG+CGSCW+FS   A+EG + + TG L SLSEQ+L+DCD         + ++GCNGG
Sbjct: 147 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDR--------TYNNGCNGG 198

Query: 212 LMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANL 270
           LM+ AF +I++  G+ +E+DYPY   + G+C+  K +     +S +  +  + +Q     
Sbjct: 199 LMDYAFSFIVENDGLHKEEDYPYI-MEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKA 257

Query: 271 VKHGPLAGNVASIELPHISFSF 292
           + + PL+    +IE     F F
Sbjct: 258 LANQPLS---VAIEASGRDFQF 276


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 99/247 (40%), Positives = 129/247 (52%), Gaps = 28/247 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F+ FK+K+ K Y    E   RF +FKAN+         + T   GV +F+DLT  EF   
Sbjct: 27  FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEFAAS 86

Query: 113 FLGLNRRLRLPADAQKA-PILPTND-----LPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           + GL      PA      P L T++     L +  DW   G VT VK+QG CGSCWSFS 
Sbjct: 87  YTGLK-----PASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFST 141

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEGA  LSTG LVSLSEQQ  DCD         + DSGCNGG M++AF +  K   +
Sbjct: 142 TGALEGAWALSTGNLVSLSEQQFEDCD---------TTDSGCNGGWMDNAFSFA-KKNSI 191

Query: 227 EREKDYPYTGTDGGSCKFDKSKIA---AAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
             E  YPYT TD G+C     ++      V  ++ +S+D +Q   + V   P++    +I
Sbjct: 192 CTEGSYPYTATD-GTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVS---IAI 247

Query: 284 ELPHISF 290
           E    SF
Sbjct: 248 EADQYSF 254


>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
          Length = 333

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 135/253 (53%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
           DH LN +  + L+K+   K Y   EE  +R  V+K N++  +          H     + 
Sbjct: 22  DHSLNTQ--WELWKAVHRKPYDLNEE-GWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F DLT  EFR+   G  R+           I  +  +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDLTSEEFRQMMNGFQRQENKKGKVFHETIFAS--IPPSVDWREKGYVTPVKNQGKCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TGALEG  F  TG+LVSLSEQ LVDC     PE     + GC+GGLM++AF+Y
Sbjct: 137 SCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQ---PEG----NRGCHGGLMDNAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +L  GG++ E+ YPYTG   G+C ++    AA  + F  +   E+ +   +   GP++  
Sbjct: 190 VLDVGGLDSEESYPYTGLV-GTCNYNPKNSAANETGFVDLPKQENALMKAVATLGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  + SF F
Sbjct: 247 -VAVDASNPSFQF 258


>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
 gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
          Length = 308

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 16  SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 75

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           FSDLT  EFR  +L    R + P +  K      +  P ++DWR  GAVT VKDQG CGS
Sbjct: 76  FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 134

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 135 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSAI 185

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 186 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 241


>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 101/257 (39%), Positives = 141/257 (54%), Gaps = 23/257 (8%)

Query: 27  DDAMIRQVVPSDGEQSEDHLLNA---EHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           D+  IRQ+V     + E+ +L       H   F+ F  ++ K Y T EE   RF VF  N
Sbjct: 29  DENPIRQIVSDGLHELENGILQVVGKTRHALLFARFAHRYGKRYETVEEIKQRFEVFLDN 88

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPT 139
           L+  +       +   GV +F+D+T  EFRR  LG  +     +   K  +  TN  LP 
Sbjct: 89  LKMIRSHNKKGLSYKLGVNEFTDITWDEFRRDRLGAAQNC---SATTKGNLKLTNVVLPE 145

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
             DWR+ G V+ VK+QG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC       
Sbjct: 146 TKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDC------- 198

Query: 200 ESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SN 255
            +G+ ++ GCNGGL + AFEYI   GG++ E+ YPYTG + G CKF    +   V    N
Sbjct: 199 -AGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVN 256

Query: 256 FSVISSDEDQMAANLVK 272
            ++ + DE + A  LV+
Sbjct: 257 ITLGAEDELKYAVALVR 273


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 93/255 (36%), Positives = 136/255 (53%), Gaps = 25/255 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           ++ +++FK +++K Y  +EE   R  V+++NL       L      H    G+ ++ D+T
Sbjct: 24  DNEWNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMT 82

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPI-LPTN---DLPTDFDWRDHGAVTGVKDQGACGSC 161
             EF +   G     R+      AP+ +P N   DLP   DWR  G VT +K+QG CGSC
Sbjct: 83  NEEFTKTMNGY----RMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSC 138

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           WSFSATG+LEG  F  TG+LVSLSEQ LVDC  +         + GC GGLM+ AF YI 
Sbjct: 139 WSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKK-------QGNHGCEGGLMDDAFTYIK 191

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNV 280
              G++ E  YPY   D G C+F  + + A  + F  + + DE+ +   +   GP++   
Sbjct: 192 ANNGIDTEASYPYKARD-GKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPIS--- 247

Query: 281 ASIELPHISFSFLFT 295
            +I+  H+SF    T
Sbjct: 248 VAIDASHMSFQLYRT 262


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 91/261 (34%), Positives = 139/261 (53%), Gaps = 24/261 (9%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
           +D+ L  +     + +K  + YA  +E + R+ VFK N+ R +R   +    T    V +
Sbjct: 29  DDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 88

Query: 101 FSDLTPSEFRRQFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
           F+DLT  EFR  + G      L+ +      + +   + +  LP   DWR  GAVT +K+
Sbjct: 89  FADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKN 148

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG CG CW+FSA  A+EGA  +  G+L+SLSEQQLVDCD           D GC+GGLM+
Sbjct: 149 QGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN---------DFGCSGGLMD 199

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
           +AFE+I+  GG+  E +YPY G D  +CK   +K  A +++ +  +  ++++     V H
Sbjct: 200 TAFEHIMATGGLTTESNYPYKGKD-ATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAH 258

Query: 274 GPLAGNVASIELPHISFSFLF 294
            P+     SI +    F F F
Sbjct: 259 QPV-----SIGIEGGGFDFQF 274


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 85/212 (40%), Positives = 122/212 (57%), Gaps = 22/212 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+ +      +  + ++  +TY    E + RF VF+ NLR   +        
Sbjct: 27  IVSYGERSEEEV---RRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAG 83

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLN----RRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           +H    G+ +F+DLT  E+R  +LG+     R  RL    Q A      +LP   DWR+ 
Sbjct: 84  LHSFRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSGRYQAAD---NEELPESVDWREK 140

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG CGSCW+FSA  A+EG + + TG++++LSEQ+LVDCD         S + 
Sbjct: 141 GAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDT--------SYNQ 192

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GCNGGLM+ AFE+I+  GG++ E+DYPY   D
Sbjct: 193 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERD 224


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 97/256 (37%), Positives = 136/256 (53%), Gaps = 21/256 (8%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL---DPTAVH-GVTK 100
           +LL  E H  LFK+   K Y +Q E  +R +++  N  +  +  +L      + H  + K
Sbjct: 21  NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNK 78

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F DL   EFR    G   + +  + A+       P N  +P   DWR+ GA+T VKDQG 
Sbjct: 79  FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQ 138

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS+TGALEG  F  TG+LVSLSEQ L+DC  +   E       GCNGGLM+ AF
Sbjct: 139 CGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 191

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY   D   C+++     A    F  I S +ED++ A +   GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 251 S---VAIDASHESFQF 263


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
          Length = 443

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/248 (35%), Positives = 131/248 (52%), Gaps = 44/248 (17%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           FS FK+  ++ Y +  E   RF +F AN+++A      +P A  G  +F+D++  EF+ +
Sbjct: 25  FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84

Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
                                 +     AD QK             DWR  GAVT VK+Q
Sbjct: 85  HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD         + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183

Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
           AF +++  + G +  E  YPY   +G   +C +  D   + A +SNF  I+  E+ MAA 
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243

Query: 270 LVKHGPLA 277
           +  +GPL+
Sbjct: 244 VFNYGPLS 251


>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|1222695|gb|AAA92019.1| CP4 [Dictyostelium discoideum]
          Length = 442

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 84/234 (35%), Positives = 127/234 (54%), Gaps = 13/234 (5%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L   + F+ +     +TY++ EE + R+++FK+N+    +        V G+  F+D+T 
Sbjct: 24  LQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITN 82

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            E+R  +LG           ++  I  T   PT  DWR  GAVT +K+QG CG CWSFS 
Sbjct: 83  QEYRTTYLGTPFDGSALIGTEEEKIFST-PAPT-VDWRAQGAVTPIKNQGQCGGCWSFST 140

Query: 167 TGALEGAHFLSTG---ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           TG+ EGAHF+++G   +LVSLSEQ L+DC            ++GC GGLM   FEYI+  
Sbjct: 141 TGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG-------NNGCEGGLMTLGFEYIINN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G++ E  YPYT  DG  CKF  S I A + ++  ++S  +    +   + P++
Sbjct: 194 KGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVS 247


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/251 (35%), Positives = 134/251 (53%), Gaps = 22/251 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+ +      ++ + ++   TY    E + RF  F+ NLR   +        
Sbjct: 28  IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG     +R  +L A  Q A     ++LP   DWR  
Sbjct: 85  VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG CGSCW+FSA  A+EG + + TG+++ LSEQ+LVDCD         S + 
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ E+DYPY   D       K+     +  +  +  + ++ 
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 253

Query: 267 AANLVKHGPLA 277
               V + P++
Sbjct: 254 LQKAVANQPIS 264


>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
 gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
          Length = 335

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
           +L  A  +F  F   ++K Y +  E + R+ +FK NL    AK     D PTA +G+ KF
Sbjct: 27  NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKF 86

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
           SDL+ SE   +F GL+   R  ++  K  +L  P +  P  FDWR+   VT +K+QGACG
Sbjct: 87  SDLSKSELIAKFTGLSIPQR-ASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQGACG 145

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           +CW+F+   ++E    +    LV LSEQQL+DCD         S D GCNGGL+++AFE 
Sbjct: 146 ACWAFATLASVESQFAMRHNRLVDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 196

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
           I++ GGV+ E DYP+ G D   C  D+ +  + + V  +  +  +E+++   L   GP+
Sbjct: 197 IIRMGGVQAELDYPFVGRD-RRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 254


>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
          Length = 443

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 90/242 (37%), Positives = 132/242 (54%), Gaps = 32/242 (13%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK+  ++ YA+ +E   RF +F  N+++A      +P A  G  +F+D+T  EF+ +
Sbjct: 25  FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 84

Query: 113 F----LGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
                     + R P + +       KA +          DWR  GAVT VK+QGACGSC
Sbjct: 85  HNAARHYAAAKARPPKNTKTFTAEEIKAAV------GQQIDWRLKGAVTPVKNQGACGSC 138

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           WSFS TG +EG H ++TG+LV++SEQ+LV CD           D GCNGGLM++AF +++
Sbjct: 139 WSFSTTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLI 189

Query: 222 KA--GGVEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
            A  G +  E +YPY   +G     S   +   + A +S F  I+  E+ MAA + KHGP
Sbjct: 190 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGP 249

Query: 276 LA 277
           L+
Sbjct: 250 LS 251


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 96/275 (34%), Positives = 147/275 (53%), Gaps = 29/275 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE++++    A   ++ + +   +TY      + R++VF+ NLR            
Sbjct: 29  IVSYGERTDEE---ARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAG 85

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRR----LRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+   +LG   R     +L A    A      DLP   DWR  
Sbjct: 86  VHSFRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAAD---NEDLPESVDWRAK 142

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG+CG+CW+FS   A+EG + + TG+L+SLSEQ+LVDCD         S + 
Sbjct: 143 GAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNQ 194

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ EKDYPY GTDG      K+     + ++  + +++++ 
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254

Query: 267 AANLVKHGPLAGNVASIELPHISF----SFLFTVS 297
               V + P++    +IE    +F    S +FT S
Sbjct: 255 LQKAVANQPVS---VAIEAAGTAFQLYSSGIFTGS 286


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 134/248 (54%), Gaps = 16/248 (6%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K+Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQG CGSCW+FSA  A+E  + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ AF++I+  GG++ E DYPY G D       K+     + ++  ++ + +     
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQK 253

Query: 270 LVKHGPLA 277
            V++ P++
Sbjct: 254 AVRNQPVS 261


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 98/240 (40%), Positives = 130/240 (54%), Gaps = 20/240 (8%)

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGL 116
           SK Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T  EFR+   G 
Sbjct: 1   SKKYHEKEE-GWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGY 59

Query: 117 NRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
            R+ +       +  +  N L  P   DWRD+G VT VKDQG CGSCW+FS TGALEG H
Sbjct: 60  KRKPQRKFTG--SLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 117

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
           F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI    G++ E  YPY
Sbjct: 118 FRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPY 170

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIELPHISFSFL 293
            GTD   C +D    +A  + F  I S +++     V   GP++    +I+  H SF F 
Sbjct: 171 LGTDDQPCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVS---VAIDAGHESFQFY 227


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 125/227 (55%), Gaps = 11/227 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  + SK  K Y + EE  +RF VF+ NL     R     +   G+ +F+DL+  EF+ +
Sbjct: 404 FESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEEFKSK 463

Query: 113 FLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +LGL        D + +       DLP   DWR  GAVT VK+QGACGSCW+FS   A+E
Sbjct: 464 YLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVAAVE 523

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + + TG L +LSEQ+L+DCD         + +SGCNGGLM+ AF +I   GG+ +E D
Sbjct: 524 GINQIVTGNLTTLSEQELIDCDT--------TFNSGCNGGLMDYAFAFIASNGGLHKEDD 575

Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   + G+C+  K  +    +S +  +   +++     + H PL+
Sbjct: 576 YPYL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLS 621


>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 147/297 (49%), Gaps = 50/297 (16%)

Query: 9   LLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQE 68
           +LLL+L +V++ A A          V+P + E            + ++K +  K Y T+ 
Sbjct: 1   MLLLILGAVISMATA---------GVLPHNKE------------WEMWKLQHGKQYETEA 39

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSEFRRQFLGLNRRLRLPA 124
           E   R  +F+ N  +     +     +H  T    KF D+   EF ++ +G   ++    
Sbjct: 40  EEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIV--- 96

Query: 125 DAQKAPILPT--------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              K P+L +          LP   DWR+   V+ VKDQG CG CW+FS TG+LEG H  
Sbjct: 97  ---KKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQGECGPCWAFSTTGSLEGQHSN 153

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG+LV LSEQQLVDC  +         + GC GGLM+ AF+YI   GG++ E+ YPYT 
Sbjct: 154 KTGKLVDLSEQQLVDCSKDFG-------NQGCGGGLMDQAFQYIPANGGLDTEESYPYTA 206

Query: 237 TDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           TD   CKFD S + A +  +  V S +E  +   +   GP++    +I+  H SF F
Sbjct: 207 TDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVS---VAIDAGHESFQF 260


>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 308

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 97/256 (37%), Positives = 128/256 (50%), Gaps = 27/256 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ANL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 65  QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHANFGVTPFSDLTREEFRS 124

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P D +          P   DWR+ GAVT VK+QG CGSCW+F
Sbjct: 125 RYQNGAAHFAAAQERARVPVDVEVV------GAPAAKDWREEGAVTAVKNQGMCGSCWAF 178

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
           +A G +EG  FL+   L  LSEQ LV CD+          +SGC GG    AF++I+   
Sbjct: 179 AAIGNIEGQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGSPFRAFKWIVDRN 229

Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
            G V  E  YPY    G    CK     + A +S +  I SDE ++AA L   GPL+  V
Sbjct: 230 NGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATISGYVTIPSDEKRIAAVLAVKGPLSVAV 289

Query: 281 ASIELPHISFSFLFTV 296
            +    H +   +FT+
Sbjct: 290 DASSWMHYT-GGVFTI 304


>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 123/233 (52%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+F+  G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + D GC  G M++AF++I+    G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +C      + A + +   I  +E+ +A  L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNENAIAEWLAKNGPVA 261


>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
          Length = 443

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/248 (35%), Positives = 131/248 (52%), Gaps = 44/248 (17%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           FS FK+  ++ Y +  E   RF +F AN+++A      +P A  G  +F+D++  EF+ +
Sbjct: 25  FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNRKNPMATFGPNEFADMSSEEFQTR 84

Query: 113 F-----------------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
                                 +     AD QK             DWR  GAVT VK+Q
Sbjct: 85  HNAARHYAAAKARRAKHTKSFTKEEIKAADGQK------------IDWRLKGAVTSVKNQ 132

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G+CGSCWSFS TG +EG + ++TG LVSLSEQ+LV CD         + D+GCNGGLM++
Sbjct: 133 GSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQELVSCD---------TTDNGCNGGLMDN 183

Query: 216 AFEYIL--KAGGVEREKDYPYTGTDG--GSCKF--DKSKIAAAVSNFSVISSDEDQMAAN 269
           AF +++  + G +  E  YPY   +G   +C +  D   + A +SNF  I+  E+ MAA 
Sbjct: 184 AFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAF 243

Query: 270 LVKHGPLA 277
           +  +GPL+
Sbjct: 244 VFNYGPLS 251


>gi|66815417|ref|XP_641725.1| cysteine proteinase [Dictyostelium discoideum AX4]
 gi|74844418|sp|Q94503.1|CYSP6_DICDI RecName: Full=Cysteine proteinase 6; Flags: Precursor
 gi|1644500|gb|AAC47481.1| cysteine proteinase [Dictyostelium discoideum]
 gi|60469754|gb|EAL67741.1| cysteine proteinase [Dictyostelium discoideum AX4]
          Length = 434

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 90/235 (38%), Positives = 128/235 (54%), Gaps = 26/235 (11%)

Query: 66  TQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPAD 125
           + EE + RF +FKAN+             V G+  F+D+T  E+R  +LG       P D
Sbjct: 42  SSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRATYLGT------PFD 95

Query: 126 AQKAPILPTNDL-----PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG- 179
           A    + P+  +         DWR  GAVT +K+QG CG CWSFSATGA EGA +++ G 
Sbjct: 96  ASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGD 155

Query: 180 -ELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKAGGVEREKDYPYTGT 237
            +L S+SEQQL+DC        SGS  ++GC GGLM  AFEYI+  GG++ E  YP+T  
Sbjct: 156 SDLTSVSEQQLIDC--------SGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFT-A 206

Query: 238 DGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           +   CK++ S I A +S++  ++S  +   A  V  GP +    +I+    SF F
Sbjct: 207 NTEKCKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTS---VAIDASQPSFQF 258


>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
 gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
 gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
 gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
 gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
 gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
 gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
 gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
 gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
 gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
          Length = 484

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235

Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           FSDLT  EFR  +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  
Sbjct: 294 SCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I   GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/252 (34%), Positives = 139/252 (55%), Gaps = 18/252 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLN------RRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
           +F+D+T  EF  +F GLN          +P+   K   L  +D+P++ DWR+ GAVT VK
Sbjct: 87  EFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           +QG CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M
Sbjct: 147 NQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            +AF++I++ GG+ RE DY Y G    +C+      A  +SN+ V+   E  +   + K 
Sbjct: 198 TNAFDFIIENGGISRESDYEYLGQQ-YTCRSQGKTAAVQISNYQVVPEGETSLLQAVTKQ 256

Query: 274 GPLAGNVASIEL 285
               G  AS +L
Sbjct: 257 PVSIGIAASHDL 268


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/249 (36%), Positives = 135/249 (54%), Gaps = 19/249 (7%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
           + +  FK+++ K Y + +E  YR  V++ N              +   T    +F D+T 
Sbjct: 20  NEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTT 79

Query: 107 SEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            E      G L+   ++P      P++  ++LP   DWRD GAVT VKDQ ACGSCW+FS
Sbjct: 80  EEINAAMNGFLSAGKKVPRGTMYQPLV--DELPDTVDWRDKGAVTPVKDQKACGSCWAFS 137

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG HFLSTG+LVSLSEQ LVDC  +         + GC GGLM++AF YI    G
Sbjct: 138 ATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYG-------NFGCGGGLMDNAFRYIKDNNG 190

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIE 284
           ++ E+ YPY   + G C+F+   + A +S++  I    ED +   + + GP++    +I+
Sbjct: 191 IDTEESYPYEAKN-GPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVS---VAID 246

Query: 285 LPHISFSFL 293
               +F F 
Sbjct: 247 ASTSTFHFY 255


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 96/257 (37%), Positives = 134/257 (52%), Gaps = 22/257 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKF 101
           L+ AE  +S FK+   K Y ++ E  YR +++  N +  A+  +      V     + ++
Sbjct: 24  LVGAE--WSAFKALHGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEY 81

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT----NDLPTDFDWRDHGAVTGVKDQGA 157
            D+   EF     G  R  R         I P       LP   DWR  GAVT VK+QG 
Sbjct: 82  GDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQ 141

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG+LEG HF  +G++VSLSEQ LVDC        +   ++GC GGLM++AF
Sbjct: 142 CGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC-------STAFGNNGCEGGLMDNAF 194

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPL 276
           +YI   GG++ EK YPY GTD G+C F KS + A  + F  I    + +    V   GP+
Sbjct: 195 KYIKANGGIDTEKSYPYNGTD-GTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPI 253

Query: 277 AGNVASIELPHISFSFL 293
           +    +I+  H SF F 
Sbjct: 254 S---VAIDASHQSFQFY 267


>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
          Length = 485

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 99/237 (41%), Positives = 136/237 (57%), Gaps = 12/237 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           FSDLT  EFR  +L    R + P +  K      +  P ++DWR  GAVT VKDQG CGS
Sbjct: 236 FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 294

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 295 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 345

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 346 KNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPIS 401


>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 123/233 (52%), Gaps = 14/233 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F+ FK K+S++Y    E  +RFRVFK N+ RAK     +P A  GVT+FSD++P EF
Sbjct: 38  QQQFAAFKQKYSRSYRDATEEAFRFRVFKQNMERAKEEAAANPYATFGVTRFSDMSPEEF 97

Query: 110 RRQFL-GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           R  +  G           +K   + T   P   DWR  GAVT VKDQG C S W+F+  G
Sbjct: 98  RATYHNGAEYYAAALKRPRKVVNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGV 226
            +EG   ++  EL SLSEQ LV CD         + D GC  G M++AF++I+    G V
Sbjct: 158 NIEGQWKIAGHELTSLSEQMLVSCD---------TNDLGCRAGFMDTAFKWIVSPNDGNV 208

Query: 227 EREKDYPYTGTDGG--SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E+ YPY    G   +C      + A + +   I  +E+ +A  L K+GP+A
Sbjct: 209 FTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNENAIAEWLAKNGPVA 261


>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
          Length = 350

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 102/267 (38%), Positives = 150/267 (56%), Gaps = 24/267 (8%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
           SLL++L     A+A     D   IR V  SD E+    ++    H   F+ F +++ K Y
Sbjct: 5   SLLIVLFCVASAAAGFSFHDSNPIRMV--SDVEEQLLQVIGESRHAVSFARFANRYGKRY 62

Query: 65  ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
            + +E   RF++F  N+   R + +R+L   +   GV  F+D T  EFR   LG  +   
Sbjct: 63  DSVDEMKLRFKIFSENIELIRSSNKRRL---SYKLGVNHFADWTWEEFRSHRLGAAQNC- 118

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
             A  +    +   +LP + DWR  G V+GVKDQG+CGSCW+FS TGALE A+  + G+ 
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALESAYAQAFGKN 177

Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEEAYPYTGSN-G 228

Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQM 266
            CKF    +A  V  + ++    ED++
Sbjct: 229 LCKFRSEHVAVKVLGSVNITLGAEDEL 255


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 88/223 (39%), Positives = 128/223 (57%), Gaps = 13/223 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F  +    SK Y  ++E   RF ++++N++       L         +F+D+T SEF
Sbjct: 40  KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99

Query: 110 RRQFLGLNRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +  FLGLN   LRL    Q+    P  ++P   DWR  GAVT +++QG CG CW+FSA  
Sbjct: 100 KAHFLGLNTSSLRLHKK-QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVA 158

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           A+EG + + TG LVSLSEQQL+DCD        G+ + GC+GGLM +AFE+I   GG+  
Sbjct: 159 AIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKTNGGLAT 211

Query: 229 EKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDED--QMAA 268
           E DYPYTG + G+C  +KSK     +  +  ++ +E   Q+AA
Sbjct: 212 ETDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQNEASLQIAA 253


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 136/256 (53%), Gaps = 21/256 (8%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
           +LL  E H  LFK+   K Y +Q E  +R +++  N  +  +  +L    + +    + K
Sbjct: 25  NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNK 82

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F DL   EFR    G   + +  + A+       P N ++P   DWR  GA+T VKDQG 
Sbjct: 83  FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQ 142

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS+TGALEG  F  TG+L+SLSEQ L+DC  +   E       GCNGGLM+ AF
Sbjct: 143 CGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNE-------GCNGGLMDQAF 195

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY   D   C+++     A    F  I S +ED++ A +   GP+
Sbjct: 196 QYIKDNKGIDTENTYPYEAED-NVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPV 254

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 255 S---VAIDASHESFQF 267


>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 139/253 (54%), Gaps = 23/253 (9%)

Query: 31  IRQVVPSDGEQSEDHLLN----AEHHFSL--FKSKFSKTYATQEEHDYRFRVFKANLRRA 84
           IRQVV     + E+ +L     + H  S   F  ++ K Y + EE   RF VF  NL+  
Sbjct: 33  IRQVVSDGLHELENGILQVVGQSRHALSFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMI 92

Query: 85  KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDW 143
           +       +   GV +F+DLT  EFRR  LG  +     +   K  +  TN  LP   DW
Sbjct: 93  RSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQNC---SATTKGNVKLTNAVLPETKDW 149

Query: 144 RDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS 203
           R+ G V+ VK+QG CGSCW+FS TGALE A+  + G+ +SLSEQQLVDC        +G+
Sbjct: 150 REDGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDC--------AGA 201

Query: 204 CDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV---SNFSVI 259
            ++ GCNGGL + AFEYI   GG++ E+ YPYTG + G CKF    +   V    N ++ 
Sbjct: 202 FNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN-GLCKFSSENVGVKVIDSVNITLG 260

Query: 260 SSDEDQMAANLVK 272
           + DE + A  LV+
Sbjct: 261 AEDELKYAVALVR 273


>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
          Length = 491

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 14/243 (5%)

Query: 37  SDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAV 95
           + G  S+D  +     F  F + +++TY ++EE  +R  +F  N+ RA++ Q LD  TA 
Sbjct: 178 NKGPLSKDFSMQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTAR 237

Query: 96  HGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKD 154
           +G+TKFSDLT  EFR  +L  N  LR     +     P  D  P ++DWR+ GAVT VK+
Sbjct: 238 YGITKFSDLTEEEFRTIYL--NPLLREDPGKKMRVAKPVGDPAPPEWDWRNKGAVTNVKN 295

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG CGSCW+FS TG +EG  FL  G L+SLSEQ+L+DCD           D  C GGL +
Sbjct: 296 QGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDK---------MDKACLGGLPS 346

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHG 274
           +A+  I   GG+E E+DY Y G    +C F   K    +++   +S +E ++AA L K G
Sbjct: 347 NAYSAIKNLGGLETEEDYSYQG-QMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKG 405

Query: 275 PLA 277
           P++
Sbjct: 406 PIS 408


>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
          Length = 428

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 90/242 (37%), Positives = 132/242 (54%), Gaps = 32/242 (13%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK+  ++ YA+ +E   RF +F  N+++A      +P A  G  +F+D+T  EF+ +
Sbjct: 10  FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNPMATFGPNEFADMTSEEFQTR 69

Query: 113 F----LGLNRRLRLPADAQ-------KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
                     + R P + +       KA +          DWR  GAVT VK+QGACGSC
Sbjct: 70  HNAARHYAAAKARPPKNTKTFTAEEIKAAV------GQQIDWRLKGAVTPVKNQGACGSC 123

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           WSFS TG +EG H ++TG+LV++SEQ+LV CD           D GCNGGLM++AF +++
Sbjct: 124 WSFSTTGNIEGQHAIATGQLVAVSEQELVSCD---------PIDDGCNGGLMDNAFGWLI 174

Query: 222 KA--GGVEREKDYPYTGTDG----GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
            A  G +  E +YPY   +G     S   +   + A +S F  I+  E+ MAA + KHGP
Sbjct: 175 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGP 234

Query: 276 LA 277
           L+
Sbjct: 235 LS 236


>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
          Length = 460

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 100/234 (42%), Positives = 133/234 (56%), Gaps = 28/234 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY +QEE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222

Query: 112 QFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            +L        G N RL  P          T+  P  +DWR+ GAVT VKDQG CGSCW+
Sbjct: 223 IYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVKDQGMCGSCWA 273

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TG +EG  FL  G L+SLSEQ+L+DCD           D  C GGL ++A+  I   
Sbjct: 274 FSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLPSNAYSAIRTL 324

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           GG+E E DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 325 GGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGPIS 377


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/215 (40%), Positives = 121/215 (56%), Gaps = 11/215 (5%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFL-GLNRR 119
           K Y    E D RF +F  NL+  +    +   +   G+T+F+DLT  EFR  +L     R
Sbjct: 46  KNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYLRSKMER 105

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            R    +++      + LP + DWR  GAV  VKDQG+CGSCW+FSA GA+EG + + TG
Sbjct: 106 TRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTG 165

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
           ELVSLSEQ+LVDCD         S ++GC GGLM+ AF++I+  GG++ E+DYPYT TD 
Sbjct: 166 ELVSLSEQELVDCDT--------SYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTATDD 217

Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
             C  DK       +  +  +  +E+ +   L   
Sbjct: 218 NICNTDKKNTRVVTIDGYEDVPENENSLKKALANQ 252


>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/232 (39%), Positives = 127/232 (54%), Gaps = 17/232 (7%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTP 106
           + + L+K+ + K+Y T EE  YR   ++ N    K       +  HG T     F DLT 
Sbjct: 25  NEWELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNT--DSDKHGYTLEMNSFGDLTS 82

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +EF   + G  + L        + +   N +P+  DWRD   VT VK+QG CGSCW+FS 
Sbjct: 83  AEFSSLYNGYRQNLETSGSVFSSSL--RNAMPSSLDWRDKKVVTDVKNQGKCGSCWAFST 140

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG+LEG H L TG LVSLSEQQL+DC  +         ++GC+GG M SAF+YI  AGG 
Sbjct: 141 TGSLEGLHALKTGHLVSLSEQQLMDCSVKYG-------NNGCDGGNMRSAFQYIKDAGGD 193

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
           + E+ YPYT  +  SC+FD  K+ A    +  I S DE  +   L + GP++
Sbjct: 194 DTEESYPYTAKN-ESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPIS 244


>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
          Length = 302

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 5   FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR     +        DL P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 65  IYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 122

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 123 EGQWFLNQGTLLSLSEQELLDCD---------KMDKACMGGLPSNAYSAIKNLGGLETED 173

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 174 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 219


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|16076439|emb|CAC94444.1| cysteine proteinase [Betula pendula]
          Length = 133

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 72/85 (84%), Positives = 80/85 (94%)

Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
           DHECDPEE G+CDSGC+GGLM +AFEY LKAGG+EREKDYPYTGTD GSCKFDKSKIAA+
Sbjct: 1   DHECDPEEYGACDSGCSGGLMTTAFEYTLKAGGLEREKDYPYTGTDRGSCKFDKSKIAAS 60

Query: 253 VSNFSVISSDEDQMAANLVKHGPLA 277
           VSNFSV+S DEDQ+AANLVK+GPLA
Sbjct: 61  VSNFSVVSIDEDQIAANLVKNGPLA 85


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 127/253 (50%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+K  + K Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 36  DRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 95

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             E       L   LR+P+  Q+       P   LP   DWRD G VT VK QG+CGSCW
Sbjct: 96  SEEVT----ALMSSLRVPSQWQRNVTYKSNPNQKLPDSVDWRDKGCVTDVKYQGSCGSCW 151

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVDC            + GCNGG M  AF+YI+ 
Sbjct: 152 AFSAVGALEAQVKLKTGKLVSLSAQNLVDC------SVGKYSNRGCNGGFMTEAFQYIID 205

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD-EDQMAANLVKHGPLAGNVA 281
             G+E E  YPY   D G C++D    AA  S ++ +  D ED +   +   GP++    
Sbjct: 206 NNGIESEASYPYKAMD-GKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVS---V 261

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 262 AIDASHPSF-FLY 273


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/281 (37%), Positives = 143/281 (50%), Gaps = 23/281 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + LS+L+L LL+    S+          RQ+   D   + D  + + H    + ++  +T
Sbjct: 1   MSLSTLILALLA---MSSAVAAPRALAARQL-AGDEAITVDAAMVSRHE--KWMAEHGRT 54

Query: 64  YATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
           YA +EE   R  VF+AN +         D T      +F+DLT  EFR    GL R    
Sbjct: 55  YANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAA 114

Query: 119 --RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
                  A   +       D     DWR  GAVTGVKDQG+CG CW+FSA  A+EG   +
Sbjct: 115 AAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKI 174

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG LVSLSEQQLVDCD   D       D GC GGLM++AFEY++  GG+  E  YPY G
Sbjct: 175 RTGRLVSLSEQQLVDCDVYGD-------DEGCAGGLMDNAFEYMINRGGLTTESSYPYRG 227

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           TD GSC+  +S  AA++  +  + ++ +      V H P++
Sbjct: 228 TD-GSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVS 265


>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
          Length = 333

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 130/249 (52%), Gaps = 18/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
           N +  +  +K+   + Y+T EE  +R  V++ N++  +          HG T     F D
Sbjct: 24  NLDTQWYQWKATHRRLYSTNEE-GWRRAVWEKNMKMIELHNGEYSRGKHGFTMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+  +    +        + P+L   DLP   DWR  G VT VK+Q  CGSCW+
Sbjct: 83  MTNEEFRQVMVCFRNQKHKNGKVFRGPLLL--DLPKSVDWRKKGYVTPVKNQKQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC     P+     + GCNGG MN AF Y+ + 
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR---PQG----NQGCNGGFMNYAFRYVKEN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E  YPY   D G CK+      A  + F VI + E ++   +   GP++    ++
Sbjct: 194 GGLDSEASYPYEAKD-GICKYKPENSVANDTGFVVIPTHEKELMKAVATVGPIS---VAV 249

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 250 DASHSSFQF 258


>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
          Length = 477

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/244 (41%), Positives = 136/244 (55%), Gaps = 28/244 (11%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKF 101
           +D  +     F  F + +++TY +QEE  +R  VF  N+ RA++ Q LD  TA +GVTKF
Sbjct: 170 QDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKF 229

Query: 102 SDLTPSEFRRQFL--------GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVK 153
           SDLT  EFR  +L        G N RL  P          T+  P  +DWR+ GAVT VK
Sbjct: 230 SDLTEEEFRTIYLNPLLKDAPGRNMRLAQPV---------TDVPPPQWDWRNKGAVTDVK 280

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           DQG CGSCW+FS TG +EG  FL  G L+SLSEQ+L+DCD           D  C GGL 
Sbjct: 281 DQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT---------DKACLGGLP 331

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
           ++A+  I   GG+E E DY Y G    +C F   K    +++   +S +E ++AA L K 
Sbjct: 332 SNAYSAIRTLGGLETEDDYSYRG-HLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKK 390

Query: 274 GPLA 277
           GP++
Sbjct: 391 GPIS 394


>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
          Length = 338

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/248 (39%), Positives = 132/248 (53%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +KS  SK Y  +EE  +R  +++ NL+  +   L      H    G+  F D+T  
Sbjct: 27  HWLSWKSWHSKKYHEKEE-GWRRMIWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNE 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G  ++ R     + +  L  N L  P   DWR+ G VT VKDQG CGSCW+FS
Sbjct: 86  EFRQVMNGF-KQSRSQRKYKGSQFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATGALEG HF  TG+LVSLSEQ L+DC     PE     + GCNGGLM+ AF+YI    G
Sbjct: 145 ATGALEGQHFRKTGKLVSLSEQNLIDCS---GPE----GNQGCNGGLMDQAFQYIKDNNG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLAGNVASIE 284
           ++ E+ YPY G D   C +     +A  + F  I    ++     V   GP++    +I+
Sbjct: 198 IDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRERALMKAVAAVGPIS---VAID 254

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 255 ASHTSFQF 262


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/281 (37%), Positives = 143/281 (50%), Gaps = 23/281 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + LS+L+L LL+    S+          RQ+   D   + D  + + H    + ++  +T
Sbjct: 1   MSLSTLILALLA---MSSAVAAPRALAARQL-AGDEAITVDSAMVSRHE--KWMAEHGRT 54

Query: 64  YATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNR---- 118
           YA +EE   R  VF+AN +         D T      +F+DLT  EFR    GL R    
Sbjct: 55  YANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAA 114

Query: 119 --RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
                  A   +       D     DWR  GAVTGVKDQG+CG CW+FSA  A+EG   +
Sbjct: 115 AAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKI 174

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG LVSLSEQQLVDCD   D       D GC GGLM++AFEY++  GG+  E  YPY G
Sbjct: 175 RTGRLVSLSEQQLVDCDVYGD-------DEGCAGGLMDNAFEYMINRGGLTTESSYPYRG 227

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           TD GSC+  +S  AA++  +  + ++ +      V H P++
Sbjct: 228 TD-GSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVS 265


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 89/248 (35%), Positives = 133/248 (53%), Gaps = 15/248 (6%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           +N    + L+K K+ KTY +  E + R +++  N         +D +    V +F+DLT 
Sbjct: 23  VNDAEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTA 82

Query: 107 SEFRRQFLGLNR-RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            EF   + G  + R R   +           +P   DWR  G VT VK+Q  CGSCW+FS
Sbjct: 83  EEFSSIYNGYGKGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG+LEGAH   TG+LVSLSEQ LVDCD +         D GC GGLM +AF+YI +  G
Sbjct: 143 TTGSLEGAHAKKTGKLVSLSEQNLVDCDKK---------DHGCQGGLMTTAFKYIEENKG 193

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVS-NFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E+ YPY   + G C+F K  I A V  + S++++D + +   + + GP++    +++
Sbjct: 194 IDTEESYPYKAKN-GRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPIS---VAMD 249

Query: 285 LPHISFSF 292
             H SF  
Sbjct: 250 ASHSSFQL 257


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/282 (36%), Positives = 146/282 (51%), Gaps = 32/282 (11%)

Query: 28  DAMIRQVVPSDGEQSEDH------LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL 81
           DA+++Q + +D  +S  H      L+N E  +  FK +  K Y +  E  +R ++F  N 
Sbjct: 7   DAVVQQKLTND--ESRTHAVSFFELVNQE--WMTFKMEHKKVYKSDVEERFRMKIFMDNK 62

Query: 82  RR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAP-----IL 132
            + AK     +   V     + K+ D+   EF     G N+ +     +++ P     I 
Sbjct: 63  HKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIE 122

Query: 133 PTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVD 191
           P N  LP   DWR  GAVT VKDQG CGSCWSFSATGALEG HF  TG LVSLSEQ L+D
Sbjct: 123 PANVVLPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLID 182

Query: 192 CDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAA 251
           C  +         ++GCNGGLM+ AF+YI    G++ E  YPY   +   C+++ +   A
Sbjct: 183 CSGKYG-------NNGCNGGLMDQAFQYIKDNKGLDTEASYPYE-AENDKCRYNPANSGA 234

Query: 252 A-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
             V    + + DE  + A +   GP++    +I+  H SF F
Sbjct: 235 IDVGYIDIPTGDEKLLKAAVATIGPVS---VAIDASHQSFQF 273


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/248 (38%), Positives = 130/248 (52%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +K+  SK Y  QEE  +R  +++ NL+  +   L      H    G+  F D+T  
Sbjct: 28  HWQAWKTWHSKKYHQQEE-GWRRMIWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G  +  +     + +  L  N L  P   DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQVMNGY-KHSKTEKKYRGSEFLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFS 145

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG+LEG HF  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AFEYI   GG
Sbjct: 146 TTGSLEGQHFRKTGKLVSLSEQNLVDCSR---PEG----NQGCNGGLMDQAFEYIADNGG 198

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E+ YPY   D   C +     AA  + F  V    E  +   +   GP++    +I+
Sbjct: 199 IDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERALMKAVAAVGPVS---VAID 255

Query: 285 LPHISFSF 292
             H +F F
Sbjct: 256 ASHSTFQF 263


>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
          Length = 377

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/229 (39%), Positives = 128/229 (55%), Gaps = 7/229 (3%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGV-TKFSDLTPSEFRR 111
           F  F +KF KTY T EE  +R  VF  N +              G+  +F+D T  EF  
Sbjct: 65  FMTFMTKFEKTYETVEEWAHRLTVFAQNAKIVLEHDAKAEGFALGLDNQFADWTAEEFA- 123

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +  L+ R + P+ A     +     PT  DWR  G V  +K+QG+CGSCW+FS   ++E
Sbjct: 124 SYQKLHSRPK-PSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIE 182

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVERE 229
           GA    TG+LV+LSEQ LVDC  +   +    C  GC+GGLM++AF+YI+K   GG++ E
Sbjct: 183 GAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTE 242

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLA 277
             Y YTG D G+C FDK+ + A +SN++ V   DE  +A  L   GP++
Sbjct: 243 ASYGYTGKD-GTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVS 290


>gi|118401108|ref|XP_001032875.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89287220|gb|EAR85212.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 360

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/224 (40%), Positives = 118/224 (52%), Gaps = 11/224 (4%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F  FK K++KTY    E  YRF VF  N     R       +  GV +F+DLT  EF
Sbjct: 42  ERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNKFLVFSKVGVNQFADLTHEEF 101

Query: 110 RRQFLGLNRRLRLPADAQKA--PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           +  + G         D  K   P LPT++LP  FDWRD GA+T VK Q  CG CW+FS  
Sbjct: 102 KALYTGHKHSKDDDDDDNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTV 161

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            ++EG +FL TG+L SLS QQ++DC   C  +E     SGC GG    AF  I   GG+ 
Sbjct: 162 QSIEGLYFLKTGKLESLSTQQVIDC---CRIDE-----SGCLGGDPEPAFRCIQNNGGIM 213

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLV 271
            E +YPY      SCKFD+ K    +  +  + SD+ Q+ A L+
Sbjct: 214 TETEYPYIAKQ-QSCKFDEDKPTFQIGGYIDVPSDQSQVKAALL 256


>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
          Length = 360

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/277 (38%), Positives = 146/277 (52%), Gaps = 24/277 (8%)

Query: 9   LLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNA----EHHFS--LFKSK 59
           L L++   + ASA+A      D+  IRQVV     + E+ +L       H  S   F  +
Sbjct: 8   LALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALSSARFAHR 67

Query: 60  FSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRR 119
           + K Y + EE   RF VF  NL+  +       +   GV +F+DLT  EFRR  LG  + 
Sbjct: 68  YGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQN 127

Query: 120 LRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
               +   K  +  TN  LP    WR+ G V+ VK+QG CGSCW+FS TGALE A+  + 
Sbjct: 128 C---SATTKGNLKVTNVVLPETKGWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAF 184

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+ YPYTG +
Sbjct: 185 GKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKN 237

Query: 239 GGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            G CKF    +   V    N ++ + DE + A  LV+
Sbjct: 238 -GLCKFSSENVGVKVIDSVNITLGAEDELKYAVALVR 273


>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/244 (37%), Positives = 143/244 (58%), Gaps = 22/244 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  F  F++K+ K Y + E  +YR +V   N+   ++    + +   G+T F+D+T +EF
Sbjct: 24  EKLFQTFEAKYGKNYLSSE-REYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF 82

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTD-FDWRDHGAVTGVKDQGACGSCWSFSATG 168
                 L   ++ P + ++A +L  N++  +  DWR+ GAVT VK+QG+CGSCW+FSATG
Sbjct: 83  ATS--KLCGCMKKPLNHKQARVL--NNMAVESIDWREKGAVTPVKNQGSCGSCWAFSATG 138

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALEG +F++TG+LVSLSEQQLVDCD E         D+GC GG M++AFEY++K  G+  
Sbjct: 139 ALEGGNFVATGKLVSLSEQQLVDCDTE---------DAGCGGGFMDTAFEYVMKK-GLCT 188

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E+DYPY   D   CK D+     +++ +  + +++       +   P+     S+ +   
Sbjct: 189 EEDYPYHAKD-EDCKDDQCTSVISITGYEDVPANDGVALKQALTKAPV-----SVAIQAD 242

Query: 289 SFSF 292
           SF F
Sbjct: 243 SFVF 246


>gi|1019667|gb|AAA79287.1| rangelipain, partial [Trypanosoma rangeli]
          Length = 263

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 95/236 (40%), Positives = 122/236 (51%), Gaps = 22/236 (9%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK +  K Y +  E  +R  VFK NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96

Query: 112 QFLGLNRRLRLPADAQKAPILPT------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           ++   +      A AQK   +P          P   DWR  GAVT +KDQG CGSCW+FS
Sbjct: 97  RY---HNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFS 153

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KA 223
             G +EG   L+   L  LSEQ LV CD+          D+GC+GGLM+SAF++I+    
Sbjct: 154 TIGNIEGQWHLAGNPLTGLSEQMLVSCDN---------ADNGCDGGLMDSAFDWIVGQNN 204

Query: 224 GGVEREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E  Y Y   G D  +C      + A +S    +  DED+MAA L  +GPLA
Sbjct: 205 GSVYTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLA 260


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 143/263 (54%), Gaps = 22/263 (8%)

Query: 40  EQSEDHLLNAEHH----FSLFKSKFSKTYATQ-EEHDYRFRVFKANLRRAKRRQLLDPTA 94
           EQ E  LL+A+ +    F  +  +++K YA   +E + RF V+  NL           + 
Sbjct: 28  EQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSH 87

Query: 95  VHGVTKFSDLTPSEFRRQFLGLNRRLRLPADA-QKAPIL----PTNDLPTDFDWRDHGAV 149
              +  F+DLT  EFR + LG + + R  ++  Q +P +      N LPT+ DWR  GAV
Sbjct: 88  WLHLNAFADLTTDEFRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAV 146

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
           T VK+QG CGSCW+F+ TG++EG + + TGEL SLSEQ+LVDCD +         D GC+
Sbjct: 147 TEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTD--------EDRGCS 198

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GGLM+ A+++I+K GG++ E DYPYT  DG      K++    +  +  I  +++     
Sbjct: 199 GGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKK 258

Query: 270 LVKHGPLAGNVASIELPHISFSF 292
              H P+A    +IE    SF  
Sbjct: 259 AAAHQPIA---VAIEADAKSFQL 278


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/253 (37%), Positives = 130/253 (51%), Gaps = 19/253 (7%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDL 104
            +  ++ FK +  K Y ++ E  +R ++F  N  +  +   L    ++     + K+ DL
Sbjct: 23  VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82

Query: 105 TPSEFRRQFLGLNRRL----RLPADAQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACG 159
              EF     G NR      R         I P + D+P   DWR  GAVT VKDQG CG
Sbjct: 83  LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCWSFSATGALEG HF  T +LVSLSEQ LVDC        S   ++GCNGGLM++AF Y
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDC-------SSRFGNNGCNGGLMDNAFRY 195

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I   GG++ E  YPY G D       K++ A       + S DED++ A +   GP++  
Sbjct: 196 IKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPIS-- 253

Query: 280 VASIELPHISFSF 292
             +I+  H SF  
Sbjct: 254 -IAIDASHESFQL 265


>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
          Length = 358

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/256 (39%), Positives = 137/256 (53%), Gaps = 21/256 (8%)

Query: 27  DDAMIRQVVPSDGEQSED---HLLNAEHH---FSLFKSKFSKTYATQEEHDYRFRVFKAN 80
           D+  IRQVV     + E    H++    H   F+ F  ++ K Y + EE   RF +F  N
Sbjct: 27  DENPIRQVVSDSFHELESGILHVVGQTRHALSFARFARRYGKRYDSVEEIKQRFDIFLDN 86

Query: 81  LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTD 140
           L           +   GV +FSDLT  EFRR  LG  +     A  +    L    LP  
Sbjct: 87  LEMINSHNDKGLSYKLGVNEFSDLTWDEFRRDRLGAAQNC--SATTKGNLKLRDAVLPET 144

Query: 141 FDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 200
            DWR+ G V+ VK+QG CGSCW+FS TGALE A+    G+ +SLSEQQLVDC        
Sbjct: 145 KDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDC-------- 196

Query: 201 SGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVS---NF 256
           +G+ ++ GCNGGL + AFEYI   GG+E E+ YPYTG + G CKF    +   V+   N 
Sbjct: 197 AGAFNNFGCNGGLPSQAFEYIKSNGGLETEEAYPYTGKN-GLCKFSSQNVGVKVTDSVNI 255

Query: 257 SVISSDEDQMAANLVK 272
           ++ + DE + A  LV+
Sbjct: 256 TLGAEDELKYAVALVR 271


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/237 (37%), Positives = 132/237 (55%), Gaps = 14/237 (5%)

Query: 58  SKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
           SK  K+Y + EE  +RF VF+ NL+          +   G+ +F+DL+  EF+R++LGL 
Sbjct: 2   SKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLK 61

Query: 118 RRLRLPADA-QKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
             L    D+ ++       DLP   DWR  GAV  VK+QGACGSCW+FS   A+EG + +
Sbjct: 62  IELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQI 121

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TG L +LSEQ+L+DCD           ++GCNGGLM+ AF +I+  GG+ +E+DYPY  
Sbjct: 122 VTGNLTALSEQELIDCDK--------PFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV- 172

Query: 237 TDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            + G+C   K ++    +S +  +  D +Q     + + PL+    +IE     F F
Sbjct: 173 MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLS---VAIEASSRGFQF 226


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/250 (36%), Positives = 133/250 (53%), Gaps = 20/250 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           ++H+  +K    KTYA +EE  +R  +++ NL+  +   L      H    G+ +F D+T
Sbjct: 26  DNHWYSWKDWHKKTYAPKEE-GWRRVLWEKNLKMIEFHNLDHSLGKHSYRLGMNQFGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
             EF++   G   +  +      AP     + P   DWR  G VT VKDQG CGSCW+FS
Sbjct: 85  NEEFKQLMNGYKNQKMIRGSTFLAP--NNFEAPKSVDWRKKGYVTPVKDQGQCGSCWAFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG H+  T +L+SLSEQ LVDC            + GCNGGLM+ AF+Y+   GG
Sbjct: 143 TTGALEGQHYRKTSKLISLSEQNLVDC-------SRAQGNEGCNGGLMDQAFQYVKDNGG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS--DEDQMAANLVKHGPLAGNVASI 283
           ++ E  YPYT  D   C +D +  +A  + F  + S  ++D M A +   GP++    +I
Sbjct: 196 IDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCEKDLMKA-VASVGPVS---VAI 251

Query: 284 ELPHISFSFL 293
           +  H SF F 
Sbjct: 252 DAGHQSFQFY 261


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 137/258 (53%), Gaps = 31/258 (12%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M++L+  SL L L+ +V A+    N+ D            +SE  L N    +  ++S  
Sbjct: 3   MKKLLFISLSLALIFTV-ANTFDFNEHDL-----------ESEKSLWNL---YERWRSHH 47

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---N 117
           + T    E+H+ RF VFKAN+        LD      + KF D+T  EFRR +      +
Sbjct: 48  TVTRNLDEKHN-RFNVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISH 106

Query: 118 RRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHF 175
            R+      +    +  N  D+P+  DWR+ GAVTGVKDQG CGSCW+FS   A+EG + 
Sbjct: 107 HRMFRGMSHENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQ 166

Query: 176 LSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYT 235
           + T +LVSLSEQQLVDCD E         + GCNGGLM  AFE+I K  G+  E +YPY 
Sbjct: 167 IKTQKLVSLSEQQLVDCDTE--------ENEGCNGGLMEYAFEFI-KQNGITTESNYPYA 217

Query: 236 GTDGGSCKFDKSKIAAAV 253
             D G+C  +K   A ++
Sbjct: 218 AKD-GTCDVEKEDKAVSI 234


>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
 gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
 gi|228243|prf||1801240A Cys protease 1
          Length = 322

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 24/258 (9%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK KF + Y   EE  YR  VF  NL+      K+ +  + T    + +F
Sbjct: 13  LAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQF 72

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLP--TDFDWRDHGAVTGVKDQGACG 159
           SD+T  +F     G  +  R PA    A    T+  P  T+ DWR  GAVT VKDQG CG
Sbjct: 73  SDMTNEKFNAVMKGYKKGPR-PA----AVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCG 127

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFE 218
           SCW+FS TG +EG HFL TG LVSLSEQQLVDC         GS  + GCNGG +  A  
Sbjct: 128 SCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-------AGGSYYNQGCNGGWVERAIM 180

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH-GPLA 277
           Y+   GGV+ E  YPY   D  +C+F+ + I A  + +  I+   +       +  GP++
Sbjct: 181 YVRDNGGVDTESSYPYEARD-NTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPIS 239

Query: 278 GNVASIELPHISFSFLFT 295
               +I+  H SF   +T
Sbjct: 240 ---VAIDASHRSFQSYYT 254


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 150/284 (52%), Gaps = 23/284 (8%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           + R  LS  LL++ ++  A  +++   D   ++       +++D ++     +  +  K 
Sbjct: 3   LHRSSLSLFLLMIFTASSAVDMSIVSYD---QRHADKSSWRTDDEVM---AMYEAWLVKH 56

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---- 116
            K Y    E + RF +FK NLR        + T   G+ +F+DLT  E+R  +LG+    
Sbjct: 57  GKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGA 116

Query: 117 ---NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               R++   +D   A +   + LP   DWR  GAV GVKDQG+CGSCW+FS   A+EG 
Sbjct: 117 TRVTRKVSRKSDRFAARV--GDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGI 174

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           + + TG+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+  GG++ E+DYP
Sbjct: 175 NQIVTGDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINNGGIDSEEDYP 226

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y   D    ++ K+    ++  +  +  +++      V   P++
Sbjct: 227 YRAADQKCDQYRKNANVVSIDGYEDVPENDEAALKKAVAKQPVS 270


>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
          Length = 318

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/243 (39%), Positives = 131/243 (53%), Gaps = 22/243 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-----RQLLDPTAVHGVTKFSDLTPSEFR 110
           FK   +K Y   +E  YR  +F+ N +  +      RQ L  T    + +F D+T  EF 
Sbjct: 21  FKLTHAKVYTHGKEDLYRRSIFENNQKVVEEHNERFRQGL-VTFDLKMNRFGDMTTEEFV 79

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            Q  GLN+  R     +     P  +     DWRD GAVT VKDQG CGSCW+FS TGAL
Sbjct: 80  SQMTGLNKVERTVG--KVFAHYPEVERADTVDWRDKGAVTPVKDQGQCGSCWAFSTTGAL 137

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EGAHFL  G+LVSLSEQ LVDC  E         +SGCNGG++  A++YI    G++ E 
Sbjct: 138 EGAHFLKHGDLVSLSEQNLVDCSTE---------NSGCNGGVVQWAYDYIKSNNGIDTES 188

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHIS 289
            YPY   D  +C+FD + + A V+ ++ I  +DE   A+ +   GP++     I+  H S
Sbjct: 189 SYPYEAQD-LTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGPVS---VCIDAGHNS 244

Query: 290 FSF 292
           F  
Sbjct: 245 FQL 247


>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
          Length = 354

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 93/225 (41%), Positives = 126/225 (56%), Gaps = 17/225 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           F+ F S+F K+Y ++EE   R+ +F  NLR  R+  ++ L  T    V  F+D T  EF+
Sbjct: 55  FARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNKKRLPYTL--SVNHFADWTWEEFK 112

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           R  LG  +      +      L    LP   DWR  G V+ VKDQG+CGSCW+FS TGAL
Sbjct: 113 RHRLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSSVKDQGSCGSCWTFSTTGAL 170

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           E A+  + G+ +SLSEQQLVDC    +       + GC+GGL + AFEYI   GG+E E+
Sbjct: 171 EAAYAQAFGKSISLSEQQLVDCAGPFN-------NFGCHGGLPSQAFEYIKYNGGLETEE 223

Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            YPYTG D G CKF    +A  V    N ++ + DE + A   V+
Sbjct: 224 AYPYTGKD-GVCKFSAENVAVQVLDSVNITLGAEDELKHAVAFVR 267


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 130/224 (58%), Gaps = 15/224 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           +  F  +    SK Y  ++E   RF ++++N++       L         +F+D+T SEF
Sbjct: 40  KQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEF 99

Query: 110 RRQFLGLNRR-LRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           +  FLGLN   LRL    ++ P+  P  ++P   DWR  GAVT +++QG CG CW+FSA 
Sbjct: 100 KAHFLGLNTSSLRL--HKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            A+EG + + TG LVSLSEQQL+DCD        G+ + GC+GGLM +AFE+I   GG+ 
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCD-------VGTYNKGCSGGLMETAFEFIKSNGGLT 210

Query: 228 REKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDED--QMAA 268
            E DYPYTG + G+C  +K+K     +  +  ++ +E   Q+AA
Sbjct: 211 TETDYPYTGIE-GTCDQEKAKNKVVTIQGYQKVAQNEASLQIAA 253


>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 347

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/283 (35%), Positives = 149/283 (52%), Gaps = 29/283 (10%)

Query: 9   LLLLLLSSVLASAVAVND-DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +LL LL+S    +++ +  DD ++ + V    E            F  +  K  KTYAT 
Sbjct: 10  ILLFLLASFTDVSLSFDPLDDFVMSESVQRAAE------------FERWTIKHKKTYATA 57

Query: 68  EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN-RRLRLPAD 125
           EE+++R RV+ AN    KR  +   P     + +F+DLT +EF+R +L  + +  R    
Sbjct: 58  EEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLTFAEFKRIYLSSSSQHCRATTG 117

Query: 126 AQKAPILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
             + P+   N + P   DWR    +T V+DQG+CGSCW+FSAT  L     L TG+L+SL
Sbjct: 118 NFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWAFSATSCLSAHLALKTGQLISL 177

Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
           S+QQL+DC    +       + GC GGL + AFEYI   GG+E E+DYPY   +   C F
Sbjct: 178 SKQQLLDCSRSFN-------NRGCKGGLPSQAFEYIRYNGGIESERDYPYKDRE-EKCHF 229

Query: 245 DKSKIAAAVS---NFSVISSDEDQMAANLVKHGPLAGNVASIE 284
             S +AA V+   NF+     ED +A  L   GP++  + S +
Sbjct: 230 KPSLVAATVTGVVNFT--QGAEDDIAVALANIGPVSIGIHSTK 270


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 129/232 (55%), Gaps = 11/232 (4%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTP 106
           A   + L+ ++  ++Y    EH+ RFRVF  NLR   A   +  D     G+ +F+DLT 
Sbjct: 50  ARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTN 109

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
            EFR  FLG     R  A  ++       +LP   DWR+ GAV  VK+QG CGSCW+FSA
Sbjct: 110 EEFRATFLGAKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
              +E  + L TGE+++LSEQ+LV+C        +   +SGCNGGLM+ AF++I+K GG+
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVEC-------STNGQNSGCNGGLMDDAFDFIIKNGGI 222

Query: 227 EREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           + E DYPY   D G C  ++      ++  F  +  ++++     V H P++
Sbjct: 223 DTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVS 273


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 131/249 (52%), Gaps = 18/249 (7%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSD 103
           N +  +  +K+   + Y   EE  +R  V++ N+R  +          HG T     + D
Sbjct: 24  NLDTQWYQWKATHKRLYGLNEE-GWRRAVWEKNMRMIELHNGEYSQGKHGFTMGMNAYGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   +        + P+L     P   DWR+ G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKHKKGKMFRDPLLL--QYPKSVDWREKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+L+SLSEQ LVDC H   P+     + GCNGGLM+ AF+Y+   
Sbjct: 141 FSATGALEGQMFQKTGKLISLSEQNLVDCSH---PQG----NQGCNGGLMDYAFQYVKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
            G++ E+ YPY G D G+CK+      A  + F  I   E  +   +   GP++   A+I
Sbjct: 194 SGLDSEESYPYEGMD-GTCKYKPECSVANDTGFVDIPGHEKALLRAVATVGPIS---AAI 249

Query: 284 ELPHISFSF 292
           +  H+SF F
Sbjct: 250 DAGHMSFQF 258


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 129/226 (57%), Gaps = 15/226 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  +  K  K+Y T +E   R+ VF+ N+    +        + G+   +DLT  EF++ 
Sbjct: 32  FQNWMVKHQKSY-TNDEFGSRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKL 90

Query: 113 FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
           +LG    +      +K  ++  + LP   DWR +GAVT VK+QG CG C++FS TG++EG
Sbjct: 91  YLGTKANVTY----KKKTLVGVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEG 146

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVEREKD 231
            H +++ +LV LSEQQ++DC        SGS  ++GC+GGLM ++FEYI+  GG++ E  
Sbjct: 147 IHEITSQQLVPLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEAS 198

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPYTG + G CKF+K  I A ++ +  + S  +      V   P++
Sbjct: 199 YPYTG-EVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQPVS 243


>gi|29789900|gb|AAF21457.2|U56958_1 cysteine proteinase [Paragonimus westermani]
          Length = 272

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 18/208 (8%)

Query: 83  RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPT 139
           RA++ QL D  TA +GVT+FSDLTP EF  ++L          + Q   + PT     P 
Sbjct: 2   RAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPE 56

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
             DWR  GAVT V++QG+CGSCW+FS  G +EG  F+ TG+LVSLS+QQLVDCD   D  
Sbjct: 57  RIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-- 114

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
                  GCNGG   S++  I+  GG+E + DYPY G     C  +K ++ A + +   +
Sbjct: 115 -------GCNGGWPASSYLEIMHMGGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIAL 166

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPH 287
              ED  AA L +HGPL+  + +I L +
Sbjct: 167 XPSEDDNAAYLAEHGPLSTLLNAITLQY 194


>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
          Length = 335

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + YAT +E   R   F+ NL   +  Q  +P A  G+TKF DL+  EF  +
Sbjct: 30  FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 89

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L       +  +  +   +      +  P   DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 90  YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 149

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
            +E   +L+T  L+SLSEQ+LV CD           D GCNGGLM  AF+++L  + G V
Sbjct: 150 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMGQAFDWLLNNRNGAV 200

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                YPY   +G   +  +S    I A +     I S+ED MAA L  +GP+A
Sbjct: 201 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 254


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/212 (40%), Positives = 121/212 (57%), Gaps = 22/212 (10%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+ +      ++ + ++   TY    E + RF  F+ NLR   +        
Sbjct: 28  IVSYGERSEEEV---RRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAG 84

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           VH    G+ +F+DLT  E+R  +LG     +R  +L A  Q A     ++LP   DWR  
Sbjct: 85  VHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKK 141

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  VKDQG CGSCW+FSA  A+EG + + TG+++ LSEQ+LVDCD         S + 
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQ 193

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GCNGGLM+ AFE+I+  GG++ E+DYPY   D
Sbjct: 194 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERD 225


>gi|42516556|gb|AAS17989.1| cysteine proteinase CP2 [Paragonimus westermani]
          Length = 272

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 18/208 (8%)

Query: 83  RAKRRQLLDP-TAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--NDLPT 139
           RA++ QL D  TA +GVT+FSDLTP EF  ++L          + Q   + PT     P 
Sbjct: 2   RAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVN-----NDQVKRVRPTGLKAAPE 56

Query: 140 DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPE 199
             DWR  GAVT V++QG+CGSCW+FS  G +EG  F+ TG+LVSLS+QQLVDCD   D  
Sbjct: 57  RIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-- 114

Query: 200 ESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI 259
                  GCNGG   S++  I+  GG+E + DYPY G     C  +K ++ A + +   +
Sbjct: 115 -------GCNGGWPASSYLEIMHMGGLESQDDYPYAGVK-EQCFMEKERLLAKIDDSIAL 166

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPH 287
              ED  AA L +HGPL+  + +I L +
Sbjct: 167 GPSEDDNAAYLAEHGPLSTLLNAITLQY 194


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 134/253 (52%), Gaps = 19/253 (7%)

Query: 40  EQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT 99
           E  E H  +A   FS F++ ++K+YAT+EE   R+ +FK NL           +    + 
Sbjct: 106 EWKEAHFQDA---FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMN 162

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPAD-----AQKAPILPTNDLPTDFDWRDHGAVTGVKD 154
            F DL+  EFRR++LG  +   L +       +   +LP+ +LP   DWR  G VT VKD
Sbjct: 163 HFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKD 221

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           Q  CGSCW+FS TGALEGAH   TG+LVSLSEQ+L+DC            +  C+GG MN
Sbjct: 222 QRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR-------AEGNQSCSGGEMN 274

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKH 273
            AF+Y+L +GG+  E  YPY   D   C+    +    +  F  V    E  M A L K 
Sbjct: 275 DAFQYVLDSGGICSEDAYPYLARD-EECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK- 332

Query: 274 GPLAGNVASIELP 286
            P++  + + ++P
Sbjct: 333 SPVSIAIEADQMP 345


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 95/252 (37%), Positives = 135/252 (53%), Gaps = 25/252 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSEFRR 111
           FK +  K Y ++ E  +R +++  N  + AK  QL +   V    G  K++D+   EF +
Sbjct: 31  FKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLHHEFIQ 90

Query: 112 QFLGLNRRLR-------LPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCW 162
              G NR  +          D + A  +P   +  P   DW   GAVT VKDQG CGSCW
Sbjct: 91  AMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGKCGSCW 150

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FS TGALEG HF  +G LVSLSEQ L+DC        S   ++GCNGGLM++AF+YI  
Sbjct: 151 AFSTTGALEGQHFRKSGYLVSLSEQNLIDC-------SSTYGNNGCNGGLMDNAFKYIKD 203

Query: 223 AGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
            GG++ EK YPY G D   C+++ K+  A  V    + S DE+++   +   GP++    
Sbjct: 204 NGGIDTEKTYPYEGVD-DKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVS---V 259

Query: 282 SIELPHISFSFL 293
           +I+    SF F 
Sbjct: 260 AIDASQNSFQFY 271


>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
          Length = 467

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 95/236 (40%), Positives = 122/236 (51%), Gaps = 22/236 (9%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK +  K Y +  E  +R  VFK NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHASFGVTPFSDLTREEFRS 96

Query: 112 QFLGLNRRLRLPADAQKAPILPT------NDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           ++   +      A AQK   +P          P   DWR  GAVT +KDQG CGSCW+FS
Sbjct: 97  RY---HNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFS 153

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KA 223
             G +EG   L+   L  LSEQ LV CD+          D+GC+GGLM+SAF++I+    
Sbjct: 154 TIGNIEGQWHLAGNPLTGLSEQMLVSCDNA---------DNGCDGGLMDSAFDWIVGQNN 204

Query: 224 GGVEREKDYPYT--GTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G V  E  Y Y   G D  +C      + A +S    +  DED+MAA L  +GPLA
Sbjct: 205 GSVYTEASYSYVSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLA 260


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 120/207 (57%), Gaps = 23/207 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH---GVTKFSDLTPSEF 109
           F  ++ +  K Y    E + R+R FK NL+    +      A+    G+ KF+DL+  EF
Sbjct: 50  FQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADLSNEEF 109

Query: 110 RRQFLGLNRRLRLPADAQKAPI-------LPTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           +  +L    +++ P + +++         L T D P+  DWR  G VT VKDQG CGSCW
Sbjct: 110 KELYLS---KVKKPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCGSCW 166

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           SFS TGA+EG + + TG+L+SLSEQ+LVDCD         + + GC GG M+ AFE+++ 
Sbjct: 167 SFSTTGAIEGINAIVTGDLISLSEQELVDCD---------TTNYGCEGGYMDYAFEWVIN 217

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI 249
            GG++ E +YPYTG D G+C   K +I
Sbjct: 218 NGGIDTEANYPYTGVD-GTCNTTKEEI 243


>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
          Length = 379

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  +F  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 82  FRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR     +        DL P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 142 IYL--NPLLREEPGKKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 199

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL +SA+  I   GG+E E 
Sbjct: 200 EGQWFLNQGTLLSLSEQELLDCDK---------IDKACMGGLPSSAYSAIKNLGGLETED 250

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 251 DYSYRG-HMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 296


>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
 gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
          Length = 323

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/245 (36%), Positives = 137/245 (55%), Gaps = 23/245 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  IL  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVIILDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
           K GGV+ E DYPY   D  +C+ + +K    V +   + ++  ++ +    LV   P+A 
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246

Query: 279 NVASI 283
           + A I
Sbjct: 247 DAADI 251


>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 132/253 (52%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VT 99
           D   +AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    + 
Sbjct: 22  DQTFSAEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    +P   DWR+ G VT VK++G CG
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNKGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           I + GG++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++  
Sbjct: 190 IKENGGLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSF 292
             +++  H S  F
Sbjct: 247 -VAMDASHPSLQF 258


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 127/248 (51%), Gaps = 16/248 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +  + +FK + +K Y   +E  YR  VF   +   ++  L     VH    G+ +++D+ 
Sbjct: 19  DREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMP 78

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
             EF R   G   + + P      P     DLP   DWR  G VT VK+QG CGSCW+FS
Sbjct: 79  NEEFVRVMNGYKMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFS 138

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           +TG+LEG  F    +L+SLSEQ LVDC  E         + GC GGLM+ AF YI    G
Sbjct: 139 STGSLEGQTFKKYNKLISLSEQNLVDCSTE-------QGNMGCGGGLMDQAFTYIKVNDG 191

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY     G C+F+K+ + A  + ++ I S  E  + + +   GP+A    +I+
Sbjct: 192 IDTETSYPYEAAS-GKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIA---VAID 247

Query: 285 LPHISFSF 292
             H+SF  
Sbjct: 248 ASHMSFQL 255


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 92/248 (37%), Positives = 132/248 (53%), Gaps = 20/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+  +K+   K Y  +EE  +R  +++ NLR+ +   L     +H    G+  F D+   
Sbjct: 28  HWEQWKTWHGKNYHEKEE-GWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G   +       + +  +  N  ++P+  DWR+ G VT VKDQG CGSCW+FS
Sbjct: 87  EFRQVMNGYKHKTE--RKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFS 144

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGA+EG  F   G+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+YI    G
Sbjct: 145 TTGAMEGQMFRKQGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNNG 197

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E+ YPY GTD   C +D    AA  + F  + S  E  +   +   GP++    +I+
Sbjct: 198 LDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVS---VAID 254

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 255 AGHESFQF 262


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/191 (45%), Positives = 118/191 (61%), Gaps = 18/191 (9%)

Query: 69  EHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN----RRL-RL 122
           E D RF +FK NLR   ++    D +   G+ +F+DLT  E+R  +LG      RR+ + 
Sbjct: 66  EKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRRIAKT 125

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
            +D + AP      LP   DWR+ GAV  VKDQG+CGSCW+FS   A+EG + + TGEL+
Sbjct: 126 KSDRRYAP-KAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELI 184

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+K GG++ E DYPYTG  G   
Sbjct: 185 SLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYG--- 233

Query: 243 KFDKSKIAAAV 253
           + D+++  A V
Sbjct: 234 RCDQTRKNAKV 244


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 93/249 (37%), Positives = 133/249 (53%), Gaps = 20/249 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  +K KF ++Y +  E  +R +++  N +      +L    +     G+T F+D+   E
Sbjct: 26  FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85

Query: 109 FRR---QFLGLNRRLRLPADAQKAPILPT-NDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           ++R   Q    +    LP        LP   DLP   DWRD G VT VKDQ  CGSCW+F
Sbjct: 86  YKRVISQGCLHSFNASLPRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAF 145

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SATG+LEG HF  TG LVSLSEQQLVDC  +         + GC GGLM+ AF+YI   G
Sbjct: 146 SATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYG-------NMGCMGGLMDYAFQYIQANG 198

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASI 283
           G++ E+ YPY   + G C+++   I A  + ++ +S  DED +   +   GP++     I
Sbjct: 199 GIDTEESYPYE-AENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPIS---VGI 254

Query: 284 ELPHISFSF 292
           +   +SF F
Sbjct: 255 DASQMSFQF 263


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 83/200 (41%), Positives = 118/200 (59%), Gaps = 12/200 (6%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTK 100
            S+D +L+  H +     + S+ Y +  E   RF++FK NL         + +   G+ K
Sbjct: 43  HSDDGMLDVFHQW---LERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNK 99

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF-DWRDHGAVTGVKDQGACG 159
           FSDLT  EFR  +LG+    R          +  + +  +  DWR  GAV+ VKDQG+CG
Sbjct: 100 FSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCG 159

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA G++EG + + TGEL+SLSEQ+LVDCD           + GCNGGLM+ AF++
Sbjct: 160 SCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDR--------GQNQGCNGGLMDYAFDF 211

Query: 220 ILKAGGVEREKDYPYTGTDG 239
           I+K GG++ E+DYPY  TDG
Sbjct: 212 IIKNGGIDTEEDYPYKATDG 231


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 14/239 (5%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFS 102
           +D L+ A H    + +++ + Y+   E   R  VFKAN+   +     +        +F+
Sbjct: 25  DDWLIAARHE--QWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFA 82

Query: 103 DLTPSEFRRQFLGLNRRL---RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           D+T  EFR    G   ++   +  A   +   +  +DLP   DWR +GAVT VKDQG CG
Sbjct: 83  DITKDEFRAMHKGYKMQVIGSKARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCG 142

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
            CW+FS   ++EG   +STG+L+SLSEQ+LVDCD        G  + GC GGLM++AFE+
Sbjct: 143 CCWAFSTVASMEGIVKVSTGKLISLSEQELVDCD-------VGMQNKGCGGGLMDNAFEF 195

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I+  GG++ E DYPYTG D G+C  +K S IAA++  +  + ++++      V   P++
Sbjct: 196 IVNNGGLDTEADYPYTGAD-GTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVS 253


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 129/253 (50%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH+ L+K  + K Y  + E   R  +++ NL+      L     +H    G+   +D+T
Sbjct: 34  DHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMT 93

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+  Q+       P   LP   DWRD G VT VK QG+CGSCW
Sbjct: 94  SEEV----MLLMSSLRVPSQWQRNVTFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCW 149

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG+LVSLS Q LVDC            + GCNGG M  AF+YI+ 
Sbjct: 150 AFSAVGALEAQLKLKTGKLVSLSVQNLVDCS------TGKYSNKGCNGGFMTEAFQYIID 203

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   D G C++D    AA  S +  +   +E+ +   +   GP++    
Sbjct: 204 NNGIDSEASYPYKAMD-GKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVS---V 259

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 260 AIDASHPSF-FLY 271


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 94/253 (37%), Positives = 131/253 (51%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           +   NA+ H   +KS + + Y T EE ++R  V++ N++  +          HG T    
Sbjct: 22  NQTFNAQWH--KWKSTYRRLYGTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    LP   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQLVNGYKHQKHRKGKVFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA GALEG   L TG LVSLSEQ LVDC            + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQ-------AEGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +L   G++ E+ YPY   D G+CK+     AA  + +  I   E  +   +   GP+A  
Sbjct: 190 VLNNKGLDSEESYPYEAKD-GTCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIA-- 246

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 247 -IAIDASHPSFQF 258


>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
          Length = 484

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 100/238 (42%), Positives = 137/238 (57%), Gaps = 14/238 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 176 SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 235

Query: 101 FSDLTPSEFRRQFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
           FSDLT  EFR  +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CG
Sbjct: 236 FSDLTEEEFRTIYL--NTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCG 293

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FS TG ++G  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  
Sbjct: 294 SCWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSA 344

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           I   GG+E E DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 345 IKNLGGLETEDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 401


>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
          Length = 350

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/275 (37%), Positives = 149/275 (54%), Gaps = 26/275 (9%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
           SLL++L     A+A     D   IR V  SD E+    ++    H   F+ F +++ K Y
Sbjct: 5   SLLIVLFCVTTAAAGFSFHDSNPIRMV--SDAEEQLLQVIGESRHAVSFARFANRYGKLY 62

Query: 65  ATQEEHDYRFRVFKANL---RRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLR 121
            + +E   RF++F  NL   R   +R+L   +   GV  F+D T  EF+   LG  +   
Sbjct: 63  DSVDEMKLRFKIFSENLELIRSTNKRRL---SYKLGVNHFADWTWEEFKSHRLGAAQNC- 118

Query: 122 LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
             A  +    +   +LP + DWR  G V+ VKDQG CGSCW+FS TGALE A+  + G+ 
Sbjct: 119 -SATLKGNHKITDANLPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKN 177

Query: 182 VSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG+E E+ YPYTG++ G
Sbjct: 178 ISLSEQQLVDC--------AGAFNNFGCSGGLPSQAFEYIKYNGGLETEETYPYTGSN-G 228

Query: 241 SCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            CKF    +A  V    N ++ S DE + A    +
Sbjct: 229 LCKFTSENVALKVLGSVNITLGSEDELKHAVAFAR 263


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 93/259 (35%), Positives = 139/259 (53%), Gaps = 23/259 (8%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
           +D L+  + H   + ++  +TYA   E + R+ VFK N+ R +R   +    T    V +
Sbjct: 29  DDELIMQKKH-DEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 87

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKD 154
           F+DLT  EFR  + G      L + +Q          +    LP   DWR  GAVT +K+
Sbjct: 88  FADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKN 147

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG+CG CW+FSA  A+EGA  +  G+L+SLSEQQLVDCD         + D GC+GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCSGGLMD 198

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
           +AFE+I+  GG+  E +YPY G D  +CK   +K  AA+++ +  +  +++      V H
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGED-ANCKIKSTKPSAASITGYEDVPVNDENALMKAVAH 257

Query: 274 GPLAGNVASIELPHISFSF 292
            P++     IE     F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273


>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
          Length = 517

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 133/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 280 IYL--NSLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 337

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 338 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 388

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 389 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 434


>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
          Length = 210

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 84/214 (39%), Positives = 123/214 (57%), Gaps = 21/214 (9%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----- 116
           K Y + EE  +RF +FK NL+    R  +      G+ +FSDL+  EF++ +LGL     
Sbjct: 6   KIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKVDHD 65

Query: 117 --NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
             N + +   D +    +   DLP   DWR  GAVT VK+QG CGSCW+FS   A+EG +
Sbjct: 66  LLNNKKQSQQDFEYRDFV---DLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGIN 122

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
            + TG L SLSEQ+L+DCD         + ++GCNGGLM+ AF++I+  GG+ +E DYPY
Sbjct: 123 QIKTGNLTSLSEQELIDCD--------TTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPY 174

Query: 235 TGTDGGSC--KFDKSKIAAAVSNFSVISSDEDQM 266
              + G+C  K D+S++        V ++DE  +
Sbjct: 175 L-MEEGTCDEKRDESEVVTIDGYRDVPANDEQSL 207


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/279 (31%), Positives = 145/279 (51%), Gaps = 17/279 (6%)

Query: 7   SSLLLLLLSSVLASAVAVNDDDAMIRQVVPSD--GEQSEDHLLNAEHHFSLFKSKFSKTY 64
           S +L++L+   L +A    D   +      SD    +S+  + N    + +   K +   
Sbjct: 8   SPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNI 67

Query: 65  ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN------R 118
              E+ D RF +FK NL+        + T   G+ +F+DL+  E+R ++LG         
Sbjct: 68  DGSEK-DKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMM 126

Query: 119 RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
             R    + +      + LP   DWR  GAV  VKDQG+CGSCW+FS   A+EG + + T
Sbjct: 127 MARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVT 186

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GELVSLSEQ+LVDCD         + ++GC+GGLM  AFE+I+  GG++ ++DYPY G D
Sbjct: 187 GELVSLSEQELVDCDR--------TVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVD 238

Query: 239 GGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           G   ++ K+    ++ ++  + + ++      V + P++
Sbjct: 239 GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPIS 277


>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
          Length = 352

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 24/247 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +K K+ K Y  +EE+DY    F  N+          +L   T   G+   +DL  SE+R+
Sbjct: 48  YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 106

Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
             L   R  RL  D+ +      ++P N  +P   DWR+H  VT VK+QG CGSCW+FSA
Sbjct: 107 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 164

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF +TG+LVSLSEQ LVDC        +   + GCNGGLM+ AFEYI    G+
Sbjct: 165 TGALEGQHFRATGKLVSLSEQNLVDCS-------TKYGNHGCNGGLMDLAFEYIKDNHGI 217

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPY G +   C F K  I A    F  +   DED +   +   GP++    +I+ 
Sbjct: 218 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 273

Query: 286 PHISFSF 292
            H SF  
Sbjct: 274 GHRSFQL 280


>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
 gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 356

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/285 (37%), Positives = 153/285 (53%), Gaps = 29/285 (10%)

Query: 1   MERLILSSLLLLLLSSVLASAVA---VNDDDAMIRQVVPSDGEQSEDHLLNAEHH----- 52
           M RL   SL+L+L++ + A+A+A      D   IRQVV  D  + E+ +L          
Sbjct: 1   MSRL---SLVLILVAGLFATALAGPATFADKNPIRQVVFPD--ELENGILQVVGQTRSAL 55

Query: 53  -FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ F  +  K Y + EE   RF +F  NL+  +       +   G+ +F+DLT  EFR+
Sbjct: 56  SFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRK 115

Query: 112 QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
             LG ++     +   K  +  TN  LP   DWR  G V+ VK QG CGSCW+FS TGAL
Sbjct: 116 HKLGASQNC---SATTKGNLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGAL 172

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           E A+  + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG++ E+
Sbjct: 173 EAAYAQAFGKGISLSEQQLVDCAGAFN-------NFGCNGGLPSQAFEYIKFNGGLDTEE 225

Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            YPYTG + G CKF ++ I   V    N ++ +  E + A  LV+
Sbjct: 226 AYPYTGKN-GICKFSQANIGVKVISSVNITLGAEYELKYAVALVR 269


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/251 (35%), Positives = 129/251 (51%), Gaps = 11/251 (4%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKF 101
           E H L        + +K  K Y   +E   RF++FK+N+   +      + + + G+ KF
Sbjct: 29  ELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKF 88

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
           +DLT  EFR  + G  R L                LP+  DWR  GAVT +KDQG CGSC
Sbjct: 89  ADLTNEEFRAFWNGYKRPLGASRKITPFKYENVTALPSSIDWRSKGAVTPIKDQGVCGSC 148

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FSA  A EG H L TG+LVSLSEQ+LVDCD +         D GC GGLM  AF++I 
Sbjct: 149 WAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQ-------DKGCQGGLMVDAFKFIK 201

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
           + GG+  E +YPY G DG      ++  A  ++ +  +  + +      V + P++    
Sbjct: 202 RHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVS---V 258

Query: 282 SIELPHISFSF 292
           +I+   +SF F
Sbjct: 259 AIDAGSLSFQF 269


>gi|345493482|ref|XP_001602523.2| PREDICTED: cathepsin L-like [Nasonia vitripennis]
          Length = 514

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 93/255 (36%), Positives = 136/255 (53%), Gaps = 24/255 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-----AKRRQLLDPTAVHGVTKFSDLTPS 107
           +  FK +  K Y +  E  +R ++F  N  +     AK    L P  +  + K++D+   
Sbjct: 28  WKTFKVQHKKGYNSDIEEKFRMKIFMENKHKIAKHNAKYEMGLVPYKLQ-INKYADMLHH 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAP-----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSC 161
           EF     G N+       + + P     I P + +LP   DWR  GAVT +KDQG CGSC
Sbjct: 87  EFVNTLNGFNKTKPGMLQSYQKPVGAKFIAPAHVELPKSVDWRQEGAVTPIKDQGHCGSC 146

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           WSFSATGALEG HF  TG+LVSLSEQ L+DC  +         ++GCNGGLM++AF+YI 
Sbjct: 147 WSFSATGALEGQHFRQTGKLVSLSEQNLIDCSGKYG-------NNGCNGGLMDNAFKYIR 199

Query: 222 KAGGVEREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
              G++ E  YPY   D   C+++ ++  A  V    +   DE+++ A +   GP++   
Sbjct: 200 DNKGLDTESTYPYEAED-DECRYNARNSGAEDVGFVDIPEGDEEKLKAAIATIGPVS--- 255

Query: 281 ASIELPHISFSFLFT 295
            +I+  H +F F  T
Sbjct: 256 VAIDASHQTFQFYST 270



 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 16/155 (10%)

Query: 142 DWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEES 201
           D+   GAVT VK+QG CGSCW+FSATG+LEG HF   G L+SLSEQ LVDC        S
Sbjct: 300 DYWKQGAVTPVKNQGNCGSCWAFSATGSLEGQHFRHNGSLISLSEQNLVDC--------S 351

Query: 202 GS-CDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVI 259
           G   + GC+GGLMN+AF Y+    G++ EK YPY   D   C+++    AA  + + ++ 
Sbjct: 352 GRFGNDGCDGGLMNNAFTYVKVNRGLDSEKSYPYEAED-DRCRYNPKNSAADDAGYVNIP 410

Query: 260 SSDEDQMAANLVKHGPLAGNVASIELPHISFSFLF 294
           +  E ++ A +   GP+     S+ +   S SF+F
Sbjct: 411 TGSESKLQAAVATVGPI-----SVAIDADSDSFMF 440


>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
          Length = 548

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/254 (40%), Positives = 144/254 (56%), Gaps = 22/254 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 257 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 316

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 317 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 374

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 375 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 425

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL----- 285
           DY Y G    SC F   K    +++   +S +E ++AA L K GP++  + +  +     
Sbjct: 426 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQVRPX 484

Query: 286 PHISFSFLFTVSSP 299
           PH S    + ++SP
Sbjct: 485 PHCS---AWIINSP 495


>gi|3641698|dbj|BAA33398.1| preprocathepsin L [Bos taurus]
          Length = 301

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
           N + H+  +K+   + Y   EE ++R  V++ N +             HG    +  F D
Sbjct: 24  NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   +          P+L   D+P   DW   G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+YI   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY  TD  SC +     AA  + F  I   E  +   +   GP++    +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 251 DAGHTSFQF 259


>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
 gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
 gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
 gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
          Length = 334

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 126/249 (50%), Gaps = 17/249 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
           N + H+  +K+   + Y   EE ++R  V++ N +             HG    +  F D
Sbjct: 24  NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   +          P+L   D+P   DW   G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+YI   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY  TD  SC +     AA  + F  I   E  +   +   GP++    +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 251 DAGHTSFQF 259


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 95/257 (36%), Positives = 140/257 (54%), Gaps = 24/257 (9%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVT 99
           D  L+++ H   +K++  +TYA  E+  +R   ++ NL+  +   L      H    G+ 
Sbjct: 22  DQTLDSQWH--QWKAQHRRTYAANED-GWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78

Query: 100 KFSDLTPSEFRRQFLGLNR---RLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQG 156
           KF D+T  EF++   G N    + R      + P+L    LP   DWR+ G VT VK+QG
Sbjct: 79  KFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLA--QLPKSVDWREKGYVTPVKNQG 136

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FSATG+LEG  F  T +LVSLSEQ LVDC        +   ++GC+GGLM++A
Sbjct: 137 QCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCS-------TSEGNNGCSGGLMDNA 189

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGP 275
           FEY+   GG++ E+ YPY G D   CK+      A V+ F  I S +E  +   +   GP
Sbjct: 190 FEYVKNNGGIDTEQAYPYLGQD-NECKYRAECSGANVTGFVDIPSMNERALMKAVANVGP 248

Query: 276 LAGNVASIELPHISFSF 292
           ++    +I+  + SF F
Sbjct: 249 IS---VAIDAGNPSFQF 262


>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
          Length = 347

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 24/247 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +K K+ K Y  +EE+DY    F  N+          +L   T   G+   +DL  SE+R+
Sbjct: 43  YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101

Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
             L   R  RL  D+ +      ++P N  +P   DWR+H  VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF +TG+LVSLSEQ LVDC        +   + GCNGGLM+ AFEYI    G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPY G +   C F K  I A    F  +   DED +   +   GP++    +I+ 
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 268

Query: 286 PHISFSF 292
            H SF  
Sbjct: 269 GHRSFQL 275


>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 320

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 23/200 (11%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTP 106
           N +  +S FK+ ++K YA  +   YR  VF  NL+      ++D    + G+TKF DLT 
Sbjct: 38  NIKTLWSTFKNSYNKKYADPDFEQYRIEVFTENLK------IIDSNCQNFGITKFMDLTQ 91

Query: 107 SEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDF--DWRDHGAVTGVKDQGACGSCWSF 164
            EF++ +L L  +  +    ++ P    ND   D   DW   GAVT VKDQG CGSCWSF
Sbjct: 92  EEFKQTYLTLKTKKYI----EEIPETVFNDSNGDIEIDWTMKGAVTPVKDQGKCGSCWSF 147

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           S TGA+EGAHFLS+ ELVSLSEQ L+DC        S + + GCNGGLM++AF++I +  
Sbjct: 148 STTGAVEGAHFLSSNELVSLSEQYLIDC--------SKNGNEGCNGGLMDTAFDFIAQ-N 198

Query: 225 GVEREKDYPYTGTDGGSCKF 244
           G+  E  YPY   D G+CK 
Sbjct: 199 GIPTENAYPYKALD-GTCKM 217


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 94/242 (38%), Positives = 131/242 (54%), Gaps = 19/242 (7%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLGLNRR- 119
           K Y    E + RF++FK N+ R +     +      G  KFSDLT  EFR    G  R  
Sbjct: 51  KVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSH 110

Query: 120 -LRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
              + +   K     TN  D+P   DWR  GAVT +KDQ  CG CW+FSA  A+EG H L
Sbjct: 111 PKVMTSSKGKTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQL 170

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            TGEL+ LSEQ+LVDCD E +       D GC+GGL+++AF++ILK  G+  E +YPY G
Sbjct: 171 KTGELIPLSEQELVDCDVEGE-------DEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG 223

Query: 237 TDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
            D G C   KS ++AA ++ +  + ++ ++     V + P+     S+ +   SF F F 
Sbjct: 224 ED-GVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPV-----SVAIDGSSFDFQFY 277

Query: 296 VS 297
            S
Sbjct: 278 SS 279


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 92/245 (37%), Positives = 130/245 (53%), Gaps = 28/245 (11%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR---------RAKRRQLLDPTAVHGVTK 100
           E  F  + ++  K YAT EE   R  VF  N            A       P+    +  
Sbjct: 38  EALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNA 97

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPT--------NDLPTDFDWRDHGAVTGV 152
           F+DLT  EFR   LG   R+   A A ++P  P           +P   DWR++GAVT V
Sbjct: 98  FADLTHEEFRAARLG---RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKV 154

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           KDQG+CG+CWSFSATGA+EG + + TG LVSLSEQ+L+DCD         S +SGC GGL
Sbjct: 155 KDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDR--------SYNSGCGGGL 206

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
           M+ A+++++K GG++ E+DYPY   DG   K    K    +  +S + S+++ +    V 
Sbjct: 207 MDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVA 266

Query: 273 HGPLA 277
             P++
Sbjct: 267 QQPVS 271


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 126/237 (53%), Gaps = 22/237 (9%)

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRRLRLP 123
           +EH  RF +FK N++        D     G+ KF+DL+  EF+   +      ++ LR  
Sbjct: 61  DEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKFADLSNEEFKAMHMTTKMEKHKSLRGD 120

Query: 124 ADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
              +    +  N   LP   DWR  GAVT VK+QG CGSCW+FS   ++EG +++ TG+L
Sbjct: 121 RGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAFSTIASVEGINYIKTGKL 180

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQQLVDC  E         ++GCNGGLM++AF+YI+  GG+  E +YPYT  + G 
Sbjct: 181 VSLSEQQLVDCSKE---------NAGCNGGLMDNAFQYIIDNGGIVTEDEYPYT-AEAGE 230

Query: 242 C---KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFT 295
           C   K +   IA  +  F  + ++ +      V H P++    +IE     F F  T
Sbjct: 231 CSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVS---IAIEASGHDFQFYST 284


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/251 (34%), Positives = 134/251 (53%), Gaps = 22/251 (8%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+ +      ++ + S+  +TY    E + RF VF+ NLR   +        
Sbjct: 26  IVSYGERSEEEV---RRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAG 82

Query: 95  VH----GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDH 146
           +H    G+ +F+DLT  E+R  +LG     +R  +L A  Q        +LP   DWR  
Sbjct: 83  LHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQADD---NEELPETVDWRKK 139

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAV  +KDQG CGSCW+FSA  A+EG + + TG+++ LSEQ+LVDCD         S + 
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNE 191

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GCNGGLM+ AFE+I+  GG++ E+DYPY   D       K+     +  +  +  + ++ 
Sbjct: 192 GCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKS 251

Query: 267 AANLVKHGPLA 277
               V + P++
Sbjct: 252 LQKAVANQPIS 262


>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
          Length = 313

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 95/248 (38%), Positives = 133/248 (53%), Gaps = 25/248 (10%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L  E+ F+ F++++ K Y    E  +R +VF  N+  A++    D     G T F+D+T 
Sbjct: 17  LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTN 76

Query: 107 SEFRRQFLG---LNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
           +EF    L    L  ++  PA     PI+ P  +     DWR+ GAVT VK+Q +CGSCW
Sbjct: 77  TEFAVSKLCGCMLKPKMTKPA----TPIMEPAAEA---VDWREKGAVTPVKNQASCGSCW 129

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSATGA+EG +F++ GEL+SLSEQQLVDCDH+          SGC GGLM  AFEY  K
Sbjct: 130 AFSATGAMEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGLMTYAFEY-AK 179

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVAS 282
             G+ +E+DYPY   D   CK DK         +  +   +       V  GP++    +
Sbjct: 180 KKGMCKEEDYPYHAVD-EDCKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVS---VA 235

Query: 283 IELPHISF 290
           +E   I F
Sbjct: 236 VEADSIVF 243


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 93/242 (38%), Positives = 130/242 (53%), Gaps = 17/242 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA--VHGVTKFSDLTPSEFRRQF 113
           +K++ +K Y+   E   R+++++ N +  +             G+ KF DL   EF   F
Sbjct: 25  WKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAEMF 84

Query: 114 LGLNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEG 172
            G   + R  +++ K  +  P        DWR  GAVTGVK+QG CGSCW+FS TG+LEG
Sbjct: 85  NGYMMQAR--SNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTGSLEG 142

Query: 173 AHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDY 232
            HFL TG+LVSLSEQ LVDC  +   E       GCNGGLM+ AFEYI K GG++ E  Y
Sbjct: 143 QHFLKTGKLVSLSEQNLVDCSGKEGNE-------GCNGGLMDQAFEYIKKNGGIDTEASY 195

Query: 233 PYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELPHISFS 291
           PY   D   C+F  S + A  + +  I   DE+ +   + K GP++    +I+  H SF 
Sbjct: 196 PYQAHD-ERCRFKASDVGATCTGYVDIKREDENALMQAVEKIGPVS---VAIDASHSSFQ 251

Query: 292 FL 293
             
Sbjct: 252 LY 253


>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
          Length = 490

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  +F  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR     +        DL P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 253 IYL--NPLLREEPSNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 310

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 311 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 361

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 362 DYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 407


>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 129/239 (53%), Gaps = 14/239 (5%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL-LDPTAVHGVTKFSDLTPSEFR 110
            F+ F  ++ K+YA+ EE + RF +F  NL       +  +     G+TKF+D++  EF+
Sbjct: 33  QFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQ 92

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDH-GAVTGVKDQGACGSCWSFSATGA 169
            + L  N          + P       P+ FDWR+  G VT V DQG CGSCW+FSAT  
Sbjct: 93  SRVLMSNPPPPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFSATEN 152

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +E    L+  +L  LS QQ+VDC            D GC GG  + A++Y++ A G++  
Sbjct: 153 IESQWALAGHKLTGLSMQQIVDCSW---------WDDGCGGGFPSYAYDYVIDAPGLDAL 203

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD--EDQMAANLVKHGPLAGNVASIELP 286
            +YPYT   GGSC F +S++ A +S+++  ++D  E QMA  L +HGP++  V +   P
Sbjct: 204 ANYPYTAV-GGSCAFKESQVVAKISSWTYTTTDSNEHQMANYLAQHGPISVCVDAESWP 261


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/251 (35%), Positives = 132/251 (52%), Gaps = 22/251 (8%)

Query: 51  HHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSEF 109
           H +  +  K  K Y    E + RF++FK NLR  +      D +   G+ KF+DLT  E+
Sbjct: 46  HVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEY 105

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTN--------DLPTDFDWRDHGAVTGVKDQGACGSC 161
           R  FLG   R R P +        T+        +LP   DWR+ GAVT +KDQG CGSC
Sbjct: 106 RAMFLGT--RTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQCGSC 163

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS  GA+EG + + TG L SLSEQ+LVDCD           + GCNGGLM+ AFE+I+
Sbjct: 164 WAFSTVGAVEGINQIVTGNLTSLSEQELVDCDR--------GYNMGCNGGLMDYAFEFIV 215

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVA 281
           + GG++ E+DYPY   D       K+     +  +  + +++++     V + P++    
Sbjct: 216 QNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVS---V 272

Query: 282 SIELPHISFSF 292
           +IE   + F  
Sbjct: 273 AIEAGGMEFQL 283


>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 130/248 (52%), Gaps = 20/248 (8%)

Query: 49  AEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSDL 104
           AE H   +KS   + Y T EE ++R  +++ N+R  +          HG    +  F D+
Sbjct: 1   AEWH--QWKSTHRRLYGTNEE-EWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDM 57

Query: 105 TPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           T  EFR+   G   +        + P++    +P   DWR+ G VT VK+QG CGSCW+F
Sbjct: 58  TNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQCGSCWAF 115

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SA+G LEG  FL TG+L+SLSEQ LVDC H          + GCNGGLM+ AF+YI + G
Sbjct: 116 SASGCLEGQMFLKTGKLISLSEQNLVDCSH-------AQGNQGCNGGLMDFAFQYIKENG 168

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           G++ E+ YPY   D GSCK+      A  + F  I   E  +   +   GP++    +++
Sbjct: 169 GLDSEESYPYEAKD-GSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPIS---VAMD 224

Query: 285 LPHISFSF 292
             H S  F
Sbjct: 225 ASHPSLQF 232


>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/247 (36%), Positives = 134/247 (54%), Gaps = 19/247 (7%)

Query: 54  SLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKFSDLTPSEF 109
           S  ++   K Y +  E  YR ++F  N R+     ++ ++ +     G+ K+ D+   E 
Sbjct: 64  SCHRTHHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHEL 123

Query: 110 RRQFLGLNRRLRLPADAQKAP--ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
                G N+ + +  +       I P N +LP   DWR  GAVT +KDQG CGSCW+FS+
Sbjct: 124 INTLNGFNKSVTVSEEQLIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSS 183

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF  +G LVSLSEQ L+DC  +         ++GCNGGLM+ AF YI +  G+
Sbjct: 184 TGALEGQHFRQSGVLVSLSEQNLIDCSGKYG-------NNGCNGGLMDYAFRYIKENKGL 236

Query: 227 EREKDYPYTGTDGGSCKFD-KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + EK YPY   +   C+++ K+  A+ V    +   DED++ A +   GP++    +I+ 
Sbjct: 237 DTEKSYPYE-AENDQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPIS---VAIDA 292

Query: 286 PHISFSF 292
            H SF F
Sbjct: 293 SHESFHF 299


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 99/287 (34%), Positives = 153/287 (53%), Gaps = 18/287 (6%)

Query: 11  LLLLSSVLA-SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           +LLL +VLA SA+A +   A    +     +  ED  +     + L+ ++  K Y    E
Sbjct: 3   ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAI--MELYELWLAQHKKAYNGLGE 60

Query: 70  HDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNRRLRLP-AD 125
              RF VFK N L   +     +P+   G+ +F+DL+  EF+  +LG  L+ + RL  + 
Sbjct: 61  KQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP 120

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           + +       DLP   DWR+ GAVT VKDQG+CGSCW+FS   A+EG + + TG L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQ+LVDCD         S + GCNGGLM+ AF++I+  GG++ E DYPY   DG    + 
Sbjct: 181 EQELVDCDT--------SYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYR 232

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           K+     + ++  +  ++++       + P++    +IE    +F F
Sbjct: 233 KNAHVVTIDDYEDVPENDEKSLKKAAANQPIS---VAIEASGRAFQF 276


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 94/251 (37%), Positives = 132/251 (52%), Gaps = 22/251 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL----LDPTAVHGVTKFSDLT 105
           E  F  FKS F + Y + E   +R  +F+ANL+   R  +     D T    V  F+DL+
Sbjct: 30  EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             EFR  F G  R   L A +    +   ND   LP   DW   G VT +K+Q  CGSCW
Sbjct: 90  NEEFRATFNGYRR---LAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA  ++EG H L TG+LVSLSEQ LVDC        +   D GC+GG M+ AF+Y+++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   D  SC+F ++ I A + +F  V + DE  +   +   GP++    
Sbjct: 200 NRGIDTEASYPYKAID-ESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPIS---V 255

Query: 282 SIELPHISFSF 292
           +I+    SF F
Sbjct: 256 AIDASQPSFQF 266


>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
          Length = 335

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 93/254 (36%), Positives = 133/254 (52%), Gaps = 20/254 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           DH LNAE  +  +K+ + + Y   EE  +R  V++ N +  +          HG T    
Sbjct: 22  DHSLNAE--WYQWKATYRRLYGADEE-GWRRAVWEKNRKMIELHNREYSQRKHGFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLG-LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
            F D+T  EFR+   G L ++        + P+    ++P+  DWR  G VT VK+QG C
Sbjct: 79  AFGDMTNEEFRQVMNGFLKQKQHRNGRLFREPLFA--EIPSSVDWRQKGYVTPVKNQGQC 136

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCW+FSA GALEG  F  TG+LVSLSEQ LVDC H          + GCNGGLM++AF+
Sbjct: 137 GSCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHS-------QGNQGCNGGLMDNAFQ 189

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
           Y+    G++ E+ YPY G +  +C +     AA  + F  I   E  +   +   GP++ 
Sbjct: 190 YVKDNKGLDSEESYPYLGRESNTCNYRPEYSAANDTGFVDIPQHERGLMKAVATVGPIS- 248

Query: 279 NVASIELPHISFSF 292
              +I+  H SF F
Sbjct: 249 --VAIDAGHSSFQF 260


>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
          Length = 310

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 132/253 (52%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  + K Y  + E   R  +++ NL+      L     +H    G+    D+T
Sbjct: 37  DHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMT 96

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E     + L   LR+P+  Q+     +N    LP   DWR+ G VT VK QG+CG+CW
Sbjct: 97  SEEV----ISLMGCLRVPSQWQRNVTYKSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 152

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG LVSLS Q LVD    C  E+ G  + GCNGG M  AF+YI+ 
Sbjct: 153 AFSAVGALEAQLKLKTGNLVSLSAQNLVD----CSTEKYG--NKGCNGGFMTEAFQYIID 206

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   D G C++D    AA  S ++ +    E+ +   +   GP++    
Sbjct: 207 NNGIDSEASYPYKAMD-GKCQYDSKNRAATCSKYTELPFGSEEDLKETVANKGPVS---V 262

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 263 AIDASHSSF-FLY 274


>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
          Length = 467

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 97/237 (40%), Positives = 125/237 (52%), Gaps = 24/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           HF+ FK +  K Y +  E  +R  VFK NL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  HFAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHASFGVTPFSDLTREEFRS 96

Query: 112 QF-------LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           ++           +R R+P + +          P   DWR  GAVT VKDQG CGSCW+F
Sbjct: 97  RYHNAAAHFAAAQKRARVPVEVEVE----VGGAPAAVDWRARGAVTAVKDQGECGSCWAF 152

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
           S  G +EG   L+   L SLSEQ LV CD+          D+GC+GGLM++AF++I+   
Sbjct: 153 STIGNIEGQWHLAGNPLTSLSEQMLVSCDNA---------DNGCDGGLMDNAFDWIVGKN 203

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E  Y Y    G S K D S   + A +S    +  DED+MAA L  +GPLA
Sbjct: 204 NGTVYTEASYSYVSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLA 260


>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
          Length = 340

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 135/253 (53%), Gaps = 24/253 (9%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTK 100
           +DH+      F  F S+FSK Y ++EE + R + +K+N+        Q    +   G   
Sbjct: 37  QDHI-----DFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNH 91

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
            +D T  E+++  LG   R +   +    P L   D+P   DWR+ GAV  VKDQG CGS
Sbjct: 92  LADYTHDEYKK-MLGYKPRNKTGKEVYSTPNLK--DIPESIDWREKGAVNAVKDQGQCGS 148

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS   +LE  +F+ TG+L SLSEQQLVDC        S + + GCNGG M  A +YI
Sbjct: 149 CWAFSTIASLESRYFIETGKLQSLSEQQLVDC--------SKNGNEGCNGGDMGLAMDYI 200

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
             AGGVE EKDYPY G D  +C F+ SK +A    + +++      + A  +  GP++  
Sbjct: 201 ASAGGVETEKDYPYVGKD-QTCAFEASKEVATDKGHINIVPGKFATLQA-AIAEGPVS-- 256

Query: 280 VASIELPHISFSF 292
             +IE   + F F
Sbjct: 257 -VAIEADSLFFQF 268


>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
          Length = 318

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 112/300 (37%), Positives = 160/300 (53%), Gaps = 42/300 (14%)

Query: 1   MERLILSSLLLLLLSSVLASAV-----AVNDDDAMIRQVVPS--DGEQSEDHLLNAEHH- 52
           M RL + + +L+LL +V +        +  D++  IR V  S  D E S   L+    H 
Sbjct: 1   MARLSVVAAVLILLCAVASGEADHHFRSSFDEENPIRLVSDSIRDLESSVLRLIGDTRHA 60

Query: 53  --FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSE 108
             F+ F  ++ K+Y T +E   RF +F  NL+  R+  R+ L  T    V +F+D T  E
Sbjct: 61  HSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNRKGLPYTLA--VNQFADWTWEE 118

Query: 109 FRRQFLGL--NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           FRR  LG   N    L  + +   ++    LP   DWR+ G V+ +KDQG CGSCW+FS 
Sbjct: 119 FRRHRLGAAQNCSATLKGNHKLTDVI----LPETKDWREDGIVSPIKDQGHCGSCWTFST 174

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGG 225
           TGALE A+  + G+ +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG
Sbjct: 175 TGALEAAYAQAFGKGISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGG 226

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSD------------EDQMAANLVKH 273
           ++ E+ YPYTG D G+CKF    I   V +   I+ D            ED +A  L+K+
Sbjct: 227 LDTEEAYPYTGLD-GTCKFSSENIGVQVLDSVNITLDVNHAVLAVGYGVEDGVAYWLIKN 285


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 82/217 (37%), Positives = 123/217 (56%), Gaps = 16/217 (7%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEFRRQFLGLN----RRLRL 122
           + D RF +FK NLR        +  A +  G+TKF+DLT  E+R  +LG      RR+  
Sbjct: 69  DQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAK 128

Query: 123 PADAQK--APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
             +  +  +  +   ++P   DWR  GAV  +KDQG CGSCW+FS   A+EG + + TGE
Sbjct: 129 AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGE 188

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           L+SLSEQ+LVDCD+        S + GCNGGLM+ AF++I+K GG++ EKDYPY G  G 
Sbjct: 189 LISLSEQELVDCDN--------SYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGK 240

Query: 241 SCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              F K+    ++  +  + + ++      +   P++
Sbjct: 241 CNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVS 277


>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
          Length = 338

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 135/237 (56%), Gaps = 12/237 (5%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTK 100
           S+D  +     F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTK
Sbjct: 30  SQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTK 89

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
           FSDLT  EFR  +L    R + P +  K      +  P ++DWR  GAVT VKDQG CGS
Sbjct: 90  FSDLTEEEFRTIYLNTLLR-KEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGS 148

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS TG +EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I
Sbjct: 149 CWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAI 199

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              GG+E   DY Y G    SC F   K    +++   +S +E ++AA L K GP++
Sbjct: 200 KNLGGLETVDDYSYQG-HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 255


>gi|2677828|gb|AAB97142.1| cysteine protease [Prunus armeniaca]
          Length = 358

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/274 (38%), Positives = 151/274 (55%), Gaps = 27/274 (9%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDG----EQSEDHLLNAEH---HFSLF 56
           L+LS+ L+L+  S  A+A +  D+   IR V  SDG    EQ    +L       HF+ F
Sbjct: 6   LVLSAALVLVAISCGAAASSF-DESNPIRLV--SDGLRELEQQVVQVLGNSRRALHFARF 62

Query: 57  KSKFSKTYATQEEHDYRFRVFKAN--LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
             ++ K Y + EE   R+ +F  N  L R+  ++ L  T    V +F+D +  EFRRQ L
Sbjct: 63  AHRYGKKYESVEEMKLRYEIFSENKKLIRSTNKKGLPYTL--AVNRFADWSWEEFRRQRL 120

Query: 115 GLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
           G  +     A  + +  L    LP   +WR+ G VT VKDQG CGSCW+FS TGALE A+
Sbjct: 121 GAAQNC--SATTKGSHELTDAVLPESKNWREEGIVTPVKDQGHCGSCWTFSTTGALEAAY 178

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYP 233
             +  + +SLSEQQLVDC        +G+ ++ GC+GGL + AFEYI   GG++ E  YP
Sbjct: 179 VQAFRKQISLSEQQLVDC--------AGAFNNFGCHGGLPSQAFEYIKYNGGLDTEAAYP 230

Query: 234 YTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQM 266
           Y GTD G+CKF    +   V  + ++   DE ++
Sbjct: 231 YVGTD-GACKFSAENVGVQVLDSVNITLGDEQEL 263


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 107/296 (36%), Positives = 152/296 (51%), Gaps = 34/296 (11%)

Query: 6   LSSLLLLLLSSVLA-SAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
           ++ L   LLS VL   +VA       + Q +P D    E  L + E  +SL++ K+   +
Sbjct: 1   MAKLSYALLSVVLVLGSVA-------LAQSIPFD----EKDLASEESLWSLYE-KWRAHH 48

Query: 65  ATQ---EEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGL---- 116
           A     ++ D RF VFK N++      Q  D T    + KF D+T  EFR  + G     
Sbjct: 49  AVSRDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDH 108

Query: 117 NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
           +  LR   DA +      +DLPT  DWR+ GAVTGVKDQG CGSCW+FS   A+EG + +
Sbjct: 109 HMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQI 168

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
            T ELVSLSEQQLVDCD +         +SGCNGGLM+ AF++I   GG+  E  YPY  
Sbjct: 169 KTNELVSLSEQQLVDCDTK---------NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYL- 218

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            +  SC  + +     +  +  +  + +      V + P++    +IE    +F F
Sbjct: 219 AEQKSCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPVS---VAIEASGYAFQF 271


>gi|16076437|emb|CAC94443.1| cysteine proteinase [Betula pendula]
          Length = 133

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 72/91 (79%), Positives = 82/91 (90%)

Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
           DHECDPEE GSCDSGC+GGLMNSAFEY LKAGG+ RE+DYPYTGTD  +CKFDKSKIAA+
Sbjct: 1   DHECDPEEQGSCDSGCSGGLMNSAFEYTLKAGGLMREEDYPYTGTDRSTCKFDKSKIAAS 60

Query: 253 VSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           VSNFSVIS DEDQ+AANLVK+GPLA  + ++
Sbjct: 61  VSNFSVISLDEDQIAANLVKNGPLAVAINAV 91


>gi|66823853|ref|XP_645281.1| hypothetical protein DDB_G0272298 [Dictyostelium discoideum AX4]
 gi|60473355|gb|EAL71301.1| hypothetical protein DDB_G0272298 [Dictyostelium discoideum AX4]
          Length = 305

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 96/256 (37%), Positives = 135/256 (52%), Gaps = 33/256 (12%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN 117
           K++K Y   +E+  RF +F+ N       R          + ++SDLT  EF  +F    
Sbjct: 3   KYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFF--- 59

Query: 118 RRLRLPADAQKAPILPTNDL-------------PTDFDWRDHGAVTGVKDQGACGSCWSF 164
              +L  + +  PI   ND+             P  FDWRDHGAV  VK+QG+C SCWSF
Sbjct: 60  --EKLVPEPRSGPI---NDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSF 114

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SA GALEG +++  GEL+ LSEQ LVDC     P+       GC  G M+ AF+YI+ +G
Sbjct: 115 SALGALEGHYYIKYGELLDLSEQNLVDCATPFGPK-------GCKTGWMHDAFKYIISSG 167

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNV--A 281
           GV  E  YPYTG D   CKF++S+  A VS F +I   DE  +   +  +GP+A  +  +
Sbjct: 168 GVNLESQYPYTGKD-EVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTS 226

Query: 282 SIELPHISFSFLFTVS 297
           + E  H+S    ++ S
Sbjct: 227 TKEFQHLSGGIYYSDS 242


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 91/247 (36%), Positives = 132/247 (53%), Gaps = 23/247 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSEFRR 111
           FK+   + Y   EE   R  VF+ NL++ +    L      +   G+ +F+D+   EF  
Sbjct: 47  FKTVHERNYGETEEMQ-RKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFAS 105

Query: 112 QFLG--LNRRLRLPADAQK---APILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
              G  +N R ++         +P +P + LP + DWR  G VT +KDQG CGSCWSFS 
Sbjct: 106 VVNGFRMNNRTKVRDHLHSHYISPAIPVS-LPAEVDWRKEGYVTPIKDQGHCGSCWSFST 164

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF  TG+LVSLSEQ L+DC        +   ++GCNGG+M+ AF+YI    G 
Sbjct: 165 TGALEGQHFRKTGKLVSLSEQNLIDC-------STSYGNNGCNGGVMDYAFQYIKDNDGD 217

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVASIEL 285
           + E  YPY   D G C+F K  + A  + ++ +   DE++M   +   GP++    +I+ 
Sbjct: 218 DTEDSYPYEAAD-GPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVS---VAIDA 273

Query: 286 PHISFSF 292
            H SF  
Sbjct: 274 SHTSFQM 280


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/246 (36%), Positives = 131/246 (53%), Gaps = 14/246 (5%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E  +  +K   +K Y T  E   R  +++ NL++ ++      +    +    DLT  EF
Sbjct: 25  EQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQDEF 84

Query: 110 RRQFLGLNRRLRLPADAQKAPIL-PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           R  + G+          Q +  L P++  +P   DWR  G VT VK+QG CGSCW+FS T
Sbjct: 85  RYFYTGMRSHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G+LEG +F  TG+LVSLSEQ LVDC        +   ++GC GGLM+ AF+YI + GG++
Sbjct: 145 GSLEGQNFKKTGKLVSLSEQNLVDC-------STAYGNNGCQGGLMDYAFKYIKENGGID 197

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELP 286
            E+ YPY   +   C+F KS I A  + F  V   DE+ +       GP++    +I+  
Sbjct: 198 TEESYPYEARN-DRCRFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPIS---VAIDAG 253

Query: 287 HISFSF 292
           H+SF F
Sbjct: 254 HMSFQF 259


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 135/256 (52%), Gaps = 21/256 (8%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTK 100
           +LL  E H  LFK+   K Y +Q E  +R +++  N  +  +  +L    + +    + K
Sbjct: 21  NLLADEWH--LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNK 78

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGA 157
           F DL   EFR    G   + +  + A+       P N ++P   DWR+ GA+T VKDQG 
Sbjct: 79  FGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ 138

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CG CW+FS+TGALEG  F  TG+LVSL EQ L+DC  +   E       GCNGGLM+ AF
Sbjct: 139 CGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNE-------GCNGGLMDQAF 191

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY   D   C+++     A    F  + S +ED++ A +   GP+
Sbjct: 192 QYIKDNKGIDTENTYPYEAED-DVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPV 250

Query: 277 AGNVASIELPHISFSF 292
           +    +I+  H SF F
Sbjct: 251 S---VAIDASHESFQF 263


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 121/224 (54%), Gaps = 13/224 (5%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--- 115
           K  K Y    E D RF++FK NL         + T + G+ KF+D+T  E+R  +LG   
Sbjct: 45  KHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRS 104

Query: 116 -LNRR-LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
            + RR ++      +      + LP   DWR  GA+T +KDQG+CGSCW+FS    +E  
Sbjct: 105 DIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAI 164

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
           + + TG+LVSLSEQ+LVDCD         + + GCNGGLM+ AFE+I+  GG++ ++ YP
Sbjct: 165 NKIVTGKLVSLSEQELVDCDR--------AFNEGCNGGLMDYAFEFIIGNGGIDTDQHYP 216

Query: 234 YTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y G +G      K     ++  +  + S+ +      V H P++
Sbjct: 217 YKGFEGRCDPTRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVS 260


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 131/253 (51%), Gaps = 23/253 (9%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           +HH++L+K  + + Y  + E   R  +++ NL+      L     +H    G+   +D+T
Sbjct: 33  DHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMT 92

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             E       L   LR+P+  Q      +N    LP   DWR+ G VT VK QGACG+CW
Sbjct: 93  SEEVSS----LMSSLRVPSQWQANVTYKSNSNQKLPDSVDWREKGCVTEVKYQGACGACW 148

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA GALE    L TG LVSLS Q LVD    C  E  G  + GCNGG M  AF+YI+ 
Sbjct: 149 AFSAVGALEAQLKLKTGNLVSLSAQNLVD----CSTERYG--NKGCNGGFMTKAFQYIID 202

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   D G+C++D    AA  S ++ +    ED +   +   GP++    
Sbjct: 203 NNGIDSEVSYPYKAMD-GNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVS---V 258

Query: 282 SIELPHISFSFLF 294
           +I+  H SF FL+
Sbjct: 259 AIDAKHSSF-FLY 270


>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
          Length = 443

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 124/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD           D+GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
          Length = 381

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 84  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR     +        DL P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 144 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 201

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 202 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 252

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 253 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 298


>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
 gi|255645733|gb|ACU23360.1| unknown [Glycine max]
          Length = 362

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 90/247 (36%), Positives = 135/247 (54%), Gaps = 34/247 (13%)

Query: 44  DHLLNAEHHFSLFKS---KFSKTYATQEEHDYRFRVFKANLR-----RAKRRQLLDPTAV 95
           +   + E  F LF++   +  + Y  QEE   RF++F++NLR      AKR+    PT  
Sbjct: 33  EQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEMNAKRK---SPTTQ 89

Query: 96  H--GVTKFSDLTPSEFRRQFLGLNRRLRLP-------ADAQKAPILPTNDLPTDFDWRDH 146
           H  G+ KF+D++P EF + +L   + + +P          QK      ++LP   DWRD 
Sbjct: 90  HRLGLNKFADMSPEEFMKTYL---KEIEMPYSNLESRKKLQKGDDADCDNLPHSVDWRDK 146

Query: 147 GAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDS 206
           GAVT V+DQG C S W+FS TGA+EG + + TG LVSLS QQ+VDCD             
Sbjct: 147 GAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCD---------PASH 197

Query: 207 GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQM 266
           GC GG   +AF Y+++ GG++ E  YPYT  + G+CK + +K+  ++ N  V+   E+ +
Sbjct: 198 GCAGGFYFNAFGYVIENGGIDTEAHYPYTAQN-GTCKANANKV-VSIDNLLVVVGPEEAL 255

Query: 267 AANLVKH 273
              + K 
Sbjct: 256 LCRVSKQ 262


>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
           K GGV+ E DYPY   D  +C+ + +K    V +   + ++  ++ +    LV   P+A 
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246

Query: 279 NVASI 283
           + A I
Sbjct: 247 DAADI 251


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 90/259 (34%), Positives = 137/259 (52%), Gaps = 23/259 (8%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
           ++ L+  + H   + +K  + YA  +E   R+ VFK+N+ R +    +    T    V +
Sbjct: 29  DNELIMQKRHIE-WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQ 87

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPI------LPTNDLPTDFDWRDHGAVTGVKD 154
           F+DLT  EFR  + G      L + +Q          + +  LP   DWR  GAVT +K+
Sbjct: 88  FADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKN 147

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG+CG CW+FSA  A+EGA  +  G+L+SLSEQQLVDCD         + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
           +AFE+I+  GG+  E +YPY G D  +C   K+   A +++ +  +  +++Q     V H
Sbjct: 199 TAFEHIMATGGLTTESNYPYKGED-ATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257

Query: 274 GPLAGNVASIELPHISFSF 292
            P++     IE     F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273


>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
 gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
          Length = 333

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 91/253 (35%), Positives = 130/253 (51%), Gaps = 19/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D  LNA+ +   +K+   + Y   EE  +R  V++ N++  +          HG T    
Sbjct: 22  DQSLNAQWY--QWKATHRRLYGMNEE-GWRRAVWEKNMKMIELHNREYSQGKHGFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P+    ++P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFA--EIPKSVDWREKGYVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNEGCNGGLMDNAFRY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +   GG++ E+ YPY G D  +C +     AA  + F  +   E  +   +   GP++  
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQREKALMKAVATLGPIS-- 247

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 248 -VAIDAGHQSFQF 259


>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + YAT +E   R   F+ NL   +  Q  +P A  G+TKF DL+  EF  +
Sbjct: 38  FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L       +  +  +   +      +  P   DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98  YLSGATHFAKAKKFASQYYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
            +E   +L+T  L+SLSEQ+LV CD           D GCNGGLM  AF+++L  + G V
Sbjct: 158 NIESKWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                YPY   +G   +  +S    I A +     I S+ED MAA L  +GP+A
Sbjct: 209 YTGASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262


>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
 gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
          Length = 325

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 85/235 (36%), Positives = 129/235 (54%), Gaps = 14/235 (5%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F + ++K Y   +E  YR+++FK NL     +  ++  AV  + KFSD++
Sbjct: 20  LLKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVEDHAVFSINKFSDMS 79

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
            SE   ++ GL+    +  +  +A IL  P N  P +FDWR + AVT V+ QG CGSCW+
Sbjct: 80  KSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCWA 139

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS    +E  + +   + +SLS QQLVDCD         + + GC GGL+++A E I+ A
Sbjct: 140 FSTLAGIESQYSIKYNKQISLSVQQLVDCD---------TSNMGCAGGLLHTALEQIINA 190

Query: 224 -GGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
            GGV +E+DYPY G D   C    +  A  V   +  I  +E+++   L   GP+
Sbjct: 191 GGGVLQEEDYPYKGVD-KQCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPI 244


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 127/223 (56%), Gaps = 14/223 (6%)

Query: 59  KFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG-- 115
           K  K Y      + RF +FK NLR   +  + ++ +   G+ KF+DL+  E++  FLG  
Sbjct: 13  KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLGGR 72

Query: 116 -LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAH 174
            +  R    +D  K  +   ++LP   DWR+ GAV  VKDQG CGSCW+FS   A+EG +
Sbjct: 73  MVRDRKGFESDRFKYGV--GDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130

Query: 175 FLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPY 234
            ++TG+L+SLSEQ+LVDCD           + GCNGG M+ AFE+I+K GG++ E DYPY
Sbjct: 131 QIATGDLISLSEQELVDCDK--------GFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPY 182

Query: 235 TGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G DG   +  K+     ++ F  +  ++++     V H P++
Sbjct: 183 KGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVS 225


>gi|47199802|emb|CAF88807.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 261

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 93/233 (39%), Positives = 122/233 (52%), Gaps = 21/233 (9%)

Query: 69  EHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
           E  +R  V++ NL++ +   L      H    G+  F D+T  EFR+   G   +   P 
Sbjct: 1   EEGWRRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGYKHK---PQ 57

Query: 125 DAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGEL 181
              +  +    +    P   DWRD G VT VKDQG CGSCW+FS TGALEG HF  TG+L
Sbjct: 58  RKFRGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRQTGKL 117

Query: 182 VSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           VSLSEQ LVDC     PE     + GCNGGLM+ AF+YI   GG++ E  YPY  TD   
Sbjct: 118 VSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYIKDNGGLDSEASYPYLATDDQP 170

Query: 242 CKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
           C +D S  +A  + F  V S  E  +   +   GP++    +I+  H SF F 
Sbjct: 171 CHYDPSNNSANETGFVDVPSGSERALMKAVASVGPVS---VAIDAGHESFQFY 220


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 131/240 (54%), Gaps = 16/240 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
           + ++  + Y   +E + R+ +FK N+ R +      D     GV KF+DLT  EFR    
Sbjct: 8   WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMHH 67

Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           G  R+  +L + + +   L    +PT  DWR  GAVT VKDQG CG CW+FSA  A+EG 
Sbjct: 68  GYKRQSSKLMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAIEGI 125

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
             L TG+L+SLSEQQLVDCD +         D GC GGLM++AF++IL+ GG+  E  YP
Sbjct: 126 IKLKTGKLISLSEQQLVDCDVK-------GVDQGCGGGLMDNAFQFILRNGGLTSEATYP 178

Query: 234 YTGTDGGSCKFDKS-KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
           Y G D G+CK  K+  I A ++ +  +  + +      V   P++    ++E     F F
Sbjct: 179 YQGVD-GTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVS---VAVEGGGYDFQF 234


>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
           K GGV+ E DYPY   D  +C+ + +K    V +   + ++  ++ +    LV   P+A 
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246

Query: 279 NVASI 283
           + A I
Sbjct: 247 DAADI 251


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 83/229 (36%), Positives = 129/229 (56%), Gaps = 19/229 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  +  K  K+Y T +E   R+ +F+ N+    +        + G+   +DLT  E++R 
Sbjct: 32  FQNWMVKHQKSY-TNDEFGSRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRI 90

Query: 113 FLGLNRRLRLPADAQKAPILPTNDL---PTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           +LG    ++ P       I+   D+   P   DWR +GAVT VK+QG CG C+SFS TG+
Sbjct: 91  YLGTKTTVKKPN-----LIIGVTDVSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGS 145

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVER 228
           +EG H +++ +LVSLSEQQ++DC        SGS  ++GC+GGLM ++FEYI+  GG++ 
Sbjct: 146 VEGIHEITSKQLVSLSEQQILDC--------SGSEGNNGCDGGLMTNSFEYIIAVGGLDT 197

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           E  YPY G   G CKF+K+ I A ++ +  + S  +      V   P++
Sbjct: 198 EASYPYEGVV-GKCKFNKANIGATITGYKNVKSGSESDLQTAVAAQPVS 245


>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
 gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 23/245 (9%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN---FSVISSDEDQMAANLVKHGPLAG 278
           K GGV+ E DYPY   D  +C+ + +K    V +   + ++  ++ +    LV   P+A 
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAI 246

Query: 279 NVASI 283
           + A I
Sbjct: 247 DAADI 251


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 88/244 (36%), Positives = 130/244 (53%), Gaps = 14/244 (5%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           ++  +KS   K Y  + E   R  +++ NL++         +    +    D+T  E  +
Sbjct: 28  NWKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQ 87

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPT--DFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
             LGL  +    +  + A  LP  ++      DWR  G VT VK+QG CGSCW+FS TGA
Sbjct: 88  TLLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LEG HF  TG+LVSLSEQ LVDC  +         ++GC GGLM++AF+YI + GG++ E
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYG-------NNGCEGGLMDNAFQYIKENGGIDTE 200

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           K YPY   D G C ++KS I A  + F  + + DE+ +   L   GP++    +I+    
Sbjct: 201 KSYPYLAKD-GVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPIS---IAIDASQS 256

Query: 289 SFSF 292
           +F F
Sbjct: 257 TFHF 260


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 93/251 (37%), Positives = 132/251 (52%), Gaps = 22/251 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQL----LDPTAVHGVTKFSDLT 105
           E  F  FKS F + Y + E   +R  +F+ANL+   R  +     D T    V  F+DL+
Sbjct: 30  EAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLS 89

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTND---LPTDFDWRDHGAVTGVKDQGACGSCW 162
             EFR  F G  R   L A +    +   ND   LP   DW   G VT +K+Q  CGSCW
Sbjct: 90  NEEFRATFNGYRR---LAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCW 146

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA  ++EG H L TG+LVSLSEQ LVDC        +   D GC+GG M+ AF+Y+++
Sbjct: 147 AFSAVASMEGQHALKTGKLVSLSEQNLVDC-------SAAEGDMGCSGGWMDYAFKYVIQ 199

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVA 281
             G++ E  YPY   D  SC+F ++ + A + +F  V + DE  +   +   GP++    
Sbjct: 200 NRGIDTEASYPYKAID-ESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPIS---V 255

Query: 282 SIELPHISFSF 292
           +I+    SF F
Sbjct: 256 AIDAAQPSFQF 266


>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 327

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 92/231 (39%), Positives = 128/231 (55%), Gaps = 15/231 (6%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E HF  + +  +K Y+ QE H  R ++F  N RR ++    + +   G+ +FSD+T +EF
Sbjct: 26  EQHFKSWMALHNKAYSVQEFHQ-RLQIFTENKRRIEKHNGGNHSFTMGLNQFSDMTFAEF 84

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHG-AVTGVKDQGACGSCWSFSAT 167
           R++FL    +      A K   + TN   P   DWR  G  VT VK+QGACGSCW+FS T
Sbjct: 85  RKRFLWSEPQ---NCSATKGSYMKTNSPQPESIDWRTKGNYVTPVKNQGACGSCWTFSTT 141

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           G LE    ++TG+LV LSEQQLVDC  + +       + GCNGGL + AFEYI    G+ 
Sbjct: 142 GCLESVTAINTGKLVPLSEQQLVDCAWDFN-------NHGCNGGLPSQAFEYIKYNKGLM 194

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLA 277
            E  YPYT  + G CK+     AA V N  ++ + DE  M   +  H P++
Sbjct: 195 TESGYPYTAFE-GKCKYKPELAAAFVKNVVNITAYDEKGMEDAVATHNPVS 244


>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
 gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
          Length = 356

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLD-PTAVHGVTKF 101
           +L  A  +F  F   ++K Y +  E + R+ +FK NL    AK     D PTA + + KF
Sbjct: 48  NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACG 159
           SDL+ SE   +F GL+   R+ ++  K  IL  P +  P  FDWR+   VT +K+QGACG
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-SNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACG 166

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           +CW+F+   ++E    +    L+ LSEQQL+DCD         S D GCNGGL+++AFE 
Sbjct: 167 ACWAFATLASVESQFAMRHNRLIDLSEQQLIDCD---------SVDMGCNGGLLHTAFEE 217

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSK--IAAAVSNFSVISSDEDQMAANLVKHGPL 276
           I++ GGV+ E DYP+ G +   C  D+ +  + + V  +  +  +E+++   L   GP+
Sbjct: 218 IMRMGGVQTELDYPFVGRN-RRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPI 275


>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 332

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
 gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
          Length = 334

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 126/245 (51%), Gaps = 17/245 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPS 107
            +  +K+   + Y   EE  +R  V++ N++             HG T     F D+T  
Sbjct: 28  QWYQWKATHRRLYGMNEE-GWRRAVWEKNMKMIDLHNREYSQGQHGFTMAMNAFGDMTNE 86

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           EFR+   G   +        + P+    ++P   DW   G VT VK+QG CGSCW+FSAT
Sbjct: 87  EFRQVMNGFRNQKPRKGKVFQEPLFA--EIPKSVDWTLKGYVTPVKNQGQCGSCWAFSAT 144

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GALEG  F  TG+LVSLSEQ LVDC      E       GCNGGLM++AF+Y+ + GG++
Sbjct: 145 GALEGQMFRKTGKLVSLSEQNLVDCSRSQGNE-------GCNGGLMDNAFQYVKENGGLD 197

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPH 287
            E+ YPY GTD  SCK+     AA  + F  I   E  +   +   GP++    +I+  H
Sbjct: 198 SEESYPYLGTDTDSCKYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAIDAGH 254

Query: 288 ISFSF 292
            SF F
Sbjct: 255 QSFQF 259


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 84/228 (36%), Positives = 129/228 (56%), Gaps = 16/228 (7%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEF---R 110
           + S++ K Y   +E + RF++FK N+   +     D T  +  G+ +F+DLT  EF   R
Sbjct: 42  WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR 101

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +F G      +   + K      + +P+  DWR  GAVT VK+QG CG CW+FSA  A 
Sbjct: 102 NKFKGHMCSSIMRTTSFKYE--NVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAAT 159

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG H LSTG+L+SLSEQ+LVDCD       +   D GC GGLM+ AF++I++  G+  E 
Sbjct: 160 EGIHKLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHGLSTEA 212

Query: 231 DYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
            YPY G D G+C  +K+ + A  ++ +  + ++ +Q     V + P++
Sbjct: 213 QYPYEGVD-GTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPIS 259


>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
 gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
          Length = 355

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 93/225 (41%), Positives = 124/225 (55%), Gaps = 17/225 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           F+ F S+F K+Y ++EE   R+ +F  NLR  R+  +  L  T    V  F+D T  EF+
Sbjct: 55  FARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRLPYTL--SVNHFADWTWEEFK 112

Query: 111 RQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           R  LG  +      +      L    LP   DWR  G V+ VKDQG+CGSCW+FS TGAL
Sbjct: 113 RHRLGAAQNCSATLNGNHK--LTDAVLPPTKDWRKEGIVSDVKDQGSCGSCWTFSTTGAL 170

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           E A   + G+ +SLSEQQLVDC    +       + GCNGGL + AFEYI   GG+E E+
Sbjct: 171 EAACAQAFGKSISLSEQQLVDCAGRFN-------NFGCNGGLPSQAFEYIKYNGGLETEE 223

Query: 231 DYPYTGTDGGSCKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
            YPYTG D G CKF    +A  V    N ++ + +E + A   V+
Sbjct: 224 AYPYTGKD-GVCKFSAENVAVQVIDSVNITLGAENELKHAVAFVR 267


>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
 gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F  KF+K Y+++ E   RF++F+ NL     +   D TA + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL     LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQL+DCD+          D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           ++ GGV+ E DYPY G+DG         +      +  I+  E+++   L   GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243


>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
 gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
          Length = 549

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 97/258 (37%), Positives = 137/258 (53%), Gaps = 26/258 (10%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFS 102
           + H+ NA HHF   K K    Y + +EH++R  +F+ NLR    +     T    V   +
Sbjct: 238 DSHVDNAFHHF---KRKHGVAYRSDKEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLA 294

Query: 103 DLTPSEF--RRQFLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           D T  E   RR +      N     P D  K     T+D+P+ +DWR +GAVT VKDQ  
Sbjct: 295 DKTEEELKARRGYKSSGVYNTGKPFPYDVNKY----TDDIPSQYDWRLYGAVTPVKDQSV 350

Query: 158 CGSCWSFSATGALEGAHFLST-GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
           CGSCWSF   G LEGA FL   G LV LS+Q L+DC         G  ++GC+GG     
Sbjct: 351 CGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDC-------SWGFGNNGCDGGEDFRV 403

Query: 217 FEYILKAGGVEREKDY-PYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHG 274
           ++++LK+GGV  E++Y PY G D G C  +   + A ++ F +V S+D +     L+KHG
Sbjct: 404 YQWMLKSGGVPTEEEYGPYLGQD-GYCHVNNVTLVAPITGFVNVTSNDPNAFKIALLKHG 462

Query: 275 PLAGNVASIELPHISFSF 292
           PL+    +I+    +FSF
Sbjct: 463 PLS---VAIDASPKTFSF 477


>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
          Length = 337

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 96/250 (38%), Positives = 134/250 (53%), Gaps = 20/250 (8%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H++L+KS  SK Y  +EE  +R  V++ NL++ +   L      H    G+  F D+T
Sbjct: 26  DQHWNLWKSWHSKNYHQREE-GWRRLVWEKNLKKIELHNLEHSMGKHSYRLGMNHFGDMT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWS 163
             EF++   G   +       + +  L  N L  P   DWR+ G VT VKDQG CGSCW+
Sbjct: 85  HEEFKQIMNGYKHKAE--RKFKGSLFLEPNFLEAPRSVDWREKGYVTPVKDQGECGSCWA 142

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGALEG  F  TG+LVSLS Q LV+C     PE     + GCNGGLM+ AF+Y+   
Sbjct: 143 FSTTGALEGQEFTRTGKLVSLSGQNLVECSR---PE----GNEGCNGGLMDQAFQYVKDN 195

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVAS 282
            G++ E  YPY GTD   C +D    AA  + F  + S +E  +   +   GP++    +
Sbjct: 196 QGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSGNERALMKAVASVGPVS---VA 252

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 253 IDAGHESFQF 262


>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 366

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCSGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
 gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
 gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
          Length = 323

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 87/236 (36%), Positives = 133/236 (56%), Gaps = 21/236 (8%)

Query: 47  LNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTP 106
           L A ++F  F  +F+K Y+++ E   RF++F+ NL     +   D +A + + KFSDL+ 
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLSK 80

Query: 107 SEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGSC 161
            E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+C
Sbjct: 81  DETIAKYTGLS----LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGAC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+F+  G+LE    +   EL++LSEQQ++DCD           D+GCNGGL+++AFE I+
Sbjct: 137 WAFATLGSLESQFAIKHNELINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAII 187

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           K GGV+ E DYPY   D  +C+ + +K    V + +  I   E+++   L   GP+
Sbjct: 188 KMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPI 242


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 22/247 (8%)

Query: 39  GEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-- 96
           GE+SE+ +      ++ + ++   TY    E + RF  F+ NLR   +        VH  
Sbjct: 31  GERSEEEV---RRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSF 87

Query: 97  --GVTKFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
             G+ +F+DLT  E+R  +LG     +R  +L A  Q A     ++LP   DWR  GAV 
Sbjct: 88  RLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAAD---NDELPESVDWRKKGAVG 144

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            VKDQG CGSCW+FSA  A+EG + + TG+++ LSEQ+LVDCD         S + GCNG
Sbjct: 145 AVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT--------SYNQGCNG 196

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANL 270
           GLM+ AFE+I+  GG++ E+DYPY   D       K+     +  +  +  + ++     
Sbjct: 197 GLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKA 256

Query: 271 VKHGPLA 277
           V + P++
Sbjct: 257 VANQPIS 263


>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
          Length = 347

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/247 (39%), Positives = 131/247 (53%), Gaps = 24/247 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +K K+ K Y  +EE+DY    F  N+          +L   T   G+   +DL  SE+R+
Sbjct: 43  YKIKYDKHYDPEEENDY-MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRK 101

Query: 112 QFLGLNRRLRLPADAQKAP----ILPTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
             L   R  RL  D+ +      ++P N   P   DWR+H  VT VK+QG CGSCW+FSA
Sbjct: 102 --LNGYRHRRLFGDSMRKNGTKFLVPFNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSA 159

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TGALEG HF +TG+LVSLSEQ LVDC        +   + GCNGGLM+ AFEYI    G+
Sbjct: 160 TGALEGQHFRATGKLVSLSEQNLVDC-------STKYGNHGCNGGLMDLAFEYIKDNHGI 212

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIEL 285
           + E+ YPY G +   C F K  I A    F  +   DED +   +   GP++    +I+ 
Sbjct: 213 DTEEGYPYVGKE-MRCHFKKRDIGAEDRGFVDLPEGDEDALKVAVATQGPIS---IAIDA 268

Query: 286 PHISFSF 292
            H SF  
Sbjct: 269 GHRSFQL 275


>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 130/236 (55%), Gaps = 18/236 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F  KF+K Y+++ E   RF++F+ NL     +   D TA + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDTTAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL     LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGL----ALPLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQL+DCD+          D+GCNGGL+++A+E +
Sbjct: 137 CWAFATLASLESQFAIKHNQLINLSEQQLIDCDY---------VDAGCNGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           ++ GGV+ E DYPY G+DG         +      +  I+  E+++   L   GP+
Sbjct: 188 MQMGGVQAENDYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPI 243


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 133/243 (54%), Gaps = 22/243 (9%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
           + +++ + Y   +E   R+++FK N+ R +   + +D +    + +F+DLT  EFR    
Sbjct: 42  WMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS-- 99

Query: 115 GLNRRLRLPA-----DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
               R R  A     +A          +P+  DWR  GAVT +KDQG CGSCW+FSA  A
Sbjct: 100 ----RNRFKAHICSTEATSFKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAA 155

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG   LSTG+L+SLSEQ+LVDCD       SG  D GCNGGLM+ AF++I +  G+  E
Sbjct: 156 MEGITQLSTGKLISLSEQELVDCD------TSGE-DQGCNGGLMDDAFKFIEQNHGLATE 208

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHIS 289
            +YPY GTDG   +   +  AA ++ +  + ++ ++     V H P+A    +I+     
Sbjct: 209 ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA---VAIDAGGFE 265

Query: 290 FSF 292
           F F
Sbjct: 266 FQF 268


>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
 gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
          Length = 417

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 84/231 (36%), Positives = 131/231 (56%), Gaps = 26/231 (11%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSE 108
           F  +K K  K Y   EE + R   F+ NL+    + ++++ L      G+ KF+D++  E
Sbjct: 49  FQQWKEKHRKVYKHVEEAEKRLENFRRNLKYVVEKNQKKKNLGSAHTVGLNKFADMSNVE 108

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTND-------LPTDFDWRDHGAVTGVKDQGACGSC 161
           FR+++L    +++ P   +   ++ +          P+  DWR  G VT VKDQG CGSC
Sbjct: 109 FRQKYLS---KVKKPIKKRNNNLMTSRQRNLQSCVAPSSLDWRKKGVVTPVKDQGDCGSC 165

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS+TGA+EG + + TG+LVSLSEQ+L+DCD         + + GC+GG M+ AFE+++
Sbjct: 166 WAFSSTGAIEGINAIVTGDLVSLSEQELMDCD---------TTNYGCDGGYMDYAFEWVI 216

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDK--SKIAAAVSNFSVISSDEDQMAANL 270
             GG++ E DYPYTG D G+C   K  +K+ +      V  SD   + A +
Sbjct: 217 NNGGIDTEIDYPYTGVD-GTCNIAKEETKVVSVDGYEDVAESDSALLCATV 266


>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
          Length = 350

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/274 (37%), Positives = 144/274 (52%), Gaps = 24/274 (8%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHH---FSLFKSKFSKTY 64
           +LL++      A+A     D   IR V  SD E+    ++    H   F+ F +++ K Y
Sbjct: 5   TLLIVFFCVATAAAGLSFHDSNPIRMV--SDMEKQLLQVIGESRHAVSFARFANRYGKRY 62

Query: 65  ATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL--NRRLRL 122
            T +E   RF++F  NL+  +           GV  F+D T  EFR   LG   N    L
Sbjct: 63  DTVDEMKRRFKIFSENLQLIESTNKKRLGYTLGVNHFADWTWEEFRSHRLGAAQNCSATL 122

Query: 123 PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
             + +   ++    LP + DWR  G V+ VKDQG CGSCW+FS TGALE A+  + G+ +
Sbjct: 123 KGNHRITDVV----LPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAFGKNI 178

Query: 183 SLSEQQLVDCDHECDPEESGSCDS-GCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGS 241
           SLSEQQLVDC        +G+ ++ GCNGGL + AFEYI   GG+E E+ YPYTG + G 
Sbjct: 179 SLSEQQLVDC--------AGAFNNFGCNGGLPSQAFEYIKYNGGLETEEAYPYTGQN-GP 229

Query: 242 CKFDKSKIAAAV---SNFSVISSDEDQMAANLVK 272
           CKF    +A  V    N ++ + DE + A    +
Sbjct: 230 CKFTSEDVAVQVLGSVNITLGAEDELKHAVAFAR 263


>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 133/237 (56%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F   F+K Y+++ E  +RF++F+ NL     + L D +A + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +   +L++LSEQQL+DCD           D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +  GG++ E DYPY   + G C+ + +K    V   +  I+  E+++   L   GP+
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243


>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
 gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
          Length = 364

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 160/287 (55%), Gaps = 24/287 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           +I++  LL LL  ++++A+   +D      + P+       ++ +A  +F  F S+++K 
Sbjct: 25  IIMNKSLLFLL--LVSTALTRQNDAVHTPTIKPT-----LYNINSAPLYFEKFISQYNKH 77

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
           Y  ++E  YR+ +F+ N+     +   + +AV+ + +F+D+T +E   +  GL     L 
Sbjct: 78  YKNEDEKKYRYNIFRHNIESINHKNSRNDSAVYKINRFADMTKNEVVIRHTGLASG-ELG 136

Query: 124 ADAQKAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGE 180
            +  +  ++        PT FDWR    VT VKDQG CG+CW+F+  GALE  + +    
Sbjct: 137 VNFCETIVVDGPGQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDR 196

Query: 181 LVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGG 240
           L+ LSEQQLVDCDH          D GC+GGL+++A+E I++ GGVE++ DYPY   +  
Sbjct: 197 LIDLSEQQLVDCDH---------VDMGCDGGLIHTAYEEIMRMGGVEQDFDYPYRA-ERQ 246

Query: 241 SCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
            C     K AA V S +  +  +E+++  +L++H GP+A  V ++++
Sbjct: 247 PCALKPHKFAAGVRSCYRYVLLNEERL-EDLLRHVGPIAIAVDAVDI 292


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/248 (38%), Positives = 130/248 (52%), Gaps = 19/248 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+K+   K+Y   EE  +R  V++ NL+  +   L     +H    G+ +F DLT  
Sbjct: 78  HWELWKNWHQKSYHKAEE-GWRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQFGDLTNE 136

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EF+ Q L   R          +  L  N   +PT  DWRDHG VT VK+QG CGSCW+FS
Sbjct: 137 EFQ-QMLISERHFSEGNRINGSAFLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFS 195

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALEG  F  +G LVSLSEQ LVDC  +         + GCNGG+++ AF+YIL+  G
Sbjct: 196 TTGALEGQLFRKSGRLVSLSEQNLVDCSWQ-------QGNQGCNGGIVDFAFQYILENRG 248

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISS-DEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPYT  D   C F      A V+ F  I    E+ +   +   GP++    +I+
Sbjct: 249 IDSEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVS---VAID 305

Query: 285 LPHISFSF 292
               SF F
Sbjct: 306 AHPTSFRF 313


>gi|300123574|emb|CBK24846.2| unnamed protein product [Blastocystis hominis]
          Length = 305

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/241 (38%), Positives = 131/241 (54%), Gaps = 24/241 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           FS F++++ K Y    E  +R +VF+ N+  A++    +     G+T F+D+T +EF   
Sbjct: 21  FSAFEARYGKNY-LPAERAFRAKVFEYNMEWARKMNAQNHPYTVGMTPFADMTNTEFANS 79

Query: 113 FLG---LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            L    L  ++  PA     PI+   D     DWR+ GAVT VK+Q +CGSCW+FSATGA
Sbjct: 80  KLCGCMLKPKMTKPA----TPIMQRAD--ETVDWREKGAVTPVKNQASCGSCWAFSATGA 133

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG +F++ GEL+SLSEQQLVDCDH+          SGC GG M  AFEY +K  G+ +E
Sbjct: 134 MEGRNFVANGELISLSEQQLVDCDHQ---------SSGCGGGWMTYAFEYAMKK-GMCKE 183

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHIS 289
           +DYPY   D   CK DK         +  +   +       V  GP++    ++E   I 
Sbjct: 184 EDYPYHAVD-EDCKDDKCTPVVFPKGYEEVPMYDGAALKQAVSQGPVS---VAVEADSIV 239

Query: 290 F 290
           F
Sbjct: 240 F 240


>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
          Length = 460

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR     +        DL P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 223 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 280

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 281 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 331

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 332 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPIS 377


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/254 (38%), Positives = 138/254 (54%), Gaps = 22/254 (8%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           D  LNA  H+  +K+K  K Y  +EE  +R  V++ N++  +          HG T    
Sbjct: 22  DGSLNA--HWYRWKAKHRKLYGMREE-GWRRAVWEKNMKMIEVHNQEYSQGKHGFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGL-NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGAC 158
            F D+T  EFR+   G  N++ +     Q+   L   ++P   DWR+ G VT VK+QG C
Sbjct: 79  AFGDMTNEEFRQVMNGFRNQKHKKGKVFQEPSFL---EVPKSVDWREKGYVTPVKNQGQC 135

Query: 159 GSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFE 218
           GSCW+FSATGALEG  F  TG+L+SLSEQ LVDC     P+     + GC+GGLM+ AF+
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSR---PQ----GNEGCDGGLMDYAFQ 188

Query: 219 YILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAG 278
           YI + GG++ E+ YPY   D  SCK+      A  + F  I  +E  +   +   GP++ 
Sbjct: 189 YIKENGGLDSEESYPYDAMD-ESCKYRPEYSVANDTGFVDIPKEEKALMKAVATVGPIS- 246

Query: 279 NVASIELPHISFSF 292
              +I+  H SF F
Sbjct: 247 --VAIDAGHESFQF 258


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 93/263 (35%), Positives = 131/263 (49%), Gaps = 16/263 (6%)

Query: 31  IRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL 90
           I +V+  +  ++E  L+  E H   + +K+ K Y    E + RF +FK N+   +     
Sbjct: 22  ISRVISRELHETETSLI--ERH-EQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAA 78

Query: 91  DPTAVH-GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAV 149
                  GV   +DLT  EF+    GL R                  +P   DWR  GAV
Sbjct: 79  GNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIPASVDWRKKGAV 138

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
           T +KDQG CGSCW+FS   A EG H +STG+LVSLSEQ+LVDCD +         D GC 
Sbjct: 139 TPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRK-------GTDQGCE 191

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAAN 269
           GG M   FE+I+K GG+  E +YPY   D GSCK + +  AA +  +  +  + ++    
Sbjct: 192 GGYMEDGFEFIIKNGGITTEANYPYKAVD-GSCK-NATAPAAQIKGYEKVPVNSEKALLK 249

Query: 270 LVKHGPLAGNVASIELPHISFSF 292
            V + P++    SI+    SF F
Sbjct: 250 AVANQPVS---VSIDAADGSFMF 269


>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
          Length = 484

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/227 (43%), Positives = 133/227 (58%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F   +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246

Query: 112 QFLGLNRRLRL-PADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  LR  P +  K      +  P ++DWR  GAVT VKDQG CGSCW+FS TG +
Sbjct: 247 IYL--NPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 304

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 305 EGQWFLNQGTLLSLSEQELLDCDK---------MDKACMGGLPSNAYSAIKNLGGLETED 355

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 356 DYSYRG-HMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPIS 401


>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
           cysteine proteinase A-1; Flags: Precursor
 gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
 gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 354

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/275 (36%), Positives = 143/275 (52%), Gaps = 27/275 (9%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
           LL + V+     V    A+I Q  P       D+ + A  H+  FK +  K +    E  
Sbjct: 7   LLFAIVVTILFVVCYGSALIAQTPPP-----VDNFV-ASAHYGSFKKRHGKAFGGDAEEG 60

Query: 72  YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
           +RF  FK N++ A      +P A + V+ KF+DLTP EF + +L  +   R   D  K  
Sbjct: 61  HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYARHLKD-HKED 119

Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +   +  P+     DWRD GAVT VK+QG CGSCW+FSA G +EG    S   LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
            LV CD         + D GCNGGLM+ A  +I+++  G V  E  YPY  T GG  +  
Sbjct: 180 MLVSCD---------NIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228

Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             D+ ++ A ++ F  +  DE+++A  + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVA 263


>gi|354502593|ref|XP_003513368.1| PREDICTED: cathepsin L1-like isoform 2 [Cricetulus griseus]
          Length = 330

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 128/240 (53%), Gaps = 17/240 (7%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG---- 97
           + D  L+AE H   +K++  KTY   EE   R  V++ N +  +          HG    
Sbjct: 20  THDPSLDAEWH--EWKTQHGKTYVMDEEGQKR-AVWENNRKMIELHNEDYTKGKHGFHLE 76

Query: 98  VTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           +  F DLT +EFR+   G         +  + P L   D+P   DWR HG VT VKDQG+
Sbjct: 77  MNAFGDLTNTEFRQLMTGFQSMGTTEMNVFQEPRL--GDVPKSVDWRKHGYVTPVKDQGS 134

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           C SCW+FSA G+LEG  F  TG+LV LSEQ LVDC            ++GC+GGL  SAF
Sbjct: 135 CVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRS-------QHNNGCHGGLFTSAF 187

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +YI   GG++  + YPY   D G C++D    AA ++ F V+ S+E+ +   +   GP++
Sbjct: 188 QYIKDNGGLDTSESYPYEAQD-GPCRYDPKHSAANITGFVVVPSNEEALMKAVATVGPIS 246


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/249 (35%), Positives = 141/249 (56%), Gaps = 19/249 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLP-ADAQKAPI--LPTNDLPTDFDWRDHGAVTGVKDQG 156
           +F+D+T  EF  +F GLN    +P +    +PI  L  +D+P++ DWR+ GAVT VK+QG
Sbjct: 87  EFADITSQEFLAKFTGLN----IPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQG 142

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M +A
Sbjct: 143 QCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFMTNA 193

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           F++I + GG+ RE DY Y G    +C+  +   A  +S++ V+   E  +   + K    
Sbjct: 194 FDFIKENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQPVS 252

Query: 277 AGNVASIEL 285
            G  AS +L
Sbjct: 253 IGIAASQDL 261


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/278 (31%), Positives = 153/278 (55%), Gaps = 20/278 (7%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMI---RQVVPSDGEQSEDHLLNAEHHFSLFKSKFSK 62
           L + +L++L +VLA + A+  D ++I   R      G +S++ +++    + +   K  K
Sbjct: 7   LMATILIVLFTVLAVSSAL--DMSIISYDRSHADKSGWKSDEEVMSIYEEWLV---KHGK 61

Query: 63  TYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLN---RR 119
            Y   EE + RF++FK NL   +    ++ T   G+ +FSDL+  E+R ++LG      R
Sbjct: 62  VYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSR 121

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
           +      + +P +  N LP   DWR  GAV  VK+Q  C  CW+FSA  A+EG + + TG
Sbjct: 122 MMARPSRRYSPRVADN-LPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTG 180

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            L +LSEQ+L+DCD         + ++GC+GGL++ AFE+I+  GG++ E+DYP+ G DG
Sbjct: 181 NLTALSEQELLDCDR--------TVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADG 232

Query: 240 GSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              ++  +  A  +  +  + + ++      V + P++
Sbjct: 233 ICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVS 270


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/192 (41%), Positives = 113/192 (58%), Gaps = 13/192 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           +  +  K  K Y    E   RF +FK NLR    R   + +   G+ +F+DLT  E+R  
Sbjct: 43  YETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSV 102

Query: 113 FLGLNRRLRLPADAQKA-----PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
           +LG   R    A + ++          + LP   DWR  GAV G+KDQG+CGSCW+FSA 
Sbjct: 103 YLGTRPRSVAVARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAI 162

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
            A+EG + + TG+L+SLSEQ+LV+CD         S + GC+GGLM+ AFE+I+K  G++
Sbjct: 163 AAVEGVNQIVTGDLISLSEQELVECDT--------SYNDGCDGGLMDYAFEFIIKNEGID 214

Query: 228 REKDYPYTGTDG 239
            ++DYPYTG DG
Sbjct: 215 SDEDYPYTGRDG 226


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/249 (35%), Positives = 141/249 (56%), Gaps = 19/249 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLP-ADAQKAPI--LPTNDLPTDFDWRDHGAVTGVKDQG 156
           +F+D+T  EF  +F GLN    +P +    +PI  L  +D+P++ DWR+ GAVT VK+QG
Sbjct: 87  EFADITSQEFLAKFTGLN----IPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQG 142

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M +A
Sbjct: 143 QCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFMTNA 193

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPL 276
           F++I + GG+ RE DY Y G    +C+  +   A  +S++ V+   E  +   + K    
Sbjct: 194 FDFIKENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQPVS 252

Query: 277 AGNVASIEL 285
            G  AS +L
Sbjct: 253 IGIAASQDL 261


>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
 gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
 gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
          Length = 324

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 132/237 (55%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  KF+K Y+++ E  +RF++F+ NL     +   D TA + + KFSDL+
Sbjct: 21  LLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  IL  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KEEAISKYTGLS----LPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +    L++LSEQQ +DCD           ++GC+GGL+++AFE  
Sbjct: 137 CWAFATLGSLESQFAIKYNRLINLSEQQFIDCDR---------VNAGCDGGLLHTAFESA 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPL 276
           ++ GGV+ E DYPY  T  G C+ + ++    V S    I   E+++   L   GP+
Sbjct: 188 MEMGGVQMESDYPYE-TANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPI 243


>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
 gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
          Length = 327

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/254 (33%), Positives = 137/254 (53%), Gaps = 20/254 (7%)

Query: 45  HLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDL 104
           +L ++E  F  F  K++K+Y+++EE   +F  FK N+R    +  L  +AV+ +  +SD+
Sbjct: 17  NLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSNSAVYDINFYSDM 76

Query: 105 TPSEFRRQFLGLNRRLR---------LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
             +E  R+  G    L+         +  + +     P   LP  FDWRD   +T VK+Q
Sbjct: 77  NKNELLRKQTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKNQ 136

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
             CGSCW+FS    +E  + +   +L+ LSEQQLV+CD +         ++GCNGGLM+ 
Sbjct: 137 RDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQ---------NNGCNGGLMHW 187

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           A E I++ GGV  E D+PYT +D G CK  +  +     N   I S+ED++   L+ +GP
Sbjct: 188 AMEEIIRQGGVSNETDFPYTASD-GFCKRKQGFVNINGCN-QFILSNEDRLRELLIFNGP 245

Query: 276 LAGNVASIELPHIS 289
           ++  +  I++   S
Sbjct: 246 ISIAIDVIDVIDYS 259


>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
 gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
          Length = 324

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 132/237 (55%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A  +F  F   F+K Y+++ E  +RF++F+ NL     + L D +A + + KFSDL+
Sbjct: 21  LLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDTSAQYEINKFSDLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGLS----LPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +   +L++LSEQQL+DCD           D GC+GGL+++A+E +
Sbjct: 137 CWAFATLGSLESQFAIKHDQLINLSEQQLIDCDF---------VDMGCDGGLLHTAYEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +  GG++ E DYPY   + G C+ + +K    V   +  +   E+++   L   GPL
Sbjct: 188 MNMGGIQAENDYPYEANN-GDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPL 243


>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/237 (37%), Positives = 120/237 (50%), Gaps = 26/237 (10%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
            F+ FK K  + Y +  E  +R  VF+ANL  A+     +P A  GVT FSDLT  EFR 
Sbjct: 37  QFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHATFGVTPFSDLTREEFRS 96

Query: 112 Q-------FLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           +       F     R R+P D +          P   DWR+ GAVT VK+QG CGSCW+F
Sbjct: 97  RYHNGAAHFAAAQERARVPVDVEFV------GAPAAKDWREEGAVTAVKNQGMCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--K 222
           +A G +E   FL+   L  LSEQ LV CD+          +SGC GG    AF++I+   
Sbjct: 151 AAIGNIECQWFLAGNPLTRLSEQMLVSCDNT---------NSGCGGGWPLVAFKWIVDRN 201

Query: 223 AGGVEREKDYPYTGTDGGS--CKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
            G V  E+ YPY    G S  C      + A ++ +  I  DE+ +AA L  +GP+A
Sbjct: 202 NGTVYTEESYPYHSCIGISPPCTTSGHTVGATITGYVTIPRDENGIAAWLAVNGPVA 258


>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 96/261 (36%), Positives = 133/261 (50%), Gaps = 24/261 (9%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
           +PSD        +  + H+  FK+  +KTYA   E  YR +VFK N +R AK        
Sbjct: 18  IPSD--------MEIQAHWESFKATHAKTYANAAEEAYRAKVFKENAIRIAKHNDRFASG 69

Query: 94  AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
            V    G  +++D+   E   +  G    L+  +         +       DWR  GAVT
Sbjct: 70  EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            +KDQG CGSCWSFSATG+LEG  FL    LVSLSEQ LVDC  +   E       GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
           GLM+SAFEY+   GG++ E+ YPYT  D G+C +  +  A   + +  V +  E  +   
Sbjct: 183 GLMDSAFEYVKSYGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241

Query: 270 LVKHGPLAGNVASIELPHISF 290
           + K GP++    +I+  + SF
Sbjct: 242 VEKVGPVS---VAIDASNWSF 259


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 97/257 (37%), Positives = 134/257 (52%), Gaps = 22/257 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVH---GVTKF 101
           L+ AE  +S FK+   K YA+  E  YR +++  N L+ A+  +    + V     + +F
Sbjct: 22  LVGAE--WSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEF 79

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGA 157
            DL   EF     G  R  R         + P       LP   DWR  GAVT VK+QG 
Sbjct: 80  GDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQ 139

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG+LEG HF  T +LVSLSEQ LVDC            ++GC GGLM++AF
Sbjct: 140 CGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFG-------NNGCEGGLMDNAF 192

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPL 276
           +YI    G++ E  YPY  TD G C F++S + A  + F  I   DE+++   +   GP+
Sbjct: 193 KYIKSNKGIDTEWSYPYNATD-GVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPV 251

Query: 277 AGNVASIELPHISFSFL 293
           +    +I+  H SF F 
Sbjct: 252 S---VAIDASHESFQFY 265


>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
          Length = 332

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/260 (37%), Positives = 136/260 (52%), Gaps = 28/260 (10%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           DH L+A+  +  +K+   K Y   EE   R  +++ N++  +R         H  T    
Sbjct: 22  DHSLDAD--WYKWKATHRKLYGLNEE-GRRRAIWEKNMKMIERHNWEHRQGKHSFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQ 155
            F D+T  EFR+   G     +++ ++  DA  A        P   DWR+ G VT VK+Q
Sbjct: 79  AFGDMTNEEFRKTMNGFQNQKHKKGKVFLDAGSALT------PHSVDWREKGYVTAVKNQ 132

Query: 156 GACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNS 215
           G CGSCW+FSATGALEG  F  T +L+SLSEQ LVDC     PE     + GCNGGLM++
Sbjct: 133 GHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSW---PEG----NEGCNGGLMDN 185

Query: 216 AFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGP 275
           AF+YI   GG++ E+ YPY G D GSCK+     AA  + +  I   E  +   +   GP
Sbjct: 186 AFQYIKDNGGLDSEESYPYFGKD-GSCKYKPQSSAANDTGYVDIPKQEKALMKAVATVGP 244

Query: 276 LAGNVASIELPHISFSFLFT 295
           ++     I+  H SF F  T
Sbjct: 245 IS---VGIDASHESFQFYST 261


>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
          Length = 459

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/232 (40%), Positives = 133/232 (57%), Gaps = 24/232 (10%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY TQEE  +R  VF  N+ RA++ Q LD  TA +G+TKFSDLT  EFR 
Sbjct: 162 FKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRA 221

Query: 112 QFLG------LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            +L        N+ + L            +  P ++DWR  GAVT VK+QG CGSCW+FS
Sbjct: 222 IYLNPLLKENRNKMMHLAKSI-------GDHAPPEWDWRTKGAVTNVKNQGMCGSCWAFS 274

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TG +EG  FL  G+L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG
Sbjct: 275 VTGNVEGQWFLKQGDLLSLSEQELLDCD---------KVDKACLGGLPSNAYLAIKNLGG 325

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +E E DY Y+G    +C F   K    +++   +S +E ++AA L K GP++
Sbjct: 326 LETEDDYSYSG-HLQTCSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPIS 376


>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
          Length = 340

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/238 (34%), Positives = 135/238 (56%), Gaps = 15/238 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +F  F ++++K Y +++E  YR+ +F+ N+    ++   + +AV+ + +F+D+T +E   
Sbjct: 42  YFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIVI 101

Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +  GL     L A+  +  ++        P +FDWR    VT VKDQG CG+CW+F+  G
Sbjct: 102 RHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFAGLG 160

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALE  + +    L+ L+EQQLVDCD           D GC+GGL+++A+E I++ GGVE+
Sbjct: 161 ALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGGVEQ 211

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           E DYPY   +   C     K AA V N +  +  +E+++   L   GP+A  V +++L
Sbjct: 212 EFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 268


>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
 gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
          Length = 339

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/238 (34%), Positives = 135/238 (56%), Gaps = 15/238 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +F  F ++++K Y +++E  YR+ +F+ N+    ++   + +AV+ + +F+D+T +E   
Sbjct: 41  YFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNSRNDSAVYKINRFADMTKNEIVI 100

Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +  GL     L A+  +  ++        P +FDWR    VT VKDQG CG+CW+F+  G
Sbjct: 101 RHTGLASG-ELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCGACWAFAGLG 159

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALE  + +    L+ L+EQQLVDCD           D GC+GGL+++A+E I++ GGVE+
Sbjct: 160 ALESQYAIKYDRLIDLAEQQLVDCDF---------VDMGCDGGLIHTAYEQIMRMGGVEQ 210

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           E DYPY   +   C     K AA V N +  +  +E+++   L   GP+A  V +++L
Sbjct: 211 EFDYPYK-AERQPCALKPHKFAAGVRNCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 267


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 131/253 (51%), Gaps = 20/253 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           +   NA+ H   +KS   + Y T EE ++R  V++ N++  +          HG T    
Sbjct: 22  NQTFNAQWH--KWKSTHRRLYDTNEE-EWRRAVWEKNMKMIELHNGEYSEGKHGFTMEMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   +        + P++    LP   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTNEEFRQLVNGYKHQKHRKGKLFQEPLML--QLPKSVDWREKGCVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSA GALEG   L TG LVSLSEQ LVDC         G  + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSR-------GEGNQGCNGGLMDFAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +L   G++ E+ YPY   D G+CK+     AA  + +  I   E  +   +   GP+A  
Sbjct: 190 VLNNKGLDSEESYPYEAKD-GTCKYKPEFAAANDTGYVDIPQLEKALMKAVATVGPIA-- 246

Query: 280 VASIELPHISFSF 292
             +I+  H SF F
Sbjct: 247 -VAIDASHPSFQF 258


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 130/237 (54%), Gaps = 18/237 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           F  +  ++ KTY+++EE   R +VF+ N     +   + + +    +  F+DLT  EF+ 
Sbjct: 29  FEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFKA 88

Query: 112 QFLGLNRRLRLPADAQ--KAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSAT 167
             LG +     P  AQ  ++   P  +L  P   DWR  GAVTGVKDQG CG CWSFS T
Sbjct: 89  SRLGFS-----PGRAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTT 143

Query: 168 GALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVE 227
           GA+EG + + TG LVSLSEQ+LVDCD         S +SGC GGLM+ A+++++K  G++
Sbjct: 144 GAIEGINKIVTGSLVSLSEQELVDCDR--------SYNSGCEGGLMDYAYQFVIKNQGID 195

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
            E DYPY G D    K    K    +  ++ I  ++++    +V   P++  +   E
Sbjct: 196 SEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSE 252


>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
 gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
          Length = 337

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 134/238 (56%), Gaps = 15/238 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           +F  F ++++K Y T++E  YR+ +F+ N+     +   + +A++ + +F+D+T +E   
Sbjct: 39  YFEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNSRNDSAIYKINRFADMTKNEVVI 98

Query: 112 QFLGLNRRLRLPADAQKAPIL---PTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +  GL     L A+  +  ++        PT FDWR    VT VKDQG CG+CW+F+  G
Sbjct: 99  RHTGLASG-ELGANFCETIVVDGPAQRQRPTSFDWRTLNKVTSVKDQGMCGACWAFAGLG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALE  + +    L+ L+EQQLVDCD         S D GC+GGL+++A+E I+  GGVE+
Sbjct: 158 ALESQYAIKYDRLIDLAEQQLVDCD---------SVDMGCDGGLIHTAYEQIMHMGGVEQ 208

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAV-SNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           E DYPY   +   C     K AA V S +  +  +E+++   L   GP+A  V +++L
Sbjct: 209 EFDYPYR-AERQPCALKPHKFAAGVRSCYRYVLLNEERLEDLLRYVGPIAIAVDAVDL 265


>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 322

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/246 (39%), Positives = 132/246 (53%), Gaps = 26/246 (10%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK ++ + Y T  E  YR  VF+ N +       + +  + T    + +F D+T  EF  
Sbjct: 22  FKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAA 81

Query: 112 QFLG-LNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
              G LN   R P       IL  +D  LP   DWR  GAVT VKDQ  CGSCW+FS TG
Sbjct: 82  TMNGFLNVPTRHPV-----AILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTG 136

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
           +LEG HFL  G+LVSLSEQ LVDC        SG   + GC GGLM+ AF+YI +  G++
Sbjct: 137 SLEGQHFLKDGKLVSLSEQNLVDC--------SGKFGNMGCCGGLMDQAFKYIKENKGID 188

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELP 286
            E+ YPY   D G C+FD S + A  + F  I+  +E+ +   +   GP++    +I+  
Sbjct: 189 TEESYPYEAQD-GKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPIS---VAIDAS 244

Query: 287 HISFSF 292
           H SF F
Sbjct: 245 HPSFQF 250


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 82/225 (36%), Positives = 121/225 (53%), Gaps = 28/225 (12%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL--DPTAVH-GVTKFSDLTPSEF 109
           F  +K +  K Y   EE   R   FK NL+    R  +   P   H G+ +F+D++  EF
Sbjct: 51  FQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEF 110

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
           + +F+              + +   +D P   DWR  G VTGVKDQG CGSCWSFS+TGA
Sbjct: 111 KNKFI--------------SKVESCDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGA 156

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           +EG + + TG+L+SLSEQ+LVDCD         + + GC GG M+ AFE+++  GG++ E
Sbjct: 157 IEGVNAIVTGDLISLSEQELVDCD---------TTNDGCEGGYMDYAFEWVINNGGIDTE 207

Query: 230 KDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
            DYPY G  GG+C   K +     +  ++ ++  +  +    VK 
Sbjct: 208 ADYPYIGV-GGTCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQ 251


>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
          Length = 310

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 94/245 (38%), Positives = 128/245 (52%), Gaps = 27/245 (11%)

Query: 62  KTYATQEEH----DYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQF 113
           K + +++ H     +R   +K NL+  +   L     +H    G+  F D+T  EFR+  
Sbjct: 6   KKWPSKKXHAPXXGWRRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVM 65

Query: 114 LGL----NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGA 169
            G     +RR R     +   I    ++P   DWR+ G VT VKDQG CGSCW+FS TGA
Sbjct: 66  NGFKHKKDRRFRGSLFMEPXFI----EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGA 121

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LEG  F  TG+LVSLSEQ LVDC     PE     + GCNGGLM+ AF+Y+    G++ E
Sbjct: 122 LEGQMFRKTGKLVSLSEQNLVDCSR---PE----GNEGCNGGLMDQAFQYVKDQNGLDSE 174

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           + YPY GTD   C FD    AA  + F  + S  E  +   +   GP++    +I+  H 
Sbjct: 175 ESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVS---VAIDAGHE 231

Query: 289 SFSFL 293
           SF F 
Sbjct: 232 SFQFY 236


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/211 (42%), Positives = 121/211 (57%), Gaps = 17/211 (8%)

Query: 73  RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPADAQKA 129
           R+ +FK NLR        +     G+  F+DLT  EFR Q  G   +R R R   +  + 
Sbjct: 85  RYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYEEFRY 144

Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
             +   DLP   DWR+ GAV GVKDQG+CGSCW+FSA  A+EG + L+TGELVSLSEQ+L
Sbjct: 145 GSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQEL 204

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCD           D GCNGGLM+ AF +++K GG++ E DYPY G  G  C  D+SK+
Sbjct: 205 VDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--DRSKM 253

Query: 250 AAAV---SNFSVISSDEDQMAANLVKHGPLA 277
            A V     +  +  +++      V H P++
Sbjct: 254 NAKVVTIDGYEDVPVNDETALLKAVAHQPVS 284


>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 341

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/249 (36%), Positives = 135/249 (54%), Gaps = 20/249 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           F  +K KF K+Y ++ +   R +++  N +      +L    +     G+T+F+D+   E
Sbjct: 33  FHAWKLKFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEE 92

Query: 109 FRR---QFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAVTGVKDQGACGSCWSF 164
           ++R   Q    +    LP        LP    LP   DWRD G VT V++Q  CGSCW+F
Sbjct: 93  YKRLVSQGCLHSFNSSLPRRGSTFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAF 152

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAG 224
           SATG+LEG HF  TG+LVSLS+QQLVDC  E   E       GCNGGLM+SAF+YI   G
Sbjct: 153 SATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNE-------GCNGGLMDSAFQYIQANG 205

Query: 225 GVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASI 283
           G++ E+ YPY   D G C+++     A  + +  V  ++E+ +   +   GP++    +I
Sbjct: 206 GIDTEESYPYEAED-GKCRYNPKSTGATCTGYVDVQPANEETLKEAVATIGPIS---VAI 261

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 262 DAFHPSFQF 270


>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + YAT +E   R   F+ NL   +  Q  +P A  G+TKF DL+  EF  +
Sbjct: 38  FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L       +  +  +   +      +  P   DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98  YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
            +E   +L+T  L+SLSEQ+LV CD           D GCNGGLM  AF+++L  + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                YPY   +G   +  +S    I A +     I S+ED MAA L  +GP+A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/260 (36%), Positives = 130/260 (50%), Gaps = 25/260 (9%)

Query: 47  LNAEH--HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQ----LLDPTAVHGVTK 100
           LN +H   F  +K+ + K Y T EE + +   +  N  +         L   +    + +
Sbjct: 21  LNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNE 80

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQKAPIL-------PTNDLPTDFDWRDHGAVTGVK 153
           + DLT  EF     G    +RL   +                 LPT  DWR HG VT VK
Sbjct: 81  YGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVK 140

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
           +QG CGSCWSFSATG+LEG H   TG+LVSLSEQ L+DC     PE     + GCNGGLM
Sbjct: 141 NQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCS---TPEG----NDGCNGGLM 193

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVK 272
           + AF+YI   GG++ E  YPY   D  +C+F+ +   A  + F  + S DE+ +      
Sbjct: 194 DQAFKYIKIQGGIDTEAYYPYEAKD-DTCRFNITDSGATDTGFVDIKSGDEEMLKEAAAT 252

Query: 273 HGPLAGNVASIELPHISFSF 292
            GP++    +I+  H SF F
Sbjct: 253 VGPIS---VAIDASHTSFQF 269


>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
 gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
          Length = 381

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 128/236 (54%), Gaps = 22/236 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97

Query: 113 FLGLNRRLRLPADAQKA------PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSA 166
           +L  N      A  Q A           + +P   DWR+ GAVT VKDQGACGSCW+FSA
Sbjct: 98  YL--NGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSA 155

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AG 224
            G +E     +   LVSLSEQQLV CD +         D+GCNGGLM  AFE++L+   G
Sbjct: 156 VGNIESQWARAGHGLVSLSEQQLVSCDDK---------DNGCNGGLMLQAFEWLLRHMYG 206

Query: 225 GVEREKDYPYTGTDGGSCK-FDKSKI--AAAVSNFSVISSDEDQMAANLVKHGPLA 277
            V  EK YPYT  +G   +  + SK+   A +  + +I S+E  MAA L ++GP+A
Sbjct: 207 IVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIA 262


>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 122/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + YAT +E   R   F+ NL   +  Q  +P A  G+TKF DL+  EF  +
Sbjct: 38  FEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHARFGITKFFDLSEEEFATR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L       +  +  +   +      +  P   DWR+ GAVT VKDQG CGSCW+FSA G
Sbjct: 98  YLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWAFSAIG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL--KAGGV 226
            +E   +L+T  L+SLSEQ+LV CD           D GCNGGLM  AF+++L  + G V
Sbjct: 158 NIESQWYLATHSLISLSEQELVSCD---------DVDEGCNGGLMLQAFDWLLNNRNGAV 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
                YPY   +G   +  +S    I A +     I S+ED MAA L  +GP+A
Sbjct: 209 YTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIA 262


>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
          Length = 329

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/233 (36%), Positives = 131/233 (56%), Gaps = 17/233 (7%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ ++K   SKTY ++ E   R  +++ NLR      L     +H    G+    D+T  
Sbjct: 25  HWLMWKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTRE 84

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           E  + F G   R+R     + +P + +  +  P   DWR+ G VT VK+QG+CGSCW+FS
Sbjct: 85  EILQMFAGT--RVRPNLTRRSSPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           A GALEG    +TG++ SLS Q LVDC        S   + GCNGG M  AF+Y++  GG
Sbjct: 143 AAGALEGQLKRTTGQVKSLSPQNLVDC-------SSKYGNKGCNGGFMTQAFQYVIDDGG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
           ++ ++ YPYT  D G C++D+S+ AA  S+++ +S  DE+ +   +   GP++
Sbjct: 196 IDSDEAYPYTAMD-GQCRYDQSQRAANCSSYNYVSEGDEEALKQAVATIGPIS 247


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 129/237 (54%), Gaps = 31/237 (13%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL----DPTAVHGVTKFSDLTPSE 108
           F  FK +  KTY  Q E   RF +F  N+R  +    L      +   G+ KF+D++  E
Sbjct: 26  FQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEE 85

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTN-------DLPTDFDWRDHGAVTGVKDQGACGSC 161
           F+           L   A + P L T        ++P+  DWR  G VTGVKDQG CGSC
Sbjct: 86  FKTM---------LTLSASRKPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQGDCGSC 136

Query: 162 WSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYIL 221
           W+FS TG+ EGA+   +G+LVSLSEQQL+DC   C         +GC+GG ++  F+Y++
Sbjct: 137 WAFSITGSTEGAYARKSGKLVSLSEQQLIDC---CT-----DTSAGCDGGSLDDNFKYVM 188

Query: 222 KAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLA 277
           K  G++ E+ Y Y G D G+CK++ + +   VS ++ I + DED +   +   GP++
Sbjct: 189 K-DGLQSEESYTYKGED-GACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVS 243


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/255 (36%), Positives = 141/255 (55%), Gaps = 19/255 (7%)

Query: 42  SEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTK 100
           S+D ++ A H    + +++S+ Y    E   RF VFKAN++  +            GV +
Sbjct: 121 SDDSVMVARHE--QWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQ 178

Query: 101 FSDLTPSEFR--RQFLGL-NRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGA 157
           F+DLT  EFR  +   GL +  +++P   +   +   + LPT  DWR  GAVT +KDQG 
Sbjct: 179 FADLTNDEFRSTKTNKGLKSSNMKIPTGFRYENV-SADALPTTIDWRTKGAVTPIKDQGQ 237

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CG CW+FSA  A EG   +STG+LVSL+EQ+LVDCD   +       D GC GGLM+ AF
Sbjct: 238 CGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGE-------DQGCEGGLMDDAF 290

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           ++I+K GG+  E  YPYT  D G CK   S  AA +  +  + ++++      V + P++
Sbjct: 291 KFIIKNGGLTTESSYPYTAAD-GKCK-SGSNSAATIKGYEDVPANDEAALMKAVANQPVS 348

Query: 278 GNVASIELPHISFSF 292
               +++   ++F F
Sbjct: 349 ---VAVDGGDMTFQF 360


>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
          Length = 354

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/275 (36%), Positives = 144/275 (52%), Gaps = 27/275 (9%)

Query: 12  LLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHD 71
           LL + V+     V    A+I Q  P+      D+ + A  H+  FK + SK +    E  
Sbjct: 7   LLFAIVVTILFVVCYGSALIAQTPPA-----VDNFV-ASAHYGSFKKRHSKAFGGDAEEG 60

Query: 72  YRFRVFKANLRRAKRRQLLDPTAVHGVT-KFSDLTPSEFRRQFLGLNRRLRLPADAQKAP 130
           +RF  FK N++ A      +P A + V+ KF+DLTP EF + +L  +       D  K  
Sbjct: 61  HRFNAFKQNMQTAYFLNTQNPHAHYDVSGKFADLTPQEFAKLYLNPDYYTSHLKD-HKED 119

Query: 131 ILPTNDLPT---DFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQ 187
           +   +  P+     DWRD GAVT VK+QG CGSCW+FSA G +EG    S   LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179

Query: 188 QLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKDYPYTGTDGGSCK-- 243
            LV CD         + D GCNGGLM+ A  +I+++  G V  E  YPY  T GG  +  
Sbjct: 180 MLVSCD---------NVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPY--TSGGGTRPP 228

Query: 244 -FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             D+ ++ A ++ F  +  DE+++A  + K GP+A
Sbjct: 229 CHDEGEVGAKITGFLSLPHDEERIADWVEKRGPVA 263


>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
          Length = 321

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK+++ + Y   +E  YR RVF+ N +      K+ +  + T    + +F
Sbjct: 13  LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 72

Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
            D+T  EF     G  +  R  P     A   P   +  D DWR    VT VKDQ  CGS
Sbjct: 73  GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 129

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FSATGALEG HFL   ELVSLSEQQLVDC  +         + GC GG M SAF+YI
Sbjct: 130 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 182

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
              GG++ E  YPY   D  SC+FD + I A  +    +   E+ +   +   GP++   
Sbjct: 183 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 238

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 239 VAIDASHFSFQFY 251


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/253 (34%), Positives = 142/253 (56%), Gaps = 19/253 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLN--RRLRLPA-----DAQKAPILPTNDLPTDFDWRDHGAVTGV 152
           +F+D+T  EF  +F GLN       P+     + +K   L  +D+P++ DWR+ GAVT V
Sbjct: 87  EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQV 146

Query: 153 KDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGL 212
           K QG CG CW+FSA G+LEGA+ ++TG+L+  SEQ+L+DC          + + GCNGG 
Sbjct: 147 KHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT---------TNNYGCNGGF 197

Query: 213 MNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVK 272
           M +AF++I++ GG+ RE DY Y G +  +C+  +   A  +S++ V+   E  +   + K
Sbjct: 198 MTNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTK 256

Query: 273 HGPLAGNVASIEL 285
                G  AS +L
Sbjct: 257 QPVSIGIAASQDL 269


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 96/287 (33%), Positives = 150/287 (52%), Gaps = 36/287 (12%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           LL +L++  + A A N  +A +        E+ ED +           +++ + Y   +E
Sbjct: 14  LLFVLAAWASQATARNLHEASMY-------ERHEDWM-----------AQYGRVYKDADE 55

Query: 70  HDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEF---RRQFLGLNRRLRLPAD 125
              R+++FK N+ R +   + +D +    + +F+DLT  EF   R +F    +      +
Sbjct: 56  KSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRF----KAHICSTE 111

Query: 126 AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLS 185
           A          +P+  DWR  GAVT +KDQG CGSCW+FSA  A+EG   LSTG+L+SLS
Sbjct: 112 ATSFKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLS 171

Query: 186 EQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFD 245
           EQ+LVDCD       SG  D GCNGGLM+ AF++I +  G+  E +YPY GTDG   +  
Sbjct: 172 EQELVDCD------TSGE-DQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKK 224

Query: 246 KSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            +  AA ++ +  + ++ ++     V H P+A    +I+     F F
Sbjct: 225 AAHPAAKINGYEDVPANNEKALQKAVVHQPIA---VAIDAGGFEFQF 268


>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
 gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
          Length = 327

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/263 (35%), Positives = 139/263 (52%), Gaps = 23/263 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKF 101
           L+N E  + +FK  +++ YA +EE+  R  +F+ NL+  +         +H    GV ++
Sbjct: 18  LMNPE--WEVFKKAYNRVYAAEEEYARRL-IFEDNLKTIQMHNEEADRGLHTFRLGVNQY 74

Query: 102 SDLTPSEFRRQFLG---LNRRLRLPADAQKAPILPT-NDLPTDFDWRDHGAVTGVKDQGA 157
           +D+T  EF    +G   L+               PT  D+P   DWRD G VT VK+Q  
Sbjct: 75  ADMTHKEFLENVIGGCLLDTNTSKSTADHVHEYDPTLTDVPDTVDWRDKGYVTPVKNQAQ 134

Query: 158 CGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAF 217
           CGSCW+FS TG+LEG HF +T +LVSLSEQ L+DC  +         + GC GGLM+ AF
Sbjct: 135 CGSCWAFSTTGSLEGQHFKATNKLVSLSEQNLMDCSRK-------EGNQGCQGGLMDQAF 187

Query: 218 EYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPL 276
           +YI   GG++ E+ YPY   +   C +  S   A +S+++ V S DED +   +   GP+
Sbjct: 188 KYIKTNGGIDTEECYPYKAKN-EQCNYQASCSGATLSSYTDVKSKDEDALQQAVATVGPI 246

Query: 277 AGNVASIELPHISFSFLFTVSSP 299
           +    +I+  H SF    +   P
Sbjct: 247 S---VAIDAGHSSFQLYHSGKPP 266


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 84/245 (34%), Positives = 132/245 (53%), Gaps = 24/245 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  +  K +K Y   E +D +++ FK N+         +   V G+ +F+DLT  E+++ 
Sbjct: 34  FLGWMKKHNKAYHHHEFND-KYQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKT 92

Query: 113 FLGLNRRLRLPADAQKAPILPTNDL-------PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           +LG++  + L A+      +P N L       P+  DWR +GAV  VKDQG CGSCW+F+
Sbjct: 93  YLGMSINVNLRANQ-----VPMNGLNFERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFA 147

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGA+EGAH + TG +V+ SEQ LVDC            ++GC+GGLM SAF+YI+   G
Sbjct: 148 TTGAVEGAHQIKTGNMVTFSEQHLVDCSGRYG-------NNGCDGGLMTSAFKYIIDNDG 200

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           +  E+ YPYT T    C ++ + +  A+S +  +    +      +   P+A    +I+ 
Sbjct: 201 IATEEAYPYTATQ-NRCVYNTTMLGTAISGYKDVPRGSESALTAAISKQPVA---VAIDA 256

Query: 286 PHISF 290
             I+F
Sbjct: 257 SPITF 261


>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 306

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 98/247 (39%), Positives = 132/247 (53%), Gaps = 26/247 (10%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLR----RAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           FK ++ + Y T  E  YR  VF+ N +       + +  + T    + +F D+T  EF  
Sbjct: 6   FKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAA 65

Query: 112 QFLG-LNRRLRLPADAQKAPILPTND--LPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
              G LN   R P       IL  +D  LP   DWR  GAVT VKDQ  CGSCW+FS TG
Sbjct: 66  TMNGFLNVPTRHPV-----AILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTG 120

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGS-CDSGCNGGLMNSAFEYILKAGGVE 227
           +LEG HFL  G+LVSLSEQ LVDC        SG   + GC GGLM+ AF+YI +  G++
Sbjct: 121 SLEGQHFLKDGKLVSLSEQNLVDC--------SGKFGNMGCCGGLMDQAFKYIKENKGID 172

Query: 228 REKDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLAGNVASIELP 286
            E+ YPY   D G C+FD S + A  + F  I+  +E+ +   +   GP++    +I+  
Sbjct: 173 TEESYPYEAQD-GKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPIS---VAIDAS 228

Query: 287 HISFSFL 293
           H SF F 
Sbjct: 229 HPSFQFY 235


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/211 (42%), Positives = 121/211 (57%), Gaps = 17/211 (8%)

Query: 73  RFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLG--LNR-RLRLPADAQKA 129
           R+ +FK NLR        +     G+  F+DLT  EFR Q  G   +R R R   +  + 
Sbjct: 85  RYGIFKDNLRFIHGENEKNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHEEFRY 144

Query: 130 PILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
             +   DLP   DWR+ GAV GVKDQG+CGSCW+FSA  A+EG + L+TGELVSLSEQ+L
Sbjct: 145 GSVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQEL 204

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDCD           D GCNGGLM+ AF +++K GG++ E DYPY G  G  C  D+SK+
Sbjct: 205 VDCDK--------GEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGY-GTRC--DRSKM 253

Query: 250 AAAV---SNFSVISSDEDQMAANLVKHGPLA 277
            A V     +  +  +++      V H P++
Sbjct: 254 NAKVVTIDGYEDVPVNDETALLKAVAHQPVS 284


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 129/227 (56%), Gaps = 11/227 (4%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  + SK  K Y + EE  +RF +FK NL               G+ +F+DL+  EF+ +
Sbjct: 33  FESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNK 92

Query: 113 FLGLNRRLRLPAD-AQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +LGLN  L    + +++      + +P   DWR  GAVT VK+QG+CGSCW+FS   A+E
Sbjct: 93  YLGLNVDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVE 152

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + + TG L SLSEQ+LVDCD         + ++GCNGGLM+ AF YI+  GG+ +E+D
Sbjct: 153 GINQIVTGNLTSLSEQELVDCDT--------TYNNGCNGGLMDYAFAYIISNGGLHKEED 204

Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY   + G+C+  K++     +S +  +  + ++     + + PL+
Sbjct: 205 YPYI-MEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLS 250


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 152/295 (51%), Gaps = 36/295 (12%)

Query: 6   LSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYA 65
           +S  L+  L ++ + A+A    DA I        E+ E+ +           ++F + Y+
Sbjct: 10  ISLALIFFLGALASQAIARTLQDASIH-------EKHEEWM-----------TRFKRVYS 51

Query: 66  TQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA 124
             +E + R+++FK N++R +   +  + +   G+ +F+DLT  EF+      NR      
Sbjct: 52  DAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEFKTS---RNRFKGHMC 108

Query: 125 DAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELV 182
            +Q  P    N   +P+  DWR  GAVT +KDQG CGSCW+FSA  A+EG   L+T +L+
Sbjct: 109 SSQAGPFRYENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLI 168

Query: 183 SLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSC 242
           SLSEQ+LVDCD + +       D GC GGLM+ AF++I +  G+  E +YPY G+DG   
Sbjct: 169 SLSEQELVDCDTKGE-------DQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCN 221

Query: 243 KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFLFTVS 297
              ++  AA ++ F  + ++ +      V   P+     S+ +    F F F  S
Sbjct: 222 TKQEANHAAKINGFEDVPANNEGALMKAVAKQPV-----SVAIDAGGFEFQFYSS 271


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 94/274 (34%), Positives = 145/274 (52%), Gaps = 37/274 (13%)

Query: 10  LLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEE 69
           LL +L++  + A A N  +A +        E+ ED ++           ++ + Y   +E
Sbjct: 14  LLFVLAAWASQATARNLHEASMY-------ERHEDWMV-----------QYGREYKDADE 55

Query: 70  HDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPA---- 124
              R+++FK N+ R +   + +D +    + +F+DLT  EFR        R R  A    
Sbjct: 56  KSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS------RNRFKAHICS 109

Query: 125 -DAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVS 183
            +A          +P+  DWR  GAVT +KDQG CGSCW+FSA  A+EG   LSTG+L+S
Sbjct: 110 TEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLIS 169

Query: 184 LSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCK 243
           LSEQ+LVDCD       SG  D GC+GGLM+ AF++I +  G+  E +YPY GTDG   +
Sbjct: 170 LSEQELVDCD------TSGE-DQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNR 222

Query: 244 FDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
              +  AA ++ +  + ++ ++     V H P+A
Sbjct: 223 KKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA 256


>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 320

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 128/253 (50%), Gaps = 19/253 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRA----KRRQLLDPTAVHGVTKF 101
           L  A   +  FK+++ + Y   +E  YR RVF+ N +      K+ +  + T    + +F
Sbjct: 12  LATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQF 71

Query: 102 SDLTPSEFRRQFLGLNRRLR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGS 160
            D+T  EF     G  +  R  P     A   P   +  D DWR    VT VKDQ  CGS
Sbjct: 72  GDMTNEEFNAVMKGYKKGSRGEPKAVFTAEAGP---MAADVDWRTKALVTPVKDQEQCGS 128

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FSATGALEG HFL   ELVSLSEQQLVDC  +         + GC GG M SAF+YI
Sbjct: 129 CWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYG-------NDGCGGGWMTSAFDYI 181

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
              GG++ E  YPY   D  SC+FD + I A  +    +   E+ +   +   GP++   
Sbjct: 182 KDNGGIDTESSYPYEAED-RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPIS--- 237

Query: 281 ASIELPHISFSFL 293
            +I+  H SF F 
Sbjct: 238 VAIDASHFSFQFY 250


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/227 (37%), Positives = 130/227 (57%), Gaps = 15/227 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEF---RR 111
           +  +++K Y   +E + RF++FK N+   +            GV +F DLT  EF   R 
Sbjct: 42  WMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRN 101

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
           +F G      +  +  K   + T  +P++ DWR  GAVT VKDQG CG CW+FSA  A E
Sbjct: 102 RFKGHMCSSIIRTNTYKYENVTT--VPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATE 159

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G H LSTG+L+SLSEQ+LVDCD       +   D GC GGLM+ AF++I++  G++ E  
Sbjct: 160 GIHQLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHGLDTEAK 212

Query: 232 YPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           YPY G D G+C  +++ I AA ++++  + ++ +Q     V + P++
Sbjct: 213 YPYQGVD-GTCNANEASINAATITSYEDVPTNNEQALQKAVANQPIS 258


>gi|229366026|gb|ACQ57993.1| Cathepsin H precursor [Anoplopoma fimbria]
          Length = 247

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 94/221 (42%), Positives = 128/221 (57%), Gaps = 17/221 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEF 109
           E HF  + ++ ++ Y+ QE H+ RF++F  N RR  +    + T   G+ +FSD+T SEF
Sbjct: 23  EFHFKSWMAQHNRVYSMQEYHE-RFQIFSENKRRIDKHNEGNHTFTMGLNQFSDMTFSEF 81

Query: 110 RRQFLGLNRRLRLPADAQKAPILPTND--LPTDFDWRDHG-AVTGVKDQGACGSCWSFSA 166
           R+ FL    +      A K     +ND   P   DWR  G  VT VK+QGACGSCW+FS 
Sbjct: 82  RKSFLWSEPQ---NCSATKGNYF-SNDGPHPDTIDWRKKGNYVTDVKNQGACGSCWTFST 137

Query: 167 TGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGV 226
           TG LE    +STG+LV LSEQQLVDC  + +       + GCNGGL + AFEYI+ + G+
Sbjct: 138 TGCLESVTAISTGKLVPLSEQQLVDCAQDFN-------NHGCNGGLPSQAFEYIMYSKGL 190

Query: 227 EREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMA 267
             EKDYPYT  +  +C + K K+AAA     V  +  D+M 
Sbjct: 191 MTEKDYPYTAFE-DTCAY-KQKLAAAFVREVVNITAYDEMG 229


>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 82/237 (34%), Positives = 134/237 (56%), Gaps = 20/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           +L A ++F  F  KF+K+Y+++ E   RF++F+ NL     +   D TA + + KF+DL+
Sbjct: 21  VLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHNDSTAQYEINKFADLS 80

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   +  +L  P +  P +FDWR    VT VK+QG CG+
Sbjct: 81  KDETISKYTGLS----LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGA 136

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+  G+LE    +   + ++LSEQQL+DCD           D+GC+GGL+++AFE +
Sbjct: 137 CWAFATLGSLESQFAIKHNQFINLSEQQLIDCDF---------VDAGCDGGLLHTAFEAV 187

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +  GG++ E DYPY   + G C+ + +K    V   +  I+  E+++   L   GP+
Sbjct: 188 MNMGGIQAESDYPYEANN-GDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPI 243


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 16/209 (7%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTA 94
           + S GE+SE+    A   ++ +K++  K+Y    E + R+  F+ NLR            
Sbjct: 25  IVSYGERSEEE---ARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAG 81

Query: 95  VH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTND-LPTDFDWRDHGAV 149
           VH    G+ +F+DLT  E+R  +LGL  + R         +   N+ LP   DWR  GAV
Sbjct: 82  VHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAV 141

Query: 150 TGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCN 209
             +KDQ   GSCW+FSA  A+EG + + TG+L+SLSEQ+LVDCD         S + GCN
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT--------SYNEGCN 193

Query: 210 GGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           GGLM+ AF++I+  GG++ E DYPY G D
Sbjct: 194 GGLMDYAFDFIINNGGIDTEDDYPYKGKD 222


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           + L+ ++  + Y    E D RFRVF  NLR   A   +  +     G+ +F+DLT  EFR
Sbjct: 52  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 111

Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             +LG     R+PA  ++   +           +LP   DWR+ GAV  VK+QG CGSCW
Sbjct: 112 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 167

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA  ++E  + + TGE+V+LSEQ+LV+C  +         +SGCNGGLM++AF++I+K
Sbjct: 168 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 220

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG++ E DYPY   D G C  ++      ++  F  +  ++++     V H P++
Sbjct: 221 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 275


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/205 (41%), Positives = 119/205 (58%), Gaps = 13/205 (6%)

Query: 74  FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILP 133
           FR   ANLR  +     + +   G+T+F+DLT +EF        +R  +     +  +  
Sbjct: 48  FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFS----AYVKRFPMNVTRPRNEVWI 103

Query: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193
           T     + DWR   AVT +K+QG CGSCWSFS TG++EGAH ++TG+LVSLSEQQL+DC 
Sbjct: 104 TEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCS 163

Query: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAV 253
                      + GCNGGLM+ AFEY++  GG++ E+DYPYT  DG      + K AA +
Sbjct: 164 TRYG-------NHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEI 216

Query: 254 SNF-SVISSDEDQMAANLVKHGPLA 277
             F +V    EDQ+AA  V  GP++
Sbjct: 217 HGFRNVPKEHEDQLAA-AVSIGPVS 240


>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 96/261 (36%), Positives = 133/261 (50%), Gaps = 24/261 (9%)

Query: 35  VPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKAN-LRRAKRRQLLDPT 93
           +PSD        +  + H+  FK+  +KTYA   E  YR +VFK N +R AK        
Sbjct: 18  IPSD--------MEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASG 69

Query: 94  AVH---GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVT 150
            V    G  +++D+   E   +  G    L+  +         +       DWR  GAVT
Sbjct: 70  EVTFKVGYNQYADMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVT 129

Query: 151 GVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNG 210
            +KDQG CGSCWSFSATG+LEG  FL    LVSLSEQ LVDC  +   E       GCNG
Sbjct: 130 PIKDQGQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNE-------GCNG 182

Query: 211 GLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAAN 269
           GLM+SAFEY+   GG++ E+ YPYT  D G+C +  +  A   + +  V +  E  +   
Sbjct: 183 GLMDSAFEYVKSNGGIDTEESYPYTAED-GTCLYKAANNAGVNTGYKDVQAKSESALRDA 241

Query: 270 LVKHGPLAGNVASIELPHISF 290
           + K GP++    +I+  + SF
Sbjct: 242 VEKVGPVS---VAIDASNWSF 259


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/252 (34%), Positives = 141/252 (55%), Gaps = 18/252 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
           +F+D+T  EF  +F GLN         P  + +  I  L  +D+P++ DWR+ GAVT VK
Sbjct: 87  EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            QG CG CW+FSA G+LEGA+ ++TG+L+  SEQ+L+DC          + + GCNGG M
Sbjct: 147 HQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT---------TNNYGCNGGFM 197

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            +AF++I++ GG+ RE DY Y G +  +C+  +   A  +S++ V+   E  +   + K 
Sbjct: 198 TNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256

Query: 274 GPLAGNVASIEL 285
               G  AS +L
Sbjct: 257 PVSIGIAASQDL 268


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/255 (36%), Positives = 138/255 (54%), Gaps = 20/255 (7%)

Query: 5   ILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGE-QSEDHLLNAEHHFSLFKSKFSKT 63
           +LS L +L ++     ++A++       +  P     ++ D +L     + +   K  K 
Sbjct: 1   MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLV---KHGKN 57

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRR 119
           Y    E + RF +FK NL         + +   G+ +F+DLT  E+R +FLG     NRR
Sbjct: 58  YNALGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRR 117

Query: 120 LR-LPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLST 178
            R + +   +      + LP   DWR  GAV GVKDQG+CGSCW+FSA  A+EG + L+T
Sbjct: 118 NRKVNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLAT 177

Query: 179 GELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTD 238
           G+L+SLSEQ+LVDCD         S + GCNGGLM+ AFE+I+    +  E+DYPY   D
Sbjct: 178 GDLISLSEQELVDCDT--------SYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAID 229

Query: 239 GGSCKFDKSKIAAAV 253
           G   + D+++  A V
Sbjct: 230 G---RCDQNRKNAKV 241


>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
 gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
          Length = 343

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/283 (31%), Positives = 157/283 (55%), Gaps = 20/283 (7%)

Query: 8   SLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQ 67
           +L++LL+ + L +      D+ ++     +  + S  ++ +A  +F  F S+++K Y  +
Sbjct: 4   TLIILLVVNALLNW----RDNELVDAAGTAANKPSLYNINSAPQYFEQFISQYNKQYKNE 59

Query: 68  EEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQ 127
            E  +RF +F  N+    ++   + +AV+ + +F+D+T +E   +  GL     L ++  
Sbjct: 60  AEKRHRFNIFMHNIEEINQKNSRNDSAVYKINRFADMTKNEVVIRHTGLASIGELNSNFC 119

Query: 128 KAPILP---TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSL 184
           +  ++        P+ FDWR +  VT VKDQ  CG+CW+F++ GALE  + +    L+ L
Sbjct: 120 ETVVVDGPGQRQRPSSFDWRTYNKVTSVKDQSMCGACWAFASLGALESQYAIKYDRLIDL 179

Query: 185 SEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKF 244
           +EQQLVDCD           D GC+GGL+++A+E I++ GGVE+E DYPY   +   C  
Sbjct: 180 AEQQLVDCDF---------VDMGCDGGLIHTAYEQIMQMGGVEQEFDYPYRA-ERQPCAL 229

Query: 245 DKSKIAAAVSN-FSVISSDEDQMAANLVKH-GPLAGNVASIEL 285
              K AA V   F  +  +E+++  +L++H GP+A  V +++L
Sbjct: 230 KPHKFAAGVRKCFRYVLRNEERL-EDLLRHVGPIAIAVDAVDL 271


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/242 (36%), Positives = 129/242 (53%), Gaps = 21/242 (8%)

Query: 62  KTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRRQFLG----- 115
           K Y    E + RF +FK NL    +    D      G+ KF+DLT  EFR  +LG     
Sbjct: 62  KNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSS 121

Query: 116 ----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
               L    +    + +      ++LP   DWR +GAV  VKDQG CGSCW+FS   A+E
Sbjct: 122 SSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVE 181

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G + + TGEL+SLSEQ+LVDCD         S +SGC+GGLM+ A+E+I+  GG++ + D
Sbjct: 182 GINQIVTGELLSLSEQELVDCDT--------SYNSGCDGGLMDYAYEFIINNGGIDTDAD 233

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFS 291
           YPYT  DG   ++ K+     + +F  +  ++++     V H P++    +IE    +F 
Sbjct: 234 YPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVS---VAIEAGGSTFQ 290

Query: 292 FL 293
           F 
Sbjct: 291 FY 292


>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
          Length = 459

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/226 (41%), Positives = 134/226 (59%), Gaps = 12/226 (5%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP-TAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY T+EE  +R  +F +N+ RA++ Q LD  TA +GVTKFSDLT  EFR 
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALE 171
            +L    +       ++A  +  +  P ++DWR  GAVT VKDQG CGSCW+FS TG +E
Sbjct: 222 IYLNPLLKEEPGVKMRRAKSV-GDSAPPEWDWRSKGAVTEVKDQGMCGSCWAFSVTGNVE 280

Query: 172 GAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKD 231
           G  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E D
Sbjct: 281 GQWFLNRGALLSLSEQELLDCD---------KVDKACMGGLPSNAYSAIKTLGGLETEDD 331

Query: 232 YPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y Y G    +C F   K    +++   ++ +E ++AA L K GP++
Sbjct: 332 YSYHG-HLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPIS 376


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           + L+ ++  + Y    E D RFRVF  NLR   A   +  +     G+ +F+DLT  EFR
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168

Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             +LG     R+PA  ++   +           +LP   DWR+ GAV  VK+QG CGSCW
Sbjct: 169 AAYLGA----RIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 224

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA  ++E  + + TGE+V+LSEQ+LV+C  +         +SGCNGGLM++AF++I+K
Sbjct: 225 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 277

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG++ E DYPY   D G C  ++      ++  F  +  ++++     V H P++
Sbjct: 278 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 332


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/259 (35%), Positives = 138/259 (53%), Gaps = 23/259 (8%)

Query: 43  EDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDP--TAVHGVTK 100
           ++ L+  + H   + +K  + YA  +E + R+ VFK N+ R +    +    T    V +
Sbjct: 29  DNELIMQKRHIE-WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQ 87

Query: 101 FSDLTPSEFRRQFLGLNRRLRLPADAQK--API----LPTNDLPTDFDWRDHGAVTGVKD 154
           F+DLT  EFR  + G      L + +Q   +P     + +  LP   DWR  GAVT +K+
Sbjct: 88  FADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKN 147

Query: 155 QGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMN 214
           QG+CG CW+FSA  A+EGA  +  G+L+SLSEQQLVDCD         + D GC GGLM+
Sbjct: 148 QGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD---------TNDFGCEGGLMD 198

Query: 215 SAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSK-IAAAVSNFSVISSDEDQMAANLVKH 273
           +AFE+I   GG+  E +YPY G D  +C   K+   A +++ +  +  +++Q     V H
Sbjct: 199 TAFEHIKATGGLTTESNYPYKGED-ATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257

Query: 274 GPLAGNVASIELPHISFSF 292
            P++     IE     F F
Sbjct: 258 QPVS---VGIEGGGFDFQF 273


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/244 (36%), Positives = 133/244 (54%), Gaps = 16/244 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  + +K+ K YA+ EE  +RF VFK NL           T   G+  F+DLT  EF+  
Sbjct: 66  FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTHDEFKAT 125

Query: 113 FLGLNR-RLRLPADAQ-KAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
           +LGL +   +   D++ +   +  +D+P   DWR  GAVT VK+QG CGSCW+FS   A+
Sbjct: 126 YLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAFSTVAAV 185

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG + + TG L SLSEQ+LVDC        S   ++GCNGG+M++AF YI  +GG+  E+
Sbjct: 186 EGINQIVTGNLTSLSEQELVDC--------STDGNNGCNGGVMDNAFSYIASSGGLRTEE 237

Query: 231 DYPYTGTDGGSC--KFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
            YPY   + G C  K    +    +S +  + ++++Q     + H PL+    +IE    
Sbjct: 238 AYPYL-MEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLS---VAIEASGR 293

Query: 289 SFSF 292
            F F
Sbjct: 294 HFQF 297


>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
 gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
          Length = 334

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/249 (36%), Positives = 125/249 (50%), Gaps = 17/249 (6%)

Query: 48  NAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHG----VTKFSD 103
           N + H+  +K+   + Y   EE ++R  V++ N +             H     +  F D
Sbjct: 24  NLDAHWHQWKATHRRLYGMNEE-EWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAFGD 82

Query: 104 LTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWS 163
           +T  EFR+   G   +          P+L   D+P   DW   G VT VK+QG CGSCW+
Sbjct: 83  MTNEEFRQVMNGFQNQKHKKGKLFHEPLLV--DVPKSVDWTKKGYVTPVKNQGQCGSCWA 140

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FSATGALEG  F  TG+LVSLSEQ LVDC            + GCNGGLM++AF+YI   
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSR-------AQGNQGCNGGLMDNAFQYIKDN 193

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASI 283
           GG++ E+ YPY  TD  SC +     AA  + F  I   E  +   +   GP++    +I
Sbjct: 194 GGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQREKALMKAVATVGPIS---VAI 250

Query: 284 ELPHISFSF 292
           +  H SF F
Sbjct: 251 DAGHTSFQF 259


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 131/234 (55%), Gaps = 23/234 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRA-KRRQLLDPTAVH--GVTKFSDLTPSEF 109
           F  +  K  K Y   +E + +F+ F+ NLR   ++      +  H  G+ KF+D++  EF
Sbjct: 51  FKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEF 110

Query: 110 RRQFLG-----LNRRLRLPADAQKAPILPTN----DLPTDFDWRDHGAVTGVKDQGACGS 160
           R  ++       ++R+ +    Q            D PT  DWR +G VTGVKDQG CGS
Sbjct: 111 REVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGS 170

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+FS+TGA+EG + L+ G+L+SLSEQ+LVDCD         S + GC GG M+ AFE++
Sbjct: 171 CWAFSSTGAIEGINALANGDLISLSEQELVDCD---------STNDGCEGGYMDYAFEWV 221

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKH 273
           +  GG++ E DYPYTG D G+C   K +  A ++  +  ++ +E  +   ++K 
Sbjct: 222 MSNGGIDTETDYPYTGED-GTCNTTKEETKAVSIDGYEDVAEEESALFCAVLKQ 274


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 94/252 (37%), Positives = 136/252 (53%), Gaps = 19/252 (7%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANL----RRAKRRQLLDPTAVHGVTKF 101
            ++ +  ++ +K++  K Y + EE   R  +++ NL    +   +  L   T   G+ +F
Sbjct: 21  FIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGMNQF 80

Query: 102 SDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACG 159
           +DL   EF     G  R     A      + P+N  D+PT  DWR  G VT VK+Q  CG
Sbjct: 81  ADLKNEEFVSLMNGF-RGNSSKATRGSTFLPPSNVFDMPTMVDWRTKGYVTPVKNQLQCG 139

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSATG+LEG HF  TG+LVSLSEQ LVDC  +         + GC GGLM+ AF+Y
Sbjct: 140 SCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGK-------EGNMGCEGGLMDQAFQY 192

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFS-VISSDEDQMAANLVKHGPLAG 278
           IL  GG++ E  YPYT  D G C F+K+ I A  + ++ V +  E  +   +   GP++ 
Sbjct: 193 ILDVGGIDTEMSYPYTAMD-GQCHFNKANIGATDTGYTDVTTGSESALQMAVASVGPIS- 250

Query: 279 NVASIELPHISF 290
              +I+  H SF
Sbjct: 251 --VAIDASHQSF 260


>gi|261328616|emb|CBH11594.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
           gambiense DAL972]
          Length = 220

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 124/244 (50%), Gaps = 41/244 (16%)

Query: 1   MERLILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKF 60
           M R +   ++LL +++ LAS              V       E+ L   E  F+ FK K+
Sbjct: 6   MVRFVRLPVVLLAMAACLAS--------------VALGSLHVEESL---EMRFAAFKKKY 48

Query: 61  SKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ-------F 113
            K Y   +E  +RFR F+ N+ +AK +   +P A  GVT FSD+T  EFR +       F
Sbjct: 49  GKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYATFGVTPFSDMTREEFRARYRNGASYF 108

Query: 114 LGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
               +RLR      K   + T   P   DWR+ GAVT VKDQG CGSCW+FS  G +EG 
Sbjct: 109 AAAQKRLR------KTVNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQ 162

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA--GGVEREKD 231
             ++   LVSLSEQ LV CD         + D GC GGLM++AF +I+ +  G V  E  
Sbjct: 163 WQVAGNPLVSLSEQMLVSCD---------TIDFGCGGGLMDNAFNWIVNSNGGNVFTEAS 213

Query: 232 YPYT 235
           YPY 
Sbjct: 214 YPYV 217


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/248 (37%), Positives = 131/248 (52%), Gaps = 20/248 (8%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPS 107
           H+ L+K+  SK Y  +EE  +R  +++ NL + +   L      H    G+  F D+T  
Sbjct: 27  HWELWKNWHSKKYHEKEE-GWRRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHE 85

Query: 108 EFRRQFLGLNRRLRLPADAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           EFR+   G  R+    A    +  +  N +  P+  DWR+ G VT VKDQG CGSCW+FS
Sbjct: 86  EFRQIMNGYQRKTERKAIG--SLFMEPNFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFS 143

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
            TGALZG +F   G+LVSLSEQ LVDC     PE     + GC GGLM+ AF+Y+    G
Sbjct: 144 TTGALZGQNFRKMGKLVSLSEQNLVDCSR---PE----GNEGCGGGLMDQAFQYVKDNQG 196

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY GTD   C +D    +   + F  + S  E  +   +   GP++    +I+
Sbjct: 197 LDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGKEHALMKAVASVGPVS---VAID 253

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 254 AGHESFQF 261


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/233 (36%), Positives = 128/233 (54%), Gaps = 26/233 (11%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH--GVTKFSDLTPSEF---R 110
           + S++ K Y   +E + RF++F  N+   +     D   ++  GV +F+DLT  EF   R
Sbjct: 41  WMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTNDEFTSSR 100

Query: 111 RQFLG-----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
            +F G     + R      +   A       +P+  DWR  GAVT VK+QG CG CW+FS
Sbjct: 101 NKFKGHMCSSITRTSTFKYENASA-------IPSSVDWRKKGAVTPVKNQGQCGCCWAFS 153

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           A  A EG H LSTG+L+SLSEQ+LVDCD       +   D GC GGLM+ AF++I++  G
Sbjct: 154 AVAATEGIHKLSTGKLISLSEQELVDCD-------TKGVDQGCEGGLMDDAFKFIIQNHG 206

Query: 226 VEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
           +  E +YPY G D G+C  +K  I A  ++ +  + ++ +Q     V + P++
Sbjct: 207 LNTEANYPYQGVD-GTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPIS 258


>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 443

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/250 (36%), Positives = 129/250 (51%), Gaps = 19/250 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLT 105
           + H+ L+K+   K+Y   EE  +R  V++ NL+  +   L     +H    G+ +F DLT
Sbjct: 26  DRHWKLWKNWHQKSYHEAEE-GWRRTVWEENLKAIQLHNLEQSLGLHTYRLGMNQFGDLT 84

Query: 106 PSEFRRQFLGLNRRLRLPADAQKAPILPTN--DLPTDFDWRDHGAVTGVKDQGACGSCWS 163
             EF+    G  R          +  L  N   +PT  DWRDHG VT VK+QG CGSCW+
Sbjct: 85  NEEFQEILTG-ERHFSKGNRINGSAFLEANFVQVPTSVDWRDHGYVTPVKNQGHCGSCWA 143

Query: 164 FSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKA 223
           FS TGALEG  F  +G L+SLSEQ LVDC  +         + GC+GG+++ AF+YIL+ 
Sbjct: 144 FSTTGALEGQLFRKSGRLISLSEQNLVDCSWQ-------QGNQGCHGGIVDLAFQYILQN 196

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
            G++ E  YPYT  D   C F      A V+ F  I    E+ +   +   GP++     
Sbjct: 197 QGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVS---VG 253

Query: 283 IELPHISFSF 292
           I+    SF F
Sbjct: 254 IDASSTSFRF 263


>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
          Length = 342

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 95/247 (38%), Positives = 131/247 (53%), Gaps = 22/247 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           ++ +K  + K+Y  +E+   R  +++ NLR      +      H    G+ + SDLTPSE
Sbjct: 40  WTEYKETYGKSYDMKED-VVRRSLWEGNLRHISMHNVKHDLGKHSFSMGINELSDLTPSE 98

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +R Q LGL  R  L     K  +     +P   DWRD G VT VK+QGACGSCW+FS+TG
Sbjct: 99  YR-QRLGL--RPALGERTGKKFVYNGEKVPEHVDWRDKGYVTPVKNQGACGSCWAFSSTG 155

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           +LEG HF  TG+LVSLSEQ LVDC  +         ++GCNGG M++AF Y+    G++ 
Sbjct: 156 SLEGQHFRLTGQLVSLSEQNLVDCTKKYG-------NAGCNGGWMDNAFNYVKANNGIDT 208

Query: 229 EKDYPYTGTDGGSCKFDKS---KIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIEL 285
           E  YPY G D   C +D S   K A    +  V   DE  +   +   GP++     I+ 
Sbjct: 209 EAFYPYEGHD-DWCGYDGSPGHKGANCTGHVDVQQGDELALKQAVATVGPVS---VGIDA 264

Query: 286 PHISFSF 292
            H SF  
Sbjct: 265 THRSFQL 271


>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 533

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 187

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 188 YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 247

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 248 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 298

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 299 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 352


>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
 gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
          Length = 443

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/234 (39%), Positives = 123/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/254 (36%), Positives = 134/254 (52%), Gaps = 20/254 (7%)

Query: 44  DHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT---- 99
           DH L A+  ++ +K+  ++ Y   EE  +R  V++ N++  ++         H  T    
Sbjct: 22  DHSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIEQHNQEYREGKHSFTMAMN 78

Query: 100 KFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACG 159
            F D+T  EFR+   G   R        + P+    + P   DWR+ G VT VK+QG CG
Sbjct: 79  AFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCG 136

Query: 160 SCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEY 219
           SCW+FSATGALEG  F  TG+LVSLSEQ LVDC     P+     + GCNGGLM+ AF+Y
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS---GPQ----GNEGCNGGLMDYAFQY 189

Query: 220 ILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGN 279
           +   GG++ E+ YPY  T+  SCK++     A  + F  I   E  +   +   GP++  
Sbjct: 190 VQDNGGLDSEESYPYEATE-ESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPIS-- 246

Query: 280 VASIELPHISFSFL 293
             +++  H SF F 
Sbjct: 247 -VAVDAGHQSFQFY 259


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/245 (37%), Positives = 125/245 (51%), Gaps = 17/245 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSE 108
           +  +K   SK Y T+EE D R ++++ NL++  +        +H    G+ K++DL   E
Sbjct: 28  WEAWKQTHSKQY-TKEEEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEE 86

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           F +   GL           K         P   DWRD G VT VKDQG CGSCW+FS TG
Sbjct: 87  FVQMMNGLKFDASRERQGIKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFSTTG 146

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           +LEG HF STG L SLSEQ LVDC            ++GC GGLM+ AF+YI    G++ 
Sbjct: 147 SLEGQHFRSTGVLTSLSEQNLVDC-------SISYGNNGCEGGLMDYAFQYIKDNLGIDT 199

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPH 287
           E  YPY   D  +C+F    + A  S +  V S DED +      +GP++    +I+  H
Sbjct: 200 EDKYPYEAED-DTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPIS---VAIDASH 255

Query: 288 ISFSF 292
            SF  
Sbjct: 256 ESFQL 260


>gi|381283083|gb|AFG19440.1| cathepsin L, partial [Larimichthys crocea]
          Length = 257

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 122/225 (54%), Gaps = 19/225 (8%)

Query: 76  VFKANLRRAKRRQLLDPTAVH----GVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPI 131
           V++ NLR+ +   L      H    G+  F D+T  EFR+   G  R+       + +  
Sbjct: 2   VWEMNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYKRKAE--GKFKGSLF 59

Query: 132 LPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQL 189
           +  N L  P   DWRD+G VT VKDQG CGSCW+FS TGALEG HF  TG+LVSLSEQ L
Sbjct: 60  MEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNL 119

Query: 190 VDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKI 249
           VDC     PE     + GCNGGLM+ AF+Y+    G++ E  YPY GTD   C +D +  
Sbjct: 120 VDCSR---PE----GNEGCNGGLMDQAFQYVKDNHGLDSEDSYPYLGTDDQPCHYDPNYN 172

Query: 250 AAAVSNF-SVISSDEDQMAANLVKHGPLAGNVASIELPHISFSFL 293
           +A  + F  V S  E  +   +   GP++    +I+  H SF F 
Sbjct: 173 SANDTGFVDVPSGKEHALMKAVAAVGPVS---VAIDAGHESFQFY 214


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 128/228 (56%), Gaps = 15/228 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
           + ++  + Y   +E + R+ +FK N+ R +      D     GV KF+DLT  EFR  + 
Sbjct: 43  WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 102

Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           G  R+  +L + + +   L  +D+PT  DWR+ GAVT VKDQG CG CW+FS   A+EG 
Sbjct: 103 GYKRQSSKLMSSSFRYENL--SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGI 160

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
             L TG L+SLSEQQLVDC          + + GC GGLM++AF+YI++ GG+  E +YP
Sbjct: 161 IKLQTGNLISLSEQQLVDCT---------AGNKGCQGGLMDTAFQYIIRNGGLTSEDNYP 211

Query: 234 YTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNV 280
           Y G D G+C  +K +   A ++ +  +  + +      V   P++  V
Sbjct: 212 YQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGV 258


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/231 (37%), Positives = 129/231 (55%), Gaps = 28/231 (12%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVH-GVTKFSDLTPSEFRR 111
           F  +  +  K Y + EE   R ++F+ NL+          ++   G+ KF+DLT  EF+ 
Sbjct: 43  FDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKT 102

Query: 112 QFLGLN-------RRLRLPADAQKAPILPTN--------DLPTDFDWRDHGAVTGVKDQG 156
           ++ G N       RR  L   A+  P+L            + +  DWR  GAVTGVKDQ 
Sbjct: 103 RYFGKNSKQWRDRRRTELEG-AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQA 161

Query: 157 ACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSA 216
            CGSCW+FS TGA+EG +F+STG+LVSLSEQ+LV CD         + + GC GG M+ A
Sbjct: 162 QCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACD---------ATNYGCEGGDMDYA 212

Query: 217 FEYILKAGGVEREKDYPYTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQM 266
           F ++++ GG++ EKDY YTG D  +C  +K +K   ++  ++ +S D+  +
Sbjct: 213 FTWVIQNGGIDTEKDYSYTGVD-STCNTNKEAKKIVSIDGYTDVSPDDSAL 262


>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
 gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
 gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
          Length = 323

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  +F+K Y ++ E   RF++F+ NL     +   D +A + + KFSDL+
Sbjct: 21  LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQND-SAKYEINKFSDLS 79

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+
Sbjct: 80  KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQ++DCD           D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +K GGV+ E DYPY   D  +C+ + +K    V + +  I+  E+++   L   GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 135/248 (54%), Gaps = 23/248 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAK----RRQLLDPTAVHGVTKFSDLTPSE 108
           + +FK+   KTY  Q E  +R ++F  N ++ +    + +  + +    +  F DL   E
Sbjct: 27  WHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHE 86

Query: 109 FRRQFLGLNRRLRLPADAQKAPIL--PTN-DLPTDFDWRDHGAVTGVKDQGACGSCWSFS 165
           F+     L    ++  D ++   L  P+N +LP   DWR  GAVT VKDQG CGSCWSFS
Sbjct: 87  FK----ALMNGFKMSPDTKRNGELYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFS 142

Query: 166 ATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGG 225
           ATG+LEG  FL TG+LVSLSEQ LVDC        +   ++GC GGLM+ AF+Y+    G
Sbjct: 143 ATGSLEGQVFLKTGKLVSLSEQNLVDC-------STSYGNNGCEGGLMDQAFQYVSDNKG 195

Query: 226 VEREKDYPYTGTDGGSCKFDKSKIAAA-VSNFSVISSDEDQMAANLVKHGPLAGNVASIE 284
           ++ E  YPY   +  +C+F K+K+      +  + + DE  +   L   GP++    +I+
Sbjct: 196 IDTEASYPYEARE-NTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPIS---VAID 251

Query: 285 LPHISFSF 292
             H SF F
Sbjct: 252 ANHGSFQF 259


>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
          Length = 338

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 91/266 (34%), Positives = 135/266 (50%), Gaps = 20/266 (7%)

Query: 16  SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFR 75
           S++   V  + D   +R++    G++    L  A   F  F   ++K Y   E+ + RF+
Sbjct: 8   SMVHVLVLFSIDQCKVREL----GQRRLYSLEEAPTLFEQFIKDYNKEYDESEKEE-RFK 62

Query: 76  VFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILPTN 135
           +F  NL+           AV+G+ KFSDL+  EF + + GL R      +  K   LP +
Sbjct: 63  IFVNNLKDINAMNERSSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPES 122

Query: 136 ---DLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDC 192
                P  FDWR  G V+ +K+Q  CGSCW+FSA   +E  H + TG+L+ +SEQQL+DC
Sbjct: 123 FNVTAPDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC 182

Query: 193 DHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAA 252
           D           DSGC+GGL   A  Y + A G    K YPY   + G C++D SK+   
Sbjct: 183 D---------KYDSGCSGGLPWDALRYFV-ANGAMSLKSYPYVAKE-GKCRYDSSKVEIR 231

Query: 253 VSNFSVISS-DEDQMAANLVKHGPLA 277
           +  + + S   EDQ+  +L   GPL+
Sbjct: 232 LKGYKIFSKISEDQIKEHLYNIGPLS 257


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 89/256 (34%), Positives = 138/256 (53%), Gaps = 18/256 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLN-RRLRLPADAQKAPILPTNDL-----PTDFDWRDHGAVTGVK 153
           +F+D+T  EF  +F GLN     L      +    TNDL     P++ DWR+ GAVT VK
Sbjct: 87  EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSDDDMPSNLDWRESGAVTQVK 146

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            QG CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M
Sbjct: 147 HQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            +AF++I++ GG+ RE DY Y G    +C+  +   A  +S++ V+   E  +   + K 
Sbjct: 198 TNAFDFIIENGGISRESDYEYLGQQ-YTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256

Query: 274 GPLAGNVASIELPHIS 289
               G  AS +L   S
Sbjct: 257 PVSIGIAASQDLQFYS 272


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 87/252 (34%), Positives = 140/252 (55%), Gaps = 18/252 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPKLSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
           +F+D+T  EF  +F GLN         P  + +  I  L  +D+P++ DWR+ GAVT VK
Sbjct: 87  EFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            QG CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M
Sbjct: 147 HQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            +AF++I++ GG+ RE DY Y G +  +C+  +   A  +S++ V+   E  +   + K 
Sbjct: 198 TNAFDFIIENGGISRESDYEYLG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256

Query: 274 GPLAGNVASIEL 285
               G  AS +L
Sbjct: 257 PVSIGIAASQDL 268


>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 92/244 (37%), Positives = 133/244 (54%), Gaps = 18/244 (7%)

Query: 50  EHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLL-DPTAVHGVTKFSDLTPSE 108
           +H F+ F +K+ K+Y T+EE+D+R ++FK NL +     +  D T   G+ KF+D T +E
Sbjct: 40  DHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAE 99

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           ++R  LG   +        K    P ND     +W + GAVT VKDQG CGSCWSFSATG
Sbjct: 100 YKR-LLGFGGQKNKNPRNIKVLGAPKND---GVNWVEQGAVTPVKDQGQCGSCWSFSATG 155

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           A+EG   +  G L SLSEQQLVDC            + GC GG M+ AF+Y+ +   +E 
Sbjct: 156 AMEGHAKIQFGTLYSLSEQQLVDCSQ-------AEGNEGCGGGWMDQAFQYVEQT-ALET 207

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E  YPY   D  +C+   + +    S   V  ++ +++ A L K GP++    +IE   +
Sbjct: 208 EDQYPYEAVD-DTCRASSAGVVKVDSFVDVTPNNVNELKAALDK-GPVS---VAIEADQM 262

Query: 289 SFSF 292
            F F
Sbjct: 263 VFQF 266


>gi|354466410|ref|XP_003495667.1| PREDICTED: pro-cathepsin H-like [Cricetulus griseus]
          Length = 333

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 89/229 (38%), Positives = 130/229 (56%), Gaps = 15/229 (6%)

Query: 52  HFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRR 111
           HF  + ++  KTY++ E ++YR + F  N R+       + T   G+ +FSD+T +E +R
Sbjct: 32  HFKSWMTQHQKTYSSVE-YNYRLKTFANNWRKIHAHNQRNHTFKMGLNQFSDMTFAEIKR 90

Query: 112 QFLGLNRRLRLPADAQKAPIL-PTNDLPTDFDWRDHGA-VTGVKDQGACGSCWSFSATGA 169
           ++L    +      A K   L  T  LP   DWR  G  V+ VK+QG+CGSCW+FS TGA
Sbjct: 91  KYLWSEPQ---NCSATKGNYLRGTGPLPPSMDWRKKGNFVSAVKNQGSCGSCWTFSTTGA 147

Query: 170 LEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVERE 229
           LE A  +++G+++SL+EQQLVDC    +       + GC GGL + AFEYIL   G+  E
Sbjct: 148 LESAVAIASGKMLSLAEQQLVDCAQNFN-------NHGCEGGLPSQAFEYILYNKGIMGE 200

Query: 230 KDYPYTGTDGGSCKFDKSKIAAAVSNFSVIS-SDEDQMAANLVKHGPLA 277
             YPY G D G CKFD  K  A V + + I+ +DE  M   +  + P++
Sbjct: 201 DTYPYRGKD-GHCKFDPQKAIAFVKDVANITLNDEKAMVEAVALYNPVS 248


>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
          Length = 323

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 133/237 (56%), Gaps = 21/237 (8%)

Query: 46  LLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLT 105
           LL A ++F  F  +F+K Y ++ E   RF++F+ NL     +   D +A + + KFSDL+
Sbjct: 21  LLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKDQND-SAKYEINKFSDLS 79

Query: 106 PSEFRRQFLGLNRRLRLPADAQ---KAPIL--PTNDLPTDFDWRDHGAVTGVKDQGACGS 160
             E   ++ GL+    LP   Q   K  +L  P    P +FDWR    VT VK+QG CG+
Sbjct: 80  KDETIAKYTGLS----LPIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGA 135

Query: 161 CWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYI 220
           CW+F+   +LE    +   +L++LSEQQ++DCD           D+GCNGGL+++AFE I
Sbjct: 136 CWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---------VDAGCNGGLLHTAFEAI 186

Query: 221 LKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSN-FSVISSDEDQMAANLVKHGPL 276
           +K GGV+ E DYPY   D  +C+ + +K    V + +  I+  E+++   L   GP+
Sbjct: 187 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPI 242


>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
          Length = 332

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 136/227 (59%), Gaps = 14/227 (6%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLD-PTAVHGVTKFSDLTPSEFRR 111
           F  F + +++TY ++EE  +R  VF  N+ RA++ Q LD  TA +G+TKFSDLT  EF  
Sbjct: 35  FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94

Query: 112 QFLGLNRRLRLPADAQKAPILPTNDL-PTDFDWRDHGAVTGVKDQGACGSCWSFSATGAL 170
            +L  N  L+  +  + +P    NDL P ++DWR  GAVT VK+QG CGSCW+FS TG +
Sbjct: 95  IYL--NPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 152

Query: 171 EGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230
           EG  FL+ G L+SLSEQ+L+DCD           D  C GGL ++A+  I   GG+E E 
Sbjct: 153 EGQWFLNRGTLLSLSEQELLDCD---------KVDKACLGGLPSNAYAAIKNLGGLETED 203

Query: 231 DYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           DY Y G    +C F        +++   +S +E+++AA L + GP++
Sbjct: 204 DYGYQG-HVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPIS 249


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 127/225 (56%), Gaps = 15/225 (6%)

Query: 56  FKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFL 114
           + ++  + Y   +E + R+ +FK N+ R +      D     GV KF+DLT  EFR  + 
Sbjct: 8   WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 67

Query: 115 GLNRRL-RLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGA 173
           G  R+  +L + + +   L  +D+PT  DWR+ GAVT VKDQG CG CW+FS   A+EG 
Sbjct: 68  GYKRQSSKLMSSSFRYENL--SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGI 125

Query: 174 HFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYP 233
             L TG L+SLSEQQLVDC          + + GC GGLM++AF+YI++ GG+  E +YP
Sbjct: 126 IKLQTGNLISLSEQQLVDCT---------AGNKGCQGGLMDTAFQYIIRNGGLTSEDNYP 176

Query: 234 YTGTDGGSCKFDK-SKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           Y G D G+C  +K +   A ++ +  +  + +      V   P++
Sbjct: 177 YQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVS 220


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 99/294 (33%), Positives = 149/294 (50%), Gaps = 25/294 (8%)

Query: 4   LILSSLLLLLLSSVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKT 63
           + LS LL L +        +   D +++    P D   S D L+     F  + S   K 
Sbjct: 1   MALSKLLPLAMCMSFFVVTSFGKDFSIV-GYWPED-LTSMDRLIEL---FEEWISNHGKI 55

Query: 64  YATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGL----NRR 119
           Y T EE  +RF VFK NL+          +   GV +F+DLT  EF+  +LGL    +R 
Sbjct: 56  YETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRT 115

Query: 120 LRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTG 179
            + P +     ++   DLP   DWR  GAVT VK+QG+CGSCW+FS   A+EG + +  G
Sbjct: 116 RQSPEEFTYKDVV---DLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGG 172

Query: 180 ELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTGTDG 239
            L SLSEQ+L+DCD           ++GC+GGLM+ AF +I+ +GG+ +E+DYPY   + 
Sbjct: 173 NLTSLSEQELIDCDR--------PYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVE- 223

Query: 240 GSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHISFSF 292
            +C   K ++    +S +  +  + +      + H PL+    +IE     F F
Sbjct: 224 STCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLS---VAIEASGRDFQF 274


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 84/236 (35%), Positives = 131/236 (55%), Gaps = 23/236 (9%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLR--RAKRRQLLDPTAVHGVTKFSDLTPSEFR 110
           + L+ ++  + Y    E D RFRVF  NLR   A   +  +     G+ +F+DLT  EFR
Sbjct: 49  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 108

Query: 111 RQFLGLNRRLRLPADAQKAPILP--------TNDLPTDFDWRDHGAVTGVKDQGACGSCW 162
             +LG     R+PA  ++   +           +LP   DWR+ GAV  VK+QG CGSCW
Sbjct: 109 AAYLGA----RIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCW 164

Query: 163 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK 222
           +FSA  ++E  + + TGE+V+LSEQ+LV+C  +         +SGCNGGLM++AF++I+K
Sbjct: 165 AFSAVSSVESVNQIVTGEMVTLSEQELVECSTD-------GGNSGCNGGLMDAAFDFIIK 217

Query: 223 AGGVEREKDYPYTGTDGGSCKFDKSKI-AAAVSNFSVISSDEDQMAANLVKHGPLA 277
            GG++ E DYPY   D G C  ++      ++  F  +  ++++     V H P++
Sbjct: 218 NGGIDTEGDYPYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVS 272


>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
           cysteine proteinase A-2; Flags: Precursor
 gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
          Length = 444

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 92/235 (39%), Positives = 123/235 (52%), Gaps = 19/235 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VKDQGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD   D         GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCDDMND---------GCDGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK----IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S     + A +    +I S E  MAA L K+GP+A
Sbjct: 209 HTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 263


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 95/281 (33%), Positives = 151/281 (53%), Gaps = 30/281 (10%)

Query: 6   LSSLLLLLLS-SVLASAVAVNDDDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTY 64
           ++S+L + +S ++L+ ++ V+   + +    P          + AEHH   + ++FS+ Y
Sbjct: 1   MTSILFMFVSLTILSMSLKVSQATSRVTFHEP----------IVAEHH-QQWMTRFSRVY 49

Query: 65  ATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLP 123
           + + E   RF VFK NL+  ++  +  D T   GV +F+D T  EF     GL     +P
Sbjct: 50  SDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIP 109

Query: 124 ADAQKAPILPTNDL-------PTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFL 176
           +      ++P+ +        P   DWR  GAVT VK QG CG CW+FS+  A+EG   +
Sbjct: 110 SSEFVDEMIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKI 169

Query: 177 STGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREKDYPYTG 236
             G LVSLSEQQL+DCD E         D+GCNGG+M+ AF YI+K  G+  E  YPY  
Sbjct: 170 VGGNLVSLSEQQLLDCDRE--------RDNGCNGGIMSDAFSYIIKNRGIASEASYPYQE 221

Query: 237 TDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLA 277
           T+ G+C+++ +K +A +  F  + S+ ++     V   P++
Sbjct: 222 TE-GTCRYN-AKPSAWIRGFQTVPSNNERALLEAVSRQPVS 260


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 87/252 (34%), Positives = 140/252 (55%), Gaps = 18/252 (7%)

Query: 41  QSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKR-RQLLDPTAVHGVT 99
           +S+  L  +E H  L+ S+  + Y  + E   RF +FK N++  +   +  + +   G+ 
Sbjct: 28  RSQPELSVSERH-ELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMN 86

Query: 100 KFSDLTPSEFRRQFLGLNRRLRL----PADAQKAPI--LPTNDLPTDFDWRDHGAVTGVK 153
           +F+D+T  EF  +F GLN         P  + +  I  L  +D+P++ DWR+ GAVT VK
Sbjct: 87  EFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVK 146

Query: 154 DQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLM 213
            QG CG CW+FSA G+LEGA+ ++TG L+  SEQ+L+DC          + + GCNGG M
Sbjct: 147 HQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT---------TNNYGCNGGFM 197

Query: 214 NSAFEYILKAGGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKH 273
            +AF++I++ GG+ RE DY Y G +  +C+  +   A  +S++ V+   E  +   + K 
Sbjct: 198 TNAFDFIIENGGISRESDYEYQG-EQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ 256

Query: 274 GPLAGNVASIEL 285
               G  AS +L
Sbjct: 257 PVSIGIAASQDL 268


>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
          Length = 331

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 130/244 (53%), Gaps = 18/244 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVT----KFSDLTPSE 108
           +S +K+   K Y   EE  +R  V++ NL+  K+         H  T     F DLT  E
Sbjct: 29  WSQWKAAHGKLYDENEE-GWRRAVWEKNLKVIKQHNQEYSQGKHSFTMAMNAFGDLTNEE 87

Query: 109 FRRQFLGLNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           F++   GL  + R   +  +AP  P  + P+  DWR  G VT VK+QG CGSCW+FSATG
Sbjct: 88  FKQVMNGLKSQKRKEGNVFQAP--PFAETPSSVDWRKKGYVTPVKNQGPCGSCWAFSATG 145

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILKAGGVER 228
           ALEG  F  T  LVSLSEQ LVDC            + GC+GGLM+ AF+Y+   GG++ 
Sbjct: 146 ALEGQMFRKTKRLVSLSEQNLVDCSQ-------AEGNEGCSGGLMDYAFQYVKDNGGLDS 198

Query: 229 EKDYPYTGTDGGSCKFDKSKIAAAVSNFSVISSDEDQMAANLVKHGPLAGNVASIELPHI 288
           E+ YPY   D  SCK+   + AA  + F  I  +E+ +   +   GP++   A+I+    
Sbjct: 199 EESYPYRAQD-ESCKYKPEQSAANDTGFMDIHPEEESLKLAVATVGPIS---AAIDASLS 254

Query: 289 SFSF 292
           +F F
Sbjct: 255 TFQF 258


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 133/250 (53%), Gaps = 22/250 (8%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRR-AKRRQLLDPTAVH---GVTKFSDLTPSE 108
           + LFK++FSK Y T+ E  +R +VF  N  + A+  +L     V     +  F DL   E
Sbjct: 31  WELFKTQFSKAYNTEIEEKFRMKVFMDNKHKIARHNKLFQNGEVSYELEMNHFGDLLHHE 90

Query: 109 FRRQFLGLNRRLRLPA--DAQKAPILPTNDL--PTDFDWRDHGAVTGVKDQGACGSCWSF 164
           F +   G    LR     +      +P  ++  P   DWR  GAVT VK+QG CGSCW+F
Sbjct: 91  FVKTVNGYRHSLRRVTGDEIDSVTFIPAYNVTVPDSVDWRTEGAVTEVKNQGQCGSCWAF 150

Query: 165 SATGALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSC-DSGCNGGLMNSAFEYILKA 223
           S TG+LEG HF +T +L SLSEQ L+DC        SG   ++GC+GGLM++AF YI   
Sbjct: 151 STTGSLEGQHFRNTKQLTSLSEQNLIDC--------SGKYGNNGCSGGLMDNAFAYIKSN 202

Query: 224 GGVEREKDYPYTGTDGGSCKFDKSKIAAAVSNFSVI-SSDEDQMAANLVKHGPLAGNVAS 282
            G++ E+ YPY G D   C++   +  A    F  I   DE+++   +   GP++    +
Sbjct: 203 KGIDTEQSYPYEGID-DKCRYKPQESGATDKGFVDIPQGDEEKLKLAVATVGPIS---VA 258

Query: 283 IELPHISFSF 292
           I+  H SF F
Sbjct: 259 IDASHQSFQF 268


>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
          Length = 443

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 124/234 (52%), Gaps = 18/234 (7%)

Query: 53  FSLFKSKFSKTYATQEEHDYRFRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQ 112
           F  FK  + + Y T  E   R   F+ NL   +  Q  +P A  G+TKF DL+ +EF  +
Sbjct: 38  FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEFAAR 97

Query: 113 FLG----LNRRLRLPADAQKAPILPTNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATG 168
           +L          R  A   +      + +P   DWR+ GAVT VK+QGACGSCW+FSA G
Sbjct: 98  YLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVG 157

Query: 169 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEESGSCDSGCNGGLMNSAFEYILK--AGGV 226
            +EG  +L+  ELVSLSEQQLV CD           D+GC+GGLM  AF+++L+   G +
Sbjct: 158 NIEGQWYLAGHELVSLSEQQLVSCD---------DMDNGCSGGLMLQAFDWLLQNTNGHL 208

Query: 227 EREKDYPYTGTDGGSCKFDKSK---IAAAVSNFSVISSDEDQMAANLVKHGPLA 277
             E  YPY   +G   +   S    + A +    +I S E  MAA L K+GP+A
Sbjct: 209 YTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIA 262


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.133    0.391 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,631,252,965
Number of Sequences: 23463169
Number of extensions: 193130953
Number of successful extensions: 473898
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4839
Number of HSP's successfully gapped in prelim test: 1605
Number of HSP's that attempted gapping in prelim test: 457566
Number of HSP's gapped (non-prelim): 6881
length of query: 300
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 159
effective length of database: 9,050,888,538
effective search space: 1439091277542
effective search space used: 1439091277542
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)