BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12185
         (317 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350415610|ref|XP_003490694.1| PREDICTED: cathepsin O-like [Bombus impatiens]
          Length = 355

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 163/316 (51%), Positives = 210/316 (66%), Gaps = 19/316 (6%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
           + V  IV +++LCFLAIP++VS       L+LF ++  RY KSY    +E++ RFK F K
Sbjct: 4   RTVAVIVLVVSLCFLAIPIRVSPNTSNGDLKLFQNYVMRYNKSYRNDPTEYEERFKRFLK 63

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHD- 117
           SL  IE++N  R S ESA YG+TEFSD+SE+EF +  L   +     KHV  S+H+ H  
Sbjct: 64  SLRHIEKMNGLRPSQESAYYGLTEFSDMSEDEFLSLTLLPDLPARGEKHVNESYHRRHHL 123

Query: 118 -HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
               N VKK            IP++ DWR+ G+I  VRNQ +CGACWAFSTVE  ESM+A
Sbjct: 124 LQSTNRVKK---------SVSIPLRFDWRDKGVITPVRNQGSCGACWAFSTVEVVESMYA 174

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           +KNGTL +LSVQE+IDCA N N GC GGD C+LL W+  +KV +  ES YPL+ K + CK
Sbjct: 175 IKNGTLHMLSVQEMIDCAKNSNFGCEGGDICSLLSWLLASKVQIFQESTYPLVGKTSMCK 234

Query: 237 --RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
             +     +GVKI+ + CD  + +E  +L  +ATHGPV AAVNAL+WQ YLGGVIQY+CD
Sbjct: 235 LGKMIDKASGVKIRDFNCDNFVDAEDELLITVATHGPVAAAVNALSWQNYLGGVIQYHCD 294

Query: 295 GSLANINHAVQIVGYD 310
            S  N+NHAVQIVGYD
Sbjct: 295 SSFDNLNHAVQIVGYD 310


>gi|328789602|ref|XP_623690.2| PREDICTED: cathepsin O-like [Apis mellifera]
          Length = 368

 Score =  308 bits (790), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 157/314 (50%), Positives = 213/314 (67%), Gaps = 18/314 (5%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
           K ++F + +++LCFLAIP+KV  P+  + ++LF ++  RY KSY  + SE++ RFK F++
Sbjct: 20  KTIVFTILVVSLCFLAIPIKVD-PDNNEDIKLFQNYVIRYNKSYRNNPSEYEERFKRFQR 78

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDH 118
           SL  IE +N  R S ESA YG+TEFSD+SE EF    L   +     KH+  S+H+ H  
Sbjct: 79  SLQHIERMNGLRSSQESAYYGLTEFSDMSENEFLLHTLLPDLPIRGEKHMNASYHRKHQI 138

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             + +K RSI+        IP++ DWR+ G+I  VR+Q +CGACWAFST+E  ESM A+K
Sbjct: 139 SIDRMK-RSIS--------IPLRFDWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIK 189

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-- 236
           NGTL  LSVQE+IDCA N N GC GGD C+LL W+ ++KV +  ES YPL+     CK  
Sbjct: 190 NGTLHSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLISKVQILQESIYPLVGMTGTCKLG 249

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
           +       +KI+ +TCD+ + +E  +L  +ATHGPV AAVNAL+WQ YLGGVIQY+CDGS
Sbjct: 250 KMTDKTFNIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNYLGGVIQYHCDGS 309

Query: 297 LANINHAVQIVGYD 310
             N+NHAVQI+GYD
Sbjct: 310 FNNLNHAVQIIGYD 323


>gi|380026170|ref|XP_003696831.1| PREDICTED: cathepsin O-like [Apis florea]
          Length = 368

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 156/310 (50%), Positives = 210/310 (67%), Gaps = 10/310 (3%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
           K + F + +++LCFLAIP+KV  P+  + ++LF ++  RY KSY    SE++ RFK F++
Sbjct: 20  KTIAFTILVVSLCFLAIPIKVD-PDNNEDIKLFQNYVVRYNKSYKNDPSEYEERFKRFQR 78

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           SL  IE +N  R S ESA YG+TEFSD+SE+EF    L H++   + +   KH +  + H
Sbjct: 79  SLQHIERMNGLRSSQESAYYGLTEFSDMSEDEF----LLHTLLPDLPIRGEKHKNAPY-H 133

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
            K +  T  +     IP + DWR+ G+I  VR+Q +CGACWAFST+E  ESM A+KNGTL
Sbjct: 134 RKHQVSTDRMKRSISIPSRFDWRDKGVITPVRSQGSCGACWAFSTIEVIESMFAIKNGTL 193

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKAT 240
             LSVQE+IDCA N N GC GGD C+LL W+ V+KV +  ES YPL+     CK  +   
Sbjct: 194 HSLSVQEMIDCAKNSNFGCEGGDICSLLSWLLVSKVQILQESIYPLVGMTGTCKLGKMTD 253

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
              G+KI+ +TCD+ + +E  +L  +ATHGPV AAVNAL+WQ YLGGVIQY+CDGS  N+
Sbjct: 254 KAFGIKIQDFTCDSFVDAEDELLIALATHGPVAAAVNALSWQNYLGGVIQYHCDGSFDNL 313

Query: 301 NHAVQIVGYD 310
           NHAVQI+GYD
Sbjct: 314 NHAVQIIGYD 323


>gi|383852175|ref|XP_003701604.1| PREDICTED: cathepsin O-like [Megachile rotundata]
          Length = 370

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 150/318 (47%), Positives = 216/318 (67%), Gaps = 16/318 (5%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
           + V   V +++LCFL IP++V      + ++LF ++  RY K+Y    +E++ RF+ F++
Sbjct: 20  RTVALTVLIVSLCFLVIPIRVDPDPSSEDIKLFKNYVTRYNKTYRNDPTEYEERFQRFQR 79

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDH 118
           SL  IE +N  R SPESA YG+TEFSD++E+EF+++ L   +     KH    +H+ H  
Sbjct: 80  SLRHIETMNSLRSSPESAFYGLTEFSDMTEDEFRSQALSPDLAARGEKHATAPYHRLHRL 139

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
            H++  +R+        T +P++ DWR+ G+I  VR+Q  CGACWAFSTVE AESM A++
Sbjct: 140 KHSNRVRRA--------TVVPLRFDWRDKGVITPVRSQGACGACWAFSTVEVAESMFAIQ 191

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           NGTL  LSVQE+IDCA N N GC GGD C+LL W+ ++KV +  E  YPL  K   CK +
Sbjct: 192 NGTLYPLSVQEMIDCAKNSNFGCEGGDICSLLSWLLLSKVQIFQEHAYPLTRKTDTCKLE 251

Query: 239 ATSP--NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            T+   +GV+IK +TCD+ + +E  +++ +ATHGPV AAVNAL+WQ YLGGVIQ++CDGS
Sbjct: 252 KTAGKISGVRIKDFTCDSFVDAEDELVSTLATHGPVAAAVNALSWQNYLGGVIQFHCDGS 311

Query: 297 LANINHAVQIVGYDNYSR 314
             ++NHAVQIVGYD  ++
Sbjct: 312 FDSLNHAVQIVGYDKSAK 329


>gi|340710428|ref|XP_003393792.1| PREDICTED: cathepsin O-like [Bombus terrestris]
          Length = 355

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 153/304 (50%), Positives = 199/304 (65%), Gaps = 19/304 (6%)

Query: 17  CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNR 74
           CFLAIP++VS       L+LF ++  RY KSY  + +E++ RFK F KSL  IE++N  R
Sbjct: 16  CFLAIPIRVSPDTSNGDLKLFQNYVMRYNKSYRNNPTEYEERFKRFRKSLRHIEKMNGLR 75

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSHHKHHD--HHHNHVKKRSI 128
            S ESA YG+TEFSD+SE+EF +  L   ++    KH   S+H+ H      N VKK   
Sbjct: 76  PSQESAYYGLTEFSDMSEDEFLSLTLLPDLSARGEKHANESYHRRHHLLQSTNRVKK--- 132

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                    IP++ DWR+ G+I  VR+Q +CGACWAFST+E  ESM+A+KNGTL +LSVQ
Sbjct: 133 ------SVSIPLRFDWRDKGVITPVRSQGSCGACWAFSTIEVVESMYAIKNGTLYMLSVQ 186

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN--GVK 246
           E+IDCA N N GC GGD  +LL W+  +KV +  ES YPL+ K + CK      N  GVK
Sbjct: 187 EMIDCAKNKNFGCEGGDIYSLLSWLLASKVQIFQESTYPLVGKTSMCKLGKMIDNAFGVK 246

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           I+ + CD  + +E  +L  +ATHGPV A VNAL+WQ YLGGVIQY+CD +  N NHAVQI
Sbjct: 247 IRDFNCDNFVDAEDELLIKVATHGPVAAVVNALSWQNYLGGVIQYHCDSTYDNRNHAVQI 306

Query: 307 VGYD 310
           +GYD
Sbjct: 307 IGYD 310


>gi|307206026|gb|EFN84119.1| Cathepsin O [Harpegnathos saltator]
          Length = 353

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 143/300 (47%), Positives = 197/300 (65%), Gaps = 9/300 (3%)

Query: 15  ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNK 72
           +LCF  +P++V      + ++LF  +  RY KSY     E++ RF  F++SL  IE +N 
Sbjct: 14  SLCFFMVPIRVGPDKNAEDIKLFVDYVARYNKSYRHDPPEYNERFDRFQRSLRHIERMNG 73

Query: 73  NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
            R S ESA YG+TEFSDLSE+EF  R L   ++    M  HK   ++H H K  +  +  
Sbjct: 74  FRSSQESAYYGLTEFSDLSEDEFVQRTLLPDLSSRGQM--HKAASYYHRHTKNTNNRS-- 129

Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
              T +P K DWR+ G++G +++Q+ CGACWAFST+  AESM+A+KNGTL   SVQE+ID
Sbjct: 130 ERETNVPPKIDWRDKGVVGPIQSQEICGACWAFSTIGVAESMYAMKNGTLYPFSVQEMID 189

Query: 193 CAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKATSPNGVKIKSY 250
           C   G+ GC GGD C+LL W+  +K  + PES YPL  +D  CK  + +   +GV I  +
Sbjct: 190 CM-PGDFGCQGGDICSLLSWLLTSKTKIIPESAYPLTRRDDQCKLLKLSAKTSGVGITDF 248

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           TCD+   +E  +L  +A+HGPV AAVNA++WQ YLGGVIQY+CDGS +++NHAVQIVGYD
Sbjct: 249 TCDSFADAEDELLALLASHGPVAAAVNAISWQNYLGGVIQYHCDGSFSSLNHAVQIVGYD 308


>gi|332024588|gb|EGI64786.1| Cathepsin O [Acromyrmex echinatior]
          Length = 356

 Score =  286 bits (733), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 145/306 (47%), Positives = 200/306 (65%), Gaps = 14/306 (4%)

Query: 17  CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNR 74
           CF  +P+KV     E+  ELF+++  RY KSY    ++++ RF++F+KSL  IE+LN  R
Sbjct: 16  CFFIVPIKVDFDKTEKDAELFANYIARYNKSYRNDPAKYEERFEHFQKSLRHIEKLNSLR 75

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSV----NKHVLMSHHKHHDHHHNHVKKRSITT 130
            S ESA YG+TEFSDLS++EF  + L   +     KH   S++  H     +  KR I  
Sbjct: 76  SSQESAYYGLTEFSDLSDDEFIQQALIPDLPLRGQKHTTASYYHQHFMGSVNRMKRMIPI 135

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                 GIP K DWR+ G++G V +Q+ CGACWAFSTV  AESM+A++NGTL   SVQE+
Sbjct: 136 -----IGIPSKFDWRDKGVVGPVMSQENCGACWAFSTVGVAESMYAIENGTLHSFSVQEM 190

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK--RKATSPNGVKIK 248
           IDC   GN GC GGD C+LL W+  +K  +  E +YPL L+   C+  + +   +GV+I 
Sbjct: 191 IDCM-PGNFGCQGGDICSLLSWLLASKTRIISEIDYPLTLQTDTCRLHKISAKTSGVRIT 249

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
            +TCD+ + +E+ +LT + THGPV  AVNA++WQ YLGG+IQYNCD S  ++NHAVQIVG
Sbjct: 250 DFTCDSFVDAETELLTLLVTHGPVAVAVNAISWQNYLGGIIQYNCDSSFNSLNHAVQIVG 309

Query: 309 YDNYSR 314
           YD  +R
Sbjct: 310 YDTEAR 315


>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
          Length = 358

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 151/308 (49%), Positives = 206/308 (66%), Gaps = 18/308 (5%)

Query: 17  CFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKN 73
           CF  IP+KV KPN   +  +LF ++  +Y KSY    +E+  RF+ F+KSL  IE++N  
Sbjct: 16  CFFIIPIKV-KPNKNVEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSF 74

Query: 74  RQSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSH-HKHHDHHHNHVKKRSI 128
           + S ESA YG+T+FSDLSE+EF  + L   ++    KH   S+ H++  +  NH  KR+I
Sbjct: 75  QSSQESAYYGLTKFSDLSEDEFLQQTLLPDLSLRNQKHTTASYYHQYFTNSSNH-GKRAI 133

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                IP  IP K DWR  G++G V+ Q  CGACWAFST+   ESM+A+KNGTL   SVQ
Sbjct: 134 -----IPPPIPSKVDWRNRGVVGPVQYQDNCGACWAFSTIGVVESMYAIKNGTLYPFSVQ 188

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP--NGVK 246
           E+IDC   G+ GC GGD CALL W+  +K  +  E+ YPL L++  CK   TS    GVK
Sbjct: 189 EMIDCMP-GSYGCQGGDTCALLSWLLESKTKIISENVYPLTLRNDPCKLSKTSAKTTGVK 247

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           I  +TC++ + +ES++LT + THGPV+A VNA++WQ YLGG+IQY+CDGS +++NHAVQI
Sbjct: 248 ITDFTCNSFVNAESNLLTLLGTHGPVVAGVNAISWQNYLGGIIQYHCDGSFSHLNHAVQI 307

Query: 307 VGYDNYSR 314
           VGYD  +R
Sbjct: 308 VGYDMAAR 315


>gi|156553312|ref|XP_001599758.1| PREDICTED: cathepsin O-like [Nasonia vitripennis]
          Length = 345

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 139/286 (48%), Positives = 186/286 (65%), Gaps = 14/286 (4%)

Query: 37  FSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           F ++ Q YKK Y     E++ RF  F++SL  IE LN+ R S +SARYG+T++SD++E+E
Sbjct: 26  FEAYVQDYKKPYKNDPDEYERRFGRFQQSLRKIESLNRLRSSADSARYGLTDYSDMTEQE 85

Query: 95  FKTRHLRHSVNKHV--LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           F   +LR  ++         H HH+H  N   +R++T        +P K DWR  G +  
Sbjct: 86  FLALNLRPDLSNRSEKHHQCHYHHNHSDNKRYERAVTV-------LPDKFDWRTKGAVTA 138

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q +CGACWAFS VETAESM A+ N TL   SVQE+IDCAGN N GC GGD C+LLDW
Sbjct: 139 VKSQGSCGACWAFSAVETAESMFAISNKTLRAFSVQEMIDCAGNSNFGCEGGDICSLLDW 198

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSP---NGVKIKSYTCDTLIPSESSILTDIATH 269
           + V+K  + PE  YPL     ACK + T+     G++I  +TCD  + +E  +L  +AT 
Sbjct: 199 LLVSKTEILPEINYPLTRTTDACKLQKTATKIQEGIRISDFTCDNYVGAEDKLLKVLATK 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           GPV AAVNAL+WQ YLGGVIQ++CDGS  ++NHAVQIVGYD  + T
Sbjct: 259 GPVAAAVNALSWQNYLGGVIQFHCDGSFKSLNHAVQIVGYDKTATT 304


>gi|401758202|gb|AFQ01136.1| cathepsin O2-like protease [Chilo suppressalis]
          Length = 368

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 137/324 (42%), Positives = 191/324 (58%), Gaps = 15/324 (4%)

Query: 1   MFDVKNVLFIVALIALC--FLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKS--EHDI 55
           M+ +    +++ ++  C  F+ +P+  S    +++L+ +F  + ++Y KSY  +  E++ 
Sbjct: 1   MYKINWWTWVLGIVLFCLLFIVVPISYSASTSKEQLKPIFDQYIEKYNKSYKNNPEEYET 60

Query: 56  RFKNFEKSLDIIEELNKNRQSPES--ARYGITEFSDLSEEEFKTRHL------RHSVNKH 107
           RF++F  S+  I+ LN   + PE   ARYG T+ SD+S  E+K  HL      +      
Sbjct: 61  RFQHFLVSMSEIDRLNSESRGPEQYRARYGPTKLSDMSPTEYKDLHLSDEKLTKSPATYD 120

Query: 108 VLMSHHKHHDHHH-NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFS 166
                H   D++H   V +R           +P+  DWR  G +G VRNQ  CGACWAFS
Sbjct: 121 RSWRKHNQRDYYHVQDVNERKENLIRKKRASLPMLVDWRVKGAVGAVRNQGLCGACWAFS 180

Query: 167 TVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEY 226
           TV T ESM A+  G L  LSVQEVIDCA  GN GCSGGD C LLDW+ +    +E E +Y
Sbjct: 181 TVGTMESMAAINTGKLPALSVQEVIDCARLGNQGCSGGDICLLLDWLMITNTPVEVEKDY 240

Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLG 286
           PL L +  CK K  +  GV++ S+TCD  + +E  I+  +A HGPV  AVNALTWQ YLG
Sbjct: 241 PLQLTNGVCKAKKNT-TGVRVTSFTCDDFVGTEQKIIEALALHGPVAVAVNALTWQNYLG 299

Query: 287 GVIQYNCDGSLANINHAVQIVGYD 310
           GVIQY+C G   ++NHAVQ+VGYD
Sbjct: 300 GVIQYHCSGDAMDLNHAVQLVGYD 323


>gi|357609157|gb|EHJ66323.1| putative Cathepsin O precursor [Danaus plexippus]
          Length = 382

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 195/342 (57%), Gaps = 47/342 (13%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKS 63
           N + +VAL+ L F+AIP+       E    +F  + + + K+Y    +E++ R ++F  S
Sbjct: 6   NWILVVALVCLLFVAIPLSYPDRTKESLRPMFDEYIENFNKTYKDDPAEYEKRLEHFVAS 65

Query: 64  LDIIEELNKNRQSPES--ARYGITEFSDLSEEEFKTRHLR----HSVNKHVL-------- 109
           +  I+ LN   + PE   ARYG+T+ SD+S++EF+  HL     H   +H L        
Sbjct: 66  VKEIDRLNSAARGPEQHRARYGLTQMSDMSKDEFRDVHLSDEQPHRYRRHKLGKSWSKGR 125

Query: 110 -----------------MSHHKHHDHHHNHV----KKRSITTGITIPTGIPVKKDWREAG 148
                                K    HHN      KKR++         +P++ DWR  G
Sbjct: 126 VKDIEDVADNMDDYDDEDDDDKEGSPHHNIYIVIRKKRAM---------LPLQVDWRTKG 176

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
           +IG VR+Q  CGACWAFST+ T E+M A+  G L+ LSVQEVIDCAG GN GC+GGD C 
Sbjct: 177 VIGPVRDQGLCGACWAFSTIGTMEAMAAIDTGKLNTLSVQEVIDCAGLGNSGCAGGDICL 236

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
           LLDW+ +    ++ E EYPL L +  C+ K  +  GVK+  +TC  L+ +E  I+  IAT
Sbjct: 237 LLDWLLMTDTAVQVEKEYPLKLTNGVCQAKKNA-TGVKVAKFTCTDLVGAEDKIIESIAT 295

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           HGPV  AVNALTWQ YLGGVIQY+C GS   +NHAV++VGYD
Sbjct: 296 HGPVAVAVNALTWQNYLGGVIQYHCSGSPKELNHAVELVGYD 337


>gi|321475753|gb|EFX86715.1| hypothetical protein DAPPUDRAFT_187469 [Daphnia pulex]
          Length = 360

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 141/317 (44%), Positives = 192/317 (60%), Gaps = 20/317 (6%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPN-LEQKLELFSSFQQRYKKSYSKS--EHDIRFKNFE 61
           KNV+  + L +LCFL IP+++ +P+ ++Q+   F  F +++ KSY +   E+  R   F+
Sbjct: 6   KNVICALGLFSLCFLGIPIRIDQPDSMKQE---FKQFIEKHNKSYGRDPVEYGRRLSYFK 62

Query: 62  KSLDIIEELN--KNRQSPESARYGITEFSDLSEEEFKTRHLRH--SVNKHVLMSHHKHHD 117
            S    +E N  K+ Q    A +GIT+FSDL   EF+   LRH  S    V+ S+  H +
Sbjct: 63  ASHSRAKEYNMLKHNQDNGHASFGITKFSDLDANEFQEMLLRHKPSSLSCVIGSNLNHVN 122

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            +    +KR I         +P   DWRE  ++  V+NQ +CGACWAFSTV+T ESMHA+
Sbjct: 123 RNR---RKREIPNAQKNFKQLPSYVDWREKNVVTAVKNQHSCGACWAFSTVQTVESMHAI 179

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK- 236
             G L+ LS Q+VIDCA NGN GC GGD C  L WM  + V L  E +YPL LKD  CK 
Sbjct: 180 ATGELNELSTQQVIDCARNGNKGCIGGDTCTALTWMSASNVSLLEEKQYPLTLKDQRCKT 239

Query: 237 --RKATSPNGVKIKS-YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
               +TS  GV++ S +TC  L+ +E  +   +A HGPV AAV+A+TWQ YLGG+IQY+C
Sbjct: 240 VFEGSTSSGGVRLASNFTCYNLVDNEEQLKHILAFHGPVTAAVDAVTWQDYLGGIIQYHC 299

Query: 294 DGSLANINHAVQIVGYD 310
                + NHAVQIVGYD
Sbjct: 300 RD---HTNHAVQIVGYD 313


>gi|189236657|ref|XP_970512.2| PREDICTED: similar to cathepsin o [Tribolium castaneum]
          Length = 329

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 177/301 (58%), Gaps = 28/301 (9%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
           +  IAL F  IP+++  P  +Q    F  + +R+ K+Y   S +  R   F++SL  IE 
Sbjct: 11  IFYIALLFFVIPIRIKGP--DQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIET 68

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           LN  +++  SA YG+T+FSDL  EEF   +L+ ++++    +  K H H      KR+  
Sbjct: 69  LNSKKRNG-SALYGLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHH------KRAT- 120

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
                   +P K DWRE   + ++ NQ +CGACWA+S +ET ESM+A+K      LSVQE
Sbjct: 121 --------VPNKVDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQE 172

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +IDCAGN N GC+GGD C LL W+      ++  ++Y        C R +    GV ++ 
Sbjct: 173 IIDCAGN-NKGCNGGDICTLLSWIKATNFTIQRHADY-----GGKCGRGSA---GVHVRD 223

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           + C+ L+ SE  +L  +A +GP+  A+NA TWQ Y+GGVI+Y+CDG  + +NHAVQIVGY
Sbjct: 224 FMCEGLVGSEDVMLRLLADNGPLAVAINAQTWQNYIGGVIEYHCDGDPSKLNHAVQIVGY 283

Query: 310 D 310
           D
Sbjct: 284 D 284


>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
          Length = 346

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 124/312 (39%), Positives = 188/312 (60%), Gaps = 21/312 (6%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKN 59
           F  K  + +   IAL F  +P K+    L+ ++ E F  +   + KSY  ++E   RF  
Sbjct: 8   FTYKTYIELGFYIALLFFVVPCKIK---LDSEIREQFHEYLSDFNKSYPQEAEFQFRFAA 64

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF-KTRHLRHSVNKHVLMSHHKHHDH 118
           F+KSL  IE+LN N+ +  SA+YG+T+FSD + EEF   ++ R  V + +          
Sbjct: 65  FKKSLANIEQLNANK-TKSSAQYGLTKFSDFTAEEFLDLQNNRAGVRRDL-------RGA 116

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             + +KK ++ +       +P   DWR   ++ KV+NQ+ CGACWAF+  ET ESM A+K
Sbjct: 117 AQSRLKKVALRSAY--EKELPQIVDWRNKNVVSKVKNQKNCGACWAFAVSETIESMQAIK 174

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              L+ LS+Q++IDC+   N GC GGD CALL W+ VN + +  E++YPL+L+D  C++ 
Sbjct: 175 TQQLTDLSIQQLIDCSSYNN-GCKGGDTCALLRWIKVNNIAIMNETDYPLVLEDQKCQKT 233

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
             S  GVK+ +Y C++ +  E  IL  +A +GPV  A++  TWQ Y+GGVIQ++C+G L+
Sbjct: 234 DMS-EGVKVGTYQCNSFVGREDIILKLLAINGPVAVAISGETWQNYVGGVIQFHCEGDLS 292

Query: 299 NINHAVQIVGYD 310
              HAVQIVGY+
Sbjct: 293 ---HAVQIVGYN 301


>gi|241111179|ref|XP_002399230.1| cysteine protease and A protease inhibitor, putative [Ixodes
           scapularis]
 gi|215492918|gb|EEC02559.1| cysteine protease and A protease inhibitor, putative [Ixodes
           scapularis]
          Length = 363

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 118/289 (40%), Positives = 169/289 (58%), Gaps = 15/289 (5%)

Query: 24  KVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           + + P++E     F  + +RY K+Y+   +E+  R   F  +L  IE+ N++      A 
Sbjct: 37  RTADPSVEAA---FEQYVKRYNKTYASGSAEYSKRLNAFRDALIRIEDRNRHGNHSNGAL 93

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
           YG+T +SDL+ +EF     R  +       + +   +   H   +    G T P   P K
Sbjct: 94  YGLTPYSDLTPDEF-----RALLATFAPAENTRTEANEVEHDDLQLALPGATSPR-YPPK 147

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
            DWR  G++  VRNQ+ CGACWAFSTVET E+MHAL  GTL+  SVQ++IDC+ N N GC
Sbjct: 148 FDWRTRGVVTAVRNQRDCGACWAFSTVETVETMHALAAGTLTGFSVQQMIDCSNNSNHGC 207

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
           +GGD CA L W+ VN++ L  +S YP      +C+  A+     ++  YTCD L+ +E  
Sbjct: 208 NGGDTCAALKWLKVNRIKLVRDSVYPFKAVTGSCQHPASDVT-AEVSDYTCDRLVGNEER 266

Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           ++  +A  GP++ AV+A TWQ YLGGVIQ++CD   A  NHAVQIVGYD
Sbjct: 267 MIDMLANVGPLVVAVDATTWQDYLGGVIQFHCD---AGRNHAVQIVGYD 312


>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
          Length = 326

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 119/301 (39%), Positives = 175/301 (58%), Gaps = 31/301 (10%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
           +  IAL F  IP+++  P  +Q    F  + +R+ K+Y   S +  R   F++SL  IE 
Sbjct: 11  IFYIALLFFVIPIRIKGP--DQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIET 68

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           LN  +++  SA YG+T+FSDL  EEF   +L+ ++++    +  K H H      KR+  
Sbjct: 69  LNSKKRNG-SALYGLTKFSDLLPEEFFQTYLQSNLSQKTHSNEPKRHHH------KRAT- 120

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
                   +P K DWRE   + ++ NQ +CGACWA+S +ET ESM+A+K      LSVQE
Sbjct: 121 --------VPNKVDWREKNAVTRIYNQGSCGACWAYSVIETVESMNAIKTNKSEELSVQE 172

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +IDCAGN N GC+GGD C LL W+      ++  ++Y        C R +    GV ++ 
Sbjct: 173 IIDCAGN-NKGCNGGDICTLLSWIKATNFTIQRHADY-----GGKCGRGSA---GVHVRD 223

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +    L+ SE  +L  +A +GP+  A+NA TWQ Y+GGVI+Y+CDG  + +NHAVQIVGY
Sbjct: 224 F---ILVGSEDVMLRLLADNGPLAVAINAQTWQNYIGGVIEYHCDGDPSKLNHAVQIVGY 280

Query: 310 D 310
           D
Sbjct: 281 D 281


>gi|157134825|ref|XP_001656461.1| cathepsin o [Aedes aegypti]
 gi|108884338|gb|EAT48563.1| AAEL000420-PA [Aedes aegypti]
          Length = 375

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 191/340 (56%), Gaps = 40/340 (11%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS--EHDIRFK 58
           M +V  ++ I+ ++ LCFL IP  +   ++ +  + F +F + Y K Y  +  E+D RF+
Sbjct: 1   MSEVIEMIMILIIVTLCFLMIPFNLQPNSVIEARKKFDTFIKLYDKPYRYNVREYDHRFQ 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---RHSV------NKHV- 108
            F  SL+ I  LN +R   ++A YGIT+++DL+++EF   HL   +H        N+ V 
Sbjct: 61  IFRVSLNKIASLNAHRVENDTAIYGITQYADLTDQEFLRLHLADLKHETTPGTANNRGVS 120

Query: 109 ----LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
                +   K  +   + +  R+    + I   +P   DWR+ G++  VR+Q +CGACWA
Sbjct: 121 VLDKFIIESKSAEMKDDIIFSRA-KRDLKILDYLPKVVDWRDKGVVAPVRSQGSCGACWA 179

Query: 165 FSTVETAESMHALK-NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
            S V+T  S+ A+K     S L + +VI+CAGNGN GC GGD C LL+W+   KV L   
Sbjct: 180 ISVVDTITSISAIKRQQNFSELCLDQVINCAGNGNFGCEGGDTCRLLEWLKEEKVKLNTL 239

Query: 224 SEYPLLLKDAACKRKATSPNG-------------VKIKSYTCDTLIPSESSILTDIATHG 270
            +         C+   TS NG             + +  ++C +L+  E  +L  +ATHG
Sbjct: 240 KQ---------CEALDTSKNGPNCTFQQASNGEYLSLNQFSCVSLVDREHLMLRYLATHG 290

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P++AAVNA +W+YYLGGVIQY+C+ +  ++NHAV+IVGY+
Sbjct: 291 PIVAAVNAASWKYYLGGVIQYHCEEAYEDLNHAVEIVGYN 330


>gi|328711164|ref|XP_003244460.1| PREDICTED: cathepsin O-like [Acyrthosiphon pisum]
          Length = 339

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 174/307 (56%), Gaps = 17/307 (5%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEK 62
           V NVL    +++   L +   +S   L +  + F+ F + Y KSY +++EH+ RF++F+K
Sbjct: 3   VSNVLKASLVVSSVVLILFFIMSITQLNRDQDKFNKFIKMYNKSYMNETEHNKRFEHFKK 62

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           SL  I+ L+  ++      YGITEFSDLS EEF   +L       V +   +        
Sbjct: 63  SLKTIQLLS--QKCNGCTNYGITEFSDLSTEEFTKIYLNS-----VTLRTPRTGTFSMAR 115

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             KRSITT            DWR+ G++  VRNQ+ CGACWA S VE  ES++A+K G L
Sbjct: 116 -SKRSITTATLSSI------DWRDKGVVTSVRNQKNCGACWAISVVELIESVYAIKTGLL 168

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
              SVQE++DC+G  N GC+GG    LL W+  N + +  E  YP + KD  C    T  
Sbjct: 169 QTFSVQEMLDCSGGINQGCTGGSVVYLLLWLVENNITVYKEENYPTIYKDQMCTLDKTFD 228

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
            GVK+KS+    L+  E  +L+ I+   PV  A+NAL WQ+Y+GGV+   CD S+A++NH
Sbjct: 229 KGVKVKSFLTLNLVDREDLLLSYIS-KSPVSVALNALPWQFYVGGVLS-QCDNSMASLNH 286

Query: 303 AVQIVGY 309
           A +IVGY
Sbjct: 287 AAEIVGY 293


>gi|347968429|ref|XP_312205.5| AGAP002720-PA [Anopheles gambiae str. PEST]
 gi|333468007|gb|EAA08145.5| AGAP002720-PA [Anopheles gambiae str. PEST]
          Length = 383

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 117/340 (34%), Positives = 180/340 (52%), Gaps = 32/340 (9%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFK 58
           M +V  +L I+ ++ LCFL IP       + +    F  F + Y K Y     E+  RF+
Sbjct: 1   MTEVIEMLMIILIVTLCFLMIPFNTKPSAVIESRRKFDVFVRLYDKPYRGDAREYAYRFQ 60

Query: 59  NFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEEFKTRHLR--------------HS 103
            F  SL  I  LN+  R++ ++A YGIT+++DL++ EF  R L                +
Sbjct: 61  IFRTSLSKIRALNEWAREANDTAIYGITQYADLTDREFVARQLADLLPDEPGGGAGGPRA 120

Query: 104 VNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACW 163
             K+V+ S  +  +  ++ +  R+    +     +P + DWRE G+I  V+NQ  CGACW
Sbjct: 121 YQKYVIES--RSAEMKNDIIFSRARRDALPAVRNLPHRVDWREQGVISTVKNQGGCGACW 178

Query: 164 AFSTVETAESMHALKNGTLSLLSV--QEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVL 220
           A S V+T  ++ A+K     L+ +  + V+ CA NGN GC GGD C LL+W+ + +  + 
Sbjct: 179 AISVVDTIAALAAIKRNDRKLIDLCHERVVRCAANGNNGCDGGDTCRLLEWLAEESYRIG 238

Query: 221 EPESEYPLLLKD----------AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
             ES     L D          +  +R+  + N   +K ++C      E  +L  +AT G
Sbjct: 239 AAESCLERNLADQEGGLNCTGESGVRREDGALNATLVKRFSCQGYENEEHLMLRHLATKG 298

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P++AAVNA++W+YYLGGVIQY+CD     +NHAV IVGYD
Sbjct: 299 PIVAAVNAISWKYYLGGVIQYHCDSDYELLNHAVAIVGYD 338


>gi|410914437|ref|XP_003970694.1| PREDICTED: cathepsin O-like [Takifugu rubripes]
          Length = 328

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 104/276 (37%), Positives = 152/276 (55%), Gaps = 25/276 (9%)

Query: 37  FSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           F  F++R+ ++Y  +  + D R   F++S      LN    + +SA+YGI +FSDLS+ E
Sbjct: 32  FEWFRERFGRNYEVNSPQFDRRLFFFQESTTRHAYLNSFSAASQSAKYGINQFSDLSQRE 91

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+  +LR S ++    S  K                      G+P K DWR+  I+  V+
Sbjct: 92  FQDLYLRASADRAPAFSGQK--------------------AEGLPAKFDWRDHAIVAPVQ 131

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQQ CG+CWAFS V   +S+HA+    L  LSVQ+V+DC+   N GC+GG   A L W+ 
Sbjct: 132 NQQACGSCWAFSVVGAVQSVHAIGGSQLVELSVQQVLDCSFQ-NKGCNGGTPVAALKWLT 190

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
             +V L P+SEYP   +   C   + S  GV +K++T       E +++  +  HGP+  
Sbjct: 191 QTRVKLVPQSEYPYKAQTRMCHFFSGSHGGVGVKNFTALDFSGQEEAMMGHLVKHGPLSV 250

Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            V+AL+WQ YLGG+IQY+C  S    NHAV +VGYD
Sbjct: 251 VVDALSWQDYLGGIIQYHC--SSKRSNHAVLVVGYD 284


>gi|342305190|dbj|BAK55649.1| cathepsin O [Oplegnathus fasciatus]
          Length = 338

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 100/276 (36%), Positives = 153/276 (55%), Gaps = 25/276 (9%)

Query: 37  FSSFQQRYKKSY--SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           F SF++ + + Y  +  E + R  NF+ +      LN    +P+SA+YGI  FSDLS++E
Sbjct: 42  FDSFREHFHRMYEVNGEEFNRRHLNFQNATKRHAYLNSLSTAPQSAKYGINRFSDLSQKE 101

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+  +LR S ++  L S  K                      G+P K DWR+  ++  V+
Sbjct: 102 FRGLYLRASADRAPLFSGLK--------------------TEGLPAKFDWRDKAVVAPVQ 141

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQQ CG+CWAFS V   +S+HA+    L+ LSVQ+V+DC+   N GC+GG     L W+ 
Sbjct: 142 NQQACGSCWAFSVVGAMQSVHAIGGSPLAQLSVQQVLDCSFQ-NHGCNGGSPFRALTWLK 200

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
             +V L P+SEY    +   C   + S  GV +K++T       E +++  +  HGP+ A
Sbjct: 201 QTRVKLVPQSEYSYKAETGICHFFSQSHAGVAVKNFTAHDFSGQEEAMMGQLVEHGPLAA 260

Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            V+A++WQ YLGG+IQ++C    +  NHAV +VGY+
Sbjct: 261 IVDAVSWQDYLGGIIQHHCSSQWS--NHAVLVVGYN 294


>gi|401758200|gb|AFQ01135.1| cathepsin O1-like protease [Chilo suppressalis]
          Length = 371

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 112/292 (38%), Positives = 158/292 (54%), Gaps = 22/292 (7%)

Query: 34  LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           L  F  + +++ KSY+  E   R +N+EKS+  I  LN      +S  +G+T+FSD +++
Sbjct: 42  LNSFIGYMKKFNKSYTDYEFMRRMRNYEKSVQEIRRLNTIH---DSKVFGLTKFSDWADD 98

Query: 94  EFKTRHLRHSVNK-------HVLMSHHKHHDH----HHNHVKKRSITTGITIPTG-IPVK 141
           EF    L     +         L    K+ +      +   K R+I   I+   G IPVK
Sbjct: 99  EFSAFMLSGRSERACKEQSMKCLPKRKKYQNFSPSIRYMMFKNRTIDVKISPTYGNIPVK 158

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
            DWR+ G++  V NQ+ C ACWAFS V   ESM A+    L+ LS+QE+IDC+   N GC
Sbjct: 159 IDWRDFGVVSPVLNQKLCSACWAFSIVGVMESMVAIYKKGLTRLSIQELIDCSKYNN-GC 217

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVKIKSYT--CDTLIPS 258
             GD    L ++  N   +  E EY L L+D +C+      PNG +I  Y   C+     
Sbjct: 218 HMGDIRLALQFLCQNDYPIVTEKEYSLTLRDESCRIPDDQKPNGERIAEYANLCNV---D 274

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           E  +L  IA HGPV+A+VNA  W+YY+GGVI+  C G+   +NHAVQIVGYD
Sbjct: 275 EKKLLKLIAMHGPVVASVNAAPWRYYIGGVIKSACPGTWHLVNHAVQIVGYD 326


>gi|348511930|ref|XP_003443496.1| PREDICTED: cathepsin O-like [Oreochromis niloticus]
          Length = 338

 Score =  183 bits (465), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 107/312 (34%), Positives = 165/312 (52%), Gaps = 32/312 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPN-------LEQKLELFSSFQQRYKKSY--SKSEHDIRFK 58
           +FI A++AL  L  PV     N       L      F +F++++ ++Y  S  E   R  
Sbjct: 6   VFIPAVVALGLLVSPVCCQNVNSSEIRTQLNGSAADFGAFRKQFHRTYEVSSEEFSRRHL 65

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           +F+++      LN      +SA+YGI  FSDLS+EEF+  +L     +  L S       
Sbjct: 66  SFQRATIRHTYLNSFSTETQSAKYGINRFSDLSQEEFRDLYLGAVYERAPLFS------- 118

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                       G+++   +P K DWR+   +  V++QQ CG+CWAFS V   +S+HA+ 
Sbjct: 119 ------------GLSVKE-LPDKFDWRDKAAVAAVQDQQACGSCWAFSVVGAIQSVHAIG 165

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              L  LSVQ+V+DC+   N GC+GG     L+W+   +V L  +SEYP   K   C   
Sbjct: 166 GSQLEQLSVQQVVDCSYQ-NAGCNGGSTTRALNWLKQTRVKLVTQSEYPYKAKTEICHFF 224

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
           + S  GV IK++T       E +++  +  +GP++A V+A++WQ YLGG+IQ++C    +
Sbjct: 225 SQSHGGVAIKNFTTHDFSGQEKAMMGQLVQYGPLVAIVDAVSWQDYLGGIIQHHCSSQWS 284

Query: 299 NINHAVQIVGYD 310
             NHA+ IVGYD
Sbjct: 285 --NHAILIVGYD 294


>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
 gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
          Length = 333

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 117/313 (37%), Positives = 168/313 (53%), Gaps = 33/313 (10%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
           +F VK  LFI+   +L    +   V  P     L  F SF   Y ++Y+ K EH+ RF+ 
Sbjct: 5   VFIVKATLFILISTSL---VLSESVHSPT--DLLARFKSFITDYNRNYTTKEEHEFRFQT 59

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+K+   I   N N      A YG+ +F+D ++EEFK       V    +++   HH   
Sbjct: 60  FKKNFRRIASTNAN-----GATYGVNKFADWTDEEFKELLGNRQVPTQEIVNSELHH--- 111

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWRE--AGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                  S++T        P   DWRE    I+G VRNQ  CG CWAFSTVET  S  AL
Sbjct: 112 -------SLSTA-----KFPSSLDWREHKRNIVGPVRNQGRCGCCWAFSTVETIASAWAL 159

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
              + + LSVQ+++ C  N + GC GG F    +W+  N+V LE ES  P L K   C +
Sbjct: 160 AGNSFTELSVQQLLSC-DNMDGGCRGGSFYLACNWLTKNRVPLETESANPYLGKRDKCVK 218

Query: 238 KATSPNGVKIKSYTCDTLIPSESS-ILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            AT+  G+ +K +T    I  ESS ++  +  +GP+  AV+A +W+ Y+GG+IQ++CDG 
Sbjct: 219 HATN-TGIILKKFTTSNFIYQESSSMIAALNQNGPLSIAVDATSWRDYVGGIIQHHCDGK 277

Query: 297 LANINHAVQIVGY 309
           +  +NHAVQ+VGY
Sbjct: 278 V--LNHAVQVVGY 288


>gi|312371319|gb|EFR19540.1| hypothetical protein AND_22253 [Anopheles darlingi]
          Length = 403

 Score =  180 bits (456), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 122/358 (34%), Positives = 179/358 (50%), Gaps = 48/358 (13%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS--EHDIRFK 58
           M +V  +L I+ ++ LCFL IP       + +  + F  F + Y K Y     E+D RF+
Sbjct: 1   MTEVIEMLMIILIVTLCFLMIPFNTKPSPVLEARKKFDIFVRLYDKPYRYDVREYDYRFQ 60

Query: 59  NFEKSLDIIEELNKNRQSP----ESARYGITEFSDLSEEEFKTRHLRHSVNKH---VLMS 111
            F  SL+ I +LN  R +     + A YG+T+++DL++ EF  +HL   +      V   
Sbjct: 61  IFRTSLNRIRQLNDRRSATGNETDGAIYGVTQYADLTDREFIAQHLADLLAAEEMAVPRL 120

Query: 112 HHKHHDHHHNHVKKRSITTGIT--------------------IPTGIPVKKDWREAGIIG 151
           H K+     +   K  I                         +PT +P   DWR  GII 
Sbjct: 121 HQKYAIESRSAEMKNDIIFSRARRDLPLKEQQQQQQQQQQQHLPTNLPPTVDWRAKGIIT 180

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV-------QEVIDCAGNGNMGCSGG 204
            V++Q +CGACWA S V+T  ++ A+K      L+        ++V+ CAGNGN GCSGG
Sbjct: 181 PVKSQGSCGACWAISVVDTIAALAAIKRNEQQPLTTPVTDLCHEQVVHCAGNGNNGCSGG 240

Query: 205 DFCALLDWMDVNKVVLEPESEYP---LLLKDAACKRKATSPNGV---------KIKSYTC 252
           D C LL+W+      +   +E P   L   D  C    +   G          ++K ++C
Sbjct: 241 DTCLLLEWLKQESFPIGAAAECPYRRLADTDQNCTLPGSVVAGAWQPGQHRETRVKRFSC 300

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           D     E  +L  +AT GP++AAVNA++W+YYLGGVIQY+CD     +NHAVQIVGY+
Sbjct: 301 DRFENREHLMLQHLATKGPLVAAVNAVSWKYYLGGVIQYHCDSGPQLLNHAVQIVGYE 358


>gi|443732032|gb|ELU16924.1| hypothetical protein CAPTEDRAFT_222012 [Capitella teleta]
          Length = 342

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/315 (36%), Positives = 169/315 (53%), Gaps = 31/315 (9%)

Query: 4   VKNVLF--IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKN 59
           V+ +LF  I  LI+ C     ++VS   ++   +LF  F ++Y K+Y     E+  R   
Sbjct: 5   VQQILFFLICVLISHC-----LRVSNEEID---DLFVKFTEKYHKTYLIGSLEYMHRRGI 56

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F  +      LN  R +  SA YG+T+FSDL++EEF  R L +      + +        
Sbjct: 57  FRDNFKKHVALNSLRTNNASAWYGVTQFSDLTQEEFTNRFLSNFTTSPTVPA-------- 108

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK- 178
                   +++G  I +  P K DWR+  +I  ++NQ +CG CWA++     ESMHALK 
Sbjct: 109 ----LPTLLSSGQLIDS-FPRKWDWRDKKVITSMKNQDSCGGCWAYAATAVLESMHALKV 163

Query: 179 NGTLSLLSVQEVIDCA---GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
            G L  LS Q++IDC+        GC GG+ CA L WM  N V L  E  YP + KD  C
Sbjct: 164 PGDLKSLSTQQMIDCSYGFAYALYGCKGGNPCAALHWMKQNNVGLISEKLYPTVNKDQKC 223

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K + P+ V + +Y+C   + SE S+L  I++ GPV  +V+A  W  Y GG+IQ++C G
Sbjct: 224 YIKKSKPDEVHVAAYSCQNFVGSEESLLRYISSVGPVAVSVDARMWINYQGGIIQHHC-G 282

Query: 296 SLANINHAVQIVGYD 310
            +++ NHAV IVGYD
Sbjct: 283 EVSS-NHAVTIVGYD 296


>gi|390344145|ref|XP_798313.2| PREDICTED: cathepsin O-like [Strongylocentrotus purpuratus]
          Length = 361

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 103/283 (36%), Positives = 156/283 (55%), Gaps = 26/283 (9%)

Query: 36  LFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
            F  F Q++ K+Y++   E+  R++ F++SL   E LN      + A YGIT+FSDL+ E
Sbjct: 53  FFQIFIQKFNKTYTRGSQEYFKRYRIFKESLLKHEMLNAIATHRDHATYGITKFSDLTSE 112

Query: 94  EFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWR--EAGI 149
           EF+ ++L   S+    +                RS+   +  P   +P+  D R  +  +
Sbjct: 113 EFQFQYLGTASIPDQSV----------------RSVPGPVRRPLKTMPLVYDLRSIKPPV 156

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
           +  V+NQ++CGACWAFS VET E+  ALK   L+ LS QE++DC    G+ GC GG  C 
Sbjct: 157 VTPVKNQKSCGACWAFSVVETMETQIALKTKRLTQLSAQELVDCGTAAGDGGCRGGIPCK 216

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA- 267
            LDW++  K  L PES YP + K   C+    S     + +++C      E  ++  +  
Sbjct: 217 TLDWLNRTKTSLVPESTYPYIAKKGDCRINKNSTLNAVVTNFSCGNYAADEEHVMPAMLY 276

Query: 268 THGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
             GP+  +V+A +WQYYLGG+IQY+C  +   +NHAVQIVG+D
Sbjct: 277 NQGPLSISVDAESWQYYLGGIIQYHCTPTY--LNHAVQIVGFD 317


>gi|291232495|ref|XP_002736191.1| PREDICTED: cysteine protease and A protease inhibitor,
           putative-like [Saccoglossus kowalevskii]
          Length = 367

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 103/304 (33%), Positives = 170/304 (55%), Gaps = 22/304 (7%)

Query: 12  ALIALCFL--AIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEKSL-DI 66
            ++ LC+L  AI   V   N+++ ++ F  F  +++K Y    +E++ RF+ F++SL  I
Sbjct: 17  VVLVLCYLPCAIQYDVQPGNIDEDVQ-FKEFILKHRKPYIAGTTEYEHRFRVFQQSLHRI 75

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVK 124
            + ++ +RQ  ++A YGIT+FSDL+ +EF+  +L  R S +  + +S           V+
Sbjct: 76  RKRISLSRQLNDTAVYGITQFSDLTPDEFQQMYLTLRPSKSSQIPVSL----------VQ 125

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
             S      +P  +P K D R+   +  V++Q +CG CW+FSTV+  E+   L  G ++ 
Sbjct: 126 FPSAFNSSNVPPDMPKKYDLRDKSAVSAVKDQGSCGGCWSFSTVQGMETKWVLNGGKMTE 185

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LSVQ++IDC  + + GC+GGD C  + W+    V L     YP       C+ K  +  G
Sbjct: 186 LSVQQLIDCDTSSS-GCAGGDTCIAMAWLKTKNVGLITSHNYPFTGHTGECRIKNYT-EG 243

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAV 304
           V +K +TC   I  E  ++ ++  +G ++ A+NA +WQ YLGG+IQ++C       NHAV
Sbjct: 244 VHLKDFTCKEYIGKEDKMVENLYYNGSLVVALNARSWQDYLGGIIQHHCSAGFN--NHAV 301

Query: 305 QIVG 308
           QIVG
Sbjct: 302 QIVG 305


>gi|410956684|ref|XP_003984969.1| PREDICTED: cathepsin O [Felis catus]
          Length = 390

 Score =  170 bits (431), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 111/323 (34%), Positives = 168/323 (52%), Gaps = 41/323 (12%)

Query: 12  ALIALCFLAIPVKVSKP--NLEQKLEL------FSSFQQR---YKKSYSKSEHDIRFKNF 60
           A +A      P++ S+P   LE+   L       ++FQ R    KK+  K+E   +F +F
Sbjct: 51  AALASPAPGTPLRRSRPYSRLEEAPLLAVPWTALTAFQSRSFTVKKNQIKAEETRQF-SF 109

Query: 61  EKSLDIIEELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH 113
            +    +E L+++R           SA YGI +FS L  EEFK  +LR   ++       
Sbjct: 110 RRG--ALESLHRHRYLNSVFPGENSSAVYGINQFSHLFPEEFKAIYLRSKPSR------- 160

Query: 114 KHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
                    + +       +IP+  +P++ DWR+  ++ +VRNQQTCG CWAFS V   E
Sbjct: 161 ---------LPRYRAEVQTSIPSVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVE 211

Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
           S +A+K   L  LSVQ+VIDC+ N N GC+GG     L+W++   V L  +SEYP   ++
Sbjct: 212 SAYAIKGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKTHVKLVRDSEYPFKAQN 270

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
             C+  + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++
Sbjct: 271 GLCRYFSDSHSGFPIKGYSAYDFSDQEDEMAKALVTFGPLVVVVDAVSWQDYLGGIIQHH 330

Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
           C  S    NHAV I G+D    T
Sbjct: 331 C--SSGEANHAVLITGFDKIGNT 351


>gi|301777930|ref|XP_002924382.1| PREDICTED: cathepsin O-like [Ailuropoda melanoleuca]
          Length = 300

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/256 (37%), Positives = 137/256 (53%), Gaps = 27/256 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           SA YGI +FS L  EEFK  +LR   ++              
Sbjct: 25  ESLNRHRYLNSVFPHENSSAVYGINQFSYLFPEEFKAIYLRSKSSR-------------- 70

Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
             + +       +IP   +P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K 
Sbjct: 71  --LPRYRAEAQTSIPNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKG 128

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LSVQ+VIDC+ N N GCSGG   + L W++  +V L  +SEYP   ++  C   +
Sbjct: 129 EPLEALSVQQVIDCSYN-NYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQNGLCHYFS 187

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 188 DSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGE 245

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 246 ANHAVLITGFDKIGST 261


>gi|281354027|gb|EFB29611.1| hypothetical protein PANDA_013700 [Ailuropoda melanoleuca]
          Length = 266

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/256 (37%), Positives = 137/256 (53%), Gaps = 27/256 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           SA YGI +FS L  EEFK  +LR   ++              
Sbjct: 1   ESLNRHRYLNSVFPHENSSAVYGINQFSYLFPEEFKAIYLRSKSSR-------------- 46

Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
             + +       +IP   +P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K 
Sbjct: 47  --LPRYRAEAQTSIPNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKG 104

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LSVQ+VIDC+ N N GCSGG   + L W++  +V L  +SEYP   ++  C   +
Sbjct: 105 EPLEALSVQQVIDCSYN-NYGCSGGSTVSALHWLNKTQVKLVRDSEYPFKAQNGLCHYFS 163

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 164 DSQSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGE 221

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 222 ANHAVLITGFDKIGST 237


>gi|426345827|ref|XP_004040600.1| PREDICTED: cathepsin O [Gorilla gorilla gorilla]
          Length = 321

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 44  FRESLNRHRYLNSLFPSENSTAFYGINQFSHLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 207 FSGSHSGFSIKGYSAHDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 265 GEANHAVLITGFDKTGST 282


>gi|4557501|ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens]
 gi|1168795|sp|P43234.1|CATO_HUMAN RecName: Full=Cathepsin O; Flags: Precursor
 gi|574804|emb|CAA54562.1| cathepsin O [Homo sapiens]
 gi|29351630|gb|AAH49206.1| Cathepsin O [Homo sapiens]
 gi|312153238|gb|ADQ33131.1| cathepsin O [synthetic construct]
          Length = 321

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 44  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 207 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 265 GEANHAVLITGFDKTGST 282


>gi|119625288|gb|EAX04883.1| cathepsin O [Homo sapiens]
          Length = 336

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 59  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 115

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 116 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 162

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 163 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 221

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 222 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 279

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 280 GEANHAVLITGFDKTGST 297


>gi|397504019|ref|XP_003822607.1| PREDICTED: cathepsin O [Pan paniscus]
          Length = 321

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 44  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 207 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 265 GEANHAVLITGFDKTGST 282


>gi|351707349|gb|EHB10268.1| Cathepsin O, partial [Heterocephalus glaber]
          Length = 266

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 136/256 (53%), Gaps = 27/256 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           +A YGI +FS L  EEFK  +LR   ++              
Sbjct: 1   ESLNRHRYLNSLFPHENSTAFYGINQFSYLFPEEFKAIYLRSKPSR-------------- 46

Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               K +     ++P T +P++ DWR   ++ +VRNQQ CG CWAFS V   ES  A++ 
Sbjct: 47  --FPKYAAKVQASVPNTPLPLRFDWRNKHVVTQVRNQQMCGGCWAFSVVGAVESAWAIRG 104

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L  LS Q+VIDC+ N N GC+GG   + L W++  +V L  +SEYP   +D  C   +
Sbjct: 105 GPLEDLSAQQVIDCSYN-NYGCNGGSPLSALSWLNKTRVKLVRDSEYPFKAQDGPCHYFS 163

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S  G+ I+ Y+       E+ +   +  HGP++  V+A++WQ YLGGVIQ++C    A 
Sbjct: 164 QSQPGLSIQGYSAYDFSGQEAEMARALLAHGPLVVIVDAVSWQDYLGGVIQHHCSSGRA- 222

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 223 -NHAVLITGFDRTDST 237


>gi|114596533|ref|XP_517502.2| PREDICTED: cathepsin O [Pan troglodytes]
 gi|410212082|gb|JAA03260.1| cathepsin O [Pan troglodytes]
 gi|410330245|gb|JAA34069.1| cathepsin O [Pan troglodytes]
          Length = 318

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 41  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 97

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 98  -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 204 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 262 GEANHAVLITGFDKTGST 279


>gi|332217574|ref|XP_003257933.1| PREDICTED: cathepsin O [Nomascus leucogenys]
          Length = 318

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 100/258 (38%), Positives = 138/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K    S   H   
Sbjct: 41  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 97

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                        ++IP   +P+K DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 98  -------------MSIPNVSLPLKFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 204 FLGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 262 GEANHAVLITGFDKTGST 279


>gi|291401083|ref|XP_002716930.1| PREDICTED: cathepsin O [Oryctolagus cuniculus]
          Length = 309

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 97/250 (38%), Positives = 137/250 (54%), Gaps = 25/250 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           +A YGI +FS L  EEFK  +LR         S       + 
Sbjct: 34  ESLNRHRYLNSFFSHENSTAFYGINQFSYLFPEEFKAIYLR---------SQPSSSPRYP 84

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             VK    T+ +T+P  +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES  A+K  
Sbjct: 85  AEVK----TSLLTVP--LPLRFDWRDKHVVSQVRNQQMCGGCWAFSVVGAVESTWAIKGH 138

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+ N N GCSGG   + L W++  +V L  +SEYP   +   C    +
Sbjct: 139 PLEDLSVQQVIDCSYN-NYGCSGGSTLSALKWLNKTQVRLVNDSEYPFKARSGLCHYFPS 197

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G+ IK Y+       E  +   +  +GP++  V+A++WQ YLGGVIQ++C  S    
Sbjct: 198 SHSGLSIKGYSAYDFSDQEDEMAKSLLIYGPLVVIVDAVSWQDYLGGVIQHHC--SSGEA 255

Query: 301 NHAVQIVGYD 310
           NHAV I G+D
Sbjct: 256 NHAVLITGFD 265


>gi|355681662|gb|AER96817.1| Cathepsin O precursor [Mustela putorius furo]
          Length = 265

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/238 (38%), Positives = 130/238 (54%), Gaps = 20/238 (8%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TG 137
           SA YGI +FS L  EEFK  +LR   ++                + +       +IP   
Sbjct: 19  SAIYGINQFSYLFPEEFKAIYLRSKSSR----------------LPRYRTEVQTSIPNVS 62

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P + DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K   L  LSVQ+VIDC+ N 
Sbjct: 63  LPSRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYN- 121

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           N GC GG   + L+W++  +V L  +SEYP   ++  C   + S +G  IK Y+      
Sbjct: 122 NYGCQGGSTLSALNWLNKTQVRLVRDSEYPFKAQNGLCHYFSDSQSGFSIKGYSAYDFSD 181

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
            E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    NHAV I G+D    T
Sbjct: 182 QEDEMAKALLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGEANHAVLITGFDKIGNT 237


>gi|345780796|ref|XP_539782.3| PREDICTED: cathepsin O [Canis lupus familiaris]
          Length = 456

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/255 (36%), Positives = 140/255 (54%), Gaps = 25/255 (9%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R       +   SA YGI +FS LS EEFK  +LR   ++       ++     
Sbjct: 181 ESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSKPSRS-----PRYPAEVR 235

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             ++  S+          P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K  
Sbjct: 236 TSIRNVSL----------PLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 285

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L+ +SVQ+VIDC+ N N GCSGG     L+W++  +V L  +SEYP   ++  C   + 
Sbjct: 286 PLADISVQQVIDCSYN-NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSD 344

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  I+ Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    
Sbjct: 345 SYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQHHC--SSGEA 402

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV I G+D    T
Sbjct: 403 NHAVLITGFDKIGST 417


>gi|395735444|ref|XP_002815290.2| PREDICTED: cathepsin O [Pongo abelii]
          Length = 318

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 98/258 (37%), Positives = 139/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K            
Sbjct: 41  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK------------ 88

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                 + S    ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+
Sbjct: 89  ----FPRYSAEVRMSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAI 144

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 145 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 203

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 204 FSGSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 261

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 262 GEANHAVLITGFDKTGST 279


>gi|402870704|ref|XP_003899346.1| PREDICTED: cathepsin O [Papio anubis]
          Length = 321

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)

Query: 68  EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R     SP    +A YGI +FS L  EEFK  +LR   +K    S   H     
Sbjct: 46  ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 100

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                RSI          P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K  
Sbjct: 101 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 150

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG     L+W++  +V L  +SEYP   ++  C   + 
Sbjct: 151 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 209

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    
Sbjct: 210 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 267

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV I G+D    T
Sbjct: 268 NHAVLITGFDKTGST 282


>gi|355687683|gb|EHH26267.1| hypothetical protein EGK_16186 [Macaca mulatta]
 gi|384945482|gb|AFI36346.1| cathepsin O preproprotein [Macaca mulatta]
          Length = 321

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)

Query: 68  EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R     SP    +A YGI +FS L  EEFK  +LR   +K    S   H     
Sbjct: 46  ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 100

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                RSI          P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K  
Sbjct: 101 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 150

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG     L+W++  +V L  +SEYP   ++  C   + 
Sbjct: 151 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 209

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    
Sbjct: 210 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 267

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV I G+D    T
Sbjct: 268 NHAVLITGFDKTGST 282


>gi|395542489|ref|XP_003773162.1| PREDICTED: cathepsin O-like [Sarcophilus harrisii]
          Length = 407

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 99/274 (36%), Positives = 145/274 (52%), Gaps = 25/274 (9%)

Query: 49  SKSEHDIRFKNFEKSLDIIEELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLR 101
           S+S  D   ++ ++S    E L ++R       ++  SA YGI +FS L  EEF+  +LR
Sbjct: 113 SRSRLDSPERSEKRSAAFRESLKRHRYLNSFSSRANTSAIYGINQFSHLFPEEFRAIYLR 172

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
              ++  L  +HK       H+              +P++ DWR+  ++ KVRNQQ CG 
Sbjct: 173 SKPSQLPL--YHKELKMPATHMP-------------LPIRFDWRDKNVVTKVRNQQMCGG 217

Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLE 221
           CWAFS V   ES +A+K  +L  LSVQ+VIDC+ N N GCSGG     L+W++  +V L 
Sbjct: 218 CWAFSVVGGIESAYAIKGESLEDLSVQQVIDCSYN-NFGCSGGSTVNALNWLNKTQVRLV 276

Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW 281
            +SEY    +   C   + S  GV IK Y+       E  +   +  +GP+   V+A++W
Sbjct: 277 RDSEYSFKAQTGLCHYFSGSHAGVSIKGYSSYDFSDKEDEMAKVLLAYGPLAVIVDAISW 336

Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           Q YLGG+IQ++C  S    NHAV I G+D    T
Sbjct: 337 QDYLGGIIQHHC--SSGEANHAVLITGFDKTGNT 368


>gi|189528132|ref|XP_695717.3| PREDICTED: cathepsin O [Danio rerio]
          Length = 334

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 104/310 (33%), Positives = 160/310 (51%), Gaps = 28/310 (9%)

Query: 6   NVLFIVALIALCFLA--IPVKVSKPNLEQ--KLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
           ++ FIV +I    L   I V+V + +L +  +L+   +FQQ       +     R+ N++
Sbjct: 4   SLTFIVLIIYQELLTGIISVEVIRKSLTEGERLQHSDTFQQDVNNELYQ-----RWINYQ 58

Query: 62  KSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
            SL     LN    +S +SA+YG+ +FS LS+++FK ++L             K      
Sbjct: 59  SSLQRQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPKFDQSKSE---- 114

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                      I +    P + DWR+ G++G V NQ +CG CWAFS VE  ES+ A    
Sbjct: 115 -----------IKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGE 163

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG     L W+  +K+ L  E+EYP    D  C+    
Sbjct: 164 KLQQLSVQQVIDCSYQ-NQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQ 222

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           +  GV +++Y+       E  +++ +   GP++  V+A++WQ YLGG+IQ++C    A  
Sbjct: 223 AHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHHCSSHKA-- 280

Query: 301 NHAVQIVGYD 310
           NHAV I GYD
Sbjct: 281 NHAVLITGYD 290


>gi|403272508|ref|XP_003928101.1| PREDICTED: cathepsin O [Saimiri boliviensis boliviensis]
          Length = 465

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 138/256 (53%), Gaps = 27/256 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           +A YGI +FS L  EEFK  +LR   +K+             
Sbjct: 190 ESLNRHRYLNSLFPNENSTAFYGINQFSYLFPEEFKAIYLRSKPSKY------------- 236

Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               + S    ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES  A+K 
Sbjct: 237 ---PRYSAEVRMSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESACAIKG 293

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LSVQ+VIDC+ N N GC+GG   + L+W++  +V L  +SEYP   ++  C   +
Sbjct: 294 KPLEDLSVQQVIDCSYN-NYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQNGLCHYFS 352

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 353 GSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGE 410

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV + G+D    T
Sbjct: 411 ANHAVLVTGFDKTGST 426


>gi|297293584|ref|XP_001093045.2| PREDICTED: cathepsin O [Macaca mulatta]
          Length = 421

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 99/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)

Query: 68  EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R     SP    +A YGI +FS L  EEFK  +LR   +K    S   H     
Sbjct: 146 ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 200

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                RSI          P++ DWR+  ++ +VRNQQTCG CWAFS V   ES +A+K  
Sbjct: 201 -----RSIPN-----VSWPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 250

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG     L+W++  +V L  +SEYP   ++  C   + 
Sbjct: 251 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 309

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    
Sbjct: 310 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 367

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV I G+D    T
Sbjct: 368 NHAVLITGFDKTGST 382


>gi|344239864|gb|EGV95967.1| Cathepsin O [Cricetulus griseus]
          Length = 291

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 92/259 (35%), Positives = 136/259 (52%), Gaps = 18/259 (6%)

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           F  F++SL+    LN       SA YG+ +FS LS EEFK  +L                
Sbjct: 12  FLFFQESLNRHRYLNSFSHDNSSASYGLNQFSYLSPEEFKALYL----------GSKPAW 61

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
              +   +++ I         +P++ DWR+  ++ +VRNQ+ CG CWAFS V   ES  A
Sbjct: 62  SPRYPAAEQKPIPN-----VSLPLRFDWRDKHVVNQVRNQKMCGGCWAFSVVTAIESACA 116

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           ++   L  LSVQ+VIDC+ N N GCSGG   + L W++  +V L  +SEYP   ++  C+
Sbjct: 117 IQGKPLDYLSVQQVIDCSFN-NYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAENGLCR 175

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
               S +GV IK ++       E  +   +   GP++  V+A++WQ YLGG+IQ++C  S
Sbjct: 176 YFPQSQSGVSIKDFSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGIIQHHC--S 233

Query: 297 LANINHAVQIVGYDNYSRT 315
               NHAV I G+D    T
Sbjct: 234 SGEANHAVLITGFDKTGNT 252


>gi|296195327|ref|XP_002745330.1| PREDICTED: cathepsin O [Callithrix jacchus]
          Length = 453

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 136/256 (53%), Gaps = 27/256 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           +A YGI +FS L  EEFK  +LR    K+   S   H     
Sbjct: 178 ESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPFKYRRYSAEVH----- 232

Query: 121 NHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                      ++IP   +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES  A+K 
Sbjct: 233 -----------MSIPNVSLPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESACAIKG 281

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C   +
Sbjct: 282 KPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFS 340

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C    A 
Sbjct: 341 GSHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEA- 399

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV + G+D    T
Sbjct: 400 -NHAVLVTGFDKTGST 414


>gi|355749637|gb|EHH54036.1| hypothetical protein EGM_14772, partial [Macaca fascicularis]
          Length = 311

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 98/255 (38%), Positives = 137/255 (53%), Gaps = 25/255 (9%)

Query: 68  EELNKNRQ----SP---ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R     SP    +A YGI +FS L  EEFK  +LR   +K    S   H     
Sbjct: 36  ESLNRHRYLNSLSPGENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH----- 90

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                RSI          P++ DW++  ++ +VRNQQTCG CWAFS V   ES +A+K  
Sbjct: 91  -----RSIPN-----VSWPLRFDWQDKHVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGK 140

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG     L+W++  +V L  +SEYP   ++  C   + 
Sbjct: 141 PLEDLSVQQVIDCSYT-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSG 199

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S    
Sbjct: 200 SHSGFSIKGYSAYDFSNQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEA 257

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV I G+D    T
Sbjct: 258 NHAVLITGFDKTGST 272


>gi|449272742|gb|EMC82496.1| Cathepsin O, partial [Columba livia]
          Length = 275

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 131/255 (51%), Gaps = 28/255 (10%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
            ++S   I  LN + +   +A YGI +FS L  EEFK  +LR   +K             
Sbjct: 1   LQESTKRIRLLNSSSKDNMTAFYGINQFSHLFPEEFKAIYLRSIPHK------------- 47

Query: 120 HNHVKKRSITTGITIPTG----IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
                   +   + +P G    +P K DWR+  +I +VRNQQTCG CWAFS V   ES +
Sbjct: 48  --------LPRYLKVPKGEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAY 99

Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
           A+K   L  LSVQ+VIDC+ N N GCSGG   + L W++  KV L  +SEY    +   C
Sbjct: 100 AIKGHNLEELSVQQVIDCSYN-NYGCSGGSTVSALSWLNQTKVKLVRDSEYAFKAQTGLC 158

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
                S  GV I  +        E  ++  +   GP+   V+A++WQ YLGG+IQY+C  
Sbjct: 159 HYFGHSDFGVSITGFAAYDFSGQEEEMMRMLVNWGPLAVTVDAVSWQDYLGGIIQYHCSS 218

Query: 296 SLANINHAVQIVGYD 310
             A  NHAV I G+D
Sbjct: 219 GRA--NHAVLITGFD 231


>gi|224049669|ref|XP_002196637.1| PREDICTED: cathepsin O [Taeniopygia guttata]
          Length = 299

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 94/247 (38%), Positives = 129/247 (52%), Gaps = 26/247 (10%)

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR---HSVNKHVLMSHHKHHDHHHNHV 123
           I  LN   +   +A YGI +FS L  EEFK  +LR   H + +++ +   K         
Sbjct: 32  IRLLNSLAKDNTTAVYGINQFSHLFPEEFKAIYLRSIPHKLPRYIKVPKGKEKP------ 85

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
                         +P K DWR+  +I +VRNQQTCG CWAFS V   ES +A+K  TL 
Sbjct: 86  --------------LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKRNTLE 131

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            LSVQ+VIDC+ N N GC+GG   + L W++  KV L  +SEY    +   C     S  
Sbjct: 132 ELSVQQVIDCSYN-NYGCNGGSTVSALSWLNQTKVKLVRDSEYTFKAQTGLCHYFERSDF 190

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
           GV I  +        E  ++  + + GP+   V+A++WQ YLGG+IQY+C    A  NHA
Sbjct: 191 GVSITGFAAYDFSGQEEEMMRMLVSWGPLAVTVDAVSWQDYLGGIIQYHCSSGRA--NHA 248

Query: 304 VQIVGYD 310
           V I G+D
Sbjct: 249 VLITGFD 255


>gi|395861575|ref|XP_003803057.1| PREDICTED: cathepsin O [Otolemur garnettii]
          Length = 320

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 98/258 (37%), Positives = 138/258 (53%), Gaps = 21/258 (8%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +LR   +K            
Sbjct: 43  FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK------------ 90

Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                 +      + IP   +P++ DWR+  ++ +VRNQQTCG CWAFS V   ES  A+
Sbjct: 91  ----FPRYPAELQMPIPNVSLPLRFDWRDKHVVTQVRNQQTCGGCWAFSVVGAVESACAI 146

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           K   L  LSVQ+VIDC+ N N GC+GG     L+W++  +V L  +SEYP   ++  C  
Sbjct: 147 KGEPLEDLSVQQVIDCSYN-NYGCNGGSTVNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 205

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            + S +G+ IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S 
Sbjct: 206 FSGSHSGISIKDYSEYDFNEQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 263

Query: 298 ANINHAVQIVGYDNYSRT 315
              NHAV I G+D    T
Sbjct: 264 GEANHAVLITGFDKTGST 281


>gi|213512532|ref|NP_001134063.1| Cathepsin O precursor [Salmo salar]
 gi|209730446|gb|ACI66092.1| Cathepsin O precursor [Salmo salar]
          Length = 341

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 154/323 (47%), Gaps = 51/323 (15%)

Query: 8   LFIVALIALCFLAIP---------VKVSKPNLEQKLEL-FSSFQQRYKKSY---SKSEHD 54
           LF++ L+ L  L  P           + K N     ++ F SF++++ ++Y   S   H 
Sbjct: 6   LFLMFLLNLGILTFPDVARCSGVWKTIRKSNCSAGTDVDFESFREQFHRNYKLHSDCYHR 65

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            R   F+ S+     LN      +SA+YGI +FSDLS  EF+  +L              
Sbjct: 66  RR-SYFKNSIKRHAYLNSLSTDKDSAKYGINQFSDLSIHEFRELYL-------------- 110

Query: 115 HHDHHHNHVKKRSITTGITIP-------TGIPVKKDWREAGIIGKVRNQQTCGACWAFST 167
                          T  T+P        G+P K DWR    +G V+NQQ CG CWAFS 
Sbjct: 111 -------------TATAETVPPYSGLKTEGLPAKFDWRVKAAVGSVQNQQACGGCWAFSV 157

Query: 168 VETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYP 227
           V   ES++A        LSVQ+VIDC+   N GC+GG     L W+   +V L  +SEYP
Sbjct: 158 VGAIESVYAKSGQPFKQLSVQQVIDCSYK-NQGCNGGSITRALSWLKQTRVKLVKQSEYP 216

Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
              +   C   + S +GV +K +        E +++  +   GP+   V+A++WQ YLGG
Sbjct: 217 YKAETGICHLFSQSHDGVLVKDFAAHDYSGHEEAMMGRLVEWGPLAVTVDAISWQDYLGG 276

Query: 288 VIQYNCDGSLANINHAVQIVGYD 310
           ++Q++C  S  + NHAV + GYD
Sbjct: 277 IMQHHC--SCHHANHAVLVTGYD 297


>gi|432961003|ref|XP_004086527.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin O-like [Oryzias latipes]
          Length = 333

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 87/278 (31%), Positives = 142/278 (51%), Gaps = 27/278 (9%)

Query: 37  FSSFQQRYKKSYSKSEHDI--RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           F  F++ + + +  S      R  +F+++      LN      +SA YG  +FSDLS+EE
Sbjct: 37  FDKFRKNFNRLFDGSGDQFKRRLLHFQEAAVRHTHLNSFSTEAQSATYGFNQFSDLSQEE 96

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKV 153
           F+  +L+ +  +    S                      +P  G+P + DWR+  ++  V
Sbjct: 97  FRGIYLQATSGRAPPFS---------------------GLPAEGLPARFDWRDKAVVAAV 135

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q  CG+CWAFS V   +S  A+    L  LSVQ+++DC+   N GC GG   A L W+
Sbjct: 136 QDQLACGSCWAFSVVGAVQSARAVGGSRLQRLSVQQLLDCSFT-NKGCGGGSPTAALSWL 194

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
              +  L   +EYP   +   C+  + +  GV +K++T       E +++  +  HGP++
Sbjct: 195 LQTREKLVTAAEYPYQAEAQICRFFSQTHQGVAVKNFTVHNFRGQEPAMMAQLVEHGPLV 254

Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
           A V+A++WQ YLGG+IQ++C       NHAV +VGYD 
Sbjct: 255 AVVDAVSWQDYLGGIIQHHCSSQWP--NHAVLVVGYDT 290


>gi|344293694|ref|XP_003418556.1| PREDICTED: cathepsin O-like [Loxodonta africana]
          Length = 327

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 95/255 (37%), Positives = 136/255 (53%), Gaps = 25/255 (9%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN++R           +A YGI +FS L  EEFK  +LR   ++             +
Sbjct: 52  ESLNRHRYLNSLFPNENSTASYGINQFSYLFPEEFKAIYLRSKPSRF----------PRY 101

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
               + SIT        +PV+ DWRE  ++ +VRNQ+ CG CWAFS V   ES  A+K  
Sbjct: 102 PTDLQMSITN-----VSLPVRFDWREKHVVTQVRNQKMCGGCWAFSVVGAVESACAIKGE 156

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+VIDC+   N GC+GG   + L+W++  +V L  +SEYP   ++  C+  + 
Sbjct: 157 PLEDLSVQQVIDCS-YSNYGCNGGSTLSALNWLNKMQVKLVKDSEYPFKAQNGLCQYFSV 215

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   + T GP+I  V+A++WQ YLGGVIQ++C  S    
Sbjct: 216 SHSGFSIKGYSAYDFSDREDEMAKALLTFGPLIVVVDAVSWQDYLGGVIQHHC--SSGEA 273

Query: 301 NHAVQIVGYDNYSRT 315
           NHAV + G+D    T
Sbjct: 274 NHAVLVTGFDTTGST 288


>gi|426247636|ref|XP_004017585.1| PREDICTED: cathepsin O [Ovis aries]
          Length = 288

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 93/251 (37%), Positives = 138/251 (54%), Gaps = 18/251 (7%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           +++SL+    LN       +A YGI +FS L  EEFK  +LR S ++       ++    
Sbjct: 12  WQESLNRQRYLNSFPHENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY---- 67

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                  SI+        +P+K DWR+  +I +VRNQ+TCG CWAFS V   ES+ A+K 
Sbjct: 68  ------TSISN-----LSLPLKFDWRDKHVITQVRNQKTCGGCWAFSVVGAVESVCAIKG 116

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L +LSVQ+VIDC+   N GC+GG     L W++  +V L  +SEYP   ++  C+  +
Sbjct: 117 QPLEVLSVQQVIDCS-YSNYGCNGGSPLNALYWLNKLQVKLVRDSEYPFQAQNGLCRYFS 175

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S +G  IK Y+       E  +   +   GP+I  V+A++WQ YLGG+IQ++C  S   
Sbjct: 176 DSHSGSSIKGYSAYDFSGQEDKMAKALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGE 233

Query: 300 INHAVQIVGYD 310
            NHAV + G+D
Sbjct: 234 SNHAVLVTGFD 244


>gi|431901237|gb|ELK08303.1| Cathepsin O [Pteropus alecto]
          Length = 322

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/257 (37%), Positives = 137/257 (53%), Gaps = 19/257 (7%)

Query: 60  FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
           F +SL+    LN    S  S A YGI +FS L  EEFK  +L+   ++    S       
Sbjct: 45  FRESLNRHRYLNSLFPSENSTAVYGINQFSHLFPEEFKAIYLKSKTSRFPKYS------- 97

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                    +T    +P  +P++ DWR+  ++ +VRNQQ CG CWAFS V   ES +A+K
Sbjct: 98  ------ADLLTVISKLP--LPLRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAIK 149

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              L  LSVQ+VIDC+ N N GC+GG     L W++  +V L  +SEYP   ++  C   
Sbjct: 150 GKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALYWLNKTQVKLVRDSEYPFKAQNGLCLYF 208

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
           A + +G  IK Y+       E  +   + T GP++  V+A++WQ YLGG+IQ++C  S  
Sbjct: 209 ADTHSGFSIKGYSAHDFSDQEDEMAKALLTFGPLVGIVDAVSWQDYLGGIIQHHC--SSG 266

Query: 299 NINHAVQIVGYDNYSRT 315
             NHAV I G+D    T
Sbjct: 267 EANHAVIITGFDKTGST 283


>gi|126331447|ref|XP_001375261.1| PREDICTED: cathepsin O-like [Monodelphis domestica]
          Length = 414

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 101/264 (38%), Positives = 137/264 (51%), Gaps = 25/264 (9%)

Query: 56  RFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
           R   F +SL     LN    S   SA YGI +FS L  EEFK  +LR   +   L S   
Sbjct: 133 RSTAFRESLKRHHYLNSFSSSDNTSAIYGINQFSYLFPEEFKDIYLRSKPSVLPLYSE-- 190

Query: 115 HHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
                            + +PT    +PV+ DWR+  ++ KVRNQQ CG CWAFS V + 
Sbjct: 191 ----------------ALKMPTTHMPLPVRFDWRDKHVVTKVRNQQMCGGCWAFSVVGSI 234

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           ES +A+K  +L  LSVQ+VIDC+ N N GCSGG     L+W++  +V L  +SEY    +
Sbjct: 235 ESAYAIKGESLEDLSVQQVIDCSYN-NFGCSGGSTVNALNWLNKTQVRLVKDSEYSFKAQ 293

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQY 291
              C   + S  GV IK Y+       E+ +   +   GP+   V+A++WQ YLGG+IQ+
Sbjct: 294 TGLCHYFSGSHAGVSIKDYSSYDFSGKENEMANVLLAFGPLAVIVDAVSWQDYLGGIIQH 353

Query: 292 NCDGSLANINHAVQIVGYDNYSRT 315
           +C  S    NHAV I G+D    T
Sbjct: 354 HC--SSGEANHAVLITGFDRTGNT 375


>gi|149698347|ref|XP_001499302.1| PREDICTED: cathepsin O-like [Equus caballus]
          Length = 367

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/263 (37%), Positives = 139/263 (52%), Gaps = 19/263 (7%)

Query: 54  DIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           D +   F +SL+    LN    S  S A YGI +FS L  EEFK  +LR         S 
Sbjct: 84  DRQAAAFRESLNRHRYLNSLFPSENSTAVYGINQFSYLFPEEFKAIYLR---------SK 134

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
                 +   V+  S++        +P++ DWR+  ++ +VRNQQ CG CWAFS V   E
Sbjct: 135 PSRFPRYPAEVQT-SLSN-----VSLPLRFDWRDRHVVTQVRNQQACGGCWAFSVVGAVE 188

Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
           S+ A+K   L  LSVQ+VIDC+ N N GCSGG     L+W++  +V L  +SEYP   + 
Sbjct: 189 SVCAIKGEPLEDLSVQQVIDCSYN-NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQS 247

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
             C   + S +G  IK ++       E  +   + T GP++  V+A++WQ YLGGVIQ++
Sbjct: 248 GLCHYFSDSHSGFSIKGFSAYDFSDQEDQMAKALLTFGPLVVVVDAVSWQDYLGGVIQHH 307

Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
           C  S    NHAV I G+D    T
Sbjct: 308 C--SSGEANHAVLITGFDRTGST 328


>gi|440911897|gb|ELR61520.1| Cathepsin O, partial [Bos grunniens mutus]
          Length = 276

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)

Query: 68  EELNKNR-------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN+ R           +A YGI +FS L  EEFK  +LR S ++       ++     
Sbjct: 1   ESLNRQRYLNSLFPHENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY----- 55

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                 SI+        +P++ DWR+  ++ +VRNQ+TCG CWAFS V   ES+ A+K  
Sbjct: 56  -----TSISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 105

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L +LSVQ+VIDC+   N GC+GG   + L W++  +V L  +SEYP   ++  C+  + 
Sbjct: 106 PLGVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 164

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   +   GP+I  V+A++WQ YLGG+IQ++C  S    
Sbjct: 165 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 222

Query: 301 NHAVQIVGYD 310
           NHAV + G+D
Sbjct: 223 NHAVLVTGFD 232


>gi|326918260|ref|XP_003205408.1| PREDICTED: cathepsin O-like, partial [Meleagris gallopavo]
          Length = 283

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 98/272 (36%), Positives = 130/272 (47%), Gaps = 38/272 (13%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR---HSVNKHVLMSHHKHH 116
             +S   I  LN       SA YG  +FS L  EEFK  +LR   H + +++     K  
Sbjct: 9   LRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPHKLPRYIKAPKGKEK 68

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
                                +P K DWR+  +I +VRNQQTCG CWAFS V   ES +A
Sbjct: 69  P--------------------LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYA 108

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           +K   L  LSVQ+VIDC+   N GCSGG     L W++  KV L  +SEY    +   C 
Sbjct: 109 IKGNNLEELSVQQVIDCS-YSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCH 167

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
             A S  GV I  +        E  ++  +   GP+   V+A++WQ YLGG+IQY+C  S
Sbjct: 168 YFARSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYHC--S 225

Query: 297 LANINHAVQIVGYD------------NYSRTW 316
               NHAV I G+D            ++ RTW
Sbjct: 226 SGKANHAVLITGFDRTGSIPYWIVQNSWGRTW 257


>gi|358416284|ref|XP_874012.4| PREDICTED: cathepsin O [Bos taurus]
 gi|359074588|ref|XP_002694471.2| PREDICTED: cathepsin O [Bos taurus]
          Length = 313

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)

Query: 68  EELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN+ R           +A YGI +FS L  EEFK  +LR S ++       ++     
Sbjct: 38  ESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEY----- 92

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                 SI+        +P++ DWR+  ++ +VRNQ+TCG CWAFS V   ES+ A+K  
Sbjct: 93  -----TSISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 142

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L +LSVQ+VIDC+   N GC+GG   + L W++  +V L  +SEYP   ++  C+  + 
Sbjct: 143 PLEVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 201

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   +   GP+I  V+A++WQ YLGG+IQ++C  S    
Sbjct: 202 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 259

Query: 301 NHAVQIVGYD 310
           NHAV + G+D
Sbjct: 260 NHAVLVTGFD 269


>gi|354474585|ref|XP_003499511.1| PREDICTED: cathepsin O-like [Cricetulus griseus]
          Length = 311

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 88/246 (35%), Positives = 129/246 (52%), Gaps = 18/246 (7%)

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           LN       SA YG+ +FS LS EEFK  +L                   +   +++ I 
Sbjct: 45  LNSFSHDNSSASYGLNQFSYLSPEEFKALYL----------GSKPAWSPRYPAAEQKPIP 94

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
                   +P++ DWR+  ++ +VRNQ+ CG CWAFS V   ES  A++   L  LSVQ+
Sbjct: 95  N-----VSLPLRFDWRDKHVVNQVRNQKMCGGCWAFSVVTAIESACAIQGKPLDYLSVQQ 149

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           VIDC+ N N GCSGG   + L W++  +V L  +SEYP   ++  C+    S +GV IK 
Sbjct: 150 VIDCSFN-NYGCSGGSPLSALSWLNKTQVKLMEDSEYPFKAENGLCRYFPQSQSGVSIKD 208

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           ++       E  +   +   GP++  V+A++WQ YLGG+IQ++C  S    NHAV I G+
Sbjct: 209 FSAYDFSGQEDEMAKALLNFGPLVVIVDAVSWQDYLGGIIQHHC--SSGEANHAVLITGF 266

Query: 310 DNYSRT 315
           D    T
Sbjct: 267 DKTGNT 272


>gi|429327035|gb|AFZ78846.1| cathepsin O-like protein [Coptotermes formosanus]
          Length = 227

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 124/190 (65%), Gaps = 11/190 (5%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY--SKSEHDIRFKNFEK 62
           + V  +V L+ALCFL IP+++    L QK ELF  F QR+ K+Y  +++E+   + NF++
Sbjct: 6   RRVFIVVGLVALCFLGIPIRIDDNEL-QKRELFRGFLQRFNKTYEGNETEYMKHYNNFKE 64

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK--HHDHHH 120
           SL+II+ELN++R +  SA YG+T +SDLS++EF   +L+  +  H+ +   K  H+ H +
Sbjct: 65  SLNIIDELNRDRLTEHSAVYGLTAYSDLSKDEFLHLYLQPWLPDHLNLMKQKQSHYSHKY 124

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             V K ++   +      P++ DWR+  +I +VRNQ+TCGACWAFS   T E+M+A+K G
Sbjct: 125 FAVNKEAVVDDL------PLRVDWRDRNVITEVRNQKTCGACWAFSAAATIEAMYAIKTG 178

Query: 181 TLSLLSVQEV 190
            L  LSVQEV
Sbjct: 179 LLHKLSVQEV 188


>gi|296478683|tpg|DAA20798.1| TPA: cathepsin O preproprotein-like [Bos taurus]
          Length = 375

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 25/250 (10%)

Query: 68  EELNKNRQ-------SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           E LN+ R           +A YGI +FS L  EEFK  +LR S ++       ++     
Sbjct: 100 ESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEYT---- 155

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                 SI+        +P++ DWR+  ++ +VRNQ+TCG CWAFS V   ES+ A+K  
Sbjct: 156 ------SISN-----LSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQ 204

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L +LSVQ+VIDC+   N GC+GG   + L W++  +V L  +SEYP   ++  C+  + 
Sbjct: 205 PLEVLSVQQVIDCS-YSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSD 263

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           S +G  IK Y+       E  +   +   GP+I  V+A++WQ YLGG+IQ++C  S    
Sbjct: 264 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHC--SSGEA 321

Query: 301 NHAVQIVGYD 310
           NHAV + G+D
Sbjct: 322 NHAVLVTGFD 331


>gi|345307542|ref|XP_001510786.2| PREDICTED: cathepsin O-like [Ornithorhynchus anatinus]
          Length = 358

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 129/233 (55%), Gaps = 20/233 (8%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTG 137
           +A YG  +FS L  EEFK  +LR   +K                + + S +  ++I P  
Sbjct: 101 TAYYGTNQFSYLFPEEFKAIYLRSKTSK----------------LPRYSESEEMSIKPMP 144

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +PV+ DWR+  ++ +VRNQ+ CG CWAFS V   ES +A++   L  LSVQ+VIDC+ N 
Sbjct: 145 LPVRFDWRDKHVVTQVRNQEACGGCWAFSIVGEIESAYAIRGKPLEELSVQQVIDCSYN- 203

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           N GCSGG     L+W++  +V L  ++EY    +   C   + S  G+ I+ Y+      
Sbjct: 204 NFGCSGGSTINALNWLNKTQVKLVRDAEYSFKAQTGICHYFSGSHYGISIRGYSAYDFSG 263

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            E  ++  + + GP+   V+A++WQ YLGG+IQ++C    A  NHAV I GYD
Sbjct: 264 QEDEMVKVLLSFGPLAVIVDAVSWQDYLGGIIQHHCSSGEA--NHAVLITGYD 314


>gi|71895793|ref|NP_001026300.1| cathepsin O precursor [Gallus gallus]
 gi|53127320|emb|CAG31043.1| hypothetical protein RCJMB04_1m17 [Gallus gallus]
          Length = 320

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 99/273 (36%), Positives = 129/273 (47%), Gaps = 40/273 (14%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +S   I  LN       SA YG  +FS L  EEFK  +LR    K             
Sbjct: 46  LRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPYK------------- 92

Query: 120 HNHVKKRSITTGITIPTG----IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
                   +   I +P G    +P K DWR+  +I +VRNQQTCG CWAFS V   ES +
Sbjct: 93  --------LPRYIKVPKGEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAY 144

Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
           A+K   L  LSVQ+VIDC+   N GCSGG     L W++  KV L  +SEY    +   C
Sbjct: 145 AIKGHNLEELSVQQVIDCS-YSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLC 203

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
                S  GV I  +        E  ++  +   GP+   V+A++WQ YLGG+IQY+C  
Sbjct: 204 HYFPHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYHC-- 261

Query: 296 SLANINHAVQIVGYD------------NYSRTW 316
           S    NHAV I G+D            ++ RTW
Sbjct: 262 SSGKANHAVLITGFDTTGSIPYWIVQNSWGRTW 294


>gi|301607871|ref|XP_002933519.1| PREDICTED: cathepsin O-like [Xenopus (Silurana) tropicalis]
          Length = 370

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 141/290 (48%), Gaps = 35/290 (12%)

Query: 37  FSSFQQRYKKSYSKSEHDI--RFKNFEKSLDIIEELNK---NRQSPESARYGITEFSDLS 91
           F  F Q+Y + Y         R++ F KS +    LN          +A YGI +FSDLS
Sbjct: 66  FLDFIQKYGRGYKDGSQVFQERYQIFLKSTERQNYLNAIALPTNLTSAAHYGINQFSDLS 125

Query: 92  EEEFKTRHLR------HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
            EEF   +LR      ++ NK    S  ++                      +P++ DWR
Sbjct: 126 AEEFFYTYLRSFPTGNYTSNKPFKNSAQQYF---------------------LPLRFDWR 164

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
           +  ++  V+NQ +CGACWAFS V   ES +A+K  TL  LSVQ+VIDC+   + GC+GG 
Sbjct: 165 DKKLVTPVKNQLSCGACWAFSVVGAVESAYAIKWHTLEELSVQQVIDCS-YLDSGCNGGS 223

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               L W+   K  L   SEY    K   C     +  GV I  Y       +E +++  
Sbjct: 224 TNGALKWLYQTKTKLVRASEYNFKAKTGLCHYFPKTDFGVSINGYETQDFSGTEDAMMKM 283

Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           +   GP++  VNA++WQ YLGG+IQ++C  S    NHAV ++GYD    T
Sbjct: 284 LVDLGPMVVIVNAVSWQDYLGGIIQHHC--SSGAPNHAVLVIGYDKTGDT 331


>gi|327273973|ref|XP_003221753.1| PREDICTED: cathepsin O-like [Anolis carolinensis]
          Length = 376

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 89/233 (38%), Positives = 120/233 (51%), Gaps = 18/233 (7%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           +A YG+ +FS L  EEF+  +L+   +K    +     +                I   +
Sbjct: 119 TAFYGMNQFSHLFPEEFRAIYLQSKSSKVPKFTPEVRVEE---------------IDKPL 163

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P K DWR+ GI+ KVRNQ  CG CWAFS V   ES+HA+K   L  LSVQ+VIDC+   N
Sbjct: 164 PAKFDWRDKGIVTKVRNQGVCGGCWAFSVVGIIESVHAIKRNVLEELSVQQVIDCS-YIN 222

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GC GG     L W++  +V L  +SEY    +   C+  + +  GV IK Y    L   
Sbjct: 223 SGCRGGSPVGALGWINQTRVKLVRDSEYHFQAETGLCRYFSRADFGVSIKGYAAYDLSDQ 282

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
           E  +   +   GP+   V+A +WQ YLGG+IQY+C  S    NHAV I GYD 
Sbjct: 283 EDKMKKLLLEWGPLAVVVDAASWQDYLGGIIQYHC--SSGEPNHAVLITGYDT 333


>gi|444519298|gb|ELV12725.1| Cathepsin O [Tupaia chinensis]
          Length = 428

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 106/178 (59%), Gaps = 3/178 (1%)

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P++ DWR+  I+  VRNQQTCGACWAFS V   ES  A+    L  LSVQ+V+DCA + 
Sbjct: 33  LPLRFDWRDKHIVTPVRNQQTCGACWAFSVVSAVESACAMAGAPLRELSVQQVLDCAYD- 91

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           + GC GG   + L+W++  +V L  ESEYP   +D  C+    S  GV I+ Y       
Sbjct: 92  DRGCGGGSTLSALNWLNKTQVKLVGESEYPFTARDGICRFFPASCPGVSIRGYLAYDFSA 151

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
            E  +   +   GP++A V+A++WQ YLGGVIQ++C  S    NHAV + G+D   +T
Sbjct: 152 QEDEMAKALVALGPLVAVVDAVSWQDYLGGVIQHHC--SSGEANHAVLVTGFDKAGQT 207


>gi|29244082|ref|NP_808330.1| cathepsin O precursor [Mus musculus]
 gi|67460397|sp|Q8BM88.1|CATO_MOUSE RecName: Full=Cathepsin O; Flags: Precursor
 gi|26329979|dbj|BAC28728.1| unnamed protein product [Mus musculus]
 gi|74139152|dbj|BAE38466.1| unnamed protein product [Mus musculus]
 gi|74141620|dbj|BAE38573.1| unnamed protein product [Mus musculus]
          Length = 312

 Score =  150 bits (379), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +SL     LN       +A YG+ +FS L  EEFK  +L    +K+     +      
Sbjct: 36  LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   ES  A++ 
Sbjct: 91  -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +  C+   
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 258 ANHAVLITGFDRTGNT 273


>gi|148683493|gb|EDL15440.1| cathepsin O [Mus musculus]
          Length = 312

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +SL     LN       +A YG+ +FS L  EEFK  +L    +K+     +      
Sbjct: 36  LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   ES  A++ 
Sbjct: 91  -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +  C+   
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 258 ANHAVLITGFDRTGNT 273


>gi|28278727|gb|AAH44664.1| Ctso protein [Mus musculus]
          Length = 292

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +SL     LN       +A YG+ +FS L  EEFK  +L    +K+     +      
Sbjct: 16  LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 70

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   ES  A++ 
Sbjct: 71  -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 120

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +  C+   
Sbjct: 121 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 179

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 180 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 237

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 238 ANHAVLITGFDRTGNT 253


>gi|68086379|gb|AAH98219.1| Cathepsin O [Mus musculus]
          Length = 312

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/263 (33%), Positives = 131/263 (49%), Gaps = 18/263 (6%)

Query: 53  HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           H        +SL     LN       +A YG+ +FS L  EEFK  +L    +K+     
Sbjct: 29  HQREAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPR 85

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
           +           +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   E
Sbjct: 86  YPAEG-------QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIE 133

Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
           S  A++  +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +
Sbjct: 134 SARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPPCALRWLNETQLKLVADSQYPFKAVN 192

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
             C+    S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++
Sbjct: 193 GQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH 252

Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
           C  S    NHAV I G+D    T
Sbjct: 253 C--SSGEANHAVLITGFDRTGNT 273


>gi|348582234|ref|XP_003476881.1| PREDICTED: cathepsin O-like [Cavia porcellus]
          Length = 478

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 85/237 (35%), Positives = 123/237 (51%), Gaps = 18/237 (7%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           +A YGI +FS L  EEFK  +LR   ++               +  K   + G      +
Sbjct: 221 TAFYGINQFSYLFPEEFKAIYLRSKPSRS------------PRYPSKVQTSVG---SVSL 265

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P + DWR+  ++ +VRNQQ CG CWAFS V   ES  A++   L  LS Q+VIDC+ N N
Sbjct: 266 PPRFDWRDKHVVTQVRNQQACGGCWAFSVVGAVESAWAIRGEPLEDLSAQQVIDCSYN-N 324

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GC+GG   + L W+   +V L  +SEYP   ++  C   ++S  G  I+ Y        
Sbjct: 325 FGCNGGSPLSALTWLKKTRVKLVKDSEYPFKAQNGLCHYFSSSHPGFSIQDYAAYDFSAQ 384

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           E  +   +   GP++  V+A++WQ YLGGVIQ++C  S    NHAV + G+D    T
Sbjct: 385 EDEMARVLLLSGPLVVIVDAVSWQDYLGGVIQHHC--SSGEANHAVLVTGFDQTGST 439


>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 330

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 92/276 (33%), Positives = 140/276 (50%), Gaps = 26/276 (9%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           + F  F  +Y KSY+  E   R + F  +LD I+ LN    +   ARYG+ +F+DL+ +E
Sbjct: 30  QQFRQFAAQYGKSYASEEFGERLRIFRDNLDRIDALNS---ANTGARYGVNKFADLTPKE 86

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FK  +L+ +                    KK + T  + +   +P + DWR+ G +   +
Sbjct: 87  FKATYLKGA---------------RSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTK 131

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWM 213
           +Q  CG  WAFS  E  ES   L    L  L+ Q+++DC  GNG+ GC GGD     +++
Sbjct: 132 DQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYV 189

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    L+ E  YP   +D  C  K  S  G KI ++T  T   +E+ +   +A+ GP+ 
Sbjct: 190 -IKAGGLDTEESYPYTAEDGQCAFKP-SAVGAKISNWTYITTTKNETEMQYGLASRGPLS 247

Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             V+A +WQYY+GGVI   C+ SL   +H V I GY
Sbjct: 248 ICVDASSWQYYIGGVITSLCEDSL---DHCVMITGY 280


>gi|26340204|dbj|BAC33765.1| unnamed protein product [Mus musculus]
          Length = 312

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 86/263 (32%), Positives = 130/263 (49%), Gaps = 18/263 (6%)

Query: 53  HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           H        +SL     LN       +A YG+ + S L  EEFK  +L    +K+     
Sbjct: 29  HQREAAALRESLHRHRYLNSFPHENSTAFYGVNQLSYLFPEEFKALYLG---SKYAWAPR 85

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
           +           +R I         +P++ DWR+  ++  VRNQ+ CG CWAFS V   E
Sbjct: 86  YPAEG-------QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIE 133

Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
           S  A++  +L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP    +
Sbjct: 134 SARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVN 192

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
             C+    S  GV +K ++       E  +   + + GP++  V+A++WQ YLGG+IQ++
Sbjct: 193 GQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH 252

Query: 293 CDGSLANINHAVQIVGYDNYSRT 315
           C  S    NHAV I G+D    T
Sbjct: 253 C--SSGEANHAVLITGFDRTGNT 273


>gi|380254588|gb|AFD36229.1| cysteine proteinase [Acanthamoeba castellanii]
          Length = 359

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 155/305 (50%), Gaps = 16/305 (5%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
            F V ++A+  LA  V + +  L+ +   F+ + +++ +SY   E   R+  + +++  +
Sbjct: 8   FFAVVVLAVASLAQGVSIEERELQGR---FNGWMRQHARSYDSDEFLERYNIWRENMAFV 64

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           EE N  R   +S    + ++ DL+ EEF   +  H + K       K  D   +  ++  
Sbjct: 65  EEFN--RAGDKSFTVAMNQYGDLAPEEFSRLYKGHMLPKDEEEQMRKRLDEQ-DPAEEEP 121

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
           +T G T+P       DWR  G +  + NQ +C +CWAF++    E    + N TL  LS 
Sbjct: 122 VTVGATVP----ASWDWRSVGAVTGIENQGSCASCWAFASAYALEGARKIANSTLVSLSK 177

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q+++DC+G+ GN+GC GG+      WM  N   L  E+ YP     AAC+  + SP  V 
Sbjct: 178 QQLVDCSGSGGNLGCYGGNVGLTYTWMRRNNAKLMTEANYPYTGVQAACRYTSASPAVVG 237

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
           +K+Y       SES +L + A  GPV  A+++   ++ YY GG   Y+   S + ++HAV
Sbjct: 238 VKNYA-SVKAGSESDLLANAAV-GPVTVAIDSSKRSFIYYSGGYY-YDQTCSSSYLDHAV 294

Query: 305 QIVGY 309
            +VG+
Sbjct: 295 TVVGW 299


>gi|440793487|gb|ELR14669.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 342

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 155/305 (50%), Gaps = 16/305 (5%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
            F V ++A+  LA  V + +  L+ +   F+ + +++ +SY   E   R+  + +++  +
Sbjct: 8   FFAVVVLAVASLAQGVSIEERELQGR---FNGWMRQHARSYDSDEFLERYNIWRENMAFV 64

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           EE N  R   +S    + ++ DL+ EEF   +  H + K       K  D   +  ++  
Sbjct: 65  EEFN--RAGDKSFTVAMNQYGDLAPEEFSRLYKGHMLPKDEEEQMRKRLDEQ-DPAEEEP 121

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
           +T G T+P       DWR  G +  + NQ +C +CWAF++    E    + N TL  LS 
Sbjct: 122 VTVGATVP----ASWDWRSVGAVTGIENQGSCASCWAFASAYALEGARKIANSTLVSLSK 177

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q+++DC+G+ GN+GC GG+      WM  N   L  E+ YP     AAC+  + SP  V 
Sbjct: 178 QQLVDCSGSGGNLGCYGGNVGLTYTWMRRNNAKLMTEANYPYTGVQAACRYTSASPAVVG 237

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
           +K+Y       SES +L + A  GPV  A+++   ++ YY GG   Y+   S + ++HAV
Sbjct: 238 VKNYA-SVKAGSESDLLANAAV-GPVTVAIDSSKRSFIYYSGGYY-YDQTCSSSYLDHAV 294

Query: 305 QIVGY 309
            +VG+
Sbjct: 295 TVVGW 299


>gi|260832906|ref|XP_002611398.1| hypothetical protein BRAFLDRAFT_210717 [Branchiostoma floridae]
 gi|229296769|gb|EEN67408.1| hypothetical protein BRAFLDRAFT_210717 [Branchiostoma floridae]
          Length = 283

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 87/244 (35%), Positives = 126/244 (51%), Gaps = 20/244 (8%)

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E+LN  R +  SA YG+  FSDL+  EF+  ++    +K+   S +              
Sbjct: 14  EQLNHGR-TAGSALYGLNRFSDLTPAEFRGSNV---TSKNSAQSTYD------------- 56

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                  P+ +P   DWR    +  +R+Q +CG CWAFS VET ES  ++    L   SV
Sbjct: 57  PGYSFEAPSDVPPIWDWRNNKTVTAIRDQGSCGGCWAFSIVETIESQWSIAGHLLEEYSV 116

Query: 188 QEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q+V+DC    G+ GC GGD C  L WM+     L P+ +YP   KD  C+    + + V 
Sbjct: 117 QQVLDCDRTKGSHGCRGGDTCNALSWMNQTTANLVPKKDYPYTGKDGECRFFTNTTDSVH 176

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           + +YTC      E  ++  +  HG +   V+A +WQ YLGG+IQ++C  S    NHAVQI
Sbjct: 177 LTNYTCRGYENHEDEMVRLLHGHGTLAIIVDATSWQDYLGGIIQHHC--SHDYNNHAVQI 234

Query: 307 VGYD 310
           VGY+
Sbjct: 235 VGYN 238


>gi|293345419|ref|XP_001070844.2| PREDICTED: cathepsin O-like [Rattus norvegicus]
          Length = 307

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 87/256 (33%), Positives = 133/256 (51%), Gaps = 22/256 (8%)

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
             +SL+    LN       +A YG+ +FS L  EEFK  +L    +K      +      
Sbjct: 35  LRESLNRHRYLNSFPHDNSTAFYGVNQFSYLFPEEFKALYLG---SKPAWAPRYP----- 86

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               K ++    ++    +P++ DWR+  ++  VRNQ+TCG CWAFS V   ES  A++ 
Sbjct: 87  ---AKGQTPIPNVS----LPLRFDWRDKHVVNHVRNQKTCGGCWAFSVVSAVESAGAIQG 139

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LSVQ+VIDC+ N N GC GG     L W++  ++ L  +S+YP   ++  C+   
Sbjct: 140 KPLDYLSVQQVIDCSFN-NYGCRGGSPLGALSWLNETQLKLVADSQYPFKAENGLCRYFP 198

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S N V I S+  +     E  +   + + GP++  V+A++WQ YLGG+IQ++C  S   
Sbjct: 199 QSFNYVYISSFGSN----QEDEMARALLSFGPLVVIVDAVSWQDYLGGIIQHHC--SSGE 252

Query: 300 INHAVQIVGYDNYSRT 315
            NHAV I G+D    T
Sbjct: 253 ANHAVLITGFDKTGNT 268


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 141/278 (50%), Gaps = 22/278 (7%)

Query: 35  ELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E+F  ++++++K Y  +E  + R  NF+++L  I E N  R+S    + G+ +F+DLS E
Sbjct: 48  EVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNE 107

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EF+  +L   V K + +   + H H                    P   DWR  G++  V
Sbjct: 108 EFREMYLSK-VKKPITIEEKRKHRHLQT--------------CDAPSSLDWRNKGVVTAV 152

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q  CG+CW+FST    E+++A+  G L  LS QE++DC    N GC GGD  +   W+
Sbjct: 153 KDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWV 212

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    ++ E++YP    D  C         V I+ Y    + PS+S++L       P+ 
Sbjct: 213 -IGNGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYV--DVDPSDSALLC-ATVQQPIS 268

Query: 274 AAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             ++  AL +Q Y GG+   +C G   +I+HA+ IVGY
Sbjct: 269 VGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGY 306


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 96/294 (32%), Positives = 151/294 (51%), Gaps = 31/294 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ +ELF  F  +Y+K+YS  E  +R F+ F+ +L+ I+E NK          G+ EF+D
Sbjct: 46  ERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK---ITGYWLGLNEFAD 102

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ +EFK  +L  ++      +    +D    + +  + +        +P + DWR+ G 
Sbjct: 103 LTHDEFKAAYLGLTLTP----ARRNSNDQLFRYEEVEAAS--------LPKEVDWRKKGA 150

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           + +V+NQ  CG+CWAFSTV   E ++A+  G L+ LS QE+IDC  +GN GCSGG     
Sbjct: 151 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYA 210

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN-------GVKIKSYTCDTLIPSESSI 262
             ++  N   L  E  YP L+++  C+R +T  +        V I  Y  D    +E ++
Sbjct: 211 FSYIAANG-GLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYE-DVPRNNEQAL 268

Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           L  +A H PV  A+ A    +Q+Y GGV    C      ++H V  VGY   S+
Sbjct: 269 LKALA-HQPVSVAIEASGRNFQFYSGGVFDGPCG---TRLDHGVTAVGYGTASK 318


>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
          Length = 471

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 101/289 (34%), Positives = 152/289 (52%), Gaps = 37/289 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           NL +   LF+ FQ ++K++Y  + E  +RF+ F+++L +IEELN+N Q   SA+YGITEF
Sbjct: 158 NLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG--SAKYGITEF 215

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
           +D++  E+K R      +     S+ K                   IP   +P + DWRE
Sbjct: 216 ADMTSPEYKQRTGLWQRDPQKAASNPK-----------------AEIPNIDLPKEFDWRE 258

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G I  V+NQ  CG+CWAFS     E +HA++ G L   S QE++DC    +  C+GG  
Sbjct: 259 KGAISAVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDC-DTSDSACNGG-- 315

Query: 207 CALLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L D  +  + K+  LE ES+YP   +   C   +T  + VK+K +    L  +E++I 
Sbjct: 316 --LPDNAYEAIEKIGGLELESDYPYHARKDQCHFNSTKIH-VKVKGHV--DLPKNETAIA 370

Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
             +  +GP+   +NA   Q+Y GGV       C  S  N++H V IVGY
Sbjct: 371 QWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVGY 417


>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
          Length = 471

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 101/289 (34%), Positives = 152/289 (52%), Gaps = 37/289 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           NL +   LF+ FQ ++K++Y  + E  +RF+ F+++L +IEELN+N Q   SA+YGITEF
Sbjct: 158 NLNKVEHLFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQG--SAKYGITEF 215

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
           +D++  E+K R      +     S+ K                   IP   +P + DWRE
Sbjct: 216 ADMTSPEYKQRTGLWQRDPQKAASNPK-----------------AEIPNIDLPKEFDWRE 258

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G I  V+NQ  CG+CWAFS     E +HA++ G L   S QE++DC    +  C+GG  
Sbjct: 259 KGAISAVKNQGNCGSCWAFSVTGNIEGLHAVRTGVLEQYSEQELLDC-DTSDSACNGG-- 315

Query: 207 CALLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L D  +  + K+  LE ES+YP   +   C   +T  + VK+K +    L  +E++I 
Sbjct: 316 --LPDNAYEAIEKIGGLELESDYPYHARKDQCHFNSTKIH-VKVKGHV--DLPKNETAIA 370

Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
             +  +GP+   +NA   Q+Y GGV       C  S  N++H V IVGY
Sbjct: 371 QWLIANGPISIGINANAMQFYRGGVSHPPHILC--SRKNLDHGVLIVGY 417


>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
 gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
           Precursor
 gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
          Length = 531

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/290 (33%), Positives = 143/290 (49%), Gaps = 31/290 (10%)

Query: 31  EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           EQ   LF  ++ +Y K YS + EHD RF NF+ +  II   N       S + G+  ++D
Sbjct: 219 EQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKE---SSYKLGMNHYAD 275

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS +EF T      V   V        D  H+    RSI          P   DWR    
Sbjct: 276 LSNKEFNTL-----VKPKVARPSVTGADSVHDDESLRSI----------PSTVDWRNQNC 320

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
           +  V++Q  CG+CW F +  + E  + + NG L  LS Q+++DCA   G+ GC GG   +
Sbjct: 321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASS 380

Query: 209 LLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              + M++    L  ES YP L+++  C+ +  +P+GV I  Y  +    SES++   IA
Sbjct: 381 AFQYVMEIGS--LATESNYPYLMQNGLCRDRTVTPSGVSITGYV-NVTSGSESALQNAIA 437

Query: 268 THGPVIAAVNALT--WQYYLGGVIQYN---CDGSLANINHAVQIVGYDNY 312
           T GPV  A++A    ++YY+ GV  YN   C   L +++H V  +GY  Y
Sbjct: 438 TTGPVAIAIDASVDDFRYYMSGV--YNNPACKNGLDDLDHEVLAIGYGTY 485


>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
          Length = 774

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 153/281 (54%), Gaps = 34/281 (12%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           LF++F   Y ++YS  E ++RFK F ++L+ IEEL +  Q   +  YG+  F+D+S++EF
Sbjct: 469 LFNNFMTTYNRTYSSLERNLRFKIFRENLNFIEELRETEQG--TGIYGVNMFADMSQKEF 526

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +TR+L             +      N +   ++    I +P+      DWR+ G++  V+
Sbjct: 527 RTRYL-----------GLRPDLQSENEIPLPKAEIPDIDLPSSF----DWRQKGVVTPVK 571

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--W 212
           NQ  CG+CWAFS     E  +A+K+G L  LS QE++DC  + + GC+GG    L D  +
Sbjct: 572 NQGQCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDC-DHLDEGCNGG----LPDNAY 626

Query: 213 MDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
             + ++  LE ES+YP   ++  C  K    N VK++  +   +  +E+ I   +  +GP
Sbjct: 627 RAIEQLGGLELESDYPYEAENEKCHFKQ---NLVKVELASAVNITSNETQIAQWLVQNGP 683

Query: 272 VIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           +   +NA   Q+Y+GGV   ++  C+ +  N+NH V IVGY
Sbjct: 684 IAIGINANAMQFYMGGVSHPLKILCNPN--NLNHGVLIVGY 722


>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
          Length = 323

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 156/307 (50%), Gaps = 32/307 (10%)

Query: 16  LCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNR 74
           + F A  V +   N     EL++ F++ + K+Y S  E  +RF  F+ +L  I   N   
Sbjct: 5   IAFAAFVVAI---NAASDQELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNAKY 61

Query: 75  QSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
           +S ES  Y  I +FSD+++EEF+   +++  ++  L             ++  ++T G  
Sbjct: 62  ESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSL-----------EDMEIANLTVGAA 110

Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
                P   DWR  G +  +RNQ+ CG+CWAFS V   E   A+K+G+ + LSVQ+++DC
Sbjct: 111 -----PESIDWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDC 165

Query: 194 AG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
           +   GN GC+GG      D++  N   LE +++YP    D +CK   +S   VK+  Y  
Sbjct: 166 STEGGNSGCNGGLMNGAFDYIKANG--LESDAKYPYTGTDDSCKADKSSSL-VKLTGYK- 221

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVGY-- 309
             +  SE+S+   + T GP+  AV A  W+ Y GG+     C G    ++H V  VGY  
Sbjct: 222 -KVASSEASLKEAVGTVGPISVAVYADLWRSYGGGIFNNILCLG--FGLDHGVTAVGYGT 278

Query: 310 DNYSRTW 316
           DN  + W
Sbjct: 279 DNGKKYW 285


>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
          Length = 341

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 93/302 (30%), Positives = 143/302 (47%), Gaps = 27/302 (8%)

Query: 11  VALIALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
           +ALIA+    +    V     +     F ++   + K Y  + E  +R  NF +++  IE
Sbjct: 5   LALIAIMLAVVSAYNVRLSTADDYTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIE 64

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
           ++N  RQ   +A +G+ +FSDLS +EFK         KH LM ++K         K R  
Sbjct: 65  KMN--RQYGRTATFGLNKFSDLSLDEFK---------KHYLMPNYKP--------KARVT 105

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                 P+ IP   DWR  G +  V+NQ  CG+CWAFS  E  E+ + +  G +  LS Q
Sbjct: 106 KETFNYPSNIPATLDWRTKGYVTPVKNQLMCGSCWAFSATEQIETANIMAGGQVEYLSEQ 165

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           +++DC    + GC GGD      ++  N   L     YP    + AC   +T+P  V++ 
Sbjct: 166 QIVDCDPY-DGGCGGGDPYTAYQYVQ-NNGGLTLNVTYPYTAANGACYANSTAP-AVQVT 222

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
           ++   +   +E+ +   +A  GP+   VNA  W  Y  G+    C   L   +H VQIVG
Sbjct: 223 AFGYASSQGNETQLREAMAARGPLSICVNAEPWMSYQSGIFSSTCSDDL---DHCVQIVG 279

Query: 309 YD 310
           YD
Sbjct: 280 YD 281


>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
          Length = 465

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 158/313 (50%), Gaps = 40/313 (12%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           + I+ ++ L  +A   K+S   LE+    F  FQ +Y K Y+ SE+  RF  F+ +L +I
Sbjct: 4   VIILTVLLLVSMAAAKKLS---LEETQ--FRQFQIKYNKQYTSSEYAERFATFKSNLKVI 58

Query: 68  EELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +E N++  S +S+ R+G+ EF+DLS+ EF+  +L +SV                  V+  
Sbjct: 59  DEKNRDAASRKSSVRFGVNEFADLSQSEFRATYL-NSVQA----------------VRDP 101

Query: 127 SITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
           +      +P   +P   DWR  G +  V+NQ  CG+CW+FST    E    L   TL+ L
Sbjct: 102 NAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWSFSTTGNVEGQWFLAGNTLTGL 161

Query: 186 SVQEVIDC-------AGNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           S Q ++DC        G+   + GC+GG       ++  N  + + E+ YP    D  C 
Sbjct: 162 SEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIKNGGI-DTEASYPYQGVDGTCS 220

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            KA +  G KI ++T   +  +E+ +   +  +GP+  A +A+ WQ+YLGGV    C  +
Sbjct: 221 FKAANI-GAKISNWT--YVSSNETQMAAYLVANGPLAIAADAVEWQFYLGGVFDVPCGNT 277

Query: 297 LANINHAVQIVGY 309
           L   +H + IVGY
Sbjct: 278 L---DHGILIVGY 287


>gi|281207374|gb|EFA81557.1| hypothetical protein PPL_05546 [Polysphondylium pallidum PN500]
          Length = 341

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 148/308 (48%), Gaps = 27/308 (8%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEK 62
           + F+V L+A+  +      +     +  +    F+Q   +++KSY+  SE+ +R  ++ K
Sbjct: 5   IAFLVCLVAIASVDAIRIQNNSGFHRARDFEGEFRQWMTKHEKSYADDSEYYLRLSHYIK 64

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +L  + + NK       A++   +FSDLS EEF+  +L +  NK +              
Sbjct: 65  NLRTVADYNKKHAG--MAKFAPNKFSDLSIEEFRAGYLNYVPNKLI-------------- 108

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
            K RS       P  IPV  DWR+ G +  V+NQ+ CG+CWAFS  E  E+ + +     
Sbjct: 109 -KDRSTKQNFDYPANIPVSLDWRQKGFVTPVKNQEQCGSCWAFSAGEQIETAYIMAGNAA 167

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
             +S Q+++DC    + GC GGD      ++  +   +   ++YP    D  C  + T P
Sbjct: 168 QNVSEQQIVDCDPY-DGGCGGGDPMTAYQYVQ-SAGGITTNTDYPYTATDGTCYAQNT-P 224

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
              +I SY   +   +E+ +   IA  GP+   V+A TW  Y  GV+  NC   L   +H
Sbjct: 225 KFTQIASYGYASNKGNETELKQAIAARGPLSICVDAETWMNYQSGVLNSNCPDEL---DH 281

Query: 303 AVQIVGYD 310
            VQIVGYD
Sbjct: 282 CVQIVGYD 289


>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
          Length = 322

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/312 (33%), Positives = 152/312 (48%), Gaps = 47/312 (15%)

Query: 8   LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           LF V+   LI  C +A+P        +   EL+  F++ Y K Y+  +   RF  F+ +L
Sbjct: 3   LFTVSCFVLIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
              ++L    Q   +ARYG+T+FSDL+ EEF  ++L   VN               N   
Sbjct: 55  MRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQV 97

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           KR   TG+      P + DWR  G +  V NQ +CG+CWAFST    E    +K G L  
Sbjct: 98  KRVRPTGLK---AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVS 154

Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q+++DC      GC+GG    + L+ M +    LE ES+YP +  +  C     + N
Sbjct: 155 LSKQQLVDC-DRAAQGCNGGWPASSYLEIMYMGG--LESESDYPYVGVEQTC-----ALN 206

Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ---YNCDGSLA 298
             K+ +   D+++  P E      +A HGP+   +NA+  QYY  GV++     C  +  
Sbjct: 207 KEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPTFEECPDT-- 264

Query: 299 NINHAVQIVGYD 310
            +NHAV  VGYD
Sbjct: 265 ELNHAVLTVGYD 276


>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
          Length = 321

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 102/308 (33%), Positives = 148/308 (48%), Gaps = 39/308 (12%)

Query: 8   LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           LF V+   LI  C +A+P        +   EL+  F++ Y K Y+  +   RF  F+ +L
Sbjct: 3   LFTVSCFVLIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
              ++L    Q   +ARYG+T+FSDL+ EEF  ++L   VN               N   
Sbjct: 55  MRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQV 97

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           KR   TG+      P + DWR  G +  V NQ +CG+CWAFST    E    +K G L  
Sbjct: 98  KRVRPTGLK---AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVS 154

Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q+++DC    + GC+GG    + L+ M +    LE + +YP       C  +     
Sbjct: 155 LSKQQLVDCDRAAD-GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER-- 209

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINH 302
            +  K      L PSE      +A HGP+   +NA+T QYY  G+I  + +  S  ++NH
Sbjct: 210 -LLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEECSPVDLNH 268

Query: 303 AVQIVGYD 310
           AV  VGYD
Sbjct: 269 AVLTVGYD 276


>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
          Length = 1036

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 97/282 (34%), Positives = 152/282 (53%), Gaps = 35/282 (12%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F  +YKK Y +K E ++RF+ F+ +L++IEEL +N     + RYG+T+F+DL++ E
Sbjct: 730 LFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMG--TGRYGVTQFTDLTKAE 787

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKV 153
           FK RHL     K  L S         N +         TIP   +P   DWR   ++  V
Sbjct: 788 FKARHLGL---KPTLKSE--------NDIP----MPMATIPDIELPSDYDWRHHNVVTPV 832

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD-- 211
           ++Q +CG+CWAFS     E  +A+K+G L  LS QE++DC    + GC+GG    L D  
Sbjct: 833 KDQGSCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDC-DKLDSGCNGG----LPDTA 887

Query: 212 WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +  + ++  LE ES+YP   +D  C     + N VK+   +   +  +E+ +   +  +G
Sbjct: 888 YRAIEELGGLELESDYPYDAEDEKCH---FNKNKVKVNIVSGLNITSNETQMAQWLVKNG 944

Query: 271 PVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           P+   +NA   Q+Y+GGV    ++ C  S  +++H V IVGY
Sbjct: 945 PMSIGINANAMQFYMGGVSHPFKFLC--SPDSLDHGVLIVGY 984


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 150/286 (52%), Gaps = 27/286 (9%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFS 88
           E+ LE+F  ++++++K Y  +E  + RF+NF+ +L  I E N  R++ +   + G+ +F+
Sbjct: 43  ERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFA 102

Query: 89  DLSEEEFKTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
           D+S EEF+  +L   +  +NK + +S +           +R + +        P   DWR
Sbjct: 103 DMSNEEFRKAYLSKVKKPINKGITLSRNM----------RRKVQS-----CDAPSSLDWR 147

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
             G++  V++Q +CG+CWAFS+    E ++AL  G L  LS QE+++C    N GC GG 
Sbjct: 148 NYGVVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECD-TSNYGCEGGY 206

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                +W+ +N   ++ ES+YP    D  C         V I  Y    +  S+S++L  
Sbjct: 207 MDYAFEWV-INNGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQ--DVEQSDSALLCA 263

Query: 266 IATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   PV   ++  A+ +Q Y GG+   +C     +I+HAV IVGY
Sbjct: 264 VAQQ-PVSVGIDGSAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGY 308


>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
          Length = 355

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 152/316 (48%), Gaps = 28/316 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
            + A + L  +A      + ++E  L +  F ++Q  Y +SY + +E   RF+ + ++++
Sbjct: 10  LLCACLMLVLMAGAASGGRVDVEDMLMMDRFRAWQATYNRSYLTAAERLRRFEVYRQNME 69

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV-- 123
           +IE  N  R++  S +   T F+DL+ EEF   H   S   H   +  +H +    H   
Sbjct: 70  LIEATN--RRAELSYQLSETPFTDLTSEEFLATHT-MSTRLHASEAARRHRELITTHAGP 126

Query: 124 --------KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
                    +R+ TT + +P  +    DWR  G +  V++Q  CG CW+F+TV   E +H
Sbjct: 127 VSDGGRQWNRRNYTTDLDVPESV----DWRTKGAVTTVKDQGACGGCWSFATVAAIEGLH 182

Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
            ++ G L  LS QEV+DC+   N GC GG+  A +DW+  N   L  ES+YP   +   C
Sbjct: 183 KIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAIDWVSANG-GLTTESDYPYEGRQGKC 241

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNC 293
           K      +  KI+      L+   +    ++A    PV   +N     Q+Y  GV    C
Sbjct: 242 KLDKARNHVAKIRG---RKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPC 298

Query: 294 DGSLANINHAVQIVGY 309
           D    ++NHAV +VGY
Sbjct: 299 DPE--DLNHAVTMVGY 312


>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
          Length = 327

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 101/311 (32%), Positives = 154/311 (49%), Gaps = 40/311 (12%)

Query: 8   LFIVALIALCFLAIPVKVSKPNL-EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           LF V+  AL  ++  + VS   + +   EL+  F++ Y K Y+  +   RF  F+ +L  
Sbjct: 3   LFTVSCFAL-IVSCAIAVSAGRVPDSARELYEQFKRGYGKVYANEDDQKRFAIFKDNLVR 61

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
            ++L    Q   +ARYG+T+FSDL+ EEF  ++L   VN               + VK+ 
Sbjct: 62  AQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLSAPVN--------------DDQVKRM 105

Query: 127 SITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
                   PTG+   P + DWR  G +  V NQ +CG+CWAFST    E    +K G L 
Sbjct: 106 R-------PTGLKAAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLV 158

Query: 184 LLSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q+++DC      GC+GG    + L+ M +    LE ES+YP +  +  C     + 
Sbjct: 159 SLSKQQLVDC-DRAAQGCNGGWPASSYLEIMYMGG--LESESDYPYVGVEQTC-----AL 210

Query: 243 NGVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL-AN 299
           N  K+ +   D+++  P E      +A HGP+   +NA+  Q+Y  GV++   D      
Sbjct: 211 NKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPTFDECPDTE 270

Query: 300 INHAVQIVGYD 310
           +NHAV  VGYD
Sbjct: 271 LNHAVLTVGYD 281


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 152/307 (49%), Gaps = 30/307 (9%)

Query: 14  IALCFLAIPVKVSKPNLEQKLELFSSFQQRY-------KKSYSK-SEHDIRFKNFEKSLD 65
           +AL F+ + +  S+  L + +   ++ + R+       +K Y   +E ++RF+ F+++++
Sbjct: 12  LALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKENVE 71

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            IE  N      +  + G  +FSDL+ EEF+  H  +  +   +M+  K   H       
Sbjct: 72  RIEAFNAGED--KGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFR----- 124

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                  T  T IP   DWR+ G +  +++Q+ CG CWAFS V   E +H LK G L  L
Sbjct: 125 ------YTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPL 178

Query: 186 SVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           S QE++DC   G + GCSGG      D++  NK  L  E  YP   +D  C +K ++ + 
Sbjct: 179 SEQELVDCDVEGEDEGCSGGLLDTAFDFILKNK-GLTTEVNYPYKGEDGVCNKKKSALSA 237

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINH 302
            KI  Y  D    SE ++L  +A   PV  A++  +  +Q+Y  GV   +C   L   NH
Sbjct: 238 AKITGYE-DVPANSEKALLQAVANQ-PVSVAIDGSSFDFQFYSSGVFSGSCSTWL---NH 292

Query: 303 AVQIVGY 309
           AV  VGY
Sbjct: 293 AVTAVGY 299


>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 140/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   ES  A+    L+ LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201

Query: 215 VN-KVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    +  E  YP +       AC   +    G +I  Y   T+  SE+ +   +A  G
Sbjct: 202 RNMNGTMXTEDSYPYVSSTGDVPACTNSSQLVPGARIDGYV--TIESSETVMAAWLAKSG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 260 PISIAVDASSFMSYXSGVLT-SCAGK--XLNHGVLLVGYN 296


>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 376

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 104/340 (30%), Positives = 163/340 (47%), Gaps = 57/340 (16%)

Query: 1   MFDVKNVLFIVALIALC--FLAIPVKVSKPNLEQ--------KLEL-----FSSFQQRYK 45
           M  ++ +  +VA + L     A+   V  P +EQ        +LEL     F+SF QR+ 
Sbjct: 1   MARLRRLPIVVAAVLLLSGVAALSSPVEDPLIEQVVGGDEKNELELNAEAHFASFVQRFN 60

Query: 46  KSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---- 100
           KSY  + EH  R   F  +L       ++++   SA +G+T+FSDL+ +EF+ R L    
Sbjct: 61  KSYRDADEHAHRLSVFTANL---RRARRHQRLDPSAVHGVTKFSDLTPDEFRDRFLGLRK 117

Query: 101 -RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQT 158
            R S  K +  S H                    +PT G+P + DWRE G +G V++Q +
Sbjct: 118 YRRSFLKGLSGSAHD----------------APALPTDGLPTEFDWREHGAVGPVKDQGS 161

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALL 210
           CG+CW+FST    E  H L  G L +LS Q+++DC            + GC+GG      
Sbjct: 162 CGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAF 221

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++      LE E +YP   +  ACK    S    ++K+++  T+   E  I  ++  HG
Sbjct: 222 SYL-AKAGGLETEKDYPYTGRGGACKFD-KSKIAAQVKNFS--TVAVDEDQIAANLVKHG 277

Query: 271 PVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           P+   +NA+  Q Y+GGV   + C     +++H V +VGY
Sbjct: 278 PLAIGINAVFMQTYIGGVSCPFICG---RHLDHGVLLVGY 314


>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
 gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
          Length = 346

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 145/285 (50%), Gaps = 33/285 (11%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEF 95
           F +FQQ+Y K YS +E+  +F+ F+ +L +I +LN+  +  +S  ++G+ EF+DLS  EF
Sbjct: 29  FVAFQQKYNKVYSSNEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNEFADLSAAEF 88

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +  +L   V K                +    + T   + T IP   DWR  G +  V+N
Sbjct: 89  RKYYLNAQVAKP------------DASLPMAPLLTEEVLET-IPTAFDWRTKGAVTGVKN 135

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC---------AGNGNMGCSGGDF 206
           Q  CG+CW+FST    E    L   TL  LS Q ++DC           + + GC GG  
Sbjct: 136 QGQCGSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQ 195

Query: 207 CALLDWMDVNKVVLEPESEYPLL-LKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILT 264
                ++ +    L+ E+ YP L +   +CK K+ +    KI ++   T+IP +E+ +  
Sbjct: 196 PNAYRYV-IENGGLDSENSYPYLAVTGDSCKFKSGNV-AAKISNF---TMIPQNETQMAG 250

Query: 265 DIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +ATHGP+  A +A  WQ+Y+GGV    C  SL   +H + IVG+
Sbjct: 251 YLATHGPLAIAADAAEWQFYIGGVFDLPCGQSL---DHGILIVGF 292


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 88/262 (33%), Positives = 133/262 (50%), Gaps = 22/262 (8%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +E ++RFK F+++++ IE  N      +  + G+ +FSDL+ E+F+  H  +  +   +M
Sbjct: 57  NEKEMRFKIFKENVERIEAFNAGED--KGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVM 114

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           S  K   H                 T IP   DWR+ G +  +++Q+ CG CWAFS V  
Sbjct: 115 SSSKPKTHFR-----------YANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAA 163

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            E +H LK G L  LS QE++DC   G + GCSGG      D++  NK  L  E+ YP  
Sbjct: 164 TEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNK-GLTTEANYPYK 222

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGG 287
            +D  C +K ++ +  KI  Y  D    SE ++L  +A   PV  A++  +  +Q+Y  G
Sbjct: 223 GEDGVCNKKKSALSAAKIAGYE-DVPANSEKALLQAVANQ-PVSVAIDGSSFDFQFYSSG 280

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           V   +C   L   NHAV  VGY
Sbjct: 281 VFSGSCSTWL---NHAVTAVGY 299


>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
           rotundata]
          Length = 884

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 93/279 (33%), Positives = 140/279 (50%), Gaps = 29/279 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F + Y K+Y S  E   R+K F K+L +IE+L K  Q   +A YG+T F+DL+ EE
Sbjct: 578 LFEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQG--TAVYGVTMFADLTPEE 635

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FKT++L    N               N      +   +     +P K DWRE   +  V+
Sbjct: 636 FKTKYLGLKTN--------------LNQENDIPLQEAVIPDIDLPPKFDWREYNAVTPVK 681

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   E  +A+K+  L  LS QE++DC  N + GC GG    +  +  
Sbjct: 682 DQGQCGSCWAFSAIGNIEGQYAIKHKKLLSLSEQELVDC-DNLDDGCGGG--YMINAYKT 738

Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
           V K+  LE E++YP   ++  C       N  K++  +   +   E  +   +  +GP+ 
Sbjct: 739 VEKLGGLELETDYPYDARNEKCHFLK---NKAKVQVASALNITNDEKKMAQWLVKNGPIS 795

Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
             +NA   Q+Y GGV    ++ CD   AN++H V IVGY
Sbjct: 796 VGINANAMQFYFGGVSHPFKFLCDP--ANLDHGVLIVGY 832


>gi|403352840|gb|EJY75943.1| Oryzain gamma chain [Oxytricha trifallax]
          Length = 338

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 155/310 (50%), Gaps = 33/310 (10%)

Query: 7   VLFIVALIALC-FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS- 63
            L IV +++L    A    + +  L    E F ++  R+ KSY +K+E   R K F K+ 
Sbjct: 5   TLAIVGIVSLSSVFASDAFLKESGLVSSTEEFLNYIARFGKSYATKAEFQKRAKLFLKTK 64

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH--HKHHDHHHN 121
           ++I++  + N  S  + R G  +FSD +EEEF+           +L +    + HD +H 
Sbjct: 65  MEIMQAASSN--SVPTFRLGFNQFSDWTEEEFQA----------ILGNKPSEEEHDVYHE 112

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           H+K       I     +P  KDWR+ G++  V++Q  CG+CWAFST    ES  A++ G 
Sbjct: 113 HLK-------ILEDAILPASKDWRDDGVVNPVKDQGRCGSCWAFSTAAGVESHFAIQFGK 165

Query: 182 LSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
           L  LS Q+++DC+    N GC+GG      D+  V    LE E++YP L  D  C R  +
Sbjct: 166 LYSLSEQQLVDCSTAYDNAGCNGGLATQGYDY--VKSYGLEQEADYPYLAADGTCHRDKS 223

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLAN 299
                    +T  TL PS+  +   +AT GP   +V+A   ++ Y  G++   C  SL  
Sbjct: 224 KIVAYVEDFHTVQTLSPSQ--LKAALATQGPASVSVDASGVFKNYQSGILNAGCGTSL-- 279

Query: 300 INHAVQIVGY 309
            NHA+  VGY
Sbjct: 280 -NHAILAVGY 288


>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 142/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHGLTALSEQQLVSCDDKDN-GCGGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 347

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 152/312 (48%), Gaps = 36/312 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           I+A++ L  LA   K+S   ++     F  FQ +Y K Y   E   +F  F+ +L+ I+ 
Sbjct: 5   IIAILFLVALAAARKLSPEEIQ-----FRDFQVKYNKVYGSHEFSQKFVTFKDNLNRIDT 59

Query: 70  LNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
           LN N  +  S  ++G+ EF+DLS +EF+  ++ ++V   V        D+    +     
Sbjct: 60  LNANAAASGSDTKFGVNEFADLSVQEFRKFYM-NAVPASVPSDAQVAGDYSDETLAS--- 115

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                    IP   DWR  G +  V+NQ  CG+CW+FST    E    L   TL+ LS Q
Sbjct: 116 ---------IPSSFDWRTKGAVTPVKNQGQCGSCWSFSTTGNVEGQWFLAGNTLTGLSEQ 166

Query: 189 EVIDC-----AGNGNM----GCSGGDFCALLDWMDVNKVVLEPESEYPLL-LKDAACKRK 238
            ++DC       +G      GC+GG       ++ +    ++ E+ YP L +    C+ K
Sbjct: 167 NLVDCDHHCMTYDGQQSCDDGCNGGLQPNAFQYI-IGNGGIDTETSYPYLAVAQDKCQFK 225

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
           A++  G KI ++    L  +E+ I   +A +GPV  A +A  WQ+Y+GGV    C  +L 
Sbjct: 226 ASNI-GAKISNW--QMLSTNETQIAAYLALNGPVSIAADAAEWQFYIGGVFDLPCGKAL- 281

Query: 299 NINHAVQIVGYD 310
             +H + IVGYD
Sbjct: 282 --DHGILIVGYD 291


>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 330

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/313 (32%), Positives = 152/313 (48%), Gaps = 41/313 (13%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSL 64
           N L IVAL+A C  A   + S    +     F  F Q Y K YS  EH + R   F+++L
Sbjct: 2   NKLIIVALLAACVFA---RFSTMQDQDIAAAFKKFTQTYNKKYSSEEHYNARLSIFKENL 58

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             IE  NKN    + A++GIT+F+DL+ EEF   +L              +     N   
Sbjct: 59  RRIELFNKN----DEAQHGITQFADLTHEEFADMYL-------------GYKPQLRNSQA 101

Query: 125 KRSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK-NGTL 182
           K S++ T  T PT I    DW   G +  V+NQ +CG+CWAFST  + E  + L+    L
Sbjct: 102 KVSLSSTPFTAPTAI----DWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNL 157

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
           +  S Q+++DC    + GC+GG       +++  K  LE ES YP    D +CK    S 
Sbjct: 158 TSFSEQQLVDCDTKEDQGCNGGLMDNAFTYLESAK--LETESAYPYTAVDGSCKYN-QSL 214

Query: 243 NGVKIKSYT----CDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDG 295
             V + S+       T+  +E+++   +   GP+  A+NA   Q+Y GG+   +  N +G
Sbjct: 215 GVVGVASFVDIEQGKTVADTENTMGVALDNIGPLSVAINANNLQFYAGGISNPLICNPNG 274

Query: 296 SLANINHAVQIVG 308
               +NH V IVG
Sbjct: 275 ----LNHGVLIVG 283


>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPYAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCGGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGD--TLNHGVLLVGYN 296


>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPYAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCGGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAG--ITLNHGVLLVGYN 296


>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSCDDKDN-GCAGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSSQLVPGARIDGYL--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
          Length = 884

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 90/284 (31%), Positives = 145/284 (51%), Gaps = 39/284 (13%)

Query: 36  LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF +F +++ K+Y+ ++  + RFK F+++L IIEEL    +   +A YG+T F+DL+ +E
Sbjct: 578 LFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERG--TAEYGVTMFADLTPKE 635

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP------TGIPVKKDWREAG 148
           FK R+L        L    KH +              I +P        +P+K DWR+  
Sbjct: 636 FKARYLG-------LRPELKHENE-------------IPLPEAEIPDVSLPLKFDWRDHS 675

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
           ++  V++Q  CG+CWAFS     E  +A+K+  L  LS QE++DC  + + GC+GGD   
Sbjct: 676 VVTPVKDQGQCGSCWAFSVTGNVEGQYAIKHNQLLSLSEQELVDC-DSLDEGCNGGDMEN 734

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
               ++     LE ES+YP   KD  C       N  K++  +   +   E  +   +  
Sbjct: 735 AYKAIE-RLGGLELESDYPYDAKDEKCHFLQ---NKAKVQVVSAVNITSDEKRMAQWLVK 790

Query: 269 HGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           +GP+   +NA   Q+Y GGV   + + C+    N++H V IVGY
Sbjct: 791 NGPISVGINANAMQFYFGGVSHPLNFLCNPK--NLDHGVLIVGY 832


>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C    N GCSGG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHGLTALSEQQLVSCDDKDN-GCSGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I+ Y   T+  SE+     +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIEGYM--TIESSETVKGAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
           kowalevskii]
          Length = 352

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 97/286 (33%), Positives = 146/286 (51%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++  +LF  F + Y K Y ++ EH +R++ F+ +L   E L +  Q+  + +YG+T+F
Sbjct: 46  SVDKTQDLFQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQA--TGQYGVTKF 103

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
            DLSEEEF+         K+ L    +  D H   +KK  I  G       P   DWR+A
Sbjct: 104 MDLSEEEFR---------KYYLTPVWRGSDPH---MKKAEIPKGTP-----PAAFDWRDA 146

Query: 148 --GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG- 204
               + KV+NQ TCG+CWAFST    E    +K GTL  LS QE++DC    + GC+GG 
Sbjct: 147 DKNAVTKVKNQGTCGSCWAFSTTGNIEGQWKIKKGTLVSLSEQELVDCD-KLDQGCNGGL 205

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
              A  + M    ++   E +YP   +D  CK  AT  N V I       +   E  + +
Sbjct: 206 PSNAYQEIMRFGGIM--SEDDYPYTGRDQDCKLNATL-NKVYINGSM--NISKDEGDMAS 260

Query: 265 DIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            +A +GP+   +NA   Q+Y GGV   +    +  N++H V IVGY
Sbjct: 261 WLAANGPISIGINANAMQFYFGGVSHPWKIFCNPENLDHGVLIVGY 306


>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
          Length = 260

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 94/278 (33%), Positives = 135/278 (48%), Gaps = 28/278 (10%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           EL+  F++ Y K Y+  +   RF  F+ +L   ++L    Q   +ARYG+T+FSDL+ EE
Sbjct: 4   ELYEQFKRXYGKVYANEDDQKRFAIFKDNLMRAQKLQLKDQG--TARYGVTQFSDLTPEE 61

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  ++L   VN               N   KR   TG+      P + DWR  G +  V 
Sbjct: 62  FAAKYLSAPVN---------------NDQVKRVRPTGLK---AAPERIDWRAKGAVTAVE 103

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALLDWM 213
           NQ +CG+CWAFST    E    +K G L  LS Q+++DC    + GC+GG    + L+ M
Sbjct: 104 NQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD-GCNGGWPASSYLEIM 162

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    LE + +YP       C  +      +  K      L PSE      +A HGP+ 
Sbjct: 163 HMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALXPSEDDNAAYLAEHGPLS 217

Query: 274 AAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
             +NA+T QYY  G+I  +    S  ++NHAV  VGYD
Sbjct: 218 TLLNAITLQYYQSGIIHPSYXXCSPVDLNHAVLTVGYD 255


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 90/292 (30%), Positives = 145/292 (49%), Gaps = 25/292 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESA 80
           V VS   L++    F SF+ ++ K+Y +++E   RF  F ++L  IE  N   +Q   S 
Sbjct: 12  VAVSATLLKEDGAHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSY 71

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
             GI +F+D++  EFK         K  +++            K   +  G+++P  I  
Sbjct: 72  TQGINKFADMTRAEFKAMLATQVKTKPSIVA-----------TKTFQLADGVSVPESI-- 118

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR   ++  +++Q  CG+CWAF+ V + E  +AL  G L+  S Q+++DC  + N G
Sbjct: 119 --DWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYG 176

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C GG       ++  N   LE ES+YP    D  C  + +S    K+ SY   ++  +E 
Sbjct: 177 CDGGYLDDTFPYIQTNG--LELESDYPYTGYDGYCSYE-SSKVVTKVSSYV--SVPANEQ 231

Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGYDN 311
           ++L  + T GPV  A+NA   Q+Y  G+I    CD     ++H V  VGYD+
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIIDDKYCDPEY--LDHGVLAVGYDS 281


>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V   ES  A+    L+ LS Q+++ C  + + GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +L  E  YP +        C   +    G +I  Y   T+  SE+ +   +A  
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
 gi|223947281|gb|ACN27724.1| unknown [Zea mays]
          Length = 322

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 142/289 (49%), Gaps = 26/289 (8%)

Query: 34  LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           ++ F ++Q  Y +SY + +E   RF+ + +++++IE  N  R++  S +   T F+DL+ 
Sbjct: 4   MDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATN--RRAELSYQLSETPFTDLTS 61

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV----------KKRSITTGITIPTGIPVKK 142
           EEF   H   S   H   +  +H +    H            +R+ TT + +P  +    
Sbjct: 62  EEFLATHT-MSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESV---- 116

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
           DWR  G +  V++Q  CG CW+F+TV   E +H ++ G L  LS QEV+DC+   N GC 
Sbjct: 117 DWRTKGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCH 176

Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
           GG+  A +DW+  N   L  ES+YP   +   CK      +  KI+      L+   +  
Sbjct: 177 GGNPAAAIDWVSANG-GLTTESDYPYEGRQGKCKLDKARNHVAKIRG---RKLVDQNNEA 232

Query: 263 LTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             ++A    PV   +N     Q+Y  GV    CD    ++NHAV +VGY
Sbjct: 233 ALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPE--DLNHAVTMVGY 279


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 147/287 (51%), Gaps = 31/287 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I+E NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNK---IVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK ++L   VN    +S  +   +      +            +P   DWR+ 
Sbjct: 96  ADLSHQEFKNKYLGLKVN----LSQRRESSNEEEFTYR---------DVDLPKSVDWRKK 142

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG   
Sbjct: 143 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGG--- 199

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     V    L  E +YP +++++ C+ K      V I  Y  D    +E S+L 
Sbjct: 200 -LMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQVVTINGYH-DVPQNNEQSLLK 257

Query: 265 DIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A   P+  A+ A +  +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 258 ALANQ-PLSVAIEASSRDFQFYSGGVFDGHCG---SDLDHGVSAVGY 300


>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q Y++ Y+   E   R  NF+++L+++ E   N      AR+GIT+F DLSEEE
Sbjct: 37  LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    H+  V       G  + T  P   DWRE G +  V+
Sbjct: 94  FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   ES   L   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 143 DQGMCGSCWAFSAIGNIESQWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N+   +     YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 202 NNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GGV+  +CDG    +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296


>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q Y++ Y+   E   R  NF+++L+++ E   N      AR+GIT+F DLSEEE
Sbjct: 37  LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    H+  V       G  + T  P   DWRE G +  V+
Sbjct: 94  FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   ES   L   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 143 DQGMCGSCWAFSAIGNIESQWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N+   +     YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 202 NNRNGAVYTGVSYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GGV+  +CDG    +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296


>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
          Length = 322

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 153/310 (49%), Gaps = 43/310 (13%)

Query: 8   LFIV---ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           LF V   ALI  C +A+P        +   EL+  F++ Y K Y+  +   RF  F+ +L
Sbjct: 3   LFTVSCFALIVSCAVAVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
              ++L    Q   +ARYG+T+FSDL+ EEF  ++L   +N   +               
Sbjct: 55  VRAQKLQLRDQG--TARYGVTQFSDLTPEEFAAKYLSPPLNSDQV--------------- 97

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +R   TG+      P + DWR  G +  V NQ  CG+CWAFST    E    +K G L  
Sbjct: 98  ERVQPTGLK---AAPERMDWRAKGAVTPVENQGECGSCWAFSTAGNVEGQWFIKTGQLVS 154

Query: 185 LSVQEVIDCAGNGNMGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q+++DC      GC+GG    + L+ MD+    LE E++YP +  +  C     + N
Sbjct: 155 LSKQQLVDCDMAAE-GCNGGWPSSSYLEIMDMGG--LESENDYPYVGVEQTC-----ALN 206

Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANI 300
             K+ +   D ++   SE+  +  +A HGP+   +NA+  Q+Y  G++   + D    ++
Sbjct: 207 KEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPSHKDCPDDDL 266

Query: 301 NHAVQIVGYD 310
           NHAV  VGYD
Sbjct: 267 NHAVLTVGYD 276


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 20/300 (6%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN-KNRQSPES 79
           P  V    +E   ELF  + ++++K Y+   E   R+ NF  +L  + + N + R++P S
Sbjct: 36  PEDVGAGGVEGGQELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSS 95

Query: 80  AR-YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
            +  G+  F+DLS EEF     R   +  VL               +  +  G   P  +
Sbjct: 96  GQGVGMNVFADLSNEEF-----REVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASL 150

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
               DWR+ G +  V+NQ  CG+CWAFS+    E ++A+  G L  LS QE++DC    N
Sbjct: 151 ----DWRKRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD-TTN 205

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK-DAACKRKATSPNGVKIKSYTCDTLIP 257
            GC GG      +W+ +N   ++ E+ YP   + D+ C         V I  Y  + +  
Sbjct: 206 EGCDGGYMDYAFEWV-INNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGY--EDVAT 262

Query: 258 SESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           SES++L   A   PV   ++  +L +Q Y GG+   +C G+  +I+HAV +VGY     T
Sbjct: 263 SESALLC-AAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGT 321


>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
 gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
          Length = 345

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
          Length = 352

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 97/326 (29%), Positives = 152/326 (46%), Gaps = 55/326 (16%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           + F++ L AL         +   L  +   F +FQ +Y K YS  E+ ++F+ F+ +L  
Sbjct: 5   LFFVLMLTAL--------AAGRRLSVEESQFIAFQNKYNKIYSAEEYLVKFETFKSNLLN 56

Query: 67  IEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHS---VNKHVLMSHHKHHDHHHNH 122
           I+ LNK   +  S  ++G+ +F+DLS+EEFK  +L      +   + M  +   D     
Sbjct: 57  IDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDD----- 111

Query: 123 VKKRSITTGITIPTGIPVKKDWREAG---------IIGKVRNQQTCGACWAFSTVETAES 173
                      I +  P   DWR  G          +  V+NQ  CG+CW+FST    E 
Sbjct: 112 -----------IISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEG 160

Query: 174 MHALKNGTLSLLSVQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPES 224
            H L  GTL  LS Q ++DC             N GC GG      +++  N  + + E+
Sbjct: 161 QHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGI-QTEA 219

Query: 225 EYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGPVIAAVNALTWQY 283
            YP    D  CK  +    G KI S+   T++P +E+ I + +  +GP+  A +A  WQ+
Sbjct: 220 TYPYTAVDGECKFNSAQV-GAKISSF---TMVPQNETQIASYLFNNGPLAIAADAEEWQF 275

Query: 284 YLGGVIQYNCDGSLANINHAVQIVGY 309
           Y+GGV  + C  +L   +H + IVGY
Sbjct: 276 YMGGVFDFPCGQTL---DHGILIVGY 298


>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
          Length = 338

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 91/281 (32%), Positives = 143/281 (50%), Gaps = 21/281 (7%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +LE+   LF  F + Y K Y +SE + RFK F  +L  I  +N   +   +A YGI +FS
Sbjct: 33  SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 89

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS+EEF          K+      +    + +H KK  +     +    P + DWR+ G
Sbjct: 90  DLSKEEFI---------KYYTGLKREESPSNEDH-KKTDLPESFNVTA--PDQFDWRKKG 137

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
           ++  ++NQ+ CG+CWAFS     ES+HA+K G L  +S Q+++DC    + GCSGG    
Sbjct: 138 VVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYDSGCSGGLPWD 196

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
            L +   N  +      YP + K+  C R  +S   +++K Y   + I SE  I   +  
Sbjct: 197 ALRYFVANGAM--SLKSYPYVAKEGKC-RYDSSKVEIRLKGYKIFSKI-SEDQIKEHLYN 252

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            GP+  A++    + Y+GG++   C   +  +NHAV +VGY
Sbjct: 253 IGPLSIAIDVSPIKPYVGGIVMEECH-EVCQVNHAVLLVGY 292


>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
           [Glycine max]
          Length = 400

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 87/286 (30%), Positives = 140/286 (48%), Gaps = 24/286 (8%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           P+ E  +ELF  +++  KK Y   E + +RF+NF+++L  I E N  R SP     G+ +
Sbjct: 41  PSEEGVVELFQRWKEENKKIYRNPEEEKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQ 100

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F+D+S EEFK++           MS  K      N V  +  +         P   DWR+
Sbjct: 101 FADMSNEEFKSK----------FMSKVKKPFSKRNGVSSKDHSC-----EDEPYSLDWRK 145

Query: 147 AGIIG-KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
            G++   V++Q  CG+ WAFS+ +  E ++A+    L  LS QE++DC    N GC GG 
Sbjct: 146 KGVVTLAVKDQGYCGSYWAFSSTDAIEGINAIVTADLISLSEQELVDCDST-NDGCDGGX 204

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                +W+  N  + + E+ YP +  D  C         + I  Y    +  S+SS+L  
Sbjct: 205 MDYAFEWVMYNGGI-DTETNYPYIGADGTCNVTKEKTKVIGIDGYY--DVGQSDSSLLCA 261

Query: 266 IATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
                P+ A ++  +W  Q Y+GG+   +C     +I+HA+ +VGY
Sbjct: 262 TVKQ-PISAGIDGTSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGY 306


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 100/309 (32%), Positives = 150/309 (48%), Gaps = 32/309 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSL 64
            LFI   IA  F  +        ++++ +ELF S+  ++ K+Y   E  + RF+ F  +L
Sbjct: 16  TLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNL 75

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             I+E NK      S   G+ EF+DLS EEFK+++L   V                   +
Sbjct: 76  KHIDETNKK---VSSYWLGLNEFADLSHEEFKSKYLGLRVE----------------FPR 116

Query: 125 KRSITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           KRS + G +      +P   DWR  G +  V+NQ +CG+CWAFSTV   E ++ +  G L
Sbjct: 117 KRS-SRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
           + LS QE+IDC  + N GC GG       ++  N   L  E +YP L+++  C R+    
Sbjct: 176 TSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNS-GLRKEEDYPYLMEEGRCIREKEQF 234

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANI 300
             V I  Y  D     E S+L  + +H PV  A+ A +  +Q+Y GG+    C      +
Sbjct: 235 EVVTISGYE-DVPANDEQSLLKAL-SHQPVSVAIEASSRNFQFYKGGIFTGRCG---TQM 289

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 290 DHGVTAVGY 298


>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
 gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
          Length = 352

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 150/311 (48%), Gaps = 24/311 (7%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLE-----LFSSFQQRYKKSYSKSEH-DIRFKNFEKS 63
           ++ +    F+A  V ++  N  + ++     LF  + ++  K Y  SE  + RF NF+ +
Sbjct: 7   LIIIFCFVFVAQSVNININNAYRTIDGPSKDLFHHWTKQNGKIYETSEEFEKRFSNFKTN 66

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           L  IE LN   +    A +G+ ++SDLSEEEF   +L     K+      +  D+     
Sbjct: 67  LKKIENLNNLHKG--KASFGMNKYSDLSEEEFSNFYLM----KNFKGKPEEERDYIKKPE 120

Query: 124 KKRSITTGITIPTGIPVKK----DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
              S   G  + T   +K     DWR  G++  V++Q  CG+C+ FS  E  ES +    
Sbjct: 121 NPSSNLIGGYLNTDDGLKAMYQVDWRNKGLVTPVKDQGQCGSCYIFSATEQIESEYIRAG 180

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
               LLS Q+ +DC    + GC GGD   + +++ ++   +  E +YP   +D  C    
Sbjct: 181 HKAILLSEQQSVDCD-TMDGGCGGGDPANVYNYI-ISAGGVSTEKDYPYTAQDGTC---F 235

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            +   V I  +   T    E +++T IA HGPV   V+A TWQ Y GG+I   C+    N
Sbjct: 236 NTTRAVSITGFQYVTQNSDEDTLITTIANHGPVSICVDASTWQSYTGGIITTGCE---QN 292

Query: 300 INHAVQIVGYD 310
           I+H VQ+VG D
Sbjct: 293 IDHCVQVVGLD 303


>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
 gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
          Length = 531

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 153/297 (51%), Gaps = 30/297 (10%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
           L   +KV + +L++K   F +F+  Y+KSY +K EHD+RFKN++ + + I   N    S 
Sbjct: 210 LGDSLKVKESDLQEK---FVAFKSEYEKSYENKEEHDMRFKNYKVAHNKIVSHNAKNLS- 265

Query: 78  ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
              + G   ++DLS+ EF T  ++  V +      H  HD    +               
Sbjct: 266 --YKLGFNHYADLSDHEFNTL-IKPKVARPSNNGAHSVHDDEDIYT-------------- 308

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-N 196
           IP   DWR    +  V++Q  CG+CW F +  + E  + + NG L  LS Q+++DCA   
Sbjct: 309 IPQSVDWRNQKCVTPVKDQGVCGSCWTFGSTGSLEGTNCVTNGYLVSLSEQQLVDCAYLM 368

Query: 197 GNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           G+ GC+GG   +   + MD   +    ES+Y  L+++A CK K+T+ +GV + SY  +  
Sbjct: 369 GSQGCNGGFAASAFQYIMDAGGIAT--ESDYQYLMQNALCKDKSTTFSGVGVSSYV-NVT 425

Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQY-NCDGSLANINHAVQIVGY 309
             S +++L  +AT GPV  A++A    ++YY  G+    +C     +++H V  +GY
Sbjct: 426 AGSINALLNAVATQGPVAIAIDASVDDFRYYQSGIYSNPSCKNGPDDLDHEVLAIGY 482


>gi|118399607|ref|XP_001032128.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89286466|gb|EAR84465.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 95/309 (30%), Positives = 150/309 (48%), Gaps = 23/309 (7%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRY--KKSY-SKSEHDIRFKNFEKSLDI 66
           I+ALI L  +  P+     N      LF+  + RY  K++Y S  E   R + F ++ + 
Sbjct: 3   IIALITLLLVVSPIIADSTNNFSVSALFAYNKWRYANKRTYFSLEEQQFRQQIFFETHER 62

Query: 67  IEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           I+  N N   PE+  +    +FSD+ +EEF +R L  S     L+  +     ++N   +
Sbjct: 63  IQNHNSN---PEATYKLAHNQFSDMPQEEFASRVLMKSSQ---LIPRNAVQAQNNNSTTQ 116

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
           +     + +P       DWR+ GI+  V++Q  CG+CWAFST    E+++ ++N      
Sbjct: 117 QHTAQDVQLPASF----DWRDYGILSDVKDQGQCGSCWAFSTTGILEALYFMENRQKISF 172

Query: 186 SVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
           S Q+++DCA N N     GCSGG     L +  V K  +  E +YP L  D+ CK  + +
Sbjct: 173 SEQQLVDCATNSNGFNSYGCSGGWPEEALKY--VAKFGILKEEQYPYLAVDSKCKVSSPT 230

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANIN 301
            +G K++S+     I   +  L +     PV   V+A TW  Y  GV     +    N+N
Sbjct: 231 SDGFKVQSF---YFIDKTADALKNTVARIPVSVLVDASTWGSYSSGVYNGCGNTQTYNLN 287

Query: 302 HAVQIVGYD 310
           HAV  +GYD
Sbjct: 288 HAVVAIGYD 296


>gi|84028184|sp|Q9R014.2|CATJ_MOUSE RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
           protein; AltName: Full=Cathepsin P; AltName:
           Full=Catlrp-p; Flags: Precursor
 gi|5306071|gb|AAD41898.1|AF158182_1 preprocathepsin P [Mus musculus]
 gi|12838143|dbj|BAB24099.1| unnamed protein product [Mus musculus]
 gi|74199838|dbj|BAE20748.1| unnamed protein product [Mus musculus]
 gi|74355544|gb|AAI03770.1| Cathepsin J [Mus musculus]
 gi|148709363|gb|EDL41309.1| cathepsin J, isoform CRA_a [Mus musculus]
          Length = 334

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 30/304 (9%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L+ LCF +A   +   P L+ +   +  ++ +Y KSYS  E  +R   +E+++ +I+ 
Sbjct: 5   VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYSPKEEALRRAVWEENMRMIKL 61

Query: 70  LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK N     +    + +F D + EEF     R S++ ++ +       H  NHV     
Sbjct: 62  HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 111

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                   G+P  KDWRE G +  VRNQ  CG+CWAF+     E     K G L+ LSVQ
Sbjct: 112 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 164

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  G      +++  NK  LE E+ YP   KD  C+ ++ + +   I
Sbjct: 165 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 222

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y    L P+E  +   +A+ GPV AA++A   ++++Y GG I Y  + S   +NHAV 
Sbjct: 223 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 279

Query: 306 IVGY 309
           +VGY
Sbjct: 280 VVGY 283


>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
 gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
 gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
 gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
          Length = 443

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 95/302 (31%), Positives = 145/302 (48%), Gaps = 24/302 (7%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
            + LC L    +       Q  E + +++  + KSYS   E   R   ++++L+ I+  N
Sbjct: 4   FLVLCVLVASSRGWSVRFGQDSE-WVAWKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHN 62

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
               S + A   +    DL+E+EF+  +L                  HHN  K+   T  
Sbjct: 63  AEDHSYKMA---MNHLGDLTEDEFRYFYLGVRA--------------HHNSTKRGWATYM 105

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
                 IP   DW + G +  V+NQ  CG+CWAFST  + E  H  K G+L  LS Q +I
Sbjct: 106 PPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLI 165

Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           DC+G+ GN GC GG       +++ N  + + ES YP L +  +C   ++S  G ++  Y
Sbjct: 166 DCSGSYGNNGCQGGLMDNAFRYIESNGGI-DTESSYPYLGQQGSC-HFSSSHVGARVTGY 223

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
             D    SE ++ + +AT GPV  AV+A  WQ+Y  GV   N   S   ++H V ++GY 
Sbjct: 224 Q-DIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGVYD-NPYCSSTQLDHGVLVIGYG 281

Query: 311 NY 312
           NY
Sbjct: 282 NY 283


>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 90/280 (32%), Positives = 142/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q Y++ Y+   E   R  NF+++L+++ E   N      AR+GIT+F DLSEEE
Sbjct: 37  LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANN---PHARFGITKFFDLSEEE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    ++  V       G  + T  P   DWRE G +  V+
Sbjct: 94  FATRYLSGATH---FAKAKKFASQYYRKV-------GADLSTA-PAAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   ES   L   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 143 DQGMCGSCWAFSAIGNIESKWYLATHSLISLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N+   +   + YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 202 NNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GGV+  +CDG    +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 296


>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 101/289 (34%), Positives = 145/289 (50%), Gaps = 31/289 (10%)

Query: 28  PNLEQKLELFSSFQQ---RYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           P LE+ +EL   F++   +Y K YS + E D R   F ++L   E+L    Q   SA YG
Sbjct: 165 PPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQG--SAEYG 222

Query: 84  ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
           +T+FSDL+EEEF++ +L   +++  L          H  +K  S   G       P   D
Sbjct: 223 VTKFSDLTEEEFRSTYLNPLLSQWTL----------HRPMKPASPAKGPA-----PASWD 267

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
           WR+ G +  V+NQ  CG+CWAFS     E    LKNGTL  LS QE++DC G  +  C+G
Sbjct: 268 WRDHGAVSSVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGL-DQACNG 326

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G      + ++     LE E++Y  + K  +C   AT      I S     L   E  I 
Sbjct: 327 GLPSNAYEAIE-KLGGLETETDYSYIGKKQSCDF-ATKKVAAYINSSV--ELSKDEKEIA 382

Query: 264 TDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
             +A +GPV  A+NA   Q+Y  GV   ++  C+  +  I+HAV +VGY
Sbjct: 383 AWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLMVGY 429


>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 101/289 (34%), Positives = 145/289 (50%), Gaps = 31/289 (10%)

Query: 28  PNLEQKLELFSSFQQ---RYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           P LE+ +EL   F++   +Y K YS + E D R   F ++L   E+L    Q   SA YG
Sbjct: 165 PPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQG--SAEYG 222

Query: 84  ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
           +T+FSDL+EEEF++ +L   +++  L          H  +K  S   G       P   D
Sbjct: 223 VTKFSDLTEEEFRSTYLNPLLSQWTL----------HRPMKPASPAKGPA-----PASWD 267

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
           WR+ G +  V+NQ  CG+CWAFS     E    LKNGTL  LS QE++DC G  +  C+G
Sbjct: 268 WRDHGAVSSVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLVSLSEQELVDCDGL-DQACNG 326

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G      + ++     LE E++Y  + K  +C   AT      I S     L   E  I 
Sbjct: 327 GLPSNAYEAIE-KLGGLETETDYSYIGKKQSCDF-ATKKVAAYINSSV--ELSKDEKEIA 382

Query: 264 TDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
             +A +GPV  A+NA   Q+Y  GV   ++  C+  +  I+HAV +VGY
Sbjct: 383 AWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLMVGY 429


>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
          Length = 394

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGARIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+   V+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPIAIGVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 139/287 (48%), Gaps = 31/287 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           N ++ LELF S+   + K+Y   E  + RF+ F ++L  I++ N    S      G+ EF
Sbjct: 43  NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK R+L       +             + + R IT        +P   DWR+ 
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V++Q  CG+CWAFSTV   E ++ +  G LS LS QE+IDC    N GC+GG   
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203

Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     ++   L  E +YP L+++  C+ +      V I  Y  + +  ++   L 
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               H PV  A+ A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKGGVFNGKCG---TDLDHGVAAVGY 304


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 155/314 (49%), Gaps = 37/314 (11%)

Query: 11  VALIALCFLAIPVKVSKPNL-----------EQKLELFSSFQQRYKKSYSKSEHDI-RFK 58
           VA++ LC  A   + S  ++           ++ +ELF  +  +++K+Y+  E  + RF+
Sbjct: 7   VAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFE 66

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F+ +L +I+E+N+   S      G+ EF+DL+ +EFKT +L   ++             
Sbjct: 67  VFKDNLKLIDEINREVTS---YWLGLNEFADLTHDEFKTTYL--GLSPPPARRSSSRSFR 121

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
           + N                +P   DWR+ G +  V+NQ  CG+CWAFSTV   E ++A+ 
Sbjct: 122 YEN-----------VAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIV 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC-KR 237
            G L+ LS QE+IDC+ +GN GC+GG       ++  +   L  E  YP L+++ +C   
Sbjct: 171 TGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYI-ASSGGLHTEEAYPYLMEEGSCGDG 229

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDG 295
           K +    V I  Y  D     E +++  +A H PV  A+ A    +Q+Y GGV    C  
Sbjct: 230 KKSESEAVSISGYE-DVPTKDEQALIKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCG- 286

Query: 296 SLANINHAVQIVGY 309
             A ++H V  VGY
Sbjct: 287 --AQLDHGVAAVGY 298


>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS   ++ C  + N GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEHHLVSCH-DKNSGCTGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G   ++NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAG--ISLNHGVLLVGYN 296


>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
 gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
 gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 148/322 (45%), Gaps = 37/322 (11%)

Query: 4   VKNVLFIVALI-----ALCFLAIPVKVS--------KPNLEQKLELFSSFQQRYKKSY-S 49
           +K  LF++ L+      LC+  +P + S         P+ E  +ELF  +++  KK Y S
Sbjct: 5   LKTQLFLLFLVWGSWTFLCY-GLPSEYSILALEIDKFPSEEGVIELFQRWKEENKKIYRS 63

Query: 50  KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
             +  +RF+NF+++L  I E N  R SP     G+  F+D+S EEFK++           
Sbjct: 64  PDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSK 123

Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
            +     DH                    P   DWR+ G++  V++Q  CG CWAFS+  
Sbjct: 124 RNGLSGKDHSCEDA---------------PYSLDWRKKGVVTAVKDQGYCGCCWAFSSTG 168

Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
             E ++A+ +G L  LS  E++DC    N GC GG      +W+  N  + + E+ YP  
Sbjct: 169 AIEGINAIVSGDLISLSEPELVDCD-RTNDGCDGGHMDYAFEWVMHNGGI-DTETNYPYS 226

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGG 287
             D  C         + I  Y    +  S+ S+L       P+ A ++  +W  Q Y+GG
Sbjct: 227 GADGTCNVAKEETKVIGIDGYY--NVEQSDRSLLCATVKQ-PISAGIDGSSWDFQLYIGG 283

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           +   +C     +I+HA+ +VGY
Sbjct: 284 IYDGDCSSDPDDIDHAILVVGY 305


>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
          Length = 359

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 137/278 (49%), Gaps = 24/278 (8%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E F +FQQ+Y K Y + SE  +R + F+++L  IEE NK  +Q+  S   G+ +FSDL+E
Sbjct: 22  ETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQNLVSYELGLNQFSDLTE 81

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
            EF+            L++     D     ++K +    I      PV  +W E G++  
Sbjct: 82  AEFQ-----------ALLTMSPLTDQLTKQMEKYNSEFDIKTA---PVSVNWAEKGVVTP 127

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CW F+T  T ES  ALK G+L  LS Q+++DC    N GC GG     L +
Sbjct: 128 VKNQGNCGSCWTFTTTGTIESRLALKTGSLVSLSEQQLLDC-NRVNAGCDGGVLSYALQY 186

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
             V    L  E EYP    +  C      P     K YT      SES ++  +A  GPV
Sbjct: 187 --VESAGLTTEDEYPYKAWNGTC-NSTHKPVAAYTKGYTL-IYTRSESDLMKAVA-EGPV 241

Query: 273 IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
             A+NA   QYY  G+  +N     + +NH   +VGY+
Sbjct: 242 AVALNADLLQYYSKGI--FNPSACSSTVNHGGLVVGYE 277


>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 368

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 149/295 (50%), Gaps = 36/295 (12%)

Query: 26  SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           ++PN+    + FS F++++ K Y S+ EHD RF  F+ +L       ++++   SAR+G+
Sbjct: 40  AEPNVLSSEDHFSLFKKKFGKVYASREEHDYRFSVFKSNL---RRARRHQKLDPSARHGV 96

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
           T+FSDL+  EFK +HL       V        D +   +          +PT  +P + D
Sbjct: 97  TQFSDLTRSEFKRKHL------GVKGGFKLPKDANKAPI----------LPTENLPEEFD 140

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
           WRE G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC        AG
Sbjct: 141 WRERGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAG 200

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           + + GC+GG   +  ++  +    L  E +YP   KD A  +   S     + +++  ++
Sbjct: 201 SCDSGCNGGLMNSAFEYT-LKTGGLMREEDYPYTGKDGATCKLDKSKIVASVSNFSVISI 259

Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              E  I  ++  +GP+  A+NA   Q Y+GGV   Y C   +  +NH V +VGY
Sbjct: 260 --DEEQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYIC---MRRLNHGVLLVGY 309


>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 141/282 (50%), Gaps = 29/282 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L+ LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSVVGNIESQWAVAGHRLTALSEQQLVSC-DDMDSGCGGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
            ++N  +   E  YP +        C   +    G +I  Y    +I S  +++   +A 
Sbjct: 202 RNMNGTMFT-EDSYPYVSTFGYVPECTNSSQLVPGARIDGY---VMIESNETVMAAWLAK 257

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            GP+   V+A ++  Y GGV+  +C G    +NH V +VGY+
Sbjct: 258 SGPISIGVDASSFMSYHGGVLT-SCAGK--QLNHGVLLVGYN 296


>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
 gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
 gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
 gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
 gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
 gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
 gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
 gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
 gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
 gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
 gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
 gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
 gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
 gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
 gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
 gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
 gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
 gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
 gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
 gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
 gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
 gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
 gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
 gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
 gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
 gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
 gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
 gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
 gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
 gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
 gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
 gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
 gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
 gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
 gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMTAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
          Length = 443

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 141/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARVGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/286 (32%), Positives = 143/286 (50%), Gaps = 31/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  ++ K+Y   E  + RF+ F  +L  I+E NK      S   G+ EF
Sbjct: 39  SMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKK---VSSYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP--TGIPVKKDWR 145
           +DLS EEFK+++L   V                   +KRS + G +      +P   DWR
Sbjct: 96  ADLSHEEFKSKYLGLRVE----------------FPRKRS-SRGFSYGDVEDLPESVDWR 138

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
             G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC  + N GC GG 
Sbjct: 139 TKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGL 198

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                 ++  N   L  E +YP L+++  C R+      V I  Y  D     E S+L  
Sbjct: 199 MDYAFQYIMSNS-GLRKEEDYPYLMEEGRCIREKEQFEVVTISGYE-DVPANDEQSLLKA 256

Query: 266 IATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           + +H PV  A+ A +  +Q+Y GG+    C      ++H V  VGY
Sbjct: 257 L-SHQPVSVAIEASSRNFQFYKGGIFTGRCG---TQMDHGVTAVGY 298


>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
          Length = 335

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 91/280 (32%), Positives = 144/280 (51%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q Y++ Y+   E   R  NF+++L+++ E   N  +P  AR+GIT+F DLSEEE
Sbjct: 29  LFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQAN--NPH-ARFGITKFFDLSEEE 85

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    ++  V       G  + T  P   DWRE G +  V+
Sbjct: 86  FATRYLSGATH---FAKAKKFASQYYRKV-------GADLSTA-PAAVDWREKGAVTPVK 134

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   ES   L   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 135 DQGMCGSCWAFSAIGNIESKWYLATHSLISLSEQELVSCD-DVDEGCNGGLMGQAFDWLL 193

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N+   +   + YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 194 NNRNGAVYTGASYPYVSGNGSVPECSESSDLVIGAYIDGHV--TIESNEDTMAAWLAANG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GGV+  +CDG    +NH V +VGY+
Sbjct: 252 PIAIAVDASAFMSYTGGVLT-SCDGK--QLNHGVLLVGYN 288


>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y + Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H H K R+  + +      P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQH-HRKARADLSAV------PDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V   ES  A+    L+ LS Q+++ C  + + GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +L  E  YP +        C   +    G +I  Y   T+  SE+ +   +A  
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)

Query: 34  LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF ++   ++K+Y   E   +RF+ F+ +L  I+E NK  +S      G+ EF+DLS 
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   +   ++    +  +  +     R +         +P   DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG      ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E +YP  +++  C+ +      V I  +  D     E S+L  +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305


>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
          Length = 441

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 141/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q YK+ Y+  +E   R  NF+++L+++ E   N      AR+GIT+F DLSE E
Sbjct: 37  LFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANN---PHARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    H+  V       G  + T  P   DWR+ G +  V 
Sbjct: 94  FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWRQMGAVTPVN 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS +   ES   +   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAIGNIESQWYVTTHSLITLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            NK   +   + YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 202 NNKNGAVYTGASYPYVSGNGSVPECSESSELVVGAYIDGHV--TIESNEDTMAAWLAVNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GG++  +CDG    +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGILT-SCDGR--QLNHGVLLVGYN 296


>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHCRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 143/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y + Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H H K R+  + +      P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQH-HRKARADLSAV------PDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V   ES  A+ +  L  LS Q+++ C  + + GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVADHRLXXLSEQQLVSC-DDKDSGCNGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +L  E  YP +        C   +    G +I  Y   T+  SE+ +   +A  
Sbjct: 202 RNMNGTMLT-EDSYPYVSSTGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYESGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)

Query: 34  LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF ++   ++K+Y   E   +RF+ F+ +L  I+E NK  +S      G+ EF+DLS 
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   +   ++    +  +  +     R +         +P   DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG      ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E +YP  +++  C+ +      V I  +  D     E S+L  +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305


>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
          Length = 348

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP    +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYTSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 443

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTV-STEKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMTAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
          Length = 322

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 151/312 (48%), Gaps = 47/312 (15%)

Query: 8   LFIVA---LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           LF V+   LI  C + +P        +   EL+  F++ Y K Y+  +   RF  F+ +L
Sbjct: 3   LFTVSCFVLIVSCAVVVP--------DSARELYEQFKRDYGKVYANEDDQKRFAIFKDNL 54

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
              ++L    Q   +ARYG+T+FSDL+ EEF  ++LR +VN               N   
Sbjct: 55  VRAQKLQLKDQG--TARYGVTQFSDLTPEEFAAKYLRAAVN---------------NDQV 97

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +R   TG+      P + DWRE G +  V NQ +CG+CWAFS     E    +K G L  
Sbjct: 98  ERVRPTGLK---AAPERMDWREKGAVTAVENQGSCGSCWAFSAAGNVEGQWFIKTGQLVS 154

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPN 243
           LS Q+++DC      GC+GG    +  ++++  +  LE ES+YP +  +  C     + N
Sbjct: 155 LSKQQLVDCDRVAE-GCNGG--WPVSSYLEIKHMGGLESESDYPYVGAEQTC-----ALN 206

Query: 244 GVKIKSYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ---YNCDGSLA 298
             K+ +   D ++    E      +A HGP+   +NA+  Q+Y  GV+      C  +  
Sbjct: 207 KEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPTYEECPDT-- 264

Query: 299 NINHAVQIVGYD 310
            +NHAV  VGYD
Sbjct: 265 ELNHAVLTVGYD 276


>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
          Length = 443

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|4733887|gb|AAD02173.3| cysteine proteinase [Acanthamoeba culbertsoni]
          Length = 482

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 144/309 (46%), Gaps = 34/309 (11%)

Query: 14  IALCFLAIPVKVSKPNLEQKLEL---FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           +AL  LA    VS  +L ++ EL   F+S+ +R+ +SYS  E   R+  + +++D IEE 
Sbjct: 39  LALLVLACLTLVSCVSLRER-ELQGQFNSWMRRHARSYSNDEFLERYNTWRENMDFIEEF 97

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHL-------RHSVNKHVLMSHHKHHDHHHNHV 123
           N+   +   A   + E  DL+ EEF   ++          + + +        +HHH   
Sbjct: 98  NRGNHTFTVA---MNEHGDLTPEEFARLYMGQVSPASEQELQERIAAESAMEDEHHHTRA 154

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
                         IP   DWR  G +  V+NQ +C +CWAF      E +  +  G+L 
Sbjct: 155 S-------------IPANWDWRTKGAVTPVKNQGSCASCWAFVATGAVEGVRKIAGGSLV 201

Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q ++DCA G GN GCSGG+      WM  N   L  ++ YP + + + C+   +  
Sbjct: 202 SLSDQMLLDCAVGTGNQGCSGGNVEITYRWMISNNARLMTQASYPYIARQSTCRYVPS-- 259

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
            GV+           SES +L   A   PV  A++    ++ +Y GG   Y+   S  N+
Sbjct: 260 QGVQGIRNIMRVRAGSESDLLAKAAI-APVTVAIDGSKRSFMFYSGGYY-YDPTCSSTNL 317

Query: 301 NHAVQIVGY 309
           NHAV +VG+
Sbjct: 318 NHAVLVVGW 326


>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1454

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/281 (33%), Positives = 143/281 (50%), Gaps = 30/281 (10%)

Query: 36   LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
            LF  F+ R+ ++Y  S EH++RF+ F+ +L  IE+LNK  Q   +A+YGIT F+D++  E
Sbjct: 1145 LFDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQG--TAKYGITHFADMTSAE 1202

Query: 95   FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
            ++ R         V+       +H  N + +  I   + +P       DWRE G + +V+
Sbjct: 1203 YRAR------TGLVVPREGDEVNHIRNPMAE--IDEHMELPDAF----DWRELGAVSEVK 1250

Query: 155  NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
            NQ  CG+CWAFS V   E +H +K   L   S QE++DC    +  C+GG     +D  D
Sbjct: 1251 NQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDC-DTVDSACNGG----FMD--D 1303

Query: 215  VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
              K +     LE ESEYP L K         +   V++K      L  +E++I   +  +
Sbjct: 1304 AYKAIEKIGGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAV--DLPKNETAIAQFLVAN 1361

Query: 270  GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GPV   +NA   Q+Y GG+   +    S  N++H V IVGY
Sbjct: 1362 GPVSIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGY 1402


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 141/281 (50%), Gaps = 30/281 (10%)

Query: 34  LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
           ++LF S+  ++ K Y   E   +RF+ F+ +L  I+E NK     +   Y  G+ EFSDL
Sbjct: 30  IDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNK-----KVVNYWLGLNEFSDL 84

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           S EEFK ++L   V+    MS  +      N+    SI          P   DWR+ G +
Sbjct: 85  SHEEFKNKYLGLKVD----MSERRECSQEFNYKDVMSI----------PKSVDWRKKGAV 130

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE++DC    N GC+GG      
Sbjct: 131 TDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAF 190

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++ ++   L  E +YP ++++  C+ +      V I  Y  D    SE S+L  +A   
Sbjct: 191 SYI-ISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYH-DVPQNSEESLLKALANQ- 247

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           P+  A+ A    +Q+Y GGV   +C      ++H V  VGY
Sbjct: 248 PLSVAIEASGRDFQFYSGGVFDGHCG---TQLDHGVAAVGY 285


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 144/286 (50%), Gaps = 32/286 (11%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ +ELF  +  +++K+Y+  E  + RF+ F+ +L  I+++N+   S      G+ EF+D
Sbjct: 43  ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTS---YWLGLNEFAD 99

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ +EFK  +L          S              RS        + +P   DWR+ G 
Sbjct: 100 LTHDEFKAAYLGLDAAPARRGS-------------SRSFRYEDVSASDLPKSVDWRKKGA 146

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           + +V+NQ  CG+CWAFSTV   E ++A+  G L+ LS QE+IDC+ +GN GC+GG    L
Sbjct: 147 VTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGG----L 202

Query: 210 LDWM---DVNKVVLEPESEYPLLLKDAAC-KRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           +D+      +   L  E  YP L+++ +C   K      V I  Y  D     E +++  
Sbjct: 203 MDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYE-DVPANDEQALIKA 261

Query: 266 IATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A H PV  A+ A    +Q+Y GGV    C    A ++H V  VGY
Sbjct: 262 LA-HQPVSVAIEASGRHFQFYSGGVFDGPCG---AQLDHGVAAVGY 303


>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 443

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAVKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
 gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
          Length = 324

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/310 (30%), Positives = 158/310 (50%), Gaps = 33/310 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
           +IAL  L + +     N     EL++ F++ + ++Y S  E  +RF  F+ +L  I E N
Sbjct: 4   IIALAALIVVI-----NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58

Query: 72  KNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
              ++ ES  Y  I +FSD+++EEF+   +++  ++  L             ++   +T 
Sbjct: 59  VKYENGESTYYLAINKFSDITDEEFRDMLMKNEASRPNL-----------EGLEVADLTV 107

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G       P   DWR  G++  VRNQ  CG+CWA ST    ES  A+K+G+   LS Q++
Sbjct: 108 GAA-----PESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQL 162

Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+ + GN GC+GG      +++  N   LE +++YP   K+  CK    S + V++  
Sbjct: 163 VDCSTSYGNHGCNGGFAVNGFEYVKDNG--LESDADYPYSGKEDKCKANDKSRSVVELTG 220

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
           Y    +  SE+S+   + T GP+ A V     + Y GG+    +C G   N++H V +VG
Sbjct: 221 YK--KVTASETSLKEAVGTIGPISAVVFGKPMKSYGGGIFDDSSCLGD--NLHHGVNVVG 276

Query: 309 Y--DNYSRTW 316
           Y  +N  + W
Sbjct: 277 YGIENGQKYW 286


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 142/282 (50%), Gaps = 19/282 (6%)

Query: 31  EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E  +E+F  ++ R++K Y   +E + R++NF+++L  I E    + +      G+ +F+D
Sbjct: 44  ESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFAD 103

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS EEFK  +L   V K + +      D      ++R++ T        P   DWR+ G+
Sbjct: 104 LSNEEFKELYL-SKVKKPINIKRSTARDW-----RQRNLQT-----CDAPSSLDWRKKGV 152

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CW+FST    E ++A+  G L  LS QE++DC    N GC GG     
Sbjct: 153 VTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDC-DTTNYGCEGGYMDYA 211

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            +W+ +N   ++ E+ YP    D  C         V I  YT   +  ++S++L      
Sbjct: 212 FEWV-INNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYT--DVDETDSALLC-ATVQ 267

Query: 270 GPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            P+   ++  AL +Q Y GG+   +C     +I+HAV IVGY
Sbjct: 268 QPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGY 309


>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
           Full=Turgor-responsive protein 15A; Flags: Precursor
 gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
          Length = 363

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 147/283 (51%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ KSY +K EHD RF  F+ +L I  +L++NR    +A +GIT+FSDL+  EF
Sbjct: 48  FTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDP--TAEHGITKFSDLTASEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L   + K + +  H                  I   T +P   DWRE G +  V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q +CG+CWAFST    E  H L  G L  LS Q+++DC        AG+ + GC+GG   
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              +++  +  V++ E +Y    +D +CK    S     + +++  TL   E  I  ++ 
Sbjct: 210 NAFEYLLESGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVTL--DEDQIAANLV 265

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+  A+NA   Q Y+ GV   Y C  + + ++H V +VG+
Sbjct: 266 KNGPLAVAINAAWMQTYMSGVSCPYVC--AKSRLDHGVLLVGF 306


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 145/304 (47%), Gaps = 27/304 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDII 67
           F + L+  C  A P           +E    +  +Y+++Y+ S E + R K F+++L+ I
Sbjct: 7   FCIILLWAC--AYPTMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYI 64

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E  N N    +S + G+  +SDL+ EEF   H    V+  +  S            K RS
Sbjct: 65  E--NFNNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDS------------KMRS 110

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
           +     +   +P   DWRE G++  V+NQ+ CG CWAF+ V   E +  +KNG L  LS 
Sbjct: 111 VAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSE 170

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           Q+++DC    + GC GGDF    D +  ++ +++ E +YP    D    +    P   +I
Sbjct: 171 QQLVDCDRQSS-GCGGGDFVLAFDSIIKSRGIVK-EDDYPYKANDVQTCQLGQIPGAAQI 228

Query: 248 KSYTCDTLIPS--ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y     +P+  E  +L  +      +A   +  + +Y+GGV + +C   L   NHAV 
Sbjct: 229 NGY---FKVPANDEQQLLRAVLQQPVSVAISTSYDFHHYMGGVYEGSCGPKL---NHAVT 282

Query: 306 IVGY 309
           I+GY
Sbjct: 283 IIGY 286


>gi|449668436|ref|XP_002162416.2| PREDICTED: cathepsin O-like [Hydra magnipapillata]
          Length = 365

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 75/236 (31%), Positives = 128/236 (54%), Gaps = 20/236 (8%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV---KKRSITTGITIP 135
           +A+YGI ++SD S EEFK   L  ++      S  K + ++ N +   +K++        
Sbjct: 101 TAKYGINQYSDWSLEEFKNYRLTSNLGMFSDFSTPKIYLNNGNEICSIQKKAY------- 153

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
              P  KDW   G+  K+++Q+ CG+CWAF   E  E+  A+    +  LS QE+I C+ 
Sbjct: 154 ---PSSKDW--IGMSTKIKDQKNCGSCWAFVASEQVETYLAIAGKPIVELSPQELISCS- 207

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT-CDT 254
             +MGC GG+ C  L W+      L+ E EYP   + + C     + +  +I +   C +
Sbjct: 208 -PSMGCHGGNTCTALSWLKQTHSCLKTEKEYPYEAQVSKCLYSNCTTSDARIYAVCGCQS 266

Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            + +E  ++  ++  GP+   V+A++WQ Y+GG+IQ++C     +INHAVQ++GY+
Sbjct: 267 FVGNEEYMIRVLSQKGPLSVNVDAVSWQDYIGGIIQHHCTNK--DINHAVQLIGYN 320


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 135/282 (47%), Gaps = 25/282 (8%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ LELF S+   + K Y   E  + RF+ F ++L  I++ N    S      G+ EF+D
Sbjct: 45  EKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEFAD 101

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EEFK R+L       +             + + R IT        +P   DWR+ G 
Sbjct: 102 LTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKKGA 148

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CWAFSTV   E ++ +  G LS LS QE+IDC    N GC+GG     
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
             ++ ++   L  E +YP L+++  C+ +      V I  Y  + +  ++   L     H
Sbjct: 209 FQYI-ISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLVKALAH 265

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            PV  A+ A    +Q+Y GGV    C     +++H V  VGY
Sbjct: 266 QPVSVAIEASGRDFQFYKGGVFNGQCG---TDLDHGVAAVGY 304


>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
          Length = 1165

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 141/283 (49%), Gaps = 35/283 (12%)

Query: 36   LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
            LF  F+ ++ + Y  + EH++RF+ F+ +L  IE+LNK  Q   +A+YGIT F+D++  E
Sbjct: 857  LFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQG--TAKYGITHFADMTSAE 914

Query: 95   FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
            ++ R           +   +  D +H    K  I   + +P       DWRE G +  V+
Sbjct: 915  YRQR---------TGLVIPRDEDRNHVGNPKAEIDENMELPESF----DWRELGAVSPVK 961

Query: 155  NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
            NQ  CG+CWAFS V   E +H +K   L   S QE++DC    +  C GG       +MD
Sbjct: 962  NQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAV-DSACQGG-------YMD 1013

Query: 215  -----VNKV-VLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
                 + K+  LE ESEYP L  K   C   +T    V ++      L  +E+++   + 
Sbjct: 1014 DAYKAIEKIGGLELESEYPYLAKKQKTCHFNSTE---VHVRVKGAVDLPKNETAMAQYLV 1070

Query: 268  THGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
             +GP+   +NA   Q+Y GG+   +    S  N++H V IVGY
Sbjct: 1071 ANGPISIGLNANAMQFYRGGISHPWKPLCSKKNLDHGVLIVGY 1113


>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|45822201|emb|CAE47497.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 315

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 95/298 (31%), Positives = 143/298 (47%), Gaps = 26/298 (8%)

Query: 14  IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKN 73
           + L  LA  + V+        E ++SF+  + KSY+  E  +RF  F+ +L  IEE N  
Sbjct: 1   MKLFILAAALIVATSANLGAFEKWTSFKATHNKSYNVIEDKLRFAVFQDNLKKIEEHNAK 60

Query: 74  RQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
            +S E   Y  + +F+D S  EF+    R   NK                 K+  I   +
Sbjct: 61  YESGEETYYLAVNKFADWSSAEFQAMLARQMANKP----------------KQSFIAKHV 104

Query: 133 TIPTGIPVKK-DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
             P    V++ DWR++ ++G V++Q  CG+CWAFST  + E   A+       LS QE++
Sbjct: 105 ADPNVQAVEEVDWRDSAVLG-VKDQGQCGSCWAFSTTGSLEGQLAIHKNQRVPLSEQELV 163

Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           DC  + N GC+GG      ++  V +  L  ES+Y    +D  CK     P    I  Y 
Sbjct: 164 DCDTSRNAGCNGGLMTDAFNY--VKRHGLSSESQYAYTGRDDRCKNVENKPLS-SISGYV 220

Query: 252 CDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              L  +E ++ + +A+ GPV  AV+A TWQ Y GG+  +N      N+NH V  VGY
Sbjct: 221 --ELETTEDALASAVASVGPVSIAVDADTWQLYGGGL--FNNKNCRTNLNHGVLAVGY 274


>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
          Length = 441

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 141/280 (50%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+Q YK+ Y+  +E   R  NF+++L+++ E   N      AR+GIT+F DLSE E
Sbjct: 37  LFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANN---PHARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F TR+L  + +        K    H+  V       G  + T  P   DWR+ G +  V+
Sbjct: 94  FATRYLSGATH---FAKAKKFASQHYRKV-------GADLSTA-PAAVDWRQMGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWA S +   ES   +   +L  LS QE++ C  + + GC+GG      DW+ 
Sbjct: 143 DQGACGSCWALSAIGNIESQWYVTTHSLITLSEQELVSC-DDVDEGCNGGLMLQAFDWLL 201

Query: 215 VNK-VVLEPESEYPLLLKDAA---CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            NK   +   + YP +  + +   C   +    G  I  +   T+  +E ++   +A +G
Sbjct: 202 NNKNGAVYTGASYPYVSGNGSVPECSESSELVVGAYIDGHV--TIESNEDTMAAWLAVNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  AV+A  +  Y GG++  +CDG    +NH V +VGY+
Sbjct: 260 PIAIAVDASAFMSYTGGILT-SCDGR--QLNHGVLLVGYN 296


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 25/284 (8%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I++ NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---IVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK ++L   V+    +S  +   +      +            +P   DWR+ 
Sbjct: 96  ADLSHQEFKNKYLGLKVD----LSQRRESSNEEEFTYR---------DVDLPKSVDWRKK 142

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG   
Sbjct: 143 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMD 202

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               ++  N   L  E +YP +++++ C+ K      V I  Y  D    +E S+L  +A
Sbjct: 203 YAFSFIGQNG-GLHKEEDYPYIMEESTCEMKKEETQVVTINGYH-DVPQNNEQSLLKALA 260

Query: 268 THGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              P+  A+ A +  +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 261 NQ-PLSVAIEASSRDFQFYSGGVFDGHCG---SDLDHGVSAVGY 300


>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 141/281 (50%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSCDDKDN-GCRGGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSSQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +V Y+
Sbjct: 259 GPISIAVDASSFMSYQSGVLT-SCAG--MPLNHGVLLVWYN 296


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 90/279 (32%), Positives = 142/279 (50%), Gaps = 22/279 (7%)

Query: 34  LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF  +  +Y+K+Y+  E  +R F+ F+ +L+ I+++NK   S      G+ EF+DL+ 
Sbjct: 48  IELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS---YWLGLNEFADLTH 104

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           +EFK  +L   +      S+ KH+        K S          +P + DWR+   + +
Sbjct: 105 DEFKATYL--GLTPPPTRSNSKHYSSEEFRYGKMSNGE-------VPKEMDWRKKNAVTE 155

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFSTV   E ++A+  G L+ LS QE+IDC+ +GN GC+GG       +
Sbjct: 156 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSY 215

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           +  +   L  E  YP  +++  C     +   V I  Y  D     E +++  +A H PV
Sbjct: 216 I-ASTGGLRTEEAYPYAMEEGDCDEGKGAAV-VTISGYE-DVPANDEQALVKALA-HQPV 271

Query: 273 IAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 272 SVAIEASGRHFQFYSGGVFDGPCGEQL---DHGVTAVGY 307


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 88/292 (30%), Positives = 145/292 (49%), Gaps = 25/292 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESA 80
           V VS   L++    F SF+ ++ K+Y +++E   RF  F ++L  IE  N   +Q   S 
Sbjct: 12  VAVSATLLKEDGVHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSY 71

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
             GI +F+D++  EFK         K  +++            K   +  G+++P  I  
Sbjct: 72  TQGINKFADMTRAEFKAMLATQVKTKPSIVA-----------TKTFQLADGVSVPESI-- 118

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR   ++  +++Q  CG+CW+F+ V + E  +AL  G L+  S Q+++DC  + N G
Sbjct: 119 --DWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYG 176

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C GG       ++  N   LE ES+YP    D +C    +S    K+ SY   ++  +E 
Sbjct: 177 CDGGYLDDTFPYIQTNG--LELESDYPYTGYDGSCSYD-SSKVVTKVSSYV--SVPANEQ 231

Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGYDN 311
           ++L  + T GPV  A+NA   Q+Y  G+I    CD     ++H V  VGY++
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIIDDKYCDPEW--LDHGVLAVGYNS 281


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 91/318 (28%), Positives = 153/318 (48%), Gaps = 30/318 (9%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD---IRFKNFEKSL 64
           LF+  +++ CF      +S+P L++       +  ++ + Y+  + D    RF  F++++
Sbjct: 8   LFVALVLSFCFSIQLAGLSRPLLDEDSMRHEEWMSQHGRVYADEQEDHKNKRFNVFKENV 67

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           + IEE N  +    + +  I +F+DL+ EEF+  +  +     +++S         + + 
Sbjct: 68  ERIEEFNDGK----TFKLAINQFADLTNEEFRASY--NGFKGPMVLS---------SQIT 112

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           K +      + + +PV  DWR+ G +  V+NQ  CG CWAFS V   E +  +  G L  
Sbjct: 113 KPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLIS 172

Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS QE++DC   G + GC GG      +++ +N   L  ES YP   +D  C    T+P 
Sbjct: 173 LSEQELVDCDTKGIDHGCEGGLMDTAFEFI-INNGGLTTESNYPYKGEDGTCNFNKTNPI 231

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
            V I  Y  D     E +++  +A H PV  A+ A    +Q+Y  GV    C   L   +
Sbjct: 232 AVSITGYE-DVPANDEQALMKAVA-HQPVSVAIEAGGSDFQFYSSGVFTGECGTEL---D 286

Query: 302 HAVQIVGY---DNYSRTW 316
           HAV  VGY   ++ S+ W
Sbjct: 287 HAVTAVGYGESEDGSKYW 304


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 139/283 (49%), Gaps = 26/283 (9%)

Query: 30  LEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +++ +  F S+  ++ K Y   E  + RF+ F ++L+ I+E NK      S   G+ EF+
Sbjct: 397 IDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKE---VSSYWLGLNEFA 453

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS EEFK+++L        L +       +    + R +         +P   DWR+ G
Sbjct: 454 DLSHEEFKSKYLG-------LRAEFPRSRDYSGEFRYRDVAD-------LPESVDWRKKG 499

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG    
Sbjct: 500 AVTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDY 559

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
              ++  N   L  E +YP L+++  C+ +    + V I  Y  D     E S+L  +A 
Sbjct: 560 AFAFIASNG-GLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYE-DVPEKDEESLLKALA- 616

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           H P+  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 617 HQPLSVAIEASGRDFQFYSGGVFNGPCGTEL---DHGVAAVGY 656


>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 138/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP          C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYTSTFGYVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
          Length = 325

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 151/305 (49%), Gaps = 38/305 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           +A +  C  A+    + P  +   EL+  F++ Y K Y+  +   RF  F+ +L   ++L
Sbjct: 9   LAFLVGCAFAVS---TVPVPDNARELYEQFKRDYGKVYANDDDQKRFAIFKDNLVRAQKL 65

Query: 71  N-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
             K+R +   ARYG+T+FSDL+ EEF  ++L   +N  V                +R   
Sbjct: 66  QLKDRGT---ARYGVTQFSDLTPEEFAAKYLSRPMNDQV----------------ERVRP 106

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
           TG+      P + DWRE G +G V NQ +CG+CWAFS     E    LK G L  LS Q+
Sbjct: 107 TGLK---AAPERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQ 163

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           ++DC    + GC GG       +M++ ++  LE +S+YP +     C       N  K+ 
Sbjct: 164 LVDCDVM-DYGCGGG--WPTNAYMEIMRMGGLELQSDYPYVGVQQQCYL-----NKEKLL 215

Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQ 305
           +   D ++    E      +A HGP+ +A+NA   Q+Y  G+   + +  S A++NHAV 
Sbjct: 216 AKIDDLIVLGAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPASLNHAVL 275

Query: 306 IVGYD 310
            VGYD
Sbjct: 276 TVGYD 280


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 98/300 (32%), Positives = 150/300 (50%), Gaps = 28/300 (9%)

Query: 14  IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK 72
           +A+ F  + V +S    E+    F +F+  + K+Y +++E   RF  F  ++  IE  N 
Sbjct: 3   VAIFFSLLVVAISASISEELGAKFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNA 62

Query: 73  -NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
              Q   S + GI +F+D+S+EEFKT     +  K  L           ++VK     TG
Sbjct: 63  LYEQGKVSYKKGINKFTDMSQEEFKTMLTLSASRKPTL--------ETTSYVK-----TG 109

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
           + IP+ +    DWR+ G +  V++Q  CG+CWAFS   + E  +A K+G L  LS Q++I
Sbjct: 110 VEIPSSV----DWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLI 165

Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           DC  + + GC GG       +  V K  L+ E  Y    +D ACK    S    K+  YT
Sbjct: 166 DCCTDTSAGCDGGSLDDNFKY--VMKDGLQSEESYTYKGEDGACKYNVASVV-TKVSKYT 222

Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               IP+  E ++L  +AT GPV   ++A     Y  G+ + + D S A +NHA+  VGY
Sbjct: 223 S---IPAEDEDALLEAVATVGPVSVGMDASYLSSYDSGIYE-DQDCSPAGLNHAILAVGY 278


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 97/307 (31%), Positives = 154/307 (50%), Gaps = 48/307 (15%)

Query: 24  KVSKPNLEQKLEL-----FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSP 77
           +V   + E +LEL     F+SF +R+ KSY  + EH+ R   F  +L       ++++  
Sbjct: 40  QVVGGDAENELELNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANL---RRARRHQRLD 96

Query: 78  ESARYGITEFSDLSEEEFKTRHL-----RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
            SA +GIT+FSDL+ +EF+ R L     R S  K +  S H                   
Sbjct: 97  PSAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHD----------------AP 140

Query: 133 TIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
            +PT G+P + DWRE G +G V++Q +CG+CW+FST    E  + L  G L +LS Q+++
Sbjct: 141 ALPTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLV 200

Query: 192 DC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           DC            + GC+GG       ++      LE E +YP   +++ACK    S  
Sbjct: 201 DCDHECDPSEPRACDAGCNGGLMTTAFSYL-AKAGGLETEKDYPYTGRNSACKFD-KSKI 258

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINH 302
             ++K+++  T+   E  I  ++  HGP+   +NA+  Q Y+GGV   Y C     +++H
Sbjct: 259 AAQVKNFS--TVAIDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCPYICG---RHLDH 313

Query: 303 AVQIVGY 309
            V +VGY
Sbjct: 314 -VFLVGY 319


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 156/313 (49%), Gaps = 34/313 (10%)

Query: 8   LFIVALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLD 65
           LF+ +++A  F  +        ++++ +ELF S+   + K+Y+  E  + RF+ F+++L 
Sbjct: 17  LFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLHRFEVFKENLK 76

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I++ NK   S      G+ EF+DLS EEFK++ L                 +     KK
Sbjct: 77  HIDQRNKEVTS---YWLGLNEFADLSHEEFKSKFLGL---------------YPEFPRKK 118

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            S          +P   DWR+ G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ L
Sbjct: 119 SSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSL 178

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
           S Q++IDC  + N GC+GG    L+D+     VN   L  E +YP L+++  C  K    
Sbjct: 179 SEQQLIDCDTSFNNGCNGG----LMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEM 234

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
             V I  Y  D     E S+L  +A H P+  A++A    +Q+Y GGV    C     ++
Sbjct: 235 EVVTISGYH-DVPRNDEQSLLKALA-HQPLSVAIDASGRDFQFYSGGVFSGPCG---TDL 289

Query: 301 NHAVQIVGYDNYS 313
           +H V  VGY + S
Sbjct: 290 DHGVAAVGYGSSS 302


>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
          Length = 348

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 139/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE  
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQRRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAV 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTVFT-EKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 92/278 (33%), Positives = 144/278 (51%), Gaps = 24/278 (8%)

Query: 35  ELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E F++F QRY KSY+ +E  + RF  F ++L     LN   +     ++GIT+F+D+S+E
Sbjct: 32  EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEG--KTQFGITKFADMSQE 89

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR-EAGIIGK 152
           EF++R         VLMS+         +   +    G T P+      DWR + G++  
Sbjct: 90  EFQSR---------VLMSNPPPPPTEKPYRGPK--FEGFTAPSTF----DWRNKPGVVTP 134

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V +Q  CG+CWAFS  E  ES  AL    L+ LS+Q+++DC+   + GC GG      D+
Sbjct: 135 VYDQGQCGSCWAFSATENIESQWALAGHKLTGLSMQQIVDCSWWDD-GCGGGFPSYAYDY 193

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + ++   L+  + YP      +C  K  S    KI S+T  T   +E  +   +A HGP+
Sbjct: 194 V-IDAPGLDALANYPYTAVGGSCAFK-ESQVVAKISSWTYTTTDSNEHQMANYLAQHGPI 251

Query: 273 IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
              V+A +W  Y GGV + +  G+  +I+H V  VGY+
Sbjct: 252 SVCVDAESWPSYTGGVYRASACGT--SIDHCVLAVGYN 287


>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H       ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKACADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAFS V   ES  A+    L  LS Q+++ C    N GC GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDN-GCGGGLMLQAFEWVL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  V   E  YP +  +     C   +    G +I  Y   ++  SE  +   +A +
Sbjct: 202 RNMNGTV-STEKSYPYVSGNGDVPECSNSSELAPGARIDGYV--SMESSERVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYHSGVLT-SCIGE--QLNHGVLLVGYN 296


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 135/280 (48%), Gaps = 26/280 (9%)

Query: 35  ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E+F S+  ++ KSY+   E D RFK F  +L  I+E  KN     S + G+  F+D++ E
Sbjct: 48  EMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDE--KNSLENRSYKLGLNRFADITNE 105

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           E++T +L                D   N VK +S          +P   DWRE G +  V
Sbjct: 106 EYRTGYL------------GAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGV 153

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q +CG+CWAFST+   E ++ L  G L  LS QE++DC    N GC+GGD      ++
Sbjct: 154 KDQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFI 213

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGP 271
            +    ++ E +YP   KD  C   +   N  K+ S      +P  +E S+   +A   P
Sbjct: 214 -IKNGGIDSEEDYPYTGKDGKC--DSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQ-P 269

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           V  A+ A    +Q Y  G+   +C     +++H V  VGY
Sbjct: 270 VSVAIEAGGYDFQLYSSGIFTGSCG---TDLDHGVAAVGY 306


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/295 (32%), Positives = 143/295 (48%), Gaps = 32/295 (10%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESA 80
           PV + +   E     F SF+  Y KSY+  E    R+  F+ +L  I   N   Q   S 
Sbjct: 104 PVNIWEWKEEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHN---QQGYSY 160

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
              +  F DLS EEF+ ++L           ++K  +   N++   +    ++ P+ +P 
Sbjct: 161 SLKMNHFGDLSREEFRRKYL----------GYNKSRNLKSNNLGVATELLKVS-PSDVPS 209

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNM 199
             DWRE G +  V++Q+ CG+CWAFS     E  H  K G L  LS QE++DC+   GN 
Sbjct: 210 AVDWREKGCVTPVKDQRDCGSCWAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQ 269

Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR---KATSPNGVKIKSYTCDTLI 256
           GCSGG+      ++ V+   L  E  YP L +D  CKR   K  + +G K      D   
Sbjct: 270 GCSGGEMNDAFQYV-VDSGGLCSEEGYPYLARDGECKRACKKVVTISGFK------DVPR 322

Query: 257 PSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            SE+++   +A H PV  A+ A  L +Q+Y  GV   +C     +++H V +VGY
Sbjct: 323 KSETAMKAALA-HSPVSIAIEADQLPFQFYHEGVFDASCG---TDLDHGVLLVGY 373


>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
 gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
 gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
 gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
          Length = 333

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 31/304 (10%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L+ LCF +A   +   P L+ +   +  ++ +Y KSYS  E  +R   +E+++ +I+ 
Sbjct: 5   VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYS-PEEALRRAVWEENMRMIKL 60

Query: 70  LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK N     +    + +F D + EEF     R S++ ++ +       H  NHV     
Sbjct: 61  HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 110

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                   G+P  KDWRE G +  VRNQ  CG+CWAF+     E     K G L+ LSVQ
Sbjct: 111 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 163

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  G      +++  NK  LE E+ YP   KD  C+ ++ + +   I
Sbjct: 164 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 221

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y    L P+E  +   +A+ GPV AA++A   ++++Y GG I Y  + S   +NHAV 
Sbjct: 222 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 278

Query: 306 IVGY 309
           +VGY
Sbjct: 279 VVGY 282


>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
          Length = 333

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 154/314 (49%), Gaps = 42/314 (13%)

Query: 9   FIVALIALCFLAIPVK-VSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDI 66
           F+ A++AL F     + +S+  LE    LF   + R+ +SY+  E +I R + F  +L+ 
Sbjct: 3   FVFAVLALVFAPTASELISEGELEAHFNLF---KTRFGRSYANFEEEIFRKRVFASNLEF 59

Query: 67  IEELNKNRQ---SPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHN 121
           I   N NR+     ++    +  F+D+S  EF+ R   LRHS  +     H    +    
Sbjct: 60  I--FNHNREFFAGNKNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSAPAIHSASAE---- 113

Query: 122 HVKKRSITTGITIPTGIPVKKDWREA-GIIGKVRNQQTCGACWAF-STVETAESMHALKN 179
                          G+P   DW +   ++  ++NQ+ CG+CWAF S V + E  H LK 
Sbjct: 114 ---------------GLPATVDWTKVKNVVTPIKNQEQCGSCWAFFSAVASMEGQHGLKT 158

Query: 180 GTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS Q ++DC A  GNMGC GG       ++  NK + + E  YP    D + + K
Sbjct: 159 GKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAFQYVIANKGI-DTEMSYPYKAIDESWEFK 217

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY-NCDG 295
             S  G  IKSY  D    SESS+ + +AT GP+   ++A  L++Q+Y  GV +   C  
Sbjct: 218 KNSV-GATIKSYV-DVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSGVYEEPACST 275

Query: 296 SLANINHAVQIVGY 309
           ++  ++H V  VGY
Sbjct: 276 TI--LDHGVTAVGY 287


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 101/297 (34%), Positives = 141/297 (47%), Gaps = 33/297 (11%)

Query: 31  EQKLELFSSFQQRYKKS-----YSKSEHDIRFKNFEKSLDII----EELNKNRQSPESAR 81
           ++ L  +SS+ + Y K      YS  E    F+ F+K+LD+I    EE N+  QS E   
Sbjct: 21  QKYLSAWSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYE--- 77

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
            G+  F+ L+ EEF  ++L +     V     +    H    K RS          IP  
Sbjct: 78  MGLNGFAHLTFEEFSAQYLGYG-GAEVEQPKTRRAGKHER--KSRSE---------IPAS 125

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
            DWRE G + +V+NQ  CG+CWAFS V   E  H L +G L  LS Q+++DC+   GN G
Sbjct: 126 VDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHG 185

Query: 201 CSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK--IKSYTCDTLIP 257
           C+GG    A   WM+      + E +YP    D  CK    S +GV+  I  Y  D    
Sbjct: 186 CAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCK---FSADGVRATISGYN-DVKQG 241

Query: 258 SESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313
           +E+ +L  +A  GPV  A++A    Q+YL GV           +NH V  VGY   S
Sbjct: 242 NETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFGPLNHGVTAVGYGTAS 298


>gi|118350036|ref|XP_001008299.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89290066|gb|EAR88054.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 332

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/321 (28%), Positives = 150/321 (46%), Gaps = 39/321 (12%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKSEHDIRFKNFE 61
           K ++F+ A   +   A+ +  S+ ++E+ +   ++ ++  Q++++   K  H I +K  E
Sbjct: 3   KLIVFVAAAFIIASTAVLIIESQSSVEEVIINSDIIAA--QKWQEFLKK--HSITYKTIE 58

Query: 62  KSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           + L        N +  E  + YGIT+F DL+ EEF+ R+LR   N               
Sbjct: 59  EKLHRFAVFRDNLKKIEGHSNYGITKFMDLTSEEFQQRYLRLKTNTI-----------KR 107

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
            + K       + +  G  +  DW + G +  V++Q+ CG+CWAFS     ES   +  G
Sbjct: 108 QNFKSNPKNAQLNMKLGDDIIIDWTKKGAVTPVKDQEQCGSCWAFSATGALESATFISTG 167

Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           TL  LS QE++DC+ + GN GC GGD  A   ++  N +  E E  Y     D  CK   
Sbjct: 168 TLPSLSEQELVDCSTSYGNEGCDGGDMDAAFKFIHDNNIATEKEYTYRGF--DQKCK-GT 224

Query: 240 TSPNGVKIKSY----TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             P    + S+    +CD L+ +            PV  AV+A  WQYY  G    +C  
Sbjct: 225 QYPTTYGLSSFVDVQSCDELVAA--------IQQQPVSVAVDATNWQYYEFGTFN-DC-- 273

Query: 296 SLANINHAVQIVGYDNYSRTW 316
              N+NH V +VGY++ +  W
Sbjct: 274 -FDNLNHGVLLVGYNSKTHQW 293


>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/299 (33%), Positives = 139/299 (46%), Gaps = 46/299 (15%)

Query: 27  KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
           +PNL     E K  LF S    Y K+YS  E  I R   F K  ++++        P SA
Sbjct: 39  RPNLLGTHTESKFRLFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-SA 92

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
            +G+T+FSDL+EEEFK                 + +    +    R  T G   P     
Sbjct: 93  VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGGTVGAEAPMVEVD 135

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
           G+P   DWRE G + +V+NQ  CG+CWAFST   AE  H +  G L  LS Q+++DC   
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195

Query: 197 G----NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
                + GC GG      +++ +    LE E  YP   K   CK     P  V ++    
Sbjct: 196 DKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAVRVLNF 251

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
            T+   E+ I  ++  HGP+   +NA+  Q Y+GGV   +C    S  N+NH V +VGY
Sbjct: 252 TTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV---SCPLICSKRNVNHGVLLVGY 307


>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 156/315 (49%), Gaps = 26/315 (8%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   ARTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +S+   E   +   +   A +G+T+FSD+S EEF+  +L  +          K++     
Sbjct: 67  QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALK 113

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +K      +T+ TG  P   DWR+ G +  V++Q+ CG+CWAFS +   E    +   
Sbjct: 114 RPRKV-----VTVSTGKAPPAIDWRKKGAVTPVKDQRKCGSCWAFSAIGNIEGQWKVAGH 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L+ LS Q ++ C  N + GC GG     L W+   NK  +  E  YP    D       
Sbjct: 169 ELTSLSEQMLVSC-DNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S   V  K      L   E++I   +A +GP+  AV+A ++  Y GGV+  +C  S   
Sbjct: 228 KSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT-SC--SSDA 284

Query: 300 INHAVQIVGYDNYSR 314
           +NH V +VGYD+ S+
Sbjct: 285 LNHDVLLVGYDDSSK 299


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 146/319 (45%), Gaps = 29/319 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEK 62
           V N+  +  L+   FL+              E    +  +Y K Y  S E ++R K F++
Sbjct: 6   VLNITSLTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKE 65

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           ++  IE  N      +S + GI +F+DL+ EEFK R+       H+  +  +     + H
Sbjct: 66  NVQRIEAFN--NAGNKSYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEH 120

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           V            T +P   DWR+ G +  +++Q  CG CWAFS V   E +  L  G L
Sbjct: 121 V------------TSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKL 168

Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC   G + GC GG       ++  NK  L  E++YP    DA C   A +
Sbjct: 169 ISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNK-GLNTEAKYPYQGVDATCNANAEA 227

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            +   IK +  D    SES++L  +A   P+  A++A    +Q+Y  GV   +C   L  
Sbjct: 228 KDAASIKGFE-DVPANSESALLKAVANQ-PISVAIDASGSEFQFYSSGVFTGSCGTEL-- 283

Query: 300 INHAVQIVGY--DNYSRTW 316
            +H V  VGY  D  ++ W
Sbjct: 284 -DHGVTAVGYGSDGGTKYW 301


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 149/317 (47%), Gaps = 33/317 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +F++    +    +  +V +P L  K E   + F + YK +   +E + RF+ F+ +++ 
Sbjct: 11  MFLIFTTWMLPYVMSSRVLEPYLSNKHEKWMTQFGKSYKDA---AEKEKRFQIFKNNVEF 67

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           IE  N     P      I  F+DL+ EEFK            L  + K HD     +   
Sbjct: 68  IELFNAVGNKP--FNLSINHFADLTNEEFKAS----------LNGNKKLHDKFD--ILNE 113

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           + +      T +P   DWR+ G +  ++NQ +CG+CWAFSTV + E +H +  G L  LS
Sbjct: 114 TTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLS 173

Query: 187 VQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
            QE+IDC    + GCSGG       ++   K  +  E+ YP    D  CK K  S +  +
Sbjct: 174 EQELIDCVRGNSSGCSGGYLEDAFKFI-AKKGGMASETNYPYKETDEKCKFKKESKHVAE 232

Query: 247 IKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
           IK Y     +P  SE+ +L  +A   PV   V+A    +Q+Y GG+    C     + +H
Sbjct: 233 IKGY---EKVPSNSENDLLKAVANQ-PVSVYVDAGDYVFQFYSGGIFTGKCG---TDTDH 285

Query: 303 AVQIVGYD---NYSRTW 316
            V IVGY    +Y+  W
Sbjct: 286 VVTIVGYGVSLDYTEYW 302


>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 380

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 161/319 (50%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
             L+ LS Q ++ C  N + GC GG       W+   NK  +  E  YP         AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPAC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G KI+ +    L   E++I   +A +GPV  AV+A ++Q Y GGV+  +C  
Sbjct: 227 D-KSGKVVGAKIRDHV--DLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCIS 282

Query: 296 SLANINHAVQIVGYDNYSR 314
              +++H V +VGYD+ S+
Sbjct: 283 E--HLDHGVLLVGYDDTSK 299


>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 139/282 (49%), Gaps = 29/282 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           BQ  CG+CWAFS V   ES  A+    L  LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 BQGACGSCWAFSAVGNIESQWAVAGHRLXXLSEQQLVSC-DDKDSGCXGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
             +N  +   E  YP +        C   +    G +I  Y    +I S  +++   +A 
Sbjct: 202 RXMNGTMFT-EDSYPYVSSTGDVPECTNSSELVPGARIDGY---VMIESNETVMAAWLAK 257

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            GP+   V+A ++  Y  GV+  +C G   ++NH V +VGY+
Sbjct: 258 SGPISIGVDASSFMSYESGVLT-SCAGK--HLNHGVLLVGYN 296


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 145/288 (50%), Gaps = 35/288 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I+E NK      +   G++EF
Sbjct: 40  SMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLSEF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
           +DLS  EF  ++L   V+                + ++R      T     +P   DWR+
Sbjct: 97  ADLSHREFNNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG  
Sbjct: 141 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++ AC+        V I  Y  D    +E S+L
Sbjct: 199 --LMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKEETQVVTISGYH-DVPQNNEQSLL 255

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A   P+  A+ A    +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299


>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
          Length = 442

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/316 (30%), Positives = 153/316 (48%), Gaps = 35/316 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           ++ +  L+ +C +     +  P  E    LF  F+  + ++Y S  E   RF+ F  ++ 
Sbjct: 3   IVIVTVLLMVCTV-----MGAPTTEV---LFRDFKTTHARNYASADEERKRFEIFAANMK 54

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
              ELN  R++P  A +G  EF+D+S EEF+TRH           +  +H+        K
Sbjct: 55  KAAELN--RKNPM-ATFGPNEFADMSSEEFQTRH-----------NAARHYAAVMARPPK 100

Query: 126 RSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            + T T   I   +  K DWR  G +  V+NQ +CG+CW+FST    E  HA+  G L  
Sbjct: 101 NTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFSTTGNIEGQHAIATGQLVS 160

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDA---ACKRKAT 240
           LS QE++ C    + GCSGG       W +  +   +  E+ YP +  +    AC   + 
Sbjct: 161 LSEQELVSC-DTVDDGCSGGLMDNAFGWLLSAHNGQITTEASYPYVSGNGIVPACTFNSN 219

Query: 241 S-PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
           S P G  I S+    +  +E  +   +  +GP+   V+A +WQ Y+GG++ +  D     
Sbjct: 220 SNPVGATITSF--HDIPKTERDMAAFVFKYGPLSIGVDASSWQSYIGGILSHCSD---VQ 274

Query: 300 INHAVQIVGYDNYSRT 315
           I+H V IVG+D+ + T
Sbjct: 275 IDHGVLIVGFDDTAST 290


>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 138/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y + Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYWRVYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V   ES  A+    L+ LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWAVAGHRLTALSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A  
Sbjct: 202 RNMNGTMFT-EDSYPYVSSXGDVPECTNSSQLVPGARIDGYV--TIESSETVMAAWLAKS 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+   V+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIGVDASSFMSYESGVLT-SCAGB--XLNHGVLLVGYN 296


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 140/284 (49%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F++F+ ++ KSY ++ EHD RF  F  +L        + +   SA +G+T+FSDL+ EEF
Sbjct: 44  FTTFKTKFGKSYATQEEHDYRFGVFRANL---RRAKLHAKLDPSAEHGVTKFSDLTPEEF 100

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           K ++L   +    L S               +      +PT  +P   DWR+ G +  V+
Sbjct: 101 KRQYL--GLKPLRLPS---------------TANKAPILPTSDLPENFDWRDKGAVTPVK 143

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 144 NQGSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLM 203

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               D++ +    ++ E +YP   +D  CK    S     + +++  +L   E  I  ++
Sbjct: 204 NNAFDYI-LQAGGVQTEKDYPYSGRDETCKFD-KSKVAATVANFSVVSL--DEDQIAANL 259

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA+  Q Y+GGV   Y C     N++H V +VGY
Sbjct: 260 VKHGPLAVGINAIFMQTYIGGVSCPYICG---KNLDHGVLLVGY 300


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 136/281 (48%), Gaps = 28/281 (9%)

Query: 34  LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           ++LF  +  +Y+K+Y+  E  + RF+ F+ +L  I+E NK   +      G+  F+DL+ 
Sbjct: 63  IKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTT---YWLGLNAFADLTH 119

Query: 93  EEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +EFK  +L  R    K    S  ++               G      +P   DWR+ G +
Sbjct: 120 DEFKATYLGLRQPETKKTTDSRFRY---------------GGVADDDVPASVDWRKKGAV 164

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE++DC+ +GN GC+GG      
Sbjct: 165 TDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAF 224

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++  +   L  E  YP L+++  C  KA     V   S   D     E +++  +A H 
Sbjct: 225 SYI-ASSGGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALA-HQ 282

Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           P+  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 283 PLSVAIEASGRHFQFYSGGVFNGPCGSEL---DHGVAAVGY 320


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 141/287 (49%), Gaps = 32/287 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++  +LF S+  ++ KSY   E  + RF+ F+ +L  I+E NK   S      G+ EF
Sbjct: 40  SMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSS---YWLGLNEF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS EEFK ++L       + +   K  D       K            +P   DWR+ 
Sbjct: 97  ADLSHEEFKRKYL------GLKIELPKRRDSPEEFSYKDVAD--------LPKSVDWRKK 142

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG   
Sbjct: 143 GAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGG--- 199

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     ++   L  E +YP ++++  C  K      V I  Y  D    +E S L 
Sbjct: 200 -LMDYAFAFIISNGGLRKEEDYPYVMEEGTCGEKKEELEVVTISGYH-DVPEDNEQSFLK 257

Query: 265 DIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A   P+  A+ A +  +Q+Y GG+   +C   L   +H V  VGY
Sbjct: 258 ALANQ-PLSVAIEASSRGFQFYSGGIFNGHCGTEL---DHGVAAVGY 300


>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
          Length = 358

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 145/283 (51%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ KSY +K EHD RF  F+ +L  I+     +  P +A +GIT+FSDL+  EF
Sbjct: 43  FTSFKSKFSKSYATKEEHDYRFGVFKANL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 99

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L   +NK + +  H                  I   T +P   DWRE G +  V++
Sbjct: 100 RRQFL--GLNKRLRLPAHAQ-------------KAPILPTTNLPEDFDWREKGAVTPVKD 144

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q +CG+CWAFST    E  H L  G L  LS Q+++DC        AG+ + GC+GG   
Sbjct: 145 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMN 204

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              +++  +  V++ E +Y    +D +CK    S     + +++  +L   E  I  ++ 
Sbjct: 205 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 260

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+  A+NA   Q Y+ GV   Y C  + A ++H V +VG+
Sbjct: 261 KNGPLAVAINAAWMQAYMSGVSCPYVC--AKARLDHGVLLVGF 301


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 143/287 (49%), Gaps = 32/287 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I++ NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---VVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK ++L   V+    +S  +         +             +P   DWR+ 
Sbjct: 96  ADLSHQEFKNKYLGLKVD----LSQRRESSEEEFTYR----------DVDLPKSVDWRKK 141

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG   
Sbjct: 142 GAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGG--- 198

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     V    L  E +YP +++++ C+ K      V I  Y  D    +E S+L 
Sbjct: 199 -LMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVVTINGYH-DVPQNNEQSLLK 256

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A   P+  A+ A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 257 ALANQ-PLSVAIEASGRDFQFYSGGVFDGHCGSEL---DHGVSAVGY 299


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 146/288 (50%), Gaps = 35/288 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I+E NK      +   G+ EF
Sbjct: 40  SMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
           +DLS +EFK ++L   V+                + ++R      T     +P   DWR+
Sbjct: 97  ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G + +V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG  
Sbjct: 141 KGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L
Sbjct: 199 --LMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKEETEVVTISGYH-DVPQNNEQSLL 255

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A   P+  A+ A    +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299


>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 344

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 154/312 (49%), Gaps = 22/312 (7%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
           ++K +++I  L+A+   A   +   P   Q+L  F  F+ ++ K Y ++ EH   F N++
Sbjct: 2   NMKFIVYIFVLVAVASCAYMNETIDP---QRLAEFEEFKSKFNKYYHNEHEHHSSFHNYK 58

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
            S    E + K++    +A++G T+FSD+S EEF+ + L    +  +             
Sbjct: 59  TSR---EHIVKHQMENPNAKFGHTKFSDMSPEEFENKMLNFDFS--LFKKAKSQGIKLKA 113

Query: 122 HVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              K  +  G  +  + +P   DWR+ GII   + Q TCG+CW F+T    ES +ALK G
Sbjct: 114 EPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQNTCGSCWTFATTGVIESQYALKYG 173

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L   S Q ++DC  N N GC GG       ++  +  +   ++       D   K+   
Sbjct: 174 ELLHFSEQMLLDC-DNINQGCRGGLMTDAYQFLQQSGGIQTADT-----YGDYKNKKDIC 227

Query: 241 SPNGVKIKSYTCDTL-IP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSL 297
           + +  K+K+   D   IP +E +I  ++  +GPV   +NA T Q+Y GG++   NCD   
Sbjct: 228 NFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINARTLQFYEGGIVDPKNCDDK- 286

Query: 298 ANINHAVQIVGY 309
             INHAV IVGY
Sbjct: 287 --INHAVLIVGY 296


>gi|302763927|ref|XP_002965385.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
 gi|300167618|gb|EFJ34223.1| hypothetical protein SELMODRAFT_439207 [Selaginella moellendorffii]
          Length = 353

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/283 (33%), Positives = 136/283 (48%), Gaps = 30/283 (10%)

Query: 33  KLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           K+  F  F  R+K+ Y S  E   RF  F ++L++IEE N+ ++ P +    + +F+D+S
Sbjct: 47  KVARFHEFATRHKRVYGSLVELRERFVTFSRNLELIEETNR-KELPYT--LAVNQFADMS 103

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
            EEFK         KH L S         N V+          P   P KKDWR+  I+ 
Sbjct: 104 WEEFK---------KHNLFSSQNCSATATNSVR------AFLTP---PSKKDWRDDKIVS 145

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
            V+NQQ CG+CW FST    ES HA   G + +LS Q+++DCAG   N GCSGG      
Sbjct: 146 PVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCSGGLPSQAF 205

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATH 269
           +++  N   L+ E  YP    D  C       N +  K Y    +   +E  ++  +A +
Sbjct: 206 EYIRYNG-GLDTEDSYPYTAHDGKCMYNQ---NSIGAKVYDVVNITEGAEDELIHAVAFN 261

Query: 270 GPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGYD 310
            PV  A   L  +++Y  GV   N C      +NHAV  VGY+
Sbjct: 262 RPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYN 304


>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 336

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 138/278 (49%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  +YKK Y   E    RF  F +S+ ++E  NK + S   A   + EF+D++ EEF
Sbjct: 29  FAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQHSYSLA---VNEFADMTFEEF 85

Query: 96  K-TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           + +R ++   N    + +H              + TG ++P      KDWRE GI+ +V+
Sbjct: 86  RDSRLMKGEQNCSATVGNH--------------VLTGESLPK----TKDWREEGIVSQVK 127

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ +CG+CW FST    E+ HA   G + LLS Q+++DCAG   N GC GG      +++
Sbjct: 128 NQASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYI 187

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N  + + E  YP   KD+ C+    +  G ++     +    +E+ +   IAT  PV 
Sbjct: 188 RYNGGI-DTEDSYPYNAKDSQCRFHKNTI-GAQVWD-VVNITEGAETQLKHAIATMRPVS 244

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   +  ++ Y GGV    NC      +NHAV  VGY
Sbjct: 245 VAFEVVHDFRLYNGGVYTSLNCHTGPQTVNHAVLAVGY 282


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 136/285 (47%), Gaps = 37/285 (12%)

Query: 28  PNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           P+ EQ +ELF  +++ ++K Y    E  +R +NF+++L  I E N  R SP     G+  
Sbjct: 42  PSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNR 101

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F+D+S EEFK +           +S  +  D                     P   DWR+
Sbjct: 102 FADMSNEEFKNK----------FISKVESCDD-------------------APYSLDWRK 132

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G++  V++Q  CG+CW+FS+    E ++A+  G L  LS QE++DC    N GC GG  
Sbjct: 133 KGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCD-TTNDGCEGGYM 191

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +W+ +N   ++ E++YP +     C         V I  YT   +  S+S++    
Sbjct: 192 DYAFEWV-INNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYT--DVTQSDSALFCAT 248

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               P+   ++   L +Q Y GG+   +C  +  +I+HAV IVGY
Sbjct: 249 VKQ-PISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGY 292


>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/318 (28%), Positives = 161/318 (50%), Gaps = 32/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   +T+ TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVTG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
             L+ LS Q ++ C    ++GC+GG       W+   N+  +  E  YP   K       
Sbjct: 168 HNLTSLSEQMLVSC-DTEDLGCAGGLMDNAFKWIVSSNRHNVFTEESYPYASKGGNVPPC 226

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
           R +    G KI+ +    L   E++I   +A +GPV  AV++ ++Q Y GGV+  +C   
Sbjct: 227 RMSGKVVGAKIRDHV--DLPKDENAIAEWLAKNGPVAIAVDSTSFQSYTGGVLT-SCISK 283

Query: 297 LANINHAVQIVGYDNYSR 314
              ++H V +VGYD+ S+
Sbjct: 284 --QLDHGVLLVGYDDTSK 299


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 157/320 (49%), Gaps = 44/320 (13%)

Query: 6   NVLFIVALIALCFLAIP-------VKVSKPNL---EQKLELFSSFQQRYKKSYSKSEHDI 55
           + LF +A ++L FLA         V  +  +L   ++ ++LF S+  R+ + Y  +E  +
Sbjct: 7   SFLFFLA-VSLSFLAYSGFARDSIVGYAPEDLTSNDKLIDLFESWISRFGRVYESAEEKL 65

Query: 56  -RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF+ F+ +L  I++ NK  ++      G+ EF+DLS EEFK ++L   +   +      
Sbjct: 66  ERFEIFKDNLFHIDDTNKKVRN---YWLGLNEFADLSHEEFKNKYL--GLKPDLSKRAQC 120

Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
             +  +  V              IP   DWR+ G +  V+NQ +CG+CWAFSTV   E +
Sbjct: 121 PEEFTYKDV-------------AIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 167

Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVV---LEPESEYPLLLK 231
           + +  G L+ LS QE+IDC    N GC+GG    L+D+     V    L  E +YP +++
Sbjct: 168 NQIVTGNLTSLSEQELIDCDTTYNNGCNGG----LMDYAFAYIVANGGLHKEEDYPYIME 223

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           +  C  +    + V I  Y  D    SE S+L  +A   P+  A+ A    +Q+Y GGV 
Sbjct: 224 EGTCDMRKEESDAVTISGYH-DVPQNSEESLLKALANQ-PLSIAIEASGRDFQFYSGGVF 281

Query: 290 QYNCDGSLANINHAVQIVGY 309
             +C   L   +H V  VGY
Sbjct: 282 DGHCGTEL---DHGVAAVGY 298


>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
          Length = 443

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 28/310 (9%)

Query: 12  ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEEL 70
           A+I    L +   +  P  +   +LFS F+  + ++Y S  E   RF+ F  ++    EL
Sbjct: 3   AVIVTALLMVCTVMGAPTTD---DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL 59

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N  R++P  A +G  EF+D+S EEF+TRH   +  +H   +  +   H  +  K+     
Sbjct: 60  N--RKNPM-ATFGPNEFADMSSEEFQTRH---NAARHYAAAKARRAKHTKSFTKEE---- 109

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
              I      K DWR  G +  V+NQ +CG+CW+FST    E  +A+  G L  LS QE+
Sbjct: 110 ---IKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQEL 166

Query: 191 IDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGV 245
           + C    N GC+GG       W+       +  E+ YP +  +    AC     + P G 
Sbjct: 167 VSCDTTDN-GCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGA 225

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
            I ++    +  +E  +   +  +GP+   V+A TWQ Y GG+I Y  D     I+H V 
Sbjct: 226 TISNF--QDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPD---VQIDHGVL 280

Query: 306 IVGYDNYSRT 315
           IVGYD+ + T
Sbjct: 281 IVGYDDTAPT 290


>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 154/315 (48%), Gaps = 26/315 (8%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   ARTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +S+   E   +   +   A +G+T+FSD+S EEF+  +L  +          K++     
Sbjct: 67  QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALK 113

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +K      + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +   
Sbjct: 114 RPRKV-----VNVSTGKAPPAIDWRKKGAVTPVKDQGKCGSCWAFSAIGNIEGQWKVAGH 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L+ LS Q ++ C  N + GC GG     L W+   NK  +  E  YP    D       
Sbjct: 169 ELTSLSEQMLVSC-DNMDYGCRGGFLDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
            S   V  K      L   E++I   +A +GP+  AV+A ++  Y GGV+  +C  S   
Sbjct: 228 KSGKVVGAKISGLINLPKDENAIAEWLAKNGPIAIAVDASSFLDYTGGVLT-SC--SSDA 284

Query: 300 INHAVQIVGYDNYSR 314
           +NH V +VGYD+ S+
Sbjct: 285 LNHGVLLVGYDDSSK 299


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 148/308 (48%), Gaps = 31/308 (10%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSL 64
           ++  I  L AL   AI   +   ++ +K E    +  R+K+ YS + E +IR+K F++++
Sbjct: 11  SLALIFFLGALASQAIARTLQDASIHEKHE---EWMTRFKRVYSDAKEKEIRYKIFKENV 67

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             IE  NK   S +S + GI +F+DL+ EEFKT   R+    H+  S      + +    
Sbjct: 68  QRIESFNK--ASEKSYKLGINQFADLTNEEFKTS--RNRFKGHMCSSQAGPFRYEN---- 119

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                      T +P   DWR+ G +  +++Q  CG+CWAFS V   E +  L    L  
Sbjct: 120 ----------ITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLIS 169

Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS QE++DC   G + GC GG       +++ N+  L  E+ YP    D  C  K  + +
Sbjct: 170 LSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQ-GLTTEANYPYEGSDGTCNTKQEANH 228

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
             KI  +  D    +E +++  +A   PV  A++A    +Q+Y  G+   +C   L   +
Sbjct: 229 AAKINGFE-DVPANNEGALMKAVAKQ-PVSVAIDAGGFEFQFYSSGIFTGDCGTEL---D 283

Query: 302 HAVQIVGY 309
           H V  VGY
Sbjct: 284 HGVAAVGY 291


>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
          Length = 367

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q TCG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP +      +     P G K+  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
          Length = 443

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 149/310 (48%), Gaps = 28/310 (9%)

Query: 12  ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEEL 70
           A+I    L +   +  P  +   +LFS F+  + ++Y S  E   RF+ F  ++    EL
Sbjct: 3   AVIVTALLMVCTVMGAPTTD---DLFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL 59

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N  R++P  A +G  EF+D+S EEF+TRH   +  +H   +  +   H  +  K+     
Sbjct: 60  N--RKNPM-ATFGPNEFADMSSEEFQTRH---NAARHYAAAKARRAKHTKSFTKEE---- 109

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
              I      K DWR  G +  V+NQ +CG+CW+FST    E  +A+  G L  LS QE+
Sbjct: 110 ---IKAADGQKIDWRLKGAVTSVKNQGSCGSCWSFSTTGNIEGQNAIATGNLVSLSEQEL 166

Query: 191 IDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGV 245
           + C    N GC+GG       W+       +  E+ YP +  +    AC     + P G 
Sbjct: 167 VSCDTTDN-GCNGGLMDNAFGWLISTRGGQIATEASYPYVSGNGIVPACSYNLDNKPVGA 225

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
            I ++    +  +E  +   +  +GP+   V+A TWQ Y GG+I Y  D     I+H V 
Sbjct: 226 TISNF--QDITGTEEDMAAFVFNYGPLSIGVDASTWQSYAGGIITYCPD---VQIDHGVL 280

Query: 306 IVGYDNYSRT 315
           IVGYD+ + T
Sbjct: 281 IVGYDDTAPT 290


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/288 (31%), Positives = 146/288 (50%), Gaps = 35/288 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  ++ K Y S  E  +RF+ F+ +L  I+E NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
           +DLS +EFK ++L   V+                + ++R      T     +P   DWR+
Sbjct: 96  ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 139

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG  
Sbjct: 140 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 197

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A   P+  A+ A    +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 255 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298


>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 325

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 141/284 (49%), Gaps = 34/284 (11%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RYGITEFSDLSE 92
           E +  F+ +  KSY S  E   RF+ F+++L  IE  N+   + ES  ++G+T+F+DL+E
Sbjct: 21  EEWVQFKVKNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFTDLTE 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           +EF        ++  VL  + + +  H  H+        +     +P   DWR+ G + +
Sbjct: 81  KEF--------LDLLVLSKNARPNRTHATHL--------LAPLRDLPSAFDWRDKGAVTE 124

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CW FST  + E+ H LK G L  LS Q ++DCA +   GC GG       W
Sbjct: 125 VKDQGMCGSCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGG-------W 177

Query: 213 MD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           MD     + K  +  E +YP    D  C R   S    KI ++T       E  +   +A
Sbjct: 178 MDKALEYIEKGGIMSEKDYPYEGVDDNC-RFDISKVAAKISNFTY-IKKNDEEDLKNAVA 235

Query: 268 THGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
             GP+  A++A  T+Q Y+ G++    C     ++NH V +VGY
Sbjct: 236 AKGPISVAIDASATFQLYVSGILDDTECSNEFDSLNHGVLVVGY 279


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 141/306 (46%), Gaps = 22/306 (7%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
           VL ++AL     L+IP+K      E  L  L+  ++  +  S    +   RF  F++++ 
Sbjct: 7   VLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVSRDLDQKQKRFNVFKENVK 66

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I E NKN+    + +  + +F D++ +EF+ ++    V+ H  M   +H          
Sbjct: 67  FIHEFNKNKDV--TFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMY 124

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            +           P   DWRE G +  V+NQ  CG+CWAFS +   E ++ +    L  L
Sbjct: 125 ENAVA--------PPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPL 176

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           S QE+IDC  + N GCSGG      +++  N  +   E  YP   +DA CK+   SP  V
Sbjct: 177 SEQELIDCDTDQNQGCSGGLMDYAFEFIKNNGGIT-TEDVYPYQAEDATCKK--NSP-AV 232

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
            I  Y  D     E +++  +A   PV  A+ A    +Q+Y  GV    C   L   +H 
Sbjct: 233 VIDGYE-DVPTNDEDALMKAVANQ-PVAVAIEASGYVFQFYSEGVFTGRCGTEL---DHG 287

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 288 VAVVGY 293


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 143/283 (50%), Gaps = 32/283 (11%)

Query: 34  LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF  +  +++K+Y+  E  + RF+ F+ +L  I+++N+   S      G+ EF+DL+ 
Sbjct: 147 IELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTS---YWLGLNEFADLTH 203

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L  +       S            K   ++        +P   DWR  G + +
Sbjct: 204 EEFKATYLGLAPPAPARESR--------GSFKYEDVSA-----DDLPKSVDWRTKGAVTE 250

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFSTV   E ++A+  G L+ LS QE+IDC+ +GN GC+GG    L+D+
Sbjct: 251 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGG----LMDY 306

Query: 213 M---DVNKVVLEPESEYPLLLKDAAC-KRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
                 +   L  E  YP L+++ +C   K +    V I  Y  D    +E +++  +A 
Sbjct: 307 AFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYE-DVPAHNEQALIKALA- 364

Query: 269 HGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           H PV  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 365 HQPVSVAIEASGRHFQFYSGGVFDGPCGTQL---DHGVAAVGY 404


>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
 gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
          Length = 343

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 152/313 (48%), Gaps = 35/313 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           ++ L  L    + V      LE++ + F  FQ ++ K YS  E+  RF+ F+ +L  IEE
Sbjct: 3   VILLFVLAVFTVFVSSRGIPLEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61

Query: 70  LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           LN    N ++    ++G+ +F+DLS +EFK  +L    NK  + +         +++   
Sbjct: 62  LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I +       IP   DWR  G +  V+NQ  CG+CW+FST    E  H +    L  LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166

Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
            Q ++DC             + GC+GG      +++  N  + + ES YP   +      
Sbjct: 167 EQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
             ++  G KI ++   T+IP   +++   I + GP+  A +A+ WQ+Y+GGV    C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282

Query: 297 LANINHAVQIVGY 309
             +++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 147/308 (47%), Gaps = 45/308 (14%)

Query: 11  VALIAL--CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDII 67
           VAL+AL  C  A+P              F+ ++  + + Y S  E  +R + +  +L++I
Sbjct: 7   VALLALVACATAMP--------------FAEWKALHNRQYASAQEEALRQEIYLSNLELI 52

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
            E   N     S   G+ EF DL+  EF  ++L    N               N  K  +
Sbjct: 53  NE--HNAAGRHSYTLGMNEFGDLAHHEFAAKYLGVRFNGV-------------NATKSFA 97

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
            +T +     +P   DWR AGI+  V+NQ  CG+CW+FST  + E  HA K GTL  LS 
Sbjct: 98  SSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSE 157

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC+GG      +++  N  + + E+ YP       CK  A +  G  
Sbjct: 158 QNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGI-DTEASYPYTATTGTCKFNAANI-GAT 215

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLANIN 301
           + SY  D +  SES +   +AT GPV  A++A  + +Q+Y  GV  YN   C  S   ++
Sbjct: 216 VASYQ-DIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGV--YNEKKC--STTQLD 270

Query: 302 HAVQIVGY 309
           H V  VGY
Sbjct: 271 HGVLAVGY 278


>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
          Length = 325

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 149/307 (48%), Gaps = 41/307 (13%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           +V ++   F    V+V     +   EL+  F++ Y K+Y+  +   RF  F+ +L   ++
Sbjct: 9   LVVVVGCAFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
                Q   +A+YG+T+FSDL+ EEF   +L   +++ V            + V+   + 
Sbjct: 65  YQTQEQG--TAKYGVTQFSDLTNEEFAAMYLGSRIDERV------------DRVQLNDLQ 110

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
           T        P   DWRE G +G V +Q +CG+CWAFS     E    LK G L  LS Q+
Sbjct: 111 TA-------PASVDWREKGAVGPVEHQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQ 163

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           ++DC    + GCSGG       + ++ ++  LE +S YP    + AC+   +     K+ 
Sbjct: 164 LVDC-DRLDHGCSGG--YPPYTYKEIKRMGGLELQSAYPYTGWEQACRLDRS-----KLF 215

Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHA 303
           +   D+++   +E      +A HGP+   +NA   Q+Y  G++   +Y C  S   +NHA
Sbjct: 216 AKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYAC--SPEGLNHA 273

Query: 304 VQIVGYD 310
           V  VGYD
Sbjct: 274 VLTVGYD 280


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 145/316 (45%), Gaps = 38/316 (12%)

Query: 6   NVLFIVALIAL-CFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFE 61
           N L+ ++L  L C     ++V+   L+     E    +  +Y K Y    E + RFK F+
Sbjct: 5   NQLYHISLALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFK 64

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           ++++ IE  N N    +S + GI +F+DL+ EEF     R+    H+  S  +     + 
Sbjct: 65  ENVNYIETFN-NADDTKSYKLGINQFADLTNEEFIAS--RNKFKGHMCSSIMRTTSFKYE 121

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           +V            +GIP   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G 
Sbjct: 122 NV------------SGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGK 169

Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAAC 235
           L  LS QE++DC   G + GC GG    L+D  D  K +     L  E++YP    D  C
Sbjct: 170 LISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLSTEAQYPYEGVDGTC 223

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
                S   V I  Y  D    SE ++   +A   P+  A++A    +Q+Y  GV    C
Sbjct: 224 NANKASVQAVTITGYE-DVPANSEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGAC 281

Query: 294 DGSLANINHAVQIVGY 309
              L   +H V  VGY
Sbjct: 282 GTEL---DHGVTAVGY 294


>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
          Length = 371

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 144/286 (50%), Gaps = 32/286 (11%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F+ +Y KSY ++ EHD R   F+ +L       +++    SA +G+T+FSDL+ +EF
Sbjct: 47  FTLFKSKYGKSYATQEEHDYRLSVFKANL---RRAKRHQMLDPSAVHGVTKFSDLTPKEF 103

Query: 96  KTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGK 152
           +  +L  R S +    +      D H   +          +PT  +P   +WR+ G +  
Sbjct: 104 RRTYLGIRKSSSSKQKLKLKLPADAHAAEI----------LPTSDLPFDFEWRDYGAVTG 153

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
           V++Q  CG+CW+FST  T E  + L  G L  L+ QE++DC        AG  + GC+GG
Sbjct: 154 VKDQGLCGSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGCNGG 213

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
                 +++ +    LE E +YP   +D  CK    S     + +++  +L   E  I  
Sbjct: 214 LMTTAYEYV-LQSGGLEKEKDYPYTGRDGTCKFD-KSKIAAAVANFSVVSL--DEDQIAA 269

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  HGP+   +N++  Q Y+GGV   Y C  S  N++H V IVGY
Sbjct: 270 NLVKHGPLSVGINSIFMQTYIGGVSCPYIC--SKKNLDHGVLIVGY 313


>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
          Length = 373

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 147/295 (49%), Gaps = 37/295 (12%)

Query: 26  SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           ++P L      +S F++R+KKSY S+ EHD RFK F+ +L       +++    SA +G+
Sbjct: 47  AEPQLLTAEHHYSLFKKRFKKSYGSQKEHDYRFKIFQVNL---RRAARHQNLDPSATHGV 103

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
           T+FSDL+  EF+  +L   + +  L                +  T    +PT  +P   D
Sbjct: 104 TQFSDLTPGEFRKAYL--GLRRLRL---------------PKDATEAPILPTDNLPQDFD 146

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
           WRE G +  V+NQ +CG+CW+FST    E  + L  G L  LS Q+++DC        AG
Sbjct: 147 WREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAG 206

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           + + GC+GG   +  ++  +    L  E +YP    D    +   +    K+ +++  +L
Sbjct: 207 SCDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGTDRGTCKFDNTKVAAKVANFSVVSL 265

Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              E  I  ++  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 266 --DEDQIAANLFKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 315


>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
          Length = 475

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 100/300 (33%), Positives = 144/300 (48%), Gaps = 41/300 (13%)

Query: 24  KVSKPNLEQKLE-------LFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQ 75
           KV  P+  Q LE        F  F  +Y K YS  E  D R + F ++L   E+L    Q
Sbjct: 157 KVEDPSTSQPLEESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQ 216

Query: 76  SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
              SA YG+T+FSDL+EEEF++ +L   +++  L          H  +K  +   G +  
Sbjct: 217 G--SAEYGVTKFSDLTEEEFRSTYLNPLLSQWTL----------HQPMKPATPAKGPS-- 262

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
              P   DWR+ G +  V+NQ  CG+CWAFS +   E    LKNGTL  LS QE++DC G
Sbjct: 263 ---PDSWDWRDHGAVSPVKNQGMCGSCWAFSVIGNIEGQWFLKNGTLLSLSEQELVDCDG 319

Query: 196 NGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
             +  C GG      +   + K+  LE ES+Y        C          K+ +Y   +
Sbjct: 320 L-DQACRGGLPSNAYE--AIEKLGGLETESDYSYTGHKQRCDFTTG-----KVAAYINSS 371

Query: 255 --LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
             L   E  I   +A +GPV  A+NA   Q+Y  G+   ++  C+  +  I+HAV +VGY
Sbjct: 372 VELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPLKIFCNPWM--IDHAVLLVGY 429


>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
          Length = 444

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 159/318 (50%), Gaps = 32/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR- 237
             L+ LS Q ++ C  N + GC GG       W+   NK  +  E  YP           
Sbjct: 168 HELTSLSEQMLVSCDTN-DFGCEGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPTC 226

Query: 238 -KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            K+    G KI+ +    L   E++I   +A +GPV  AV+A ++Q Y GGV+  +C   
Sbjct: 227 DKSGKVVGAKIRDHV--DLPEDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCISE 283

Query: 297 LANINHAVQIVGYDNYSR 314
             +++H V +VGYD+ S+
Sbjct: 284 --HLDHGVLLVGYDDTSK 299


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 95/286 (33%), Positives = 142/286 (49%), Gaps = 42/286 (14%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F +RY K YS   EH+ RF  F+ +L  +  L   +  P  A +G+T+FSDL++EEF
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEEF 113

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + ++L         +      D H   +          +PT  +P   DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  + LK G L  LS Q+++DC        A + + GC+GG  
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
            +   +  +    LE E +YP   KD  C     S N  KI ++  +  + S  E  I  
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  +GP+   +NA   Q Y+GGV   Y C  S  N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 95/286 (33%), Positives = 142/286 (49%), Gaps = 42/286 (14%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F +RY K YS   EH+ RF  F+ +L  +  L   +  P  A +G+T+FSDL++EEF
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEEF 113

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + ++L         +      D H   +          +PT  +P   DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  + LK G L  LS Q+++DC        A + + GC+GG  
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
            +   +  +    LE E +YP   KD  C     S N  KI ++  +  + S  E  I  
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  +GP+   +NA   Q Y+GGV   Y C  S  N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313


>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
 gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 359

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS+V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGECGSCWAFSSVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I S+    +  SE ++   +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSELVVGAQIDSHVL--IGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--EVNHAVLLVGYD 296


>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
 gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 140/285 (49%), Gaps = 39/285 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F++++KKSY S+ EHD RF  F+ +L       ++++   +A +G+T+FSDL+  EF
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHGVTQFSDLTSAEF 109

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   + K  L                +   T   +PT  +P   DWRE G +G V+
Sbjct: 110 RKQVL--GLRKLRL---------------PKDANTAPILPTNDLPEDFDWREKGAVGPVK 152

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 153 NQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            +  ++  +    L  E +YP    D  ACK      N V         +   E  I  +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGMDRGACK---FDKNKVAAGVANFSAVSLDEDQIAAN 268

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/288 (31%), Positives = 144/288 (50%), Gaps = 35/288 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF  F+ +L  I+E NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNK---VVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
           +DLS +EFK ++L   V+                + ++R      T     +P   DWR+
Sbjct: 96  ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDFELPKSVDWRK 139

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G + +V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG  
Sbjct: 140 KGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 197

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +    P+  A+ A    +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 255 KALVNQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298


>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
          Length = 368

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 140/285 (49%), Gaps = 39/285 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F++++KKSY S+ EHD RF  F+ +L       ++++   +A +G+T+FSDL+  EF
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHGVTQFSDLTSAEF 109

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   + K  L                +   T   +PT  +P   DWRE G +G V+
Sbjct: 110 RKQVL--GLRKLRL---------------PKDANTAPILPTNDLPEDFDWREKGAVGPVK 152

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 153 NQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            +  ++  +    L  E +YP    D  ACK      N V         +   E  I  +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGMDRGACK---FDKNKVAAGVANFSVVSLDEDQIAAN 268

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310


>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
 gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 139/303 (45%), Gaps = 50/303 (16%)

Query: 27  KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
           +PNL     E K  LF S    Y K+YS  E  I R   F K  ++++        P SA
Sbjct: 39  RPNLLGTHTESKFRLFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-SA 92

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
            +G+T+FSDL+EEEFK                 + +    +    R  T G   P     
Sbjct: 93  VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGGTVGAEAPMVEVD 135

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--- 193
           G+P   DWRE G + +V+NQ  CG+CWAFST   AE  H +  G L  LS Q+++DC   
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195

Query: 194 -----AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
                    + GC GG      +++ +    LE E  YP   K   CK     P  V ++
Sbjct: 196 CDPKDKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAVR 251

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQI 306
                T+   E+ I  ++  HGP+   +NA+  Q Y+GGV   +C    S  N+NH V +
Sbjct: 252 VLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV---SCPLICSKRNVNHGVLL 308

Query: 307 VGY 309
           VGY
Sbjct: 309 VGY 311


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 145/284 (51%), Gaps = 36/284 (12%)

Query: 34  LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
           ++LF S+  +++K Y   E    RF+ F+ +L  I+E NK     +   Y  G+ EF+DL
Sbjct: 30  IDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNK-----KVVNYWLGLNEFADL 84

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           S EEFK ++L  +V+    +S+ +       +    SI          P   DWR+ G +
Sbjct: 85  SHEEFKNKYLGLNVD----LSNRRECSEEFTYKDVSSI----------PKSVDWRKKGAV 130

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE++DC    N GC+GG    L+
Sbjct: 131 TDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGG----LM 186

Query: 211 DWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     ++   L  E +YP ++++  C+ +      V I  Y  D    SE S+L  +A
Sbjct: 187 DYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYH-DVPQNSEESLLKALA 245

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              P+  A++A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 246 NQ-PLSVAIDASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 285


>gi|39930363|ref|NP_058817.1| cathepsin J precursor [Rattus norvegicus]
 gi|84028185|sp|Q63088.2|CATJ_RAT RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
           protein; AltName: Full=Cathepsin P; AltName:
           Full=Catlrp-p; Flags: Precursor
 gi|28196048|gb|AAL26793.2| cathepsin P [Rattus norvegicus]
 gi|66910531|gb|AAH97263.1| Cathepsin J [Rattus norvegicus]
 gi|149039736|gb|EDL93852.1| cathepsin J [Rattus norvegicus]
          Length = 334

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 156/305 (51%), Gaps = 32/305 (10%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L+ LCF +A       PNL+ + +    ++ +Y KSYS  E +++   +E++L +I+ 
Sbjct: 5   VFLVILCFGVASGAPARDPNLDAEWQ---DWKTKYAKSYSPVEEELKRAVWEENLKMIQL 61

Query: 70  LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK N          +  F+D + EEF     R S++  ++ +           V   S 
Sbjct: 62  HNKENGLGKNGFTMEMNAFADTTGEEF-----RKSLSDILIPAA----------VTNPSA 106

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              ++I  G+P  KDWR+ G +  VRNQ  CG+CWAF+ V   E     K G L+ LSVQ
Sbjct: 107 QKQVSI--GLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  G      +++  NK  LE E+ YP   KD  C+  + + +   I
Sbjct: 165 NLLDCSKSEGNNGCRWGTAHQAFNYVLKNK-GLEAEATYPYEGKDGPCRYHSENAS-ANI 222

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
             +    L P+E  +   +A+ GPV AA++A   ++++Y GGV  + NC   +  +NHAV
Sbjct: 223 TGFV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYV--VNHAV 278

Query: 305 QIVGY 309
            +VGY
Sbjct: 279 LVVGY 283


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 90/288 (31%), Positives = 143/288 (49%), Gaps = 35/288 (12%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  R+ K Y   E  + RF+ F+ +L  I+E NK      +   G+ EF
Sbjct: 40  SMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
           +DLS  EF  ++L   V+                + ++R      T     +P   DWR+
Sbjct: 97  ADLSHREFNNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 140

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG  
Sbjct: 141 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGG-- 198

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L
Sbjct: 199 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETQVVTISGYH-DVPQNNEQSLL 255

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A   P+  A+ A    +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 256 KALANQ-PLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 299


>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 441

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q +CG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP +      +     P G K+  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/287 (29%), Positives = 144/287 (50%), Gaps = 32/287 (11%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFS 88
           ++ + ++  + Q++ K+Y++  E   RF+ F+ +L  I+E N +NR    + + G+T+F+
Sbjct: 22  DEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNR----TYKVGLTKFA 77

Query: 89  DLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           DL+ +E++   L   S  K  LM          N  ++ +   G  +P  +    DWR  
Sbjct: 78  DLTNQEYRAMFLGTRSDPKRRLMKSK-------NPSERYAYKAGDKLPESV----DWRGK 126

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  +++Q +CG+CWAFSTV   E ++ +  G L  LS QE++DC    N GC+GG   
Sbjct: 127 GAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGG--- 183

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     +N   L+ E +YP L  D  C R       V I  +  + ++P +   L 
Sbjct: 184 -LMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGF--EDVLPFDEKALQ 240

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               H PV  A+ A  +  Q+Y  GV    C  +L   +H V +VGY
Sbjct: 241 KAVAHQPVSVAIEASGMALQFYQSGVFTGECGTAL---DHGVVVVGY 284


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/288 (30%), Positives = 143/288 (49%), Gaps = 29/288 (10%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +EL+  +  ++KK+Y+   E   RF  F+ +   I +   N Q   S + G+ +F+DLS 
Sbjct: 41  MELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQ--HNNQGNPSYKLGLNQFADLSH 98

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   ++    +S+     + +        + G  +P  I    DWRE G +  
Sbjct: 99  EEFKATYLGAKLDTKKRLSNSPSPRYQY--------SDGEDLPESI----DWREKGAVTA 146

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q +CG+CWAFSTV   E ++ +  G L+ LS QE++DC  + N GC+GG    L+D+
Sbjct: 147 VKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGG----LMDY 202

Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
                +N   L+ E +YP    D +C     + + V I  Y  + +  ++   L   A +
Sbjct: 203 AFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 260

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
            P+  A+ A    +Q+Y  GV    C   L   +H V +VGY + S T
Sbjct: 261 QPISVAIEASGRAFQFYESGVFTSTCGTQL---DHGVTLVGYGSESGT 305


>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/282 (30%), Positives = 138/282 (48%), Gaps = 29/282 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYWRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           BQ  CG+CWAFS V   ES  A+    L  LS Q+++ C  + + GC GG      +W+ 
Sbjct: 143 BQGACGSCWAFSAVGNIESQWAVAXHGLVRLSEQQLVSC-DDKDSGCGGGLMTQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
            ++N  +   E  YP +        C   +    G +I  Y    +I S  +++   +A 
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGDVPECTNSSELVPGARIDGY---VMIESXETVMAAWLAK 257

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            GP+  AV+A  +  Y  GV+  +C G    +NH V +VGY+
Sbjct: 258 SGPISIAVDASPFMSYESGVLT-SCVGK--XLNHGVLLVGYN 296


>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
          Length = 887

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 148/283 (52%), Gaps = 35/283 (12%)

Query: 35  ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           +LF++F   Y ++YS   E ++R + F ++L II+ L K  +   +A Y +  F+D+S E
Sbjct: 580 QLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERG--TAHYDVNMFADMSPE 637

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGK 152
           EF++R+L             +      N +  R       IP   +P K DWRE  ++  
Sbjct: 638 EFRSRYL-----------GLRPDLRSENDIPLREAE----IPDVELPPKFDWREKSVVTP 682

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD- 211
           V++Q  CG+CWAFS     E  +A+K+G L  LS QE++DC  + + GC+GG    L D 
Sbjct: 683 VKDQGMCGSCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDC-DDLDEGCNGG----LPDN 737

Query: 212 -WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            +  + K+  LE ES+YP   ++  C  K    N  K++  +   +  +E+ +   +  +
Sbjct: 738 AYRAIEKLGGLELESDYPYEAENEKCHFKK---NLAKVQLASAVNITSNETQMAQWLVQN 794

Query: 270 GPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           GP+   +NA   Q+Y+GGV    ++ C+    N++H V IVGY
Sbjct: 795 GPISIGINANAMQFYVGGVSHPFKFLCNPK--NLDHGVLIVGY 835


>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
           pulchellus]
          Length = 475

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/291 (32%), Positives = 145/291 (49%), Gaps = 26/291 (8%)

Query: 30  LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           L Q+  LFS F + Y K+Y  K EH+ RF  F+ +L  I   N+  +   +A YG+TEFS
Sbjct: 159 LSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEG--TAHYGLTEFS 216

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS  EF+    RH +     ++ HK         + + I  G  +   +P   DWR  G
Sbjct: 217 DLSPSEFE----RHYLGLKKDLAEHK--------AEVKPIKVG-PVNEPLPDLFDWRTKG 263

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            + +V+NQ  CG+CWAFS     E    L    L  LS QE++DC  +G+ GC GG    
Sbjct: 264 AVTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDC-DHGDHGCKGGYMGQ 322

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
            +  + +    LE ESEYP    D  C+   T     +++S+    L  +E+ +   +  
Sbjct: 323 AMKAV-IEMGGLETESEYPYKGVDGTCEFNKTESK-ARVQSFV--GLPQNETELAYWLMK 378

Query: 269 HGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGYDNYSRTW 316
           HGPV   +NA   Q+Y GG+    ++ C  S  +++H V +VG+    R++
Sbjct: 379 HGPVSIGINANAMQFYFGGISHPWKFLC--SPTDLDHGVLLVGFGVDKRSF 427


>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 140/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS Q+++ C  + + GC         +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSEQQLVSC-DDKDSGCRARLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C        G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSTGYVPECSNSIQLVPGARIDGYM--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIAVDASSFMSYQRGVVT-SCAG--MPLNHGVLLVGYN 296


>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
          Length = 443

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   E    L    L  LS Q+++ C    N GCSGG      DW+ 
Sbjct: 143 NQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDN-GCSGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|414887429|tpg|DAA63443.1| TPA: hypothetical protein ZEAMMB73_816727 [Zea mays]
          Length = 334

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 141/300 (47%), Gaps = 25/300 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
            + A + L  +A      + ++E  L +  F  +Q  Y +SY + +E   RF+ + ++++
Sbjct: 31  LLCACLMLVLMAGAASGGRVDVEDMLMMDRFRGWQATYNRSYLTAAERLRRFEVYRQNME 90

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK- 124
           +IE    NR++  S + G T F+DL+ EEF   H   S   H   +  +H +    H   
Sbjct: 91  LIEA--TNRRAGLSYQLGETPFTDLTSEEFLATHT-MSTRLHASEAARRHRELITTHAGP 147

Query: 125 --------KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
                    R+ TT + +P  +    DWR  G +  V++Q  CG+CW+F TV   E +H 
Sbjct: 148 VSDGGRQWNRNYTTDLDVPESV----DWRTKGAVTPVKDQGACGSCWSFVTVAAIEGLHK 203

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           ++ G L  LS Q V+DC+   N GC+ GD  A +DW+  N   L  ES+YP + +   CK
Sbjct: 204 IRTGQLVSLSEQAVLDCSSPPNHGCNRGDPAAAIDWVSANG-GLTTESDYPYVGRQGKCK 262

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIA-THGPVIAAVNA-LTWQYYLGGVIQYNCD 294
                 +  KIK      L+   +    ++A    PV   +N     Q+Y  GV    CD
Sbjct: 263 LDKARNHVAKIKGR---KLVDQNNEAALEVAVAQQPVAVDMNVDPILQHYKSGVFHGPCD 319


>gi|45822205|emb|CAE47499.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 317

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 142/286 (49%), Gaps = 38/286 (13%)

Query: 35  ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFSDLSE 92
           + ++ F+  + K Y    E  +RF+ F ++L  IE+ N   Q+ E + Y G+ +F+D++ 
Sbjct: 14  QQWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTS 73

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT------GITIPTGIPVKKDWRE 146
           EEFK                    D    H  KR IT+       +T+P  I    DWRE
Sbjct: 74  EEFKAML-----------------DSQLIHKPKRDITSRFVADPQLTVPESI----DWRE 112

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGD 205
            G +  VR+Q+ CG+CWAFS     E    LK G L +LS Q+++DC+ +  N GC+GG 
Sbjct: 113 KGAVNPVRDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYKNEGCNGGW 172

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                D++  N + LE + +Y        CK     P   KI  Y+  ++  +E ++   
Sbjct: 173 PHWAYDYIKDNGLCLESKYKYQ-GYDGYYCKE--CIPAIKKINGYS--SINQTEEALKEA 227

Query: 266 IATHGPVIAAVNA-LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           + T GP+   VNA   WQ Y GG+++  +C G   +INHAV  VGY
Sbjct: 228 VGTAGPIAVCVNANDDWQLYSGGILESQSCPGG-ESINHAVLAVGY 272


>gi|302790930|ref|XP_002977232.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
 gi|300155208|gb|EFJ21841.1| hypothetical protein SELMODRAFT_228454 [Selaginella moellendorffii]
          Length = 353

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/283 (33%), Positives = 137/283 (48%), Gaps = 30/283 (10%)

Query: 33  KLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           K+  F  F  R+K+ Y S  E   RF  F ++L++IEE N+ ++ P +    + +F+D+S
Sbjct: 47  KVARFHEFATRHKRVYGSLVELRERFVTFSRNLELIEETNR-KELPYTL--AVNQFADMS 103

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
            EEFK         KH L S         N V+          P   P KKDWR+  I+ 
Sbjct: 104 WEEFK---------KHNLFSSQNCSATTTNSVR------AFLTP---PSKKDWRDDKIVS 145

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
            V+NQQ CG+CW FST    ES HA   G + +LS Q+++DCAG   N GC+GG      
Sbjct: 146 PVKNQQHCGSCWTFSTTGALESAHAQATGKMVVLSEQQLVDCAGGYNNFGCNGGLPSQAF 205

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATH 269
           +++  N   L+ E  YP    D  C     + N +  K Y    +   +E  ++  +A +
Sbjct: 206 EYIRYNG-GLDTEDSYPYTGHDGKC---TYNQNSIGAKVYDVVNITEGAEDELIHAVAFN 261

Query: 270 GPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGYD 310
            PV  A   L  +++Y  GV   N C      +NHAV  VGY+
Sbjct: 262 RPVSIAYEVLKDFRFYKSGVYTSNVCGTGPDTVNHAVLAVGYN 304


>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 33/281 (11%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF +F   Y ++Y ++ E ++R   F ++L II  L KN Q   + +YG+ +F+D+S EE
Sbjct: 726 LFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQG--TGQYGVNQFADVSTEE 783

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F   +L    +              +N   +++    I +P       DWR+ G +  V+
Sbjct: 784 FHAFYLGLRPDLRT----------ENNIPLRQAEIPDIELPNSF----DWRQKGAVTPVK 829

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--W 212
           NQ  CG+CWAFS     E  +A+K+  L  LS QE++DC  + + GC+GG    L D  +
Sbjct: 830 NQGMCGSCWAFSVTGNVEGQYAIKHNKLLSLSEQELVDC-DDLDEGCNGG----LPDNAY 884

Query: 213 MDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
             + K+  LE ES+YP   ++  C  K    N  K++  +   +  +E+ I   +  +GP
Sbjct: 885 RAIEKLGGLELESDYPYEAENERCHFKK---NMAKVQVGSAVNITSNETQIAQWLVANGP 941

Query: 272 VIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           +   +NA   Q+Y+GGV    ++ C+    N++H V IVGY
Sbjct: 942 ISIGINANAMQFYMGGVSHPFKFLCNPK--NLDHGVLIVGY 980


>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 332

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GCSGG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCSGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 447

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 139/284 (48%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 25  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 81

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 82  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 126

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q +CG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 127 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 185

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP + +D +  +    P G ++  + T    IP  E +I   +A +G
Sbjct: 186 VKENSGKVYTEKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 243

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 244 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 284


>gi|148908373|gb|ABR17300.1| unknown [Picea sitchensis]
          Length = 357

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 128/277 (46%), Gaps = 28/277 (10%)

Query: 37  FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y      + RF  F K++++IE  N        A   I EF+D++ EEF
Sbjct: 58  FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLA---INEFADITWEEF 114

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
             ++L  S N     S+HK  D                     P KKDWRE GI+  V+N
Sbjct: 115 HGQYLGASQNCSATKSNHKFTDAQP------------------PTKKDWREEGIVSPVKN 156

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G   +LS Q+++DCAG   N GCSGG      +++ 
Sbjct: 157 QAHCGSCWTFSTTGALEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIK 216

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N   L+ E  YP   KD  C     +  GVK+   + +  + +E  + + +    PV  
Sbjct: 217 YNG-GLDTEEAYPYTAKDGVCNYDVNNV-GVKVAD-SVNISLGAEDKLKSAVGLVRPVSV 273

Query: 275 AVNALT-WQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
           A   +  +++Y  GV     C     ++NHAV  VGY
Sbjct: 274 AFQVIQDFRFYKEGVFTSTTCGQGPMDVNHAVLAVGY 310


>gi|116779845|gb|ABK21448.1| unknown [Picea sitchensis]
 gi|116791731|gb|ABK26088.1| unknown [Picea sitchensis]
 gi|224286276|gb|ACN40847.1| unknown [Picea sitchensis]
          Length = 357

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 128/277 (46%), Gaps = 28/277 (10%)

Query: 37  FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y      + RF  F K++++IE  N        A   I EF+D++ EEF
Sbjct: 58  FAEFALRYGKRYDSVRQLVHRFNAFVKNVELIESRNSMNLPYTLA---INEFADITWEEF 114

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
             ++L  S N     S+HK  D                     P KKDWRE GI+  V+N
Sbjct: 115 HGQYLGASQNCSATKSNHKFTDAQP------------------PTKKDWREEGIVSPVKN 156

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G   +LS Q+++DCAG   N GCSGG      +++ 
Sbjct: 157 QAHCGSCWTFSTTGALEAAYTQATGKTVILSEQQLVDCAGAFNNFGCSGGLPSQAFEYIK 216

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N   L+ E  YP   KD  C     +  GVK+   + +  + +E  + + +    PV  
Sbjct: 217 YNG-GLDTEEAYPYTAKDGVCNYDVNNV-GVKVAD-SVNISLGAEDELKSAVGLVRPVSV 273

Query: 275 AVNALT-WQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
           A   +  +++Y  GV     C     ++NHAV  VGY
Sbjct: 274 AFQVIQDFRFYKEGVFTSTTCGQGPMDVNHAVLAVGY 310


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 151/329 (45%), Gaps = 46/329 (13%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKV--SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFK 58
           +F     LF++   A C      +     P  E+  +  ++  + YK SY K +   +++
Sbjct: 6   LFHCTLALFLI--FAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQ---KYQ 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F +++  IE  N     P   + GI  F+DL+ EEFK      ++N+            
Sbjct: 61  IFMENVQRIEAFNNAGXKP--YKLGINHFADLTNEEFK------AINRF---------KG 103

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
           H    + R+ T      T +P   DWR+ G +  +++Q  CG CWAFS V   E +  L+
Sbjct: 104 HVCSKRTRTTTFRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLR 163

Query: 179 NGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKD 232
            G L  LS QE++DC   G + GC GG    L+D  D  K +L+      E+ YP    D
Sbjct: 164 TGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFILQNKGLATEAIYPYEGFD 217

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ 290
             C  KA   +   IK Y  D    SES++L  +A   PV  A+ A    +Q+Y GGV  
Sbjct: 218 GTCNAKADGNHAGSIKGYE-DVPANSESALLKAVANQ-PVSVAIEASGFKFQFYSGGVFT 275

Query: 291 YNCDGSLANINHAVQIVGY---DNYSRTW 316
            +C     N++H V  VGY   D+ ++ W
Sbjct: 276 GSCG---TNLDHGVTSVGYGVGDDGTKYW 301


>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 366

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GCSGG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCSGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 149/312 (47%), Gaps = 42/312 (13%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDI 66
           L +V  +    +A P+ ++      K  LF +F+ ++ K Y  +E + R F  F +++D 
Sbjct: 5   LVLVCALVGAAMAEPLSLTV----NKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDF 60

Query: 67  IEELNKNRQSPESAR------YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           I     NR + E+AR        + +F+DL+ EE++  +LR    +  L+   +      
Sbjct: 61  I-----NRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYLRPYPTE--LLGRERQE---- 109

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                      +  P    V  DWR+ G +  ++NQ  CG+CW+FST  + E  HA+  G
Sbjct: 110 ---------VWLDGPNAGSV--DWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATG 158

Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q+++DC+G+ GN GC+GG       ++ ++   L+ E +YP   +D  C +  
Sbjct: 159 NLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYI-ISNGGLDTEQDYPYTARDGVCDKSK 217

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
            S + V I  Y  D    +E  +   +   GPV  A+ A   ++Q Y  GV    C    
Sbjct: 218 ESKHAVSISGYK-DVPQNNEDQLAAAV-EKGPVSVAIEADQQSFQMYSSGVFSGPCG--- 272

Query: 298 ANINHAVQIVGY 309
            N++H V +VGY
Sbjct: 273 TNLDHGVLVVGY 284


>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 369

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 143/287 (49%), Gaps = 36/287 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F+ +Y KSY ++ EHD R   F+ +L       +++    SA +G+T+FSDL+ +EF
Sbjct: 47  FTLFKSKYGKSYATQEEHDYRLSVFKANL---RRAKRHQLLDPSAVHGVTKFSDLTPKEF 103

Query: 96  KTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIG 151
           +   L   + S  K  L       D H   +          +PT  +P   DWR+ G + 
Sbjct: 104 RRTFLGIRKSSSGKRKL---KLPADAHAAEI----------LPTSDLPSDFDWRDYGAVT 150

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSG 203
            V++Q +CG+CW+FST    E  + L  G L  LS Q+++DC        AG  + GC+G
Sbjct: 151 GVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNG 210

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G      +++ +    LE E +YP   KD  CK    S     + +++  +L   E  I 
Sbjct: 211 GLMTTAYEYV-LQSGGLEKEKDYPYTGKDGTCKFD-KSKIAAAVANFSVVSL--DEDQIA 266

Query: 264 TDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            ++  HGP+   +NA+  Q Y+GGV   Y C  S  N++H V +VGY
Sbjct: 267 ANLVKHGPLSVGINAVFMQTYIGGVSCPYIC--SKRNLDHGVLLVGY 311


>gi|118365750|ref|XP_001016095.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297862|gb|EAR95850.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 335

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 157/311 (50%), Gaps = 28/311 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LC LA  + V      +KL  ++ +  ++++ Y  +EH+  F+   F ++L
Sbjct: 6   LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS-VNKHVLMSHHKHHDHHHNHV 123
             I+E N +  +  S    + +FSD+++EEF  + L  S +  H++    +   H+  + 
Sbjct: 58  QKIQEHNSDSNNTYSVH--LNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHNDTNK 115

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           + +  + G+++   I    DWR  G +  V+NQ  CG+CW+FS     ES + +KN  L 
Sbjct: 116 ETQLNSKGLSLADSI----DWRTKGAVTSVKNQGNCGSCWSFSAAAVMESFNFIKNKALV 171

Query: 184 LLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             S Q+++DC     G  + GCSGG   + LD+   +KV +    +YP +     C    
Sbjct: 172 DFSEQQLVDCVIPANGYNSYGCSGGWPASCLDY--ASKVGITTLDKYPYVAVQKNCNVTG 229

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
           T+ NG K  S+     IP+ S+ L       PV   V+A TW  Y  G+    CD S  +
Sbjct: 230 TN-NGFKPISW---IQIPNTSNDLKSALNFSPVSVVVDASTWGSYYSGIFN-GCDQSHIS 284

Query: 300 INHAVQIVGYD 310
           +NHAV  VGYD
Sbjct: 285 LNHAVLAVGYD 295


>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)

Query: 14  IALCFL-AIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
           I++ FL A PV     +LE     F  FQ+++ KSY   E ++ R   F  +L+ IEE+N
Sbjct: 4   ISVVFLLAFPV-CKAVDLEAAGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVN 62

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
                  S + G+ E++DL+ EEF    L          S           V     TT 
Sbjct: 63  AQNL---SYKLGVNEYTDLTLEEFAALKLS---------STDMSEGMGDGFVAGAGPTT- 109

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
            T+PT +    DWR+ G++  V++Q  CG+CWAFS +   E  +A+  G L  LS Q+++
Sbjct: 110 TTLPTSV----DWRKKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLV 165

Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS-PNGVKIKS 249
           DCAG  GN GC+GG      +++    V  + ES YP +  D  C+    +  +G+ +  
Sbjct: 166 DCAGAYGNEGCNGGLMDKAFEYIKATGV--DKESTYPYVGSDETCQATVENKTDGLPVGE 223

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVI-QYNCDGSLANINHAVQI 306
            T + ++      L +     PV  A+  N  ++Q+Y  GV    NC+    +I+H V  
Sbjct: 224 VTGNQMLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVA 283

Query: 307 VGY 309
           VGY
Sbjct: 284 VGY 286


>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 359

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 137/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS+V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGECGSCWAFSSVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QVNHAVLLVGYD 296


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 131/273 (47%), Gaps = 34/273 (12%)

Query: 44  YKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLR 101
           YK +  K+    R + F+ ++  IE  N   ++    RY  G+ +F+DL+ EEFK     
Sbjct: 55  YKDAAEKAR---RLEVFKANVAFIESFNAGGKN----RYWLGVNQFADLTSEEFK----- 102

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGKVRNQQ 157
                   M++ K     +N V+   ++TG          +P   DWR  G + ++++Q 
Sbjct: 103 ------ATMTNSKGFSTPNNGVR---VSTGFKYENVSADALPASVDWRTKGAVTRIKDQG 153

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVN 216
            CG CWAFS V   E +  L  G L  LS QE++DC  +GN  GC GG+      ++  N
Sbjct: 154 QCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSN 213

Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
              L  E+ YP   +D  CK  A +     I+ Y  D     E S++  +A   PV  AV
Sbjct: 214 G-GLTAEANYPYTAEDGRCKTTAAADVAASIRGYE-DVPANDEPSLMKAVAGQ-PVSVAV 270

Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A  +Q+Y GGV+   C  SL   +H V ++GY
Sbjct: 271 DASKFQFYGGGVMAGECGTSL---DHGVTVIGY 300


>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 363

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/318 (29%), Positives = 156/318 (49%), Gaps = 32/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   +T+ TG  P   DWR+ G +  VR+++ C + WAFS +   E    +  
Sbjct: 111 ALKRPRKV---VTVSTGKAPDAVDWRKKGAVTPVRDERLCDSSWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR- 237
             L+ LS Q ++ C    + GC GG       W+   NK  +  E  YP    D    R 
Sbjct: 168 HELTSLSEQMLLSCDTRED-GCGGGLMDRAFQWIVSSNKGNVFTEQSYPYASTDGDVPRC 226

Query: 238 -KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            K+    G KI  Y    L   E++I   +A +GPV  AV A + Q Y GGV+  +C   
Sbjct: 227 NKSGKVVGAKISDYV--DLPQDENAIAEWLAKNGPVAIAVEATSLQRYTGGVLT-SCISE 283

Query: 297 LANINHAVQIVGYDNYSR 314
              ++H V +VGYD+ S+
Sbjct: 284 --QLDHGVLLVGYDDTSK 299


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/314 (32%), Positives = 152/314 (48%), Gaps = 41/314 (13%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
            IA C L   V VS   LE+    F +F+ ++ K+Y ++ E   RF  F+ +L  IE+ N
Sbjct: 4   FIAACLL---VAVSATVLEETGVKFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHN 60

Query: 72  K-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
               Q   S + GI  F+D+++EEF+            L S  K H +   HV      T
Sbjct: 61  VLYEQGLVSYKKGINRFTDMTQEEFRAFL--------TLSSSKKPHFNTTEHV-----LT 107

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G+ +P  I    DWR  G +  V++Q  CG+CWAFS   + E+ +  K G L  LS Q++
Sbjct: 108 GLAVPDSI----DWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQL 163

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA----TSPNGVK 246
           +DC+ + N GC+GG       +  V    LE ES YP    D +CK  A    T  +G K
Sbjct: 164 VDCSTDINAGCNGGYLDETFTY--VKSKGLEAESTYPYKGTDGSCKYSASKVVTKVSGHK 221

Query: 247 -IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAV 304
            +KS         E+++L  +   GPV  A++A     Y  G+ + + C  S + +NH V
Sbjct: 222 SLKS-------EDENALLDAVGNVGPVSVAIDATYLSSYESGIYEDDWC--SPSELNHGV 272

Query: 305 QIVGY--DNYSRTW 316
            +VGY   N  + W
Sbjct: 273 LVVGYGTSNGKKYW 286


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 158/323 (48%), Gaps = 36/323 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFK 58
           +  IV+LI+   L+I +  S+P  + +L +    Q+R+ +  +K         E + R+ 
Sbjct: 8   IFLIVSLISSFCLSITL--SRPLDDNELIM----QKRHDEWMAKHGRVYADMKEKNNRYV 61

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F+++++ IE LN N  +  + +  + +F+DL+ +EF++ +  +     VL S       
Sbjct: 62  VFKRNVERIERLN-NVPAGRTFKLAVNQFADLTNDEFRSMYTGYK-GGSVLSSQSGTKTS 119

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
              +   +++++G      +PV  DWR+ G +  ++NQ TCG CWAFS V   E    +K
Sbjct: 120 SFRY---QNVSSG-----ALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIK 171

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L  LS Q+++DC  N + GCSGG      + + +    L  ES YP   KDA CK K
Sbjct: 172 KGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHI-MATGGLTTESNYPYKGKDATCKIK 229

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
            T P    I  Y  D  +  E +++  +A H PV   +      +Q+Y  GV    C   
Sbjct: 230 NTKPTATSITGYE-DVPVNDEKALMKAVA-HQPVSIGIEGGGFDFQFYGSGVFTGECTTY 287

Query: 297 LANINHAVQIVGY---DNYSRTW 316
           L   +HAV  VGY    N S+ W
Sbjct: 288 L---DHAVTAVGYGQSSNGSKYW 307


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 150/314 (47%), Gaps = 34/314 (10%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE +NK      S + GI EF+D++ EEF T+    ++  ++  S     +  
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFK 120

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            N +    +          P   DWRE+G + +V+NQ  CG CWAFS V + E  + +  
Sbjct: 121 INDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 170

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y    +   C+ + 
Sbjct: 171 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SSESDYEYQGQQYTCRSQE 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
            +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DGS
Sbjct: 229 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 278

Query: 297 LAN-INHAVQIVGY 309
            A+ INHAV  +GY
Sbjct: 279 CADRINHAVTAIGY 292


>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
 gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
          Length = 326

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 150/308 (48%), Gaps = 38/308 (12%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           F+  ++     ++  + +    +    L+  F+ +YKK+YS  + ++RF+ F+ +L+  +
Sbjct: 4   FVCCVLVTTIWSVFARTTPFEPDDARALYEEFKLKYKKTYSNDDDELRFRIFKDNLERAK 63

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            L    Q   +A YG+T+FSDL+ EEFKTR+LR   ++ ++         + +   +  +
Sbjct: 64  RLQAMEQG--TAEYGVTQFSDLTSEEFKTRYLRMRFDEPIV---------NEDPTPQEDV 112

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
           T   +         DWR+ G +G V +Q  CG+CWAFS +   E     K G L  LS Q
Sbjct: 113 TMDNS-------NFDWRDHGAVGPVLDQGDCGSCWAFSVIGNVEGQWFRKTGDLLGLSEQ 165

Query: 189 EVIDCAGNGNMGCSGG----DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           ++IDC  + + GC GG     + A+ +        LE  S+YP   KD  C    +    
Sbjct: 166 QLIDCD-HSDQGCDGGYPPQTYSAIEEMGG-----LELRSDYPYTGKDGICYMDQS---- 215

Query: 245 VKIKSYT-CDTLIP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANIN 301
            K  +Y    T +P  E +    +   GP+ + +NA+  Q Y  G+++   C+   A +N
Sbjct: 216 -KFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAVLLQLYKRGIMRPRWCN--PAELN 272

Query: 302 HAVQIVGY 309
           HAV  VGY
Sbjct: 273 HAVLTVGY 280


>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 452

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q +CG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGSCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDSKDN-GCGGGFMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP +      +     P G ++  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/279 (31%), Positives = 138/279 (49%), Gaps = 31/279 (11%)

Query: 40  FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFKT 97
           F+  Y KSY S++    R   FE +L+ I + N ++ Q   S   G+ EF+DL+ +EF  
Sbjct: 1   FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60

Query: 98  RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
            ++    N+   M ++                  + +P       DWR  G +  ++NQ 
Sbjct: 61  LYVPSKFNR--TMPYNT-----------------VYLPATSEDSVDWRTKGAVTPIKNQG 101

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVN 216
            CG+CW+FST  + E  HA+  G L  LS Q+++DC+G+ GN GC+GG       ++  N
Sbjct: 102 QCGSCWSFSTTGSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISN 161

Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
           K  L+ E +YP   +D  C ++  + +   I SY+ D    +E  +   +A  GPV  A+
Sbjct: 162 K-GLDTEEDYPYTAQDGTCNKEKEAKHAATISSYS-DVPKNNEDQLAAAVA-KGPVSVAI 218

Query: 277 NA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY-DNY 312
            A    +Q Y  GV   NC     N++H V +VGY D+Y
Sbjct: 219 EADQSGFQLYKSGVFDGNCG---TNLDHGVLVVGYTDDY 254


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 141/313 (45%), Gaps = 29/313 (9%)

Query: 4   VKNVLFIVALIAL-C--FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
            KN  + ++L  L C  FLA  V           E    +  RY K Y    E + RFK 
Sbjct: 3   AKNQFYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+++++ IE  N     P +   GI +F+DL+ EEF     R+    H+  S  +     
Sbjct: 63  FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           + +V            T IP   DWR+ G +  +++Q  CG CWAFS V   E +HAL  
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS QEV+DC   G + GC+GG       ++  N   L  E  YP    D  C  K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
           A + +   I  Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283

Query: 297 LANINHAVQIVGY 309
           L   +H V  VGY
Sbjct: 284 L---DHGVTAVGY 293


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 130/273 (47%), Gaps = 34/273 (12%)

Query: 44  YKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLR 101
           YK +  K+    R + F+ ++  IE  N   ++    RY  G+ +F+DL+ EEFK     
Sbjct: 55  YKDAAEKAR---RLEVFKANVAFIESFNAGGKN----RYWLGVNQFADLTSEEFK----- 102

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGKVRNQQ 157
                   M++ K     +N V+   ++TG          +P   DWR  G + ++++Q 
Sbjct: 103 ------ATMTNSKGFSTPNNGVR---VSTGFKYENVSADALPASVDWRTKGAVTRIKDQG 153

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVN 216
            CG CWAFS V   E    L  G L  LS QE++DC  +GN  GC GG+      ++  N
Sbjct: 154 QCGCCWAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSN 213

Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
              L  E+ YP   +D  CK  A +     I+ Y  D     E S++  +A   PV  AV
Sbjct: 214 G-GLTAEANYPYTAEDGRCKTTAAADVAASIRGYE-DVPANDEPSLMKAVAGQ-PVSVAV 270

Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A  +Q+Y GGV+   C  SL   +H V ++GY
Sbjct: 271 DASKFQFYGGGVMAGECGTSL---DHGVTVIGY 300


>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
           [Strongylocentrotus purpuratus]
          Length = 453

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 139/282 (49%), Gaps = 39/282 (13%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           +   +F++ Y+++   +E++ R+  F +++  +E  N+  Q   +A+YG T+F+D++E E
Sbjct: 158 KFLMTFKREYRQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQG--TAKYGPTKFADMTEAE 215

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F  R L+                     +KK  I     IP G +P + DWR  G +  V
Sbjct: 216 F--RKLQS------------------GPLKKTGIKKQAAIPQGPVPEEYDWRTHGAVTPV 255

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           +NQ  CG+CWAFS +   E    +K G L  LS QE++DC    + GC GG+        
Sbjct: 256 KNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCD-KVDGGCEGGEMS------ 308

Query: 214 DVNKVVLE-----PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
           D  + +++      E +YP   ++  CK   T    VKI  Y    +  +E+ +   +A 
Sbjct: 309 DAYEAIIKLGGAMSEEKYPYRGENEKCKFNMTDVR-VKINGYV--NISKNETEMAGWLAA 365

Query: 269 HGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           HGP+   +NAL  Q+Y GG+   +    S  +++H V IVGY
Sbjct: 366 HGPISIGINALMMQFYFGGIAHPWKIFCSPDSLDHGVLIVGY 407


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 29/313 (9%)

Query: 4   VKNVLFIVALIAL---CFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
            KN  + ++L  L    FLA  V           E    +  RY K Y    E + RFK 
Sbjct: 3   AKNQFYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+++++ IE  N     P +   GI +F+DL+ EEF     R+    H+  S  +     
Sbjct: 63  FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           + +V            T IP   DWR+ G +  +++Q  CG CWAFS V   E +HAL  
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS QEV+DC   G + GC+GG       ++  N   L  E  YP    D  C  K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
           A + +   I  Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283

Query: 297 LANINHAVQIVGY 309
           L   +H V  VGY
Sbjct: 284 L---DHGVTAVGY 293


>gi|12024965|gb|AAG45727.1| cathepsin L-like cysteine protease [Leishmania chagasi]
          Length = 381

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 85/270 (31%), Positives = 133/270 (49%), Gaps = 26/270 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 NQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
           GP+  AV+A ++  Y  GV  + YN  G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/279 (30%), Positives = 136/279 (48%), Gaps = 29/279 (10%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++F++F ++Y K+YS +E   RF  F+ +++ I     N  +  S   G+ EF+DLS EE
Sbjct: 40  DMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRL--HNTLANASYTMGLNEFADLSFEE 97

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FK ++  +   KHV     + ++ H                   P   DWR +  +  ++
Sbjct: 98  FKGKYFGY---KHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141

Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
           +Q  CG+CWAFS   + E    L+   TL+ LS Q+++DC+ + GN GC+GG      ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEY 201

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           +  NK +   ES YP       C++  T    V I  Y  D     E+S+L  + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGYK-DVASGDEASLLNAVGTVGPV 257

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A    +Q+Y  GV    C     N++H V  VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 144/287 (50%), Gaps = 33/287 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF S+  ++ K Y S  E  +RF+ F+ +L  I+E NK      +   G+ EF
Sbjct: 39  SMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNK---VVSNYWLGLNEF 95

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP-TGIPVKKDWRE 146
           +DLS +EFK ++L   V+                + ++R      T     +P   DWR+
Sbjct: 96  ADLSHQEFKNKYLGLKVD----------------YSRRRESPEEFTYKDVELPKSVDWRK 139

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G +  V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    + GC+GG  
Sbjct: 140 KGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGG-- 197

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
             L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L
Sbjct: 198 --LMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYH-DVPQNNEQSLL 254

Query: 264 TDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A     +A   +   +Q+Y GGV   +C    ++++H V  VGY
Sbjct: 255 KALANQSLSVAIEASGRDFQFYSGGVFDGHCG---SDLDHGVAAVGY 298


>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
          Length = 325

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 148/307 (48%), Gaps = 41/307 (13%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           +V ++   F    V+V     +   EL+  F++ Y K+Y+  +   RF  F+ +L   ++
Sbjct: 9   LVVVVGCSFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
                Q   +A+YG+T+FSDL+ EEF+ ++L   +++ V            + V+   + 
Sbjct: 65  YQMQEQG--TAKYGVTQFSDLTPEEFEAKYLGLRIDEQV------------DRVQLNDLQ 110

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
           T        P   DWRE G +G + NQ +CG+CWAFS V   E    LK G L  LS Q+
Sbjct: 111 TA-------PASVDWREKGAVGPIENQGSCGSCWAFSVVGNIEGQWFLKTGYLVSLSKQQ 163

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           ++DC    N GC GG       + ++ ++  LE +S+YP       C+   +     K+ 
Sbjct: 164 LVDCDTVDN-GCYGG--YPPYTYKEIKRMGGLELQSDYPYTGWGHGCRLDRS-----KLF 215

Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHA 303
           +   D+++    E      +A HGP+   +NA   Q+Y  G++  +   C  S   +NHA
Sbjct: 216 AKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPSKAMC--SPEGLNHA 273

Query: 304 VQIVGYD 310
           V  VGYD
Sbjct: 274 VLTVGYD 280


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 101/325 (31%), Positives = 155/325 (47%), Gaps = 38/325 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ EEF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V+NQ  CG CWAFS V + E  + + 
Sbjct: 121 KINDISDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIRENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
           S AN INHAV  +GY   +N  + W
Sbjct: 279 SCANRINHAVTAIGYGTDENGQKYW 303


>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
           partial [Trypanosoma vivax Y486]
          Length = 323

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q  CG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGRCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP + +D +  +    P G ++  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTEKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 137/284 (48%), Gaps = 37/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF + + K Y +  E++ RFK F+ +L  +  L      P +A +G+T FSDL+EEEF
Sbjct: 56  FESFIKEFGKVYHTVEEYEHRFKVFKSNL--LRALKHQALDP-TASHGVTMFSDLTEEEF 112

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
            T++L   + +   +S               +  T   +PTG +P   DWRE G +G V+
Sbjct: 113 ATQYL--GLKRPSALS---------------TAPTAEPLPTGDLPPSFDWREKGAVGPVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  H L  G L  LS Q+++DC        A   + GC GG  
Sbjct: 156 NQGSCGSCWAFSTTGAVEGAHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLM 215

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                +++     LE ES+YP   +D  C+    +PN V  K      +   E  +   +
Sbjct: 216 TNAYKYVE-EAGGLELESDYPYKGRDGKCQ---FNPNKVAAKVSNFTNIPIDEDQVAAYL 271

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              GP+   +NA   Q Y+ GV     C+    N++H V +VGY
Sbjct: 272 IKSGPLAIGINAEFMQTYVAGVSCPIFCNKR--NLDHGVLLVGY 313


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 148/333 (44%), Gaps = 54/333 (16%)

Query: 5   KNVLFIVALIALC----FLAIPVKV----SKPNLEQKLELFSSFQQRYKKSYSKSEHDIR 56
           K VLF    +ALC    F A           P  E+  +  +   + Y  SY K +   +
Sbjct: 4   KKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQ---K 60

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKT--RHLRHSVNKHVLMSHHK 114
           ++ F++++  IE  N     P   + GI  F+DL+ EEFK   R   H  +K       +
Sbjct: 61  YQTFKENVQRIEAFNHAGNKP--YKLGINHFADLTNEEFKAINRFKGHVCSKITRTPTFR 118

Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
           + +                  T +P   DWR+ G +  +++Q  CG CWAFS V   E +
Sbjct: 119 YENM-----------------TAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGI 161

Query: 175 HALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPL 228
             L  G L  LS QE++DC   G + GC GG    L+D  D  K +L+      E+ YP 
Sbjct: 162 TKLSTGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFILQNKGLAAEAIYPY 215

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
              D  C  KA   +   IK Y  D    SES++L  +A   PV  A+ A    +Q+Y G
Sbjct: 216 EGVDGTCNAKAEGNHATSIKGYE-DVPANSESALLKAVANQ-PVSVAIEASGFEFQFYSG 273

Query: 287 GVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
           GV   +C     N++H V  VGY   D+ ++ W
Sbjct: 274 GVFTGSCG---TNLDHGVTAVGYGVSDDGTKYW 303


>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
          Length = 443

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C    N GCSGG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDN-GCSGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
          Length = 363

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ KSYS K EHD RF  F+ +L  I+     +  P +A +GIT+FSDL+  EF
Sbjct: 48  FTSFKSKFSKSYSTKEEHDYRFGVFKSNL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L   + K + +  H                  I   T +P   DWRE G +  V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q +CG+CWAFST    E  H L  G L  LS Q+++DC        AG+ + GC+GG   
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              +++  +  V++ E +Y    +D +CK    S     + +++  +L   E  I  ++ 
Sbjct: 210 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 265

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+   +NA   Q Y+ GV   Y C  S   ++H V +VG+
Sbjct: 266 KNGPLAVGINAAWMQTYMSGVSCPYVCAKS--RLDHGVLLVGF 306


>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 454

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 135/284 (47%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DW   G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q TCG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDTKDN-GCGGGLMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +  E  YP +      +     P G K+  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTEKSYPYV--SGGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
          Length = 363

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ KSYS K EHD RF  F+ +L  I+     +  P +A +GIT+FSDL+  EF
Sbjct: 48  FTSFKSKFSKSYSTKEEHDYRFGVFKSNL--IKAKLHQKLDP-TAEHGITKFSDLTASEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L   + K + +  H                  I   T +P   DWRE G +  V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q +CG+CWAFST    E  H L  G L  LS Q+++DC        AG+ + GC+GG   
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              +++  +  V++ E +Y    +D +CK    S     + +++  +L   E  I  ++ 
Sbjct: 210 NAFEYLLQSGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEEQIAANLV 265

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+   +NA   Q Y+ GV   Y C  S   ++H V +VG+
Sbjct: 266 KNGPLAVGINAAWMQTYMSGVSCPYVCAKS--RLDHGVLLVGF 306


>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
          Length = 343

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 151/313 (48%), Gaps = 35/313 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           ++ L  L    + V       E++ + F  FQ ++ K YS  E+  RF+ F+ +L  IEE
Sbjct: 3   VILLFVLAVFTVFVSSRGIPPEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61

Query: 70  LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           LN    N ++    ++G+ +F+DLS +EFK  +L    NK  + +         +++   
Sbjct: 62  LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I +       IP   DWR  G +  V+NQ  CG+CW+FST    E  H +    L  LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166

Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
            Q ++DC             + GC+GG      +++  N  + + ES YP   +      
Sbjct: 167 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
             ++  G KI ++   T+IP   +++   I + GP+  A +A+ WQ+Y+GGV    C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282

Query: 297 LANINHAVQIVGY 309
             +++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 141/280 (50%), Gaps = 25/280 (8%)

Query: 34  LELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +ELF ++   ++K+Y   E  + RF+ F+ +L  I+E NK  +S      G+ EF+DLS 
Sbjct: 48  IELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKS---YWLGLNEFADLSH 104

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   +   ++    +  +  +     R +         +P   DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG      ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E +YP  +++  C+ +      V I  +  D     E S+L  +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQ-DVPTNDEKSLLKALA-HQPL 269

Query: 273 IAAVNA--LTWQYYLG-GVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y G  V    C     +++H V  VGY
Sbjct: 270 SVAIDASGREFQFYSGVSVFDGRCG---VDLDHGVAAVGY 306


>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 146/303 (48%), Gaps = 27/303 (8%)

Query: 14  IALCFL-AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
           I++ FL A PV     +LE     F  FQ+++ KSY +K E   R   F  +L+ IEE+N
Sbjct: 4   ISVVFLLAFPV-YKAVDLETSSLAFIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVN 62

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
                  S + G+ E++DL+ EEF    L          S           V     TT 
Sbjct: 63  AQNL---SYKLGVNEYTDLTLEEFAALKLS---------STDMSEGMGDGFVAGAGPTT- 109

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
            T+PT +    DWR+ G++  V++Q  CG+CWAFS +   E  +A+  G L  LS Q+++
Sbjct: 110 TTLPTSV----DWRKKGVLNPVKDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQQLV 165

Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS-PNGVKIKS 249
           DCAG  GN GC+GG      +++    V  + ES YP +  D  C+    +  +G+ +  
Sbjct: 166 DCAGAYGNEGCNGGLMDKAFEYIKATGV--DKESTYPYVGSDETCQATVENKTDGLPVGE 223

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVI-QYNCDGSLANINHAVQI 306
            T + ++      L +     PV  A+  N  ++Q+Y  GV    NC+    +I+H V  
Sbjct: 224 VTGNQMLHQTEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYSDPNCNAKGGSIDHGVVA 283

Query: 307 VGY 309
           VGY
Sbjct: 284 VGY 286


>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 389

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 29/284 (10%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF++F+Q+Y +SY + +E   R + FE   D +        +   A +G+T FSDL+ EE
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFE---DNMRRSRMYAAANPHATFGVTPFSDLTPEE 89

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKV 153
           F+TR+               H+   H    +  + T + +P G  P   DWR  G +  V
Sbjct: 90  FRTRY---------------HNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPV 134

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q TCG+CW+FS +   E   A     L+ LS Q ++ C    N GC GG      +W+
Sbjct: 135 KDQGTCGSCWSFSAIGNIEGQWAAAGNPLTSLSEQMLVSCDFKDN-GCGGGFMDNAFEWI 193

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI-KSYTCDTLIP-SESSILTDIATHG 270
              N   +     YP + +D +  +    P G ++  + T    IP  E +I   +A +G
Sbjct: 194 VKENSGKVYTGKSYPYVSEDGS--KPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNG 251

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           PV  AV+A T+  Y GGV+  +C      +NH V +VGY++ S+
Sbjct: 252 PVAVAVDATTFMSYSGGVVT-SCTSEA--LNHGVLLVGYNDSSK 292


>gi|394331830|gb|AFN27134.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 88/283 (31%), Positives = 138/283 (48%), Gaps = 31/283 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGALTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG----GDFCALL 210
           NQ  CG+CWAFS V + +S  AL    L+ LS Q+++ C    N GC G      F  +L
Sbjct: 143 NQGACGSCWAFSAVGSIQSQWALAGHRLTALSEQQLVSCHDKDN-GCPGRLMLQAFVGVL 201

Query: 211 DWMDVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
             M+        E  YP +        C   +    G +I  Y   T+  S + +   +A
Sbjct: 202 QNMNGTMFT---EDSYPYVSSTGYVPECSNSSQLVPGARIDGYM--TMESSGTVMAACLA 256

Query: 268 THGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
            +GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 257 KNGPISIAVDASSFMSYQSGVLT-SCAG--MPLNHGVLLVGYN 296


>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
          Length = 366

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/286 (32%), Positives = 141/286 (49%), Gaps = 42/286 (14%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F +RY K YS   EH+ RF  F+ +L  +  L   +  P  A +G+T+FSDL++E F
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNL--LRALEHQKLDPR-ASHGVTKFSDLTQEGF 113

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + ++L         +      D H   +          +PT  +P   DWRE G + +V+
Sbjct: 114 RHQYLG--------LRAPPLRDAHDAPI----------LPTNDLPEDFDWREKGAVTEVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  + LK G L  LS Q+++DC        A + + GC+GG  
Sbjct: 156 NQGSCGSCWAFSTTGALEGANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLM 215

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILT 264
            +   +  +    LE E +YP   KD  C     S N  KI ++  +  + S  E  I  
Sbjct: 216 TSAYQYA-LKSGGLEKEEDYPYTGKDGTC-----SFNKNKIVAHVSNFSVVSIDEGQIAA 269

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  +GP+   +NA   Q Y+GGV   Y C  S  N++H V +VGY
Sbjct: 270 NLVKNGPLSVGINAAFMQTYVGGVSCPYVC--SKRNLDHGVLLVGY 313


>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
          Length = 443

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 91/284 (32%), Positives = 139/284 (48%), Gaps = 33/284 (11%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG----DFCALL 210
               CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG     F  LL
Sbjct: 143 XXGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEXLL 201

Query: 211 DWMDVNKVVLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-I 266
             M     ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +
Sbjct: 202 RHM---YGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWL 255

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           A +GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 256 AENGPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 296


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 89/272 (32%), Positives = 138/272 (50%), Gaps = 28/272 (10%)

Query: 44  YKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH 102
           Y+K+Y+  E  +R F+ F+ +L+ I+++NK   S      G+ EF+DL+ +EFK  +L  
Sbjct: 36  YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS---YWLGLNEFADLTHDEFKATYL-- 90

Query: 103 SVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGAC 162
            +      S+ KH+        K S          +P + DWR+   + +V+NQ  CG+C
Sbjct: 91  GLTPPPTRSNSKHYSSEEFRYGKMSNGE-------VPKEMDWRKKNAVTEVKNQGQCGSC 143

Query: 163 WAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD---VNKVV 219
           WAFSTV   E ++A+  G L+ LS QE+IDC+ +GN GC+GG    L+D+      +   
Sbjct: 144 WAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGG----LMDYAFSYIASTGG 199

Query: 220 LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL 279
           L  E  YP  +++  C     +   V I  Y  D     E +++  +A H PV  A+ A 
Sbjct: 200 LRTEEAYPYAMEEGDCDEGKGAAV-VTISGYE-DVPANDEQALVKALA-HQPVSVAIEAS 256

Query: 280 T--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              +Q+Y GGV    C   L   +H V  VGY
Sbjct: 257 GRHFQFYSGGVFDGPCGEQL---DHGVTAVGY 285


>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 159/325 (48%), Gaps = 42/325 (12%)

Query: 1   MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
           +F V ++LF+   +++C      +   V  ++P +    + F+ F++++ K Y S  EH 
Sbjct: 7   LFSV-SLLFVFVSVSICGDEDLLIRQVVDEAEPKVLSSEDHFTLFKKKFGKDYGSIEEHY 65

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF  F+ +L       ++++   SAR+G+T+FSDL+  EF+ +HL       V      
Sbjct: 66  YRFSVFKANL---RRAMRHQKMDPSARHGVTQFSDLTGSEFRRKHL------GVTGGFKL 116

Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
             D +   +          +PT  +P + DWR+ G +  V+NQ +CG+CW+FST    E 
Sbjct: 117 PKDANQAPI----------LPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 166

Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
            H L  G L  LS Q+++DC        AG+ + GC+GG   +  ++  +    L  E +
Sbjct: 167 AHFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYT-LKTGGLMREED 225

Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
           YP    D    +   S     + +++  ++  +E  I  ++  +GP+  A+NA   Q Y+
Sbjct: 226 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLVKNGPLAVAINAAYMQTYI 283

Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
           GGV   Y C   L   NH V ++GY
Sbjct: 284 GGVSCPYICSRRL---NHGVLLMGY 305


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 92/324 (28%), Positives = 155/324 (47%), Gaps = 39/324 (12%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFEKS 63
           N L ++A++ L  L      S+   E  +EL   ++  +Y + Y  + E + RFK F+++
Sbjct: 7   NKLVLMAML-LVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKEN 65

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           ++ IE  N N   P   + GI  F+DL+ EEF+  H  ++++     S ++     + +V
Sbjct: 66  VEFIESFNNNGNKP--YKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV 123

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
                       T +P   DWR  G +  +++Q  CG CWAFS V   E +  L  GTL 
Sbjct: 124 ------------TAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLI 171

Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAACKR 237
            LS QE++DC  +G + GC GG    L+D  D  + ++E      E+ YP    D +C  
Sbjct: 172 SLSEQELVDCDTSGMDQGCEGG----LMD--DAFEFIIENNGLTTEANYPYEGVDGSCNT 225

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
           +  + +  KI  Y  + +   +   L     + PV  A++A    +Q+Y  G+   +C  
Sbjct: 226 RKAANHAAKITGY--ENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGT 283

Query: 296 SLANINHAVQIVGY---DNYSRTW 316
            L   +H V +VGY   D+ ++ W
Sbjct: 284 EL---DHGVTVVGYGTSDDGTKYW 304


>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
 gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
          Length = 953

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)

Query: 36  LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           +F  F+  +++ Y+ S EH++RF  F  +L  IE+LNK  +   +A+YG+T+F+D++  E
Sbjct: 642 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 699

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           ++        +  +++  H   +H  N V       G+     +P   DWR+ G + +V+
Sbjct: 700 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 749

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ +CG+CWAFS V   E +H +K   L   S QE+IDC    N GC GG     +D  D
Sbjct: 750 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 802

Query: 215 VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
             K +     LE E++YP   K         S + V++K      +  +E+ I   +  +
Sbjct: 803 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 860

Query: 270 GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           GP+   +NA   Q+Y GG+   ++   +  +I+H V IVGY
Sbjct: 861 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 901


>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
 gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
          Length = 371

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF QR+ KSY  + EH  R   F+   D +    +++    SA +G+T+FSDL+  EF
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFK---DNLRRARRHQLLDPSAEHGVTKFSDLTPAEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           +  +L    ++  L+       H               +PT G+P   DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L +LS Q+ +DC          + + GC+GG  
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                ++      LE E +YP    D  CK    S     +++++  ++   E+ I  ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 86/279 (30%), Positives = 137/279 (49%), Gaps = 11/279 (3%)

Query: 34  LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           + LF  +  R+ K Y S  E   R + F  +L  I   NKN  S  S R G+ +F+DL+ 
Sbjct: 40  VRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNS--SFRLGLNKFADLTN 97

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFKTR+   +  +       +        V K+++ +  +  + I    DWR+ G +  
Sbjct: 98  EEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCS-IASSLDWRKKGAVTG 156

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CWAFST    E ++ +  G L  LS QE++ C    N GC GGD      W
Sbjct: 157 VKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDAT-NYGCEGGDMDYAFTW 215

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + +    ++ E +Y     D+ C     +   V I  YT   + P +S++L    +  PV
Sbjct: 216 V-IQNGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYT--DVSPDDSALLCAAGSQ-PV 271

Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              ++  A+ +Q Y GG+   +C G+  +I+HAV +VGY
Sbjct: 272 SVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY 310


>gi|209882566|ref|XP_002142719.1| papain family cysteine protease [Cryptosporidium muris RN66]
 gi|209558325|gb|EEA08370.1| papain family cysteine protease, putative [Cryptosporidium muris
           RN66]
          Length = 400

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 90/290 (31%), Positives = 140/290 (48%), Gaps = 26/290 (8%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           P+ ++    F  F+Q+YKK YS  +E   R+  F K+++ I+  N       S    + E
Sbjct: 77  PSEQEFKNQFEDFKQKYKKEYSNLTEEKYRYSIFRKNMNFIKMSN---NQGFSYVLEMNE 133

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           + DL+ EEF           H  M +H  H +         +++     T  P   +W +
Sbjct: 134 YGDLTHEEFM----------HNFMGYHPQHKNKRFSDSHNILSSNKVENTSPPRFVNWVD 183

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAES-MHALKNGTLSLLSVQEVIDCA-GNGNMGCSGG 204
           AG +  VR+Q+ CG+CWAFS V + ES + A KN  L  LS Q+ +DC   NGN GC GG
Sbjct: 184 AGCVNPVRDQRYCGSCWAFSVVTSLESAVCAQKNEKLVKLSEQQFVDCTRNNGNFGCDGG 243

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVKIKSYTCDTLIPSESSIL 263
                  ++ +    L  E EYP +  + +CK     +P    + SY    ++P+  + L
Sbjct: 244 SLDLAFQYV-MEHQYLCTEEEYPYIANEKSCKFSNCKNPIRYILDSYR--NVVPNNINAL 300

Query: 264 -TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
              +A +GP+  A+ A    +Q+Y  GV    C     ++NHAV +VGYD
Sbjct: 301 KVAVAKYGPISVAIQADQAPFQFYKKGVFDAPCG---TDVNHAVVLVGYD 347


>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 158/322 (49%), Gaps = 40/322 (12%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   +T+ TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VTVSTGKAPEAVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRK 238
             L+ LS Q ++ C       C GG       W +  NK  +  E  YP     ++  R 
Sbjct: 168 HELTSLSEQTLVSCDPT-EYACEGGFMDNAFRWIISSNKGKVFTEQSYPY----SSGGRN 222

Query: 239 ATSPN------GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
             + N      G  I  Y    L   E++I   +A +GPV   V+A ++Q Y GGV+  +
Sbjct: 223 VPACNMSGKVVGANISDYV--DLPQDENAIAEWLAKNGPVSVIVDATSFQSYTGGVLT-S 279

Query: 293 CDGSLANINHAVQIVGYDNYSR 314
           C   +  +NHAV +VGYD+ S+
Sbjct: 280 CLSKI--LNHAVLLVGYDDTSK 299


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 23/284 (8%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE--LNKNRQSPESARYGITEF 87
           E+    +  +  R+ K+Y+   E + RF+ F  +L  I+E  L+ NR    S + G+ +F
Sbjct: 30  EEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNR----SYKVGLNQF 85

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EE+++ +L   V+ +  ++  +  +    +  + +           P K DWRE 
Sbjct: 86  ADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--------FPAKVDWRER 137

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV + E ++ +  G L  LS QE++DC    N GC+GG   
Sbjct: 138 GAVSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMD 197

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               ++ V+   ++ ES+YP     A C         V I  Y  + + P     L    
Sbjct: 198 YAFQFI-VSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGY--EDVPPMNEKALMKAV 254

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            H PV   + A    +Q Y  GV+  +C     N++H V +VGY
Sbjct: 255 AHQPVSVGIEASGRAFQLYTSGVLTGSCG---TNLDHGVVVVGY 295


>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
          Length = 350

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 152/320 (47%), Gaps = 41/320 (12%)

Query: 7   VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
           VLF VA  A  F    + P+++     EQ L++         F+ F  RY K Y S  E 
Sbjct: 9   VLFCVASAAAGFSFHDSNPIRMVSDVEEQLLQVIGESRHAVSFARFANRYGKRYDSVDEM 68

Query: 54  DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
            +RFK F ++L++I   NK R S    + G+  F+D + EEF++  L  + N    L  +
Sbjct: 69  KLRFKIFSENLELIRSSNKRRLS---YKLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 125

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
           HK  D +                  +P +KDWR+ GI+  V++Q +CG+CW FST    E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALE 167

Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           S +A   G    LS Q+++DCAG   N GCSGG      +++  N   LE E  YP    
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEEAYPYTGS 226

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
           +  CK ++     VK+   + +  + +E  +   IA   PV  A   +  ++ Y  GV  
Sbjct: 227 NGLCKFRSEHV-AVKVLG-SVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYT 284

Query: 291 YN-CDGSLANINHAVQIVGY 309
              C  +  ++NHAV  VGY
Sbjct: 285 STACGSTPMDVNHAVLAVGY 304


>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
           Group]
 gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
 gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
          Length = 373

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 86/295 (29%), Positives = 143/295 (48%), Gaps = 37/295 (12%)

Query: 31  EQKLEL-----FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           + +LEL     F+SF QR+ KSY  + EH  R   F+ +L       +++    SA +G+
Sbjct: 39  DNELELNAERHFASFVQRFGKSYRDADEHAYRLSVFKANL---RRARRHQLLDPSAEHGV 95

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
           T+FSDL+  EF+  +L    ++   +       H               +PT G+P   D
Sbjct: 96  TKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHE-----------APVLPTDGLPDDFD 144

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
           WR+ G +G V+NQ +CG+CW+FS     E  + L  G + +LS Q+++DC          
Sbjct: 145 WRDHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQMVDCDHECDSSEPD 204

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           + + GC+GG       ++ +    LE E +YP   +D  CK    S     +++++  ++
Sbjct: 205 SCDAGCNGGLMTNAFSYL-LKSGGLESEKDYPYTGRDGTCKFD-KSKIVTSVQNFSVVSV 262

Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              E  I  ++  HGP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 263 --DEDQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 312


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 92/319 (28%), Positives = 142/319 (44%), Gaps = 29/319 (9%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
           +N L  VAL+ +   A        +     E    +  +Y + Y   SE + RF+ F  +
Sbjct: 6   ENKLMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNN 65

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           ++ IE  NK    P   +  I EF+DL+ EEFK     +  +  V ++      + +   
Sbjct: 66  VEFIESFNKLGNRP--YKLDINEFADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYAN--- 120

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
                       T +P   DWR+ G +  +++Q  CG CWAFS V   E +  L  G L 
Sbjct: 121 -----------VTAVPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLI 169

Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS QE++DC  +G + GC GG      +++  N   L  E+ YP    D  C       
Sbjct: 170 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNG-GLTTEANYPYQGTDGTCNTNKAGN 228

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           +  KI  Y  D    SE ++L  +A+  PV  A++A    +Q+Y GGV   +C   L   
Sbjct: 229 DAAKITGYE-DVPANSEDALLKAVASQ-PVSVAIDASGSAFQFYSGGVFTGDCGTEL--- 283

Query: 301 NHAVQIVGY---DNYSRTW 316
           +H V  VGY   D+ ++ W
Sbjct: 284 DHGVTAVGYGTSDDGTKYW 302


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 84/271 (30%), Positives = 126/271 (46%), Gaps = 26/271 (9%)

Query: 43  RYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
           +Y + Y   SE + RF+ F  +++ IE  NK    P   +  I EF+DL+ EEFK     
Sbjct: 44  KYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRP--YKLDINEFADLTNEEFKASRNG 101

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
           +  + +V +S      + +               T +P   DWR+ G +  +++Q  CG 
Sbjct: 102 YKRSSNVGLSEKSSFRYGN--------------VTAVPTSMDWRQKGAVTPIKDQGQCGC 147

Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVL 220
           CWAFS V   E +  L  G L  LS QE++DC  +G + GC GG      +++  N   L
Sbjct: 148 CWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNG-GL 206

Query: 221 EPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-- 278
             E+ YP    D  C       +  KI  Y  D    SE ++L  +A+  PV  A++A  
Sbjct: 207 TTEANYPYQGTDGTCNTNKAGNDAAKITGYE-DVPANSEDALLKAVASQ-PVSVAIDASG 264

Query: 279 LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 265 SAFQFYSGGVFTGDCGTEL---DHGVTAVGY 292


>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 337

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 100/312 (32%), Positives = 150/312 (48%), Gaps = 42/312 (13%)

Query: 14  IALCFLAIPVKVSKP----NLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
            AL FL     V+ P    + E   E F+ + ++Y+K+YS   E++ R + +  +   IE
Sbjct: 8   FALFFLLASFTVALPFSPSDDEVMAESFNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIE 67

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
           +LNK    P +  Y + +FSDL+  EFK           + ++  +H    + + +K   
Sbjct: 68  QLNK-EHGPHT-EYELNQFSDLTFAEFK----------KIYLTEPQHCSATNGNFQK--- 112

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                +    PV  DWRE  +I  V++Q  CG+CW FST    E+ HA+K G L  LS Q
Sbjct: 113 ----PVNARDPVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLEAHHAIKTGQLISLSEQ 168

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK-----ATSP 242
           +++DCAG   N GC+GG      +++  N  + E ES Y    KD  C+       AT  
Sbjct: 169 QLVDCAGAFNNHGCNGGLPSQAFEYIKYNGGI-ESESNYNYTAKDGVCRFNSSLVAATVS 227

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPV-IAAVNALTWQYYLGGVIQYN---CDGSLA 298
           + V I          +E  I T +A  GPV IA     ++Q+Y  GV Q     C  S  
Sbjct: 228 DVVNITK-------DAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGVYQGEIEVCSQSPD 280

Query: 299 NINHAVQIVGYD 310
            +NHAV +VGY+
Sbjct: 281 KVNHAVLVVGYN 292


>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
 gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
          Length = 371

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF QR+ KSY  + EH  R   F+ +L       +++    SA +G+T+FSDL+  EF
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKANL---RRARRHQLLDPSAEHGVTKFSDLTPAEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           +  +L    ++  L+       H               +PT G+P   DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L +LS Q+ +DC          + + GC+GG  
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                ++      LE E +YP    D  CK    S     +++++  ++   E+ I  ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310


>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
           vulgare]
 gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 377

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 141/294 (47%), Gaps = 35/294 (11%)

Query: 30  LEQKLEL---FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
           L+  LEL   F  F QR+ K+Y  +E H  R   F+ +L       +++    SA +G+T
Sbjct: 43  LDNDLELDSQFVGFVQRFGKTYRDAEEHAHRLSVFKANL---RRARRHQLLDPSAEHGVT 99

Query: 86  EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDW 144
           +FSDL+  EF+  +L     +   +       H               +PT G+P   DW
Sbjct: 100 KFSDLTPAEFRRTYLGLKTTRRSFLREMAGSAH-----------DAPVLPTDGLPEDFDW 148

Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGN 196
           R+ G +G V+NQ +CG+CW+FS     E  + L +G + +LS Q+++DC          +
Sbjct: 149 RDHGAVGPVKNQGSCGSCWSFSASGALEGANYLASGKMEVLSEQQLVDCDHECDPSEPDS 208

Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
            + GC+GG   +   ++ +    LE E +YP   KD  CK    S     +++Y+   + 
Sbjct: 209 CDAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGKDGTCKFD-KSKIAASVQNYS--VVA 264

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             E  I  ++  +GP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 265 VDEEQIAANLVKYGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 315


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 145/294 (49%), Gaps = 32/294 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+   L+ S+   + K+Y+   E + RF+ F+ +L  I+E N+  ++    + G+T F+D
Sbjct: 56  EEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRT---YKVGLTRFAD 112

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE++ R L    ++   +S  K          + +   G  +P  +    DWR+ G 
Sbjct: 113 LTNEEYRARFLGGRFSRKPRLSAAKS--------GRYAAALGDDLPDDV----DWRKKGA 160

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CWAFS+V   E ++ +  G L  LS QE++DC  + NMGC+GG    L
Sbjct: 161 VATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGG----L 216

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +    ++ E +YP   +DAAC     +   V I  Y  D     ESS+   +
Sbjct: 217 MDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYE-DVPENDESSLKKAV 275

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           A   PV  A+ A    +Q Y  GV    C     +++H V  VGY  DN +  W
Sbjct: 276 ANQ-PVSVAIEAGGRAFQLYQSGVFTGRCG---TDLDHGVVAVGYGTDNGTDYW 325


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 81/280 (28%), Positives = 142/280 (50%), Gaps = 23/280 (8%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           + +++ +  ++ K+Y+K  E + RF+ F+ +L  I+E N ++    + + G+T F+DL+ 
Sbjct: 45  ISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKN--RTYKVGLTRFADLTN 102

Query: 93  EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
           EE++ + L   S  K  LM          N  ++ +   G  +P  I    DWR++G + 
Sbjct: 103 EEYRAKFLGTKSDPKRRLMKSK-------NPSQRYAFKAGDVLPESI----DWRQSGAVS 151

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
            +++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG       
Sbjct: 152 AIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQ 211

Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
           ++ +N   ++ + +YP    D  C         V I  +  D +   E ++   +A H P
Sbjct: 212 FI-INNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFE-DVMAFDEMALQKAVA-HQP 268

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           V  A+ A  +  Q+Y  GV    C  +L   +H V IVGY
Sbjct: 269 VSVAIEASGMALQFYQSGVFTGECGSAL---DHGVVIVGY 305


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 148/305 (48%), Gaps = 31/305 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           L+I +++AL    + V     N E        F++ + K+Y   EH +R   F+++L  I
Sbjct: 147 LYIASVLALV---VAVGADLTNFEH-------FKEHFGKTYEGDEHALRQGIFQRNLAHI 196

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E+ N  + +      GIT+F+D+S  EF+  +L   +N   +    K        +++  
Sbjct: 197 EKFNAEKAASRGYTLGITQFADMSTAEFRQTYLGLRMNASTIAKLRK--------LQREV 248

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
           +     +P  +    DWR+ G +  V++Q  CG+CWAFST    E  H LKNG L  LS 
Sbjct: 249 VADDRDLPEAV----DWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSE 304

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           Q+++DC+   + GC+GG     ++++  N   LE E+ YP      +C     S    KI
Sbjct: 305 QQMVDCSWL-DFGCNGGQPMLAMEYVRFNG-GLELETAYPYKGVGGSCHSDKKSA-AAKI 361

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDG-SLANINHAV 304
             +       SES++   +A  GP+   ++A    +Q+Y  G+  YN +  S   ++HAV
Sbjct: 362 TGFWMAGFY-SESALQKAVAKVGPISVGMDASGEDFQHYKSGI--YNPESCSSIGLDHAV 418

Query: 305 QIVGY 309
             VGY
Sbjct: 419 LAVGY 423


>gi|258406688|gb|ACV72067.1| putative cysteine protease [Lathyrus sativus]
          Length = 350

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 102/320 (31%), Positives = 149/320 (46%), Gaps = 41/320 (12%)

Query: 7   VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
           VLF V   A  F    + P+++     EQ L++         F+ F  RY K Y S  E 
Sbjct: 9   VLFCVTTAAAGFSFHDSNPIRMVSDAEEQLLQVIGESRHAVSFARFANRYGKLYDSVDEM 68

Query: 54  DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
            +RFK F ++L++I   NK R S    + G+  F+D + EEFK+  L  + N    L  +
Sbjct: 69  KLRFKIFSENLELIRSTNKRRLS---YKLGVNHFADWTWEEFKSHRLGAAQNCSATLKGN 125

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
           HK  D +                  +P +KDWR+ GI+ +V++Q  CG+CW FST    E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALE 167

Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           S +A   G    LS Q+++DCAG   N GCSGG      +++  N   LE E  YP    
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEETYPYTGS 226

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
           +  C  K TS N       + +  + SE  +   +A   PV  A   +  ++ Y  GV  
Sbjct: 227 NGLC--KFTSENVALKVLGSVNITLGSEDELKHAVAFARPVSVAFEVVHDFRLYKSGVYT 284

Query: 291 YN-CDGSLANINHAVQIVGY 309
              C  +  ++NHAV  VGY
Sbjct: 285 STACGNTPMDVNHAVLAVGY 304


>gi|350587549|ref|XP_003482436.1| PREDICTED: cathepsin O-like [Sus scrofa]
          Length = 209

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/172 (39%), Positives = 96/172 (55%), Gaps = 11/172 (6%)

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSG 203
           WR+ G          CG CWAFS V   ES +A+K   L +LSVQ+VIDC+ N N GC+G
Sbjct: 10  WRKGG--------SKCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN-NYGCNG 60

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G     L W++  +V +  +SEYP   ++  C   + S +GV IK Y+       E  + 
Sbjct: 61  GSTLNALYWLNKTQVKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMA 120

Query: 264 TDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
             + T GP+I  V+A++WQ YLGG+IQ++C  S    NHAV + G+D    T
Sbjct: 121 KTLLTLGPLIVIVDAVSWQDYLGGIIQHHC--SSGEANHAVLVTGFDKTGST 170


>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
          Length = 359

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 136/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FCARYL----NGAAYFAAAKRHTPQHYPKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLYTEDSYPYVSGNGYLPECSNSSKLVVGAQIDGHVL--IGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NHAV +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QVNHAVLLVGYD 296


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 88/288 (30%), Positives = 138/288 (47%), Gaps = 27/288 (9%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE--FSDLSEE 93
           F  +  ++ ++Y+   E   RF+ ++++L +IEE N          Y +T+  F+DL+ E
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHG-----YTLTDNKFADLTNE 173

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EF+ + L         +           H        G    T +P   DWR+ G + +V
Sbjct: 174 EFRAKMLGG-------LGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEV 226

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           +NQ +CG+CWAFS V   E ++ +KNG L  LS QE++DC     +GC+GG      +++
Sbjct: 227 KNQGSCGSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEA-VGCAGGFMSWAFEFV 285

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L  E+ YP    + AC+    + + V I  Y  +  + SE+ +L  +A   PV 
Sbjct: 286 MANH-GLTTEASYPYKGINGACQTAKLNESSVSITGYV-NVTVNSEAELL-KVAAVQPVS 342

Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
            AV+A    +Q Y GGV    C    A INH V +VGY   D   + W
Sbjct: 343 VAVDAGGFLFQLYAGGVFSGPCT---AQINHGVTVVGYGETDKAEKYW 387


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 82/295 (27%), Positives = 147/295 (49%), Gaps = 29/295 (9%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
           P K    + E+ + L+ S+   + KSY+    E D RF+ F+ +L  I+E  +N +   S
Sbjct: 34  PAKGLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDE--QNSRGDRS 91

Query: 80  ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
            + G+  F+DL+ EE+++ +L    +    ++  K         ++ +   G ++P  I 
Sbjct: 92  YKLGLNRFADLTNEEYRSTYLGAKTDARRRIAKTKSD-------RRYAPKAGGSLPDSI- 143

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
              DWRE G + +V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N 
Sbjct: 144 ---DWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNE 200

Query: 200 GCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
           GC+GG    L+D+     +    ++ E++YP   +   C +   +   V I  Y  + + 
Sbjct: 201 GCNGG----LMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGY--EDVT 254

Query: 257 PSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           P + + L +     PV  A+ A    +Q Y  G+   +C     +++H V  VGY
Sbjct: 255 PYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCG---TDLDHGVTAVGY 306


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 29/313 (9%)

Query: 4   VKNVLFIVALIAL-C--FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
            KN  + ++L  L C  FL   V           E    +  RY K Y    E + RFK 
Sbjct: 3   AKNQFYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+++++ IE  N     P +   GI +F+DL+ EEF     R+    H+  S  +     
Sbjct: 63  FKENVNYIEAFNNAANKPYT--LGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFK 118

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           + +V            T IP   DWR+ G +  +++Q  CG CWAFS V   E +HAL  
Sbjct: 119 YENV------------TAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSA 166

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS QEV+DC   G + GC+GG       ++  N   L  E  YP    D  C  K
Sbjct: 167 GKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNNEPNYPYKAVDGKCNAK 225

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
           A + +   I  Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   
Sbjct: 226 AAANHVATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYQSGVFTGSCGTE 283

Query: 297 LANINHAVQIVGY 309
           L   +H V  VGY
Sbjct: 284 L---DHGVTAVGY 293


>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
 gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
          Length = 443

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 140/282 (49%), Gaps = 29/282 (10%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +EL+  +  ++KK+Y+   E   +F  F+ +   I +   N Q   S + G+ +F+DLS 
Sbjct: 41  MELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQ--HNNQGNPSYKLGLNQFADLSH 98

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   ++    +S      + +        + G  +P  I    DWRE G +  
Sbjct: 99  EEFKAAYLGTKLDAKKRLSRSPSPRYQY--------SVGEDLPESI----DWREKGAVTA 146

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE++DC  + N GC+GG    L+D+
Sbjct: 147 VKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGG----LMDY 202

Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
                ++   L+ E +YP    + +C     + + V I  Y  + +  ++   L   A +
Sbjct: 203 AFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 260

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            P+  A+ A    +Q+Y  GV   NC   L   +H V +VGY
Sbjct: 261 QPISVAIEASGRAFQFYESGVFTSNCGTQL---DHGVTLVGY 299


>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 443

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 149/324 (45%), Gaps = 36/324 (11%)

Query: 1   MFDVKNVLFIVALIAL------CFLAI----PVKVSKPNLEQKLELFSSFQQRYKKSYSK 50
           M     +LFI     L      C ++     P K +    +Q L ++  +  ++ K+Y+ 
Sbjct: 1   MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNA 60

Query: 51  -SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
             E + RF+ F+ +L  I+E N    S    R G+  F+DL+ EE++TR L   +N +  
Sbjct: 61  LGEKEKRFEIFKDNLGFIDEHNSKNLS---FRLGLNRFADLTNEEYRTRFLGTRINPN-- 115

Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
               + +   ++   + +   G  +P  +    DWR+ G +  V++Q +CG+CWAFS + 
Sbjct: 116 ----RRNRKVNSQTNRYATRVGDKLPESV----DWRKEGAVVGVKDQGSCGSCWAFSAIA 167

Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEY 226
             E ++ L  G L  LS QE++DC  + N GC+GG    L+D+     +N V L PE +Y
Sbjct: 168 AVEGVNKLATGDLISLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINMVALTPEEDY 223

Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV-NALTWQYYL 285
           P    D  C +   +   V I  Y  D     E ++   +A     +A       +Q Y 
Sbjct: 224 PYRAIDGRCDQNRKNAKVVSIDQYE-DVPAYDEGALKKAVANQVIAVAVEGGGREFQLYD 282

Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
            GV    C  +L   +H V  VGY
Sbjct: 283 SGVFTGRCGTAL---DHGVAAVGY 303


>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
          Length = 336

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 90/296 (30%), Positives = 148/296 (50%), Gaps = 28/296 (9%)

Query: 24  KVSKP---NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
           KV KP   ++++   LF +F + Y K Y   E + RFK F  +L  I +LN       +A
Sbjct: 25  KVRKPVFYSMDEAPILFENFIREYNKKYDSKEKEERFKIFVNNLKRINDLN---HKSTNA 81

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS-ITTGITIPTGIP 139
            +GI +F+DLS+EEFK  +     +K  L           +++KK S ++  IT P    
Sbjct: 82  VHGINKFTDLSKEEFKKFYTGFKPDKSFL----------DDNIKKPSQLSFNITAPPAF- 130

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
              DWR+ G++ +V+NQ TCG+CWAFST+   ES++A+K+G L  LS Q+++DC    + 
Sbjct: 131 ---DWRDKGVVTRVKNQGTCGSCWAFSTIGNVESVNAIKHGNLVELSEQQLVDCDSK-DE 186

Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSE 259
            C  G       ++  +  +   E  YP     A C   ++    V ++    + ++ SE
Sbjct: 187 ACDSGLPDNAQQYLVSHGAI--SEQSYPYKGYAANCTYDSSQ---VVVRLSNFEKVVLSE 241

Query: 260 SSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
             +   + +  P+   + A     Y  G++   C+ S  ++NHAV +VGY N   T
Sbjct: 242 CQMAEKLYSTAPLSIVIAAEVLGTYTKGILVNECEQS-QDLNHAVLLVGYGNEGGT 296


>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 533

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 127 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 183

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 184 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 232

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 233 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 291

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 292 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 349

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 350 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 386


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 140/288 (48%), Gaps = 30/288 (10%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +EL+  +   +K++Y+   E   RF  F+ +   I E N   Q   S + G+ +F+DLS 
Sbjct: 39  MELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN---QGNRSYKLGLNQFADLSH 95

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEFK  +L   ++    +S      + +        + G  +P  I    DWRE G +  
Sbjct: 96  EEFKATYLGAKLDTKKRLSRPPSRRYQY--------SDGEDLPESI----DWREKGAVTS 143

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q +CG+CWAFSTV   E ++ +  G L  LS QE++DC  + N GC+GG    L+D+
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDY 199

Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
                +N   L+ E +YP    D +C     + + V I  Y  + +  ++   L   A +
Sbjct: 200 AFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDY--EDVPENDEKSLKKAAAN 257

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
            P+  A+ A    +Q+Y  GV    C      ++H V +VGY + S T
Sbjct: 258 QPISVAIEASGREFQFYDSGVFTSTCG---TQLDHGVTLVGYGSESGT 302


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 137/286 (47%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++ + + LF S   ++ K Y   +  + RF+ F  +L  I+E NK      +   G+ EF
Sbjct: 41  SIHKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKK---VSNYWLGLNEF 97

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK + L         ++  K  D      + R           +P   DWR+ 
Sbjct: 98  ADLTHEEFKNKFLGFKGE----LAERK--DESIEQFRYRDFVD-------LPKSVDWRKK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L++LS QE+IDC    N GC+GG   
Sbjct: 145 GAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGG--- 201

Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L+D+    V +  L  E EYP ++ +  C  K  +   V I  Y  D    +E S L  
Sbjct: 202 -LMDYAFAYVTRNGLHKEEEYPYIMSEGTCDEKRDASEKVTISGYH-DVPRNNEDSFLKA 259

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   P+  A+ A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301


>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 503

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 97  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 153

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 154 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 202

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 203 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 261

Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            N    L  E  YP +  +     C   +    G +I  +    +  SE ++   +A +G
Sbjct: 262 QNTNGHLYTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 319

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           P+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 320 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 356


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 93/320 (29%), Positives = 143/320 (44%), Gaps = 30/320 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEK 62
           V N+  +  L+   FLA              E    +  +Y K Y+ S E ++R   F++
Sbjct: 6   VLNISSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKE 65

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           ++  IE  N     P   + GI +F+DL+ EEFK R+                   H   
Sbjct: 66  NVQRIEAFNNAGNKP--YKLGINQFADLTNEEFKARN---------------RFKGHMCS 108

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
              R+ T      + +P   DWR+ G +  +++Q  CG CWAFS V   E +  L  G L
Sbjct: 109 NSTRTPTFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKL 168

Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC   G + GC GG       ++  NK  L  E++YP    DA C   A +
Sbjct: 169 ISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNK-GLNTEAKYPYQGVDATCNANAEA 227

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            +   IK +  D    SES++L  +A   P+  A++A    +Q+Y  G+   +C   L  
Sbjct: 228 KDAASIKGFE-DVPANSESALLKAVANQ-PISVAIDASGSEFQFYSSGLFTGSCGTEL-- 283

Query: 300 INHAVQIVGY---DNYSRTW 316
            +H V  VGY   D+ ++ W
Sbjct: 284 -DHGVTAVGYGVSDDGTKYW 302


>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
 gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
          Length = 381

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 84/270 (31%), Positives = 133/270 (49%), Gaps = 26/270 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
           GP+  AV+A ++  Y  GV  + YN  G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N +V  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V+NQ  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S AN INHAV  +GY
Sbjct: 279 SCANRINHAVTAIGY 293


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 96/292 (32%), Positives = 142/292 (48%), Gaps = 24/292 (8%)

Query: 31  EQKLELFSSFQQRYKK-SYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           E   ELF  +  R++K +Y+  E  +R F+ F+ +L  I+E N+      S   G+ EF+
Sbjct: 42  ESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRK---VSSYWLGLNEFA 98

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK---------RSITTGITIPTGIP 139
           DL+ +EFK  +L  S +       H HHD      ++         R    G+     +P
Sbjct: 99  DLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAAR-LP 157

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
              DWR  G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE++DC  +GN 
Sbjct: 158 KSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGNN 217

Query: 200 GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSE 259
           GC+GG       ++  N   L  E  YP L+++  C R  +S   V I  Y  D    +E
Sbjct: 218 GCNGGLMDYAFSYIAHNG-GLHTEEAYPYLMEEGTCSR-GSSAAVVTISGYE-DVPRNNE 274

Query: 260 SSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            ++L  +A H PV  A+ A     Q+Y GGV    C      ++H V  VGY
Sbjct: 275 QALLKALA-HQPVSVAIEASGRNLQFYSGGVFDGPCG---TQLDHGVAAVGY 322


>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 143/298 (47%), Gaps = 36/298 (12%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  ++P +    + FS F+ ++ K Y S  EHD RF  F+ +L       ++++   SAR
Sbjct: 37  VGGAEPQVLTSEDHFSLFKSKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAR 93

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPV 140
           +G+T+FSDL+  EF+ +HL       +    +K                   +PT  +P 
Sbjct: 94  HGVTQFSDLTRSEFRKKHLGVRAGFKLPKDANK----------------APILPTENLPE 137

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC------- 193
             DWR+ G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC       
Sbjct: 138 DFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPE 197

Query: 194 -AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
            AG+ + GC+GG   +  ++  +    L  E +YP   KD    +   S     + +++ 
Sbjct: 198 EAGSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSV 256

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            ++   E  I  ++  +GP+  A+NA   Q Y+GGV   Y C      +NH V +VGY
Sbjct: 257 ISI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 139/298 (46%), Gaps = 35/298 (11%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E   ELF  +  R++++Y+  E  +R F+ F+ +L  I+E N+      S   G+ EF+D
Sbjct: 53  ESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRK---VSSYWLGLNEFAD 109

Query: 90  LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           L+ +EFK  +L  R SV                   +     +       +P   DWR  
Sbjct: 110 LTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGAS-------LPKSVDWRSK 162

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC  +GN GC+GG   
Sbjct: 163 GAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMD 222

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS----PNG----------VKIKSYTCD 253
               ++  N   L  E  YP L+++  C+R ++S    P            V I  Y  D
Sbjct: 223 YAFSYIAHNG-GLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYE-D 280

Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               +E ++L  +A   PV  A+ A    +Q+Y GGV    C      ++H V  VGY
Sbjct: 281 VPRNNEQALLKALAQQ-PVSVAIEASGRNFQFYSGGVFDGPCG---TQLDHGVAAVGY 334


>gi|118365752|ref|XP_001016096.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297863|gb|EAR95851.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 153/311 (49%), Gaps = 28/311 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LC LA  + +      +KL  ++ +  ++++ Y  +EH+  F+   F ++L
Sbjct: 7   LLSIIMLMPLC-LAQNISI------EKLLTYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 58

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
             I+E N +     S    + +FSD+++EEF  + L +  +  H +    +   H+ ++ 
Sbjct: 59  QKIQEHNSDPNKTYSIH--LNQFSDMTKEEFAEKILMKQDLVNHFIKEMDQQVTHNDSNS 116

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           + +  +  +TI   I    DWR  G +  V+NQ +CG+CW FS     ES + +KN  L 
Sbjct: 117 ETQLNSKSLTIAASI----DWRTKGAVTSVKNQGSCGSCWTFSAAALMESFNFIKNKVLV 172

Query: 184 LLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             S Q+++DC     G  + GCSGG   + LD+   +KV +    +YP +     C    
Sbjct: 173 DFSEQQLVDCVTPANGYQSYGCSGGWPVSCLDY--ASKVGITTLDKYPYVAVQKNCNVTG 230

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
           T+ NG K K +     IP+ S+         PV   V+A  W  Y  G+    CD S  N
Sbjct: 231 TN-NGFKPKGW---IYIPNTSNEFKTALNFSPVSVIVDATNWGNYQSGIFN-GCDQSHIN 285

Query: 300 INHAVQIVGYD 310
            NHAV +VGYD
Sbjct: 286 YNHAVLVVGYD 296


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 137/286 (47%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++ + + LF S+  ++ K Y   +  + RF+ F  +L  I+E NK   +      G+ EF
Sbjct: 41  SIHKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN---YWLGLNEF 97

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK + L         ++  K          + S   G      +P   DWR+ 
Sbjct: 98  ADLTHEEFKHKFLGFKGE----LAERK---------DESSKEFGYRDFVDLPKSVDWRKK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV   E ++ +  G L++LS QE+IDC    N GC+GG   
Sbjct: 145 GAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG--- 201

Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L+D+    V +  L  E EYP ++ +  C  K      V I  Y  D     E+S L  
Sbjct: 202 -LMDYAFAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYH-DVPRNDEASFLKA 259

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   P+  A+ A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 144/285 (50%), Gaps = 29/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++ S+  ++ KSY+   E + RF+ F+ +L  I+E N   ++    + G+  F+D
Sbjct: 40  DEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRT---YKVGLNRFAD 96

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ +E+++ +L         +S  K  D +           G ++P  +    DWRE G 
Sbjct: 97  LTNDEYRSMYLGARTGSRRRLSTQKRSDRY-------VPVAGESLPDSV----DWREKGA 145

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 146 VVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGG----L 201

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +    ++ E +YP   +D  C +   +   V I  Y  D  + +E ++   +
Sbjct: 202 MDYAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYE-DVPVNNEQALQKAV 260

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   PV  A+ A  + +Q+Y  GV   NC  +L   +H V  VGY
Sbjct: 261 ANQ-PVSVAIEASGMAFQFYESGVFTGNCGTAL---DHGVTAVGY 301


>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
 gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
          Length = 1810

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)

Query: 36   LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
            +F  F+  +++ Y+ S EH++RF  F  +L  IE+LNK  +   +A+YG+T+F+D++  E
Sbjct: 1499 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 1556

Query: 95   FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
            ++        +  +++  H   +H  N V       G+     +P   DWR+ G + +V+
Sbjct: 1557 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 1606

Query: 155  NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
            NQ +CG+CWAFS V   E +H +K   L   S QE+IDC    N GC GG     +D  D
Sbjct: 1607 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 1659

Query: 215  VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
              K +     LE E++YP   K         S + V++K      +  +E+ I   +  +
Sbjct: 1660 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 1717

Query: 270  GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GP+   +NA   Q+Y GG+   ++   +  +I+H V IVGY
Sbjct: 1718 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 1758


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 136/279 (48%), Gaps = 29/279 (10%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++F++F ++Y K+YS +E   RF  F+ +++ I     N  +  S   G+ EF+DLS EE
Sbjct: 40  DMFTAFMKQYSKAYSHAEFSSRFNQFKANVETIRL--HNTLANASYTMGLNEFADLSFEE 97

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FK ++  +   KHV     + ++ H                   P   DWR +  +  ++
Sbjct: 98  FKGKYFGY---KHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141

Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
           +Q  CG+CWAFS   + E    L+   TL+ LS Q+++DC+ + G+ GC+GG      ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEY 201

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           +  NK +   ES YP       C++  T    V I  Y  D     E+S+L  + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGYK-DVASGDEASLLNAVGTVGPV 257

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A    +Q+Y  GV    C     N++H V  VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293


>gi|28194643|gb|AAO33583.1|AF479265_1 cathepsin P [Meriones unguiculatus]
          Length = 334

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 151/307 (49%), Gaps = 36/307 (11%)

Query: 11  VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V ++ LCF LA+   V  PNL+ + E    ++++YKK+YS     +R   +E+++ I++ 
Sbjct: 5   VFVVILCFGLALGASVHDPNLDAQWE---EWKEKYKKNYSPEVEAVRRAIWEENMRIVKL 61

Query: 70  LN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N +N          +  F DL+  EF+       V   + +                  
Sbjct: 62  HNGENGLGKNGFTMELNSFGDLTGGEFRNPMADIPVPAALTVERKDKK------------ 109

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                I  G+P  K+W   G +  VRNQ TCG+CWAF+     E     K G L+ LSVQ
Sbjct: 110 -----IVDGLPKFKNWINEGYVTPVRNQGTCGSCWAFAATGAIEGQMFWKTGKLTPLSVQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVV-LEPESEYPLLLKDAACKRKATSPNGVK 246
            ++DC+   GN GC+ G   A   +M VN+   L+ E  YP   K   C+  +++     
Sbjct: 165 NLVDCSEKQGNKGCAQGS--AFRAFMYVNETKGLQDEISYPYEGKQGTCRYNSSNS---- 218

Query: 247 IKSYTCD-TLIP-SESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
            ++Y  D  L+P +E  +L  +A+ GPV AAV+A   ++++Y GG I Y    S  ++NH
Sbjct: 219 -RAYVTDFRLLPQNEIYLLVAVASIGPVAAAVDASQDSFRFYRGG-IYYEPKCSQYSVNH 276

Query: 303 AVQIVGY 309
           AV +VGY
Sbjct: 277 AVLVVGY 283


>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 155/317 (48%), Gaps = 30/317 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACLVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +S+   E   +   +   A +G+T+FSD+S EE +  +L  +          K++     
Sbjct: 67  QSM---ERAKEEAAANPYATFGVTQFSDMSPEELRATYLNGA----------KYYAAALK 113

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +K      + + TG  P   DWR+ G +  V++Q+ CG+CWAFS     E    +   
Sbjct: 114 RPRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQRKCGSCWAFSATGNIEGQWKVAGH 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L+ LS Q ++ C  N + GC GG     L W+   NK  +  E  YP    D       
Sbjct: 169 ELTSLSEQMLVSC-DNMDDGCQGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCN 227

Query: 240 TSPN--GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
            S    G KI  +    L   E++I   +A +GPV  AV+A ++  Y GGV+  +C  S 
Sbjct: 228 MSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDASSFLDYKGGVLT-SC--SS 282

Query: 298 ANINHAVQIVGYDNYSR 314
             +NH V +VGYD+ S+
Sbjct: 283 DALNHDVLLVGYDDTSK 299


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/316 (29%), Positives = 150/316 (47%), Gaps = 35/316 (11%)

Query: 2   FDVKNVLFIVALI----ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIR 56
           F  +N    +ALI    AL   A+   +   ++ +K E + S   R+ + Y+  +E +IR
Sbjct: 3   FTTRNGCISLALIFLLGALVSQAMARTLQDASMHEKHEEWMS---RFGRVYNDGNEKEIR 59

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           +K F++++  IE  NK   S +S + GI +F+DL+ EEFKT   R+    H+  S     
Sbjct: 60  YKIFKENVQRIESFNK--ASGKSYKLGINQFADLTNEEFKTS--RNRFKGHMCSSQAGPF 115

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
            + +               T  P   DWR+ G +  +++Q  CG+CWAFS V   E +  
Sbjct: 116 RYEN--------------LTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQ 161

Query: 177 LKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
           L    L  LS QE++DC   G + GC GG       +++ N+  L  E+ YP    D  C
Sbjct: 162 LATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQ-GLTTEANYPYEGSDGTC 220

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
             K  + +  KI  +  D    +E +++  +A   PV  A++A    +Q+Y  G+   +C
Sbjct: 221 NTKQEANHAAKINGFE-DVPANNEGALMKAVAKQ-PVSVAIDAGGFGFQFYSSGIFTGDC 278

Query: 294 DGSLANINHAVQIVGY 309
              L   +H V  VGY
Sbjct: 279 GTEL---DHGVAAVGY 291


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 144/311 (46%), Gaps = 43/311 (13%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLD 65
           +LFI+A  A    A    + + ++ ++ E    +  RY + Y  + E + RFK F+ ++ 
Sbjct: 14  LLFILA--AWASQATSRSLHEASMYERHE---DWMARYGRMYKDANEKEKRFKIFKDNVA 68

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            IE  NK     ++ +  I EF+DL+ EEF  R LR+    H+                 
Sbjct: 69  RIESFNKAMD--KTYKLSINEFADLTNEEF--RSLRNRFKAHIC---------------S 109

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            + T      T +P   DWR+ G +  +++QQ CG CWAFS V   E +  +  G L  L
Sbjct: 110 EATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISL 169

Query: 186 SVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVV----LEPESEYPLLLKDAACKRKAT 240
           S QE++DC  G  N GCSGG    L+D  D  + +    L  E+ YP    D  C  K  
Sbjct: 170 SEQELVDCDTGGENQGCSGG----LMD--DAFRFIKIHGLASEATYPYEGDDGTCNSKKE 223

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
           +    KIK Y  D    +E ++   +A H PV  A++A    +Q+Y  GV    C   L 
Sbjct: 224 AHPAAKIKGYE-DVPANNEKALQKAVA-HQPVAVAIDAGGFEFQFYTSGVFTGQCGTEL- 280

Query: 299 NINHAVQIVGY 309
             +H V  VGY
Sbjct: 281 --DHGVAAVGY 289


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 147/320 (45%), Gaps = 35/320 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI--RFKNFE 61
           +K VL    L+A+  LA+P+  S  N  +    + S++ +Y K+Y  +E++   R   F 
Sbjct: 1   MKTVLAFACLVAVG-LALPL--SDDNQAE----WESYKAKYGKTYESNENEAARRTIYFM 53

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
               ++E   +  Q   S + G+  F+D+   EF+                 K  + +  
Sbjct: 54  AKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFR-----------------KMMNGYRR 96

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
              + S+   +     +P   DWR  G +  ++NQ  CG+CWAFST  + E  HALK G 
Sbjct: 97  GTPRNSVVVHVESNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEGQHALKKGK 156

Query: 182 LSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
           L  LS QE++DC A  GN GC GG       ++  N  + + E  YP   +D  C  K  
Sbjct: 157 LVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGI-DTEQSYPYTGEDGTCSFK-K 214

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLA 298
           S     +  +  D    SES +    AT GP+  A++A +W  Q Y  GV   + D S  
Sbjct: 215 SDVAATVTGFV-DVTSGSESGLQDASATIGPISVAIDASSWDFQLYESGVYDVS-DCSTT 272

Query: 299 NINHAVQIVGY--DNYSRTW 316
            ++H V +VGY  D+ +  W
Sbjct: 273 ELDHGVLVVGYGTDDGTAYW 292


>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
 gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
          Length = 1834

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 28/281 (9%)

Query: 36   LFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
            +F  F+  +++ Y+ S EH++RF  F  +L  IE+LNK  +   +A+YG+T+F+D++  E
Sbjct: 1523 MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERG--TAKYGVTKFADMTVAE 1580

Query: 95   FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
            ++        +  +++  H   +H  N V       G+     +P   DWR+ G + +V+
Sbjct: 1581 YR-------AHTGLVVPKHDRANHVGNRVASEEDVAGVG---DLPRSFDWRDHGAVTEVK 1630

Query: 155  NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
            NQ +CG+CWAFS V   E +H +K   L   S QE+IDC    N GC GG     +D  D
Sbjct: 1631 NQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDN-GCGGG----YMD--D 1683

Query: 215  VNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
              K +     LE E++YP   K         S + V++K      +  +E+ I   +  +
Sbjct: 1684 AFKAIEQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAV--DMPKNETYIAKYLIKN 1741

Query: 270  GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GP+   +NA   Q+Y GG+   ++   +  +I+H V IVGY
Sbjct: 1742 GPIAIGLNANAMQFYRGGISHPWHPLCNHKSIDHGVLIVGY 1782


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 138/286 (48%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ + LF S+     K Y   +  I RF+ F+ +L  I+E NK   S      G+ EF
Sbjct: 14  SIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKNSS---YWLGLNEF 70

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ +EFK +++        ++      +  + HV        +  P  I    DWR+ 
Sbjct: 71  ADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--------VDYPESI----DWRQK 118

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFSTV T E ++ +  G L  LS QE++DC    + GC GG   
Sbjct: 119 GAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRRSH-GCKGGYQT 177

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V    E EYP   K   C+ K    + VKI  Y     +P+  E S++  
Sbjct: 178 TSLQYVADNGV--HTEKEYPYEKKQGKCRAKDKKGSKVKITGY---KRVPANNEVSLIQA 232

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           IA   PV   V +    +Q+Y GG+ +  C      ++HAV  VGY
Sbjct: 233 IANQ-PVSVVVESKGRAFQFYKGGIFEGPCG---TKVDHAVTAVGY 274


>gi|118365722|ref|XP_001016081.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297848|gb|EAR95836.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 337

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/310 (29%), Positives = 155/310 (50%), Gaps = 25/310 (8%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           +L I+ L+ LCF A  + +      +KL  ++ +  ++++ Y ++ E   R   F ++L 
Sbjct: 7   LLSIIVLMPLCF-AQDISI------EKLLAYNKWSSQHQRVYLNEDEKLFRQMVFFENLQ 59

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVK 124
            I+E N N  +  S    + +FSD++++EF  + L + ++  H++    +   H+  + K
Sbjct: 60  KIKEHNSNPNNTYSIH--LNQFSDMTKQEFAEKILMKQNIVDHLMKGISQEATHNDTNNK 117

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +  + +   I   +    DWRE G I  V+NQ  CG+CW+FS     ES + ++N TL  
Sbjct: 118 ETQLNSKSLI---LADSIDWREQGAITTVKNQGNCGSCWSFSAAALMESFNFIQNNTLVD 174

Query: 185 LSVQEVIDCA--GNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            S Q+++DC    NG  + GCSGG     LD+   +KV +    +YP +     C    T
Sbjct: 175 FSEQQLVDCVIPANGYYSYGCSGGAAVYCLDY--ASKVGITTLDKYPYVRIQKNCNVTGT 232

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           + NG K K +     +P+ S+ L       PV   V+A  W  Y  G+    CD S  ++
Sbjct: 233 N-NGYKPKQW---IKVPNTSNDLKSALNFSPVSVVVDATNWDNYESGIFN-GCDQSNISL 287

Query: 301 NHAVQIVGYD 310
           NHAV  +GYD
Sbjct: 288 NHAVLAIGYD 297


>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
 gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
          Length = 366

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K+Y S  EHD R   F+ ++       +++Q   +A +G+T+FSDL+  EF
Sbjct: 49  FAVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQQLDPAAVHGVTQFSDLTPTEF 105

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDRGAVTPVK 149

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG+CW+FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 150 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 307


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 155/330 (46%), Gaps = 49/330 (14%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNF 60
           K + +I   + +C     V+V+   L Q   ++   QQ   +Y K Y+   E + RF+ F
Sbjct: 5   KQLYYISLALLMCLGLWAVQVTSRTL-QDASMYERHQQWMGQYAKIYNDHQEWEKRFQIF 63

Query: 61  EKSLDIIEELNKNRQSPESARY---GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
           +++++ IE  NK     E  R+   G+ +F DL+ EEF     R+    H+  S  + + 
Sbjct: 64  KENVNYIETSNK-----EGGRFYKLGVNQFVDLTNEEFIAP--RNRFKGHMCSSIIRTNT 116

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
           + + +V            T +P   DWR+ G +  V++Q  CG CWAFS V   E +H L
Sbjct: 117 YKYENV------------TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQL 164

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
             G L  LS QE++DC   G + GC GG    L+D  D  K +     L+ E++YP    
Sbjct: 165 STGKLISLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLDTEAKYPYQGV 218

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           D  C     S N   I SY  D    +E ++   +A   P+  A++A    +Q+Y  GV 
Sbjct: 219 DGTCNANEASINAATITSYE-DVPTNNEQALQKAVANQ-PISVAIDASGSDFQFYTSGVF 276

Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
             +C   L   +H V  VGY   D+ ++ W
Sbjct: 277 TGSCGTEL---DHGVTAVGYGVSDDGTKYW 303


>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 377

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 88/293 (30%), Positives = 141/293 (48%), Gaps = 35/293 (11%)

Query: 31  EQKLEL---FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           +  LEL   F+SF QR+ K+Y  +E H  R   F+ +L       +++    SA +GIT+
Sbjct: 44  DNDLELSSHFTSFVQRFGKTYKDAEEHAHRLSVFKANL---RRARRHQLLDPSAEHGITK 100

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWR 145
           FSDL+  EF+   L    ++   +       H               +PT G+P   DWR
Sbjct: 101 FSDLTPAEFRRTFLGLKTSRRSFLREIGGSAH-----------DAPVLPTDGLPDDFDWR 149

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNG 197
           + G +G V+NQ +CG+CW+FS     E  + L  G + +LS Q+ +DC          + 
Sbjct: 150 DHGAVGPVKNQGSCGSCWSFSASGALEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSC 209

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           + GC+GG   +   ++ +    LE E +YP   +D  CK    S     +++++  ++  
Sbjct: 210 DAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGRDGTCKFD-KSKIVASVQNFSVVSV-- 265

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            E  I  ++  HGP+   +NA   Q Y+GGV   Y C  SL   +H V +VGY
Sbjct: 266 DEEQIAANLVKHGPLAIGINAAYMQTYIGGVSCPYICGRSL---DHGVLLVGY 315


>gi|394331828|gb|AFN27133.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/281 (30%), Positives = 138/281 (49%), Gaps = 27/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTVAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWR+ G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWRKKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           +Q  CG+CWAFS V + ES  AL    L+ LS   ++ C    N G   G      +W+ 
Sbjct: 143 DQGACGSCWAFSAVGSIESQWALAGHRLTALSDHHLVSCHDKDN-GRPAGLMLQAFEWLL 201

Query: 214 -DVNKVVLEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++N  +   E  YP +        C   +    G +I  Y   T+  SE+ +   +A +
Sbjct: 202 RNMNGTMFT-EDSYPYVSSSGYVPECSNSSQLVPGARIDGYV--TIESSETVMAAWLAKN 258

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  A++A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 259 GPISIALDASSFMSYQSGVVT-SCAG--MPLNHGVLLVGYN 296


>gi|14422331|emb|CAC41636.1| early leaf senescence abundant cysteine protease [Pisum sativum]
          Length = 350

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 41/320 (12%)

Query: 7   VLFIVALIALCFL---AIPVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEH 53
           VLF VA  A  F    + P+++     EQ L++         F+ F  RY K Y S  E 
Sbjct: 9   VLFCVASAAAGFSFHDSNPIRMVSDVEEQLLQVIGESRHAVSFARFANRYGKRYDSVDEM 68

Query: 54  DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSH 112
            +RFK F +++++I   NK R S    + G+  F+D + EEF++  L  + N    L  +
Sbjct: 69  KLRFKIFSENIELIRSSNKRRLS---YKLGVNHFADWTWEEFRSHRLGAAQNCSATLKGN 125

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
           HK  D +                  +P +KDWR+ GI+  V++Q +CG+CW FST    E
Sbjct: 126 HKITDAN------------------LPDEKDWRKEGIVSGVKDQGSCGSCWTFSTTGALE 167

Query: 173 SMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           S +A   G    LS Q+++DCAG   N GCSGG      +++  N   LE E  YP    
Sbjct: 168 SAYAQAFGKNISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYIKYNG-GLETEEAYPYTGS 226

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ 290
           +  CK ++     VK+   + +  + +E  +   IA   PV  A   +  ++ Y  GV  
Sbjct: 227 NGLCKFRSEHV-AVKVLG-SVNITLGAEDELKHAIAFARPVSVAFEVVHDFRLYKSGVYT 284

Query: 291 YN-CDGSLANINHAVQIVGY 309
              C  +  ++NHAV  VGY
Sbjct: 285 STACGSTPMDVNHAVLAVGY 304


>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
          Length = 347

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/284 (32%), Positives = 134/284 (47%), Gaps = 25/284 (8%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSP-ESARYGITEFSDLSEEE 94
           F  F+ +Y K Y  +E + R    F++SLD IE+ N    +   +   G+ EF+DL+ EE
Sbjct: 31  FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90

Query: 95  FKTRH---LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
           F+  H   L    +K   ++   H D H  H    +  +     +GI    DWR+ G + 
Sbjct: 91  FRQHHVTRLPFDDDKRDPVTATLHLDEHAVHAADSNGDS-----SGI----DWRKRGAVT 141

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
            VRNQ  CG    F+ VE  E MHA+ +G L  LS Q+VIDC  +G  GCSGG   +   
Sbjct: 142 PVRNQGQCGNPAIFAAVEAVEGMHAISSGNLVELSTQQVIDC--SGTPGCSGGSLVSFFK 199

Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
           ++  N   L+  ++YP       C +   + +  K+  Y+   + P   + L       P
Sbjct: 200 YIARNG-GLDSAADYPTSGAGGQCNKAKEARHVAKVGGYS--VVPPRNETKLAAAVFKMP 256

Query: 272 VIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY-DNY 312
           V  A+ A T  +Q Y  GV    C   L   +HAV +VGY D Y
Sbjct: 257 VAVAIEADTPSFQMYTSGVYSGPCGTQL---DHAVLVVGYTDEY 297


>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
 gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
          Length = 334

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 93/316 (29%), Positives = 157/316 (49%), Gaps = 31/316 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF---EKSLDI 66
           ++ + AL  LA    +S  +LE     F S++ ++ K Y   E + + KN     + L +
Sbjct: 4   LIVITALVALASATSISLEDLE-----FHSWKLKFGKIYKSVEEESQRKNTWLENRKLVL 58

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +  +  + Q  +S R G+T F+D+  +E+     R SV K  L S ++   H  +    +
Sbjct: 59  VHNMLAD-QGIKSYRLGMTYFADMDNQEY-----RQSVFKGCLGSFNRTKGHRASTFLLQ 112

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           +   G  +P  +    DWR+ G + +V++Q+ CG+CWAFS   + E     K G L  LS
Sbjct: 113 A--GGAVLPDTV----DWRDKGYVAEVKDQKNCGSCWAFSATGSLEGQTFRKTGKLVSLS 166

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+G  GNMGC GG      ++++ NK + + E  YP    D  C+ K  +  G 
Sbjct: 167 EQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGI-DTEESYPYEATDGDCRFKPATV-GA 224

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINH 302
               Y  D     E+++   +A  GP+  A++A  +++Q Y  G+  + NC  S  +++H
Sbjct: 225 TCTGYV-DINSEDENALQKAVANIGPISVAIDAGHISFQLYGSGIYNEPNC--SSEDLDH 281

Query: 303 AVQIVGY--DNYSRTW 316
            V  VGY  DN    W
Sbjct: 282 GVLAVGYGTDNQQDYW 297


>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
 gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
          Length = 371

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 142/295 (48%), Gaps = 37/295 (12%)

Query: 31  EQKLEL-----FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           + +LEL     F SF QR+ KSY  +E H  R   F+ +L       +++    SA +G+
Sbjct: 37  DNELELNAESHFLSFVQRFGKSYKDAEEHAYRLSIFKANL---RRARRHQLLDPSAEHGV 93

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKD 143
           T+FSDL+  EF+  +L    ++  L+               +S      +PT G+P   D
Sbjct: 94  TKFSDLTPAEFRRTYLGLRKSRRALLRE-----------LGKSANEAPVLPTDGLPDDFD 142

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
           WR+ G +  V+NQ +CG+CW+FST    E  H L  G L +LS Q+++DC          
Sbjct: 143 WRDHGAVTPVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPD 202

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           + + GC+GG       ++      LE E +YP    D  CK    S     +++++  ++
Sbjct: 203 SCDSGCNGGLMTNAFSYLQ-KAGGLESEKDYPYTGSDDKCKFD-KSKIVASVQNFSVVSV 260

Query: 256 IPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              E  I  ++  HGP+   +NA   Q Y+GGV   Y C  +L   +H V +VGY
Sbjct: 261 --DEGQIAANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRTL---DHGVLLVGY 310


>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
 gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
          Length = 354

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 157/322 (48%), Gaps = 43/322 (13%)

Query: 4   VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
           V  +LF+V    ALIA   L +   ++  +       +  F++R+ K + + +E   RF 
Sbjct: 12  VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 64

Query: 59  NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
            F++++     LN +      A Y ++ +F+DL+ +EF   +L    N +    H K + 
Sbjct: 65  AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 117

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            H  HV   S+ +G+       +  DWRE G++  V+NQ  CG+CWAF+T    E   AL
Sbjct: 118 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 168

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
           KN +L  LS Q ++ C  N + GC+GG     + W+  D N  V   E  YP     A  
Sbjct: 169 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWIINDHNGTV-PTEDSYP--YTSAGG 224

Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
            R     N   G KIK Y   +L   E  I   +  +GPV  AV+A TWQ Y GGV+   
Sbjct: 225 TRPPCHDNGTVGAKIKGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 281

Query: 293 CDGSLANINHAVQIVGYDNYSR 314
           C G   ++NH V +VG++  ++
Sbjct: 282 CFG--LSLNHGVLVVGFNRQAK 301


>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
          Length = 373

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 145/298 (48%), Gaps = 36/298 (12%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  ++P +    + FS F++++ K Y+ SE HD R   F+ +L       ++++   SAR
Sbjct: 42  VDGAEPKVLSSEDHFSLFKRKFGKVYASSEEHDYRLSVFKANL---RRARRHQKLDPSAR 98

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPV 140
           +G+T+FSDL+  EF+ +HL       V        D +   +          +PT  +P 
Sbjct: 99  HGVTQFSDLTRSEFRKKHL------GVRGGFKLPKDANKAPI----------LPTENLPE 142

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC------- 193
             DWR+ G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC       
Sbjct: 143 DFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPE 202

Query: 194 -AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
            AG+ + GC+GG   +  ++  +    L  E +YP   KD    +   S     + +++ 
Sbjct: 203 EAGSCDSGCNGGLMNSAFEYT-LKTGGLMREEDYPYTGKDGPTCKLDKSKIVASVSNFSV 261

Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            ++   E  I  ++  +GP+  A+NA   Q Y+GGV   Y C      +NH V +VGY
Sbjct: 262 ISI--DEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCPYIC---ARRLNHGVLLVGY 314


>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
          Length = 443

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 145/315 (46%), Gaps = 33/315 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
            + + AL+ +C       +  P  E    LF +F+  + ++Y S  E   RF+ F  ++ 
Sbjct: 3   TVIVAALLMVC-----NAMGAPTTEV---LFGNFKAAHARNYASPDEERKRFEIFAGNMK 54

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
               LN  R++P  A +G  EF+D++ EEF+TRH              K+         K
Sbjct: 55  KAAVLN--RKNP-MATFGPNEFADMTSEEFQTRHNAARHYAAAKARPPKNTKTFTAEEIK 111

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            ++   I          DWR  G +  V+NQ  CG+CW+FST    E  HA+  G L  +
Sbjct: 112 AAVGQQI----------DWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAV 161

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---ACKRKATS 241
           S QE++ C    + GC+GG       W+   +K  +  E+ YP +  +    AC     S
Sbjct: 162 SEQELVSCDPIDD-GCNGGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPES 220

Query: 242 -PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
            P G  I ++    +  +E  +   +  HGP+   V+A TWQ Y GG++ Y C      I
Sbjct: 221 KPVGATISAF--QDIARTEEDMAAFVFKHGPLSIGVDASTWQSYAGGIMSY-CPQD--QI 275

Query: 301 NHAVQIVGYDNYSRT 315
           +H V IVG+D+ + T
Sbjct: 276 DHGVLIVGFDDTAST 290


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/280 (30%), Positives = 134/280 (47%), Gaps = 28/280 (10%)

Query: 34  LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + LF S+  ++ K Y     D +   FE  +D ++ ++   +   +   G+ EF+DL+ E
Sbjct: 46  IHLFESWLAKHSKIYESL--DEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHE 103

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK + L             +  +     +++ S    + +P  +    DWR+ G +  V
Sbjct: 104 EFKNKFLGLK---------GELPERKDESIEEFSYRDFVDLPKSV----DWRKKGAVAPV 150

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW- 212
           +NQ  CG+CWAFSTV   E ++ +  G L++LS QE+IDC    N GC+GG    L+D+ 
Sbjct: 151 KNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG----LMDYA 206

Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
              V +  L  E EYP ++ +  C  K      V I  Y  D    +E S L  +A   P
Sbjct: 207 FAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSETVTISGYH-DVPRNNEDSFLKALANQ-P 264

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +  A+ A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 265 ISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 39/292 (13%)

Query: 31  EQKLEL-FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESA-RYGITEF 87
           E +LE  F  F+  + + Y   E ++  K+ F  +L  I   N +  + +S     +  F
Sbjct: 26  EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS EEF+     +     V ++   H D   N V+             +P   DW   
Sbjct: 86  TDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVE------------ALPATVDWTTK 130

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDF 206
           G++  ++NQQ CG+CWAFS V + E  HALK G L  LS Q ++DC A  G+MGCSGG  
Sbjct: 131 GVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGG-- 188

Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
                WMD      +    ++ E+ YP    D +C+ K  S  G  I S+  D     ES
Sbjct: 189 -----WMDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSV-GATIHSFV-DVKTGDES 241

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
           ++   +A+ GP+  A++A   ++Q+Y  GV  YN  D S   ++H V  VGY
Sbjct: 242 ALQNAVASIGPISVAIDAAQPSFQFYSSGV--YNEPDCSTEILDHGVTAVGY 291


>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 143/283 (50%), Gaps = 24/283 (8%)

Query: 35  ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           +LF  +++ + K+Y  + E ++R +NF+KS+  + E N  R+S      G+ +F+DLS E
Sbjct: 48  DLFGKWKELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNE 107

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGII 150
           EFK  ++             K      N +K   +   +++ +     P   DWR+ G++
Sbjct: 108 EFKEMYMS------------KVKGSRSNELKMGGVKRNMSVSSRTCDAPTSLDWRDKGVV 155

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             +++Q  CG+CWAFS   + ES +A+  G L  LS QE++DC    + GC GG+     
Sbjct: 156 TPMKDQGQCGSCWAFSVSGSIESANAIATGDLIRLSEQELVDC-DTYDYGCDGGNMDTAY 214

Query: 211 DWMDVNKVVLEPESEYPLLL---KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
            W+ +    L+ E +YP      +D  C +  ++ + V + SY    +  +E ++L  +A
Sbjct: 215 RWI-IKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYV--EVESNEDAVLCAVA 271

Query: 268 THGPVIAAV-NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           T    I  V +A  +Q Y GGV    C     +I+HAV IVGY
Sbjct: 272 TTPVTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGY 314


>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
           cysteine proteinase A-2; Flags: Precursor
 gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
          Length = 444

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 135/281 (48%), Gaps = 26/281 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y ++Y   +E   R  NFE++L+++ E     ++P  A++GIT+F DLSE E
Sbjct: 37  LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CWAFS V   E    L    L  LS Q+++ C  + N GC GG      DW+ 
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201

Query: 215 VNKVV-LEPESEYPLLLKDAACKRKATSPN----GVKIKSYTCDTLIPSESSILTDIATH 269
            N    L  E  YP +  +      + S      G +I  +    +  SE ++   +A +
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHV--LIGSSEKAMAAWLAKN 259

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           GP+  A++A ++  Y  GV+   C G    +NH V +VGYD
Sbjct: 260 GPIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 297


>gi|118366325|ref|XP_001016381.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89298148|gb|EAR96136.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 337

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 149/317 (47%), Gaps = 30/317 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY----SKSEHDIRFKN 59
           + N    +AL+AL   +   + + PN   +LE  +++ Q    +     + +E   R   
Sbjct: 1   MNNKFISLALVALLICSSLAQQTNPN--HQLEALTAYNQWKNNNLRIYINDAEKQYRQSV 58

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F ++   I+E N N+    + + G+ +FSD+++EEF  +         +LMS+ +     
Sbjct: 59  FLENFQKIKEHNANQ--ANTYQQGLNQFSDMTQEEFVQK---------ILMSNSQADSSQ 107

Query: 120 HNHVKKRSITTGITIPTGIPVKK--DWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
                + S        T  P+    DWR  G +  V+NQ  CG+CWAFS+    ES + +
Sbjct: 108 SLSAPQSSSNNQNLTATASPIAASVDWRTKGAVTPVKNQGNCGSCWAFSSTGAMESFNFI 167

Query: 178 KNGTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
           KN  LS  S Q+++DCA    G  + GC+GG +   L  +  +KV ++ ES+YP      
Sbjct: 168 KNKVLSSFSEQQLVDCAIQQNGYYSHGCNGGSYYQAL--LYASKVGMKTESQYPYTAIWG 225

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
            C+   T+ NG K  ++     +   +  L       PV  A++A     Y  GV   NC
Sbjct: 226 TCQVSGTN-NGYKPVAFGS---VGQNTLALQTALNAAPVSIAMDATNLYLYTSGVYN-NC 280

Query: 294 DGSLANINHAVQIVGYD 310
           + S  N+NHAV  VGYD
Sbjct: 281 NPSSINLNHAVLAVGYD 297


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 144/309 (46%), Gaps = 25/309 (8%)

Query: 6   NVLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           N+   + +++LC  LA       P+L+   +L+ S+   + K Y + E   R   +EK+L
Sbjct: 15  NMNVCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSW---HSKDYHEREESWRRVVWEKNL 71

Query: 65  DIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
            +IE  N +      S + G+ +F D++ EEF+            LM+ +KH      + 
Sbjct: 72  KMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQ-----------LMNGYKHKKSERKYR 120

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
             + +          P   DWRE G +  V++Q  CG+CWAFST    E  H  K G L 
Sbjct: 121 GSQFLEPSFLEA---PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLV 177

Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q ++DC+   GN GC+GG       ++  N  + + E  YP   KD    R     
Sbjct: 178 SLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEY 236

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           N      +  D     E +++  +A+ GPV  A++A   ++Q+Y  G I Y  D S  ++
Sbjct: 237 NAANDTGFV-DIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDL 294

Query: 301 NHAVQIVGY 309
           +H V +VGY
Sbjct: 295 DHGVLVVGY 303


>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
          Length = 347

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 140/303 (46%), Gaps = 23/303 (7%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
            IVA++ L  LA        NL  +   F  FQ +Y K Y   E   +   F+ SL  I+
Sbjct: 5   LIVAILLLVALA---SARTSNLSFEETQFREFQLKYNKHYESHEFAQKLATFKNSLKRIQ 61

Query: 69  ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           ELN   +++     +G+ +F+DLS+EEF   +L         M       +  ++  K  
Sbjct: 62  ELNDMAKRAKVDTEFGVNKFADLSKEEFANYYLNKGG-----MESTDSETYAPDYSDKE- 115

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                   + +P   DWR  G +  V++Q  CG+CW+FST    E    L    L+ LS 
Sbjct: 116 -------ISNLPTSFDWRTQGAVTPVKDQGQCGSCWSFSTTGNVEGQWFLAGNDLTGLSE 168

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL-LKDAACKRKATSPNGVK 246
           Q ++DC+   N GC+GG      D++  N  + + E+ YP L ++   C+    +  G K
Sbjct: 169 QNLVDCS-TKNDGCNGGLMPLAYDYIVENNGI-DTEASYPYLAIQQKNCQFNPANI-GAK 225

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           I  Y    +  +E+ +  ++  +GP+  A +A  WQYY  G+          N++H + I
Sbjct: 226 IDGYY--NVSSNETQMQINLVNNGPLSIAADAAEWQYYKKGIFSGIFGICGKNLDHGILI 283

Query: 307 VGY 309
           VGY
Sbjct: 284 VGY 286


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/275 (32%), Positives = 128/275 (46%), Gaps = 38/275 (13%)

Query: 43  RYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
           RY + Y  + E + RFK F+ ++  IE  NK     ++ +  I EF+DL+ EEF  R LR
Sbjct: 3   RYGRMYKDANEKEKRFKIFKDNVARIESFNKAMD--KTYKLSINEFADLTNEEF--RSLR 58

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
           +    H+                  + T      T +P   DWR+ G +  +++QQ CG 
Sbjct: 59  NRFKAHIC---------------SEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGC 103

Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVV- 219
           CWAFS V   E +  +  G L  LS QE++DC  G  N GCSGG    L+D  D  + + 
Sbjct: 104 CWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGG----LMD--DAFRFIK 157

Query: 220 ---LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
              L  E+ YP    D  C  K  +    KIK Y  D    +E ++   +A H PV  A+
Sbjct: 158 IHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYE-DVPANNEKALQKAVA-HQPVAVAI 215

Query: 277 NA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A    +Q+Y  GV    C   L   +H V  VGY
Sbjct: 216 DAGGFEFQFYTSGVFTGQCGTEL---DHGVAAVGY 247


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 136/276 (49%), Gaps = 30/276 (10%)

Query: 30  LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +++ +ELF S+  R+ K Y S  E  +RF+ F+ +L  I+E NK      +   G+ EF+
Sbjct: 1   MDKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNK---VVSNYWLGLNEFA 57

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS  EFK ++L   V+                  ++ S          +P   DWR+ G
Sbjct: 58  DLSHHEFKKQYLGLKVD---------------FSTRRESSEEFTYRDVDLPKSVDWRKKG 102

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            +  ++NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG    
Sbjct: 103 AVTNIKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGG---- 158

Query: 209 LLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           L+D+     V    L  E +YP ++++  C+        V I  Y  D    +E S+L  
Sbjct: 159 LMDYAFSFIVENGGLHKEDDYPYIMEEGTCEMSKEESQVVTISGYH-DVPQNNEQSLLKA 217

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
           +A   P+  A+ A    +Q+Y GGV   +C   LA+
Sbjct: 218 LANQ-PLSVAIEASGRDFQFYSGGVFDGHCGTQLAS 252


>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
 gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
 gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
 gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
 gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
          Length = 361

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 158/325 (48%), Gaps = 42/325 (12%)

Query: 1   MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
           +F V +++F+   +++C      +   V  ++P +    + F+ F++++ K Y S  EH 
Sbjct: 8   LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF  F+ +L  +  +   +  P SAR+G+T+FSDL+  EF+ +HL       V      
Sbjct: 67  YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117

Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
             D +   +          +PT  +P + DWR+ G +  V+NQ +CG+CW+FST    E 
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167

Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
            H L  G L  LS Q+++DC         G+ + GC+GG   +  ++  +    L  E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYT-LKTGGLMREKD 226

Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
           YP    D    +   S     + +++  ++  +E  I  ++  +GP+  A+NA   Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284

Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
           GGV   Y C   L   NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306


>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
          Length = 325

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 41/308 (13%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           +V ++   F    V+V     +   EL+  F++ Y K+Y+  +   RF  F+ +L   ++
Sbjct: 9   LVVVVGCAFAVNTVRVP----DNARELYEQFKRDYGKAYANEDDQKRFAIFKDNLVRAQQ 64

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
                Q   +A+YG+T+FSDL+ EEF   +L   +++ V            + V+   + 
Sbjct: 65  YQMQEQG--TAKYGVTQFSDLTPEEFAAMYLGSRIDERV------------DRVQLNDLQ 110

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
           T        P   DWR+ G +G V +Q +CG+CWAFS     E    LK G L  LS Q+
Sbjct: 111 TA-------PASVDWRKKGAVGPVEDQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQ 163

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           ++DC    + GCSGG       + ++ ++  LE +S YP      AC+   +     K+ 
Sbjct: 164 LVDC-DRLDHGCSGG--YPPYTYKEIKRMGGLELQSAYPYTSWKQACRIDRS-----KLV 215

Query: 249 SYTCDTLI--PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN---CDGSLANINHA 303
           +   D+++    E      +A HGP+   +NA   Q+Y  G++  +   C  S   +NHA
Sbjct: 216 AKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPSKAMC--SPEGLNHA 273

Query: 304 VQIVGYDN 311
           V  VGYD 
Sbjct: 274 VLTVGYDT 281


>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
 gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
          Length = 330

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 139/286 (48%), Gaps = 40/286 (13%)

Query: 37  FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF  R+ K+Y+ +E +  R K FE +L  +  ++     P SA +GIT+FSDL+EEEF
Sbjct: 21  FKSFIARFGKAYATAEAYAHRLKVFEANL--VRAVSHQALDP-SAVHGITQFSDLTEEEF 77

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           K + L   V   +                 R       +PT  +P   DWRE G + +V+
Sbjct: 78  KQQFLGLRVPSRL-----------------REANKAPVLPTNDLPEDFDWREHGAVTEVK 120

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ  CG+CWAFST    E  H L+ G L  LS Q+++DC          + + GC+GG  
Sbjct: 121 NQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLM 180

Query: 207 CALLDWMDVNKVVLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               D++ +    LE E++YP     +  C+  A   N +        T+   E  I  +
Sbjct: 181 TNAYDYV-MKSGGLETETDYPYTGNSNGKCQFNA---NKIVASVANFSTVSLDEDQIAAN 236

Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
           +  HGP+   +NA+  Q Y+GGV   +C    S  +I+H V +VGY
Sbjct: 237 LVKHGPLAIGINAVFMQTYIGGV---SCPIICSKHHIDHGVLLVGY 279


>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
          Length = 428

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/286 (30%), Positives = 135/286 (47%), Gaps = 25/286 (8%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF +F+  + ++Y S  E   RF+ F  ++     LN  R++P  A +G  EF+D++ EE
Sbjct: 9   LFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLN--RKNP-MATFGPNEFADMTSEE 65

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+TRH              K+         K ++   I          DWR  G +  V+
Sbjct: 66  FQTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQI----------DWRLKGAVTPVK 115

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CW+FST    E  HA+  G L  +S QE++ C    + GC+GG       W+ 
Sbjct: 116 NQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPIDD-GCNGGLMDNAFGWLI 174

Query: 214 DVNKVVLEPESEYPLLLKDA---ACKRKATS-PNGVKIKSYTCDTLIPSESSILTDIATH 269
             +K  +  E+ YP +  +    AC     S P G  I ++    +  +E  +   +  H
Sbjct: 175 SAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAF--QDIARTEEDMAAFVFKH 232

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           GP+   V+A TWQ Y GG++ Y C      I+H V IVG+D+ + T
Sbjct: 233 GPLSIGVDASTWQSYAGGIMSY-CPQD--QIDHGVLIVGFDDTAST 275


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 136/286 (47%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++ + + LF S+  ++ K Y   +  + RF+ F  +L  I+E NK   +      G+ EF
Sbjct: 41  SIHKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN---YWLGLNEF 97

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK + L         ++  K          + S   G      +P   DWR+ 
Sbjct: 98  ADLTHEEFKHKFLGFKGE----LAERK---------DESSKEFGYRDFVDLPKSVDWRKK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG CWAFSTV   E ++ +  G L++LS QE+IDC    N GC+GG   
Sbjct: 145 GAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGG--- 201

Query: 208 ALLDW--MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L+D+    V +  L  E EYP ++ +  C  K      V I  Y  D     E+S L  
Sbjct: 202 -LMDYAFAYVMRSGLHKEEEYPYIMSEGTCDEKKDVSEKVTISGYH-DVPRNDEASFLKA 259

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   P+  A+ A    +Q+Y GGV   +C   L   +H V  VGY
Sbjct: 260 LANQ-PISVAIEASGRDFQFYSGGVFDGHCGTEL---DHGVAAVGY 301


>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
 gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
          Length = 367

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 139/286 (48%), Gaps = 40/286 (13%)

Query: 37  FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF  R+ K+Y+ +E +  R K FE +L  +  ++     P SA +GIT+FSDL+EEEF
Sbjct: 58  FKSFIARFGKAYATAEAYAHRLKVFEANL--VRAVSHQALDP-SAVHGITQFSDLTEEEF 114

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           K + L   V   +                 R       +PT  +P   DWRE G + +V+
Sbjct: 115 KQQFLGLRVPSRL-----------------REANKAPVLPTNDLPEDFDWREHGAVTEVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ  CG+CWAFST    E  H L+ G L  LS Q+++DC          + + GC+GG  
Sbjct: 158 NQGACGSCWAFSTTGAIEGAHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLM 217

Query: 207 CALLDWMDVNKVVLEPESEYPLLL-KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               D++ +    LE E++YP     +  C+  A   N +        T+   E  I  +
Sbjct: 218 TNAYDYV-MKSGGLETETDYPYTGNSNGKCQFNA---NKIVASVANFSTVSLDEDQIAAN 273

Query: 266 IATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQIVGY 309
           +  HGP+   +NA+  Q Y+GGV   +C    S  +I+H V +VGY
Sbjct: 274 LVKHGPLAIGINAVFMQTYIGGV---SCPIICSKHHIDHGVLLVGY 316


>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 377

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 144/290 (49%), Gaps = 37/290 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+Q++ KSY SK EHD RF+ F+ +L   +   +++    SA +G+T+FSDL+  EF
Sbjct: 60  FSVFKQKFGKSYASKEEHDHRFRVFKANL---KRAQRHQALDPSATHGVTQFSDLTPSEF 116

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           +   L     +  L +             K  I     +PT G+P   DWR+ G + +V+
Sbjct: 117 RRSFLGLRSRRLGLPA----------DANKAPI-----LPTDGLPTDFDWRDKGAVSEVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  + L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 162 NQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLM 221

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    +   S     + +++  +L   E  I  ++
Sbjct: 222 NSAFEYT-LKSGGLMKEQDYPYTGTDRGTCKFDKSKIAASVANFSVVSL--DEEQIAANL 278

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY--DNYS 313
             +GP+  A+NA+  Q Y+ GV   Y C     +++H V +VGY  D Y+
Sbjct: 279 VKNGPLAVAINAVFMQTYIKGVSCPYICS---KHLDHGVLLVGYGSDGYA 325


>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
 gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/285 (31%), Positives = 136/285 (47%), Gaps = 39/285 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ ++KKSY S+ EHD RF  F+ +L       ++++   +A +G+T+FSDL+  EF
Sbjct: 53  FSLFKSKFKKSYGSQEEHDYRFSVFKANL---RRAARHQELDPTASHGVTQFSDLTPAEF 109

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +         K VL           N            +PT  +P   DWR+ G +G ++
Sbjct: 110 R---------KQVLGLRRLRLPKDANEAP--------ILPTSDLPEDFDWRDKGAVGPIK 152

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 153 NQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 212

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            +  ++  +    L  E +YP    D  ACK      N V  +      +   E  I  +
Sbjct: 213 NSAFEYT-LKAGGLMREEDYPYTGTDRDACK---FDKNKVAARVANFSVVSLDEDQIAAN 268

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 LVKNGPLAVAINAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 310


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 131/282 (46%), Gaps = 21/282 (7%)

Query: 34  LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           LE F  +  R+ + Y+ + E   R + + ++++++E  N         R    +F+DL+ 
Sbjct: 30  LERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG---YRLADNKFADLTN 86

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG---IPVKKDWREAGI 149
           EEF+ + L     +         H    + V    I +G+    G   +P   DWRE G 
Sbjct: 87  EEFRAKMLGFGRPRS---GGGAGHSTAPSTVA--CIGSGLMGRQGYSDLPKSVDWREKGA 141

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CWAFS V   E ++ +KNG L  LS QE++DC     +GC+GG     
Sbjct: 142 VAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWA 200

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            +++  N+  L  E  YP    + AC+      + V I  Y    + PS    L   A  
Sbjct: 201 FEFVMKNR-GLTTERNYPYQGLNGACQTPKLKESAVSISGYM--NVTPSSEPDLLRAAAA 257

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            PV  AV+A    WQ Y GGV    C    A +NH V +VGY
Sbjct: 258 QPVSVAVDAGSFVWQLYGGGVFTGPC---TAELNHGVTVVGY 296


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 145/303 (47%), Gaps = 34/303 (11%)

Query: 24  KVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
           K S    E+ + +++ +  ++ K+Y+   E + RF+ F+ +L  ++E N   +S    + 
Sbjct: 34  KSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRS---YKV 90

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG--IPV 140
           G+  F+DL+ EE+++  L                D     +K +S +    +     +P 
Sbjct: 91  GLNRFADLTNEEYRSMFLGTKT------------DSKRRFMKSKSASRRYAVQDSDMLPE 138

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWRE+G +  +++Q +CG+CWAFSTV   E ++ +  G +  LS QE++DC    + G
Sbjct: 139 SVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG 198

Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           C+GG    L+D+     +N   ++ E +YP    D  C  +  +   V I  Y  + + P
Sbjct: 199 CNGG----LMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDY--EDVPP 252

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYS 313
            +   L     H PV  A+ A    +Q YL GV    C  +L   +H V +VGY  DN +
Sbjct: 253 YDEMALKKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRAL---DHGVVVVGYGTDNGA 309

Query: 314 RTW 316
             W
Sbjct: 310 DHW 312


>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
          Length = 451

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 144/298 (48%), Gaps = 33/298 (11%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDI---IEELNKNRQSP 77
           P   ++ +  Q + LF  F   Y KSY+ + E   R   F ++L++   ++EL++     
Sbjct: 139 PAPAAQEDSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRG---- 194

Query: 78  ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
            SA YG+T+FSDL+EEEF+T +L      + L+S           +  R++  G      
Sbjct: 195 -SAEYGVTKFSDLTEEEFRTSYL------NPLLSS----------LPGRALRPGPATRGP 237

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
            P   DWR+ G +  V+NQ  CG+CWAFS     E    L+ G L  LS QE++DC    
Sbjct: 238 APASWDWRDHGAVTGVKNQGACGSCWAFSVTGNVEGQWFLRRGALLALSEQELVDC-DTL 296

Query: 198 NMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
           +  C GG       +  + K+  LE E +Y    +   C   + SP+  ++   +   L 
Sbjct: 297 DQACGGG--LPSNAYTAIEKLGGLETEKDYSYEGRKERC---SFSPDKARVYINSSVDLS 351

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGYDNYS 313
             E  + T +A +GPV  A+NA   Q+Y  GV   +    S   I+HAV +VGY + S
Sbjct: 352 RDEEELATWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWFIDHAVLLVGYGHRS 409


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 138/284 (48%), Gaps = 34/284 (11%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++ ++  ++ K+Y+   E + RFK F+ +L  IEE   N    +S + G+ +F+DL+ EE
Sbjct: 47  VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEE--HNGAGDKSYKLGLNKFADLTNEE 104

Query: 95  FKTRHL----RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           ++   L    R   NK  +++  K  D +     +            +P   DWRE G +
Sbjct: 105 YRAMFLGTRTRGPKNKAAVVA--KKTDRYAYRAGEE-----------LPAMVDWREKGAV 151

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             +++Q  CG+CWAFSTV   E ++ +  G L+ LS QE++DC    NMGC+GG    L+
Sbjct: 152 TPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGG----LM 207

Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     V    ++ E +YP   KD  C     +   V I  Y  D     E S++  +A
Sbjct: 208 DYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYE-DVPTNDEKSLMKAVA 266

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              PV  A+ A  + +Q Y  GV    C     N++H V  VGY
Sbjct: 267 NQ-PVSVAIEAGGMEFQLYQSGVFTGRCG---TNLDHGVVAVGY 306


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 138/283 (48%), Gaps = 24/283 (8%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFS 88
           E+ + ++  +  ++ K+Y+   E + RF+ F+ +L  I+E N +NR    + + G+  F+
Sbjct: 40  EEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNR----TYKVGLNRFA 95

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DL+ EE++  +L    +     +  K      N   + ++  G  +P  +    DWRE G
Sbjct: 96  DLTNEEYRAIYLGTRSDPKRRFAKLK------NASPRYAVMPGEVLPESV----DWRETG 145

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            +  V++Q++CG+CWAFSTV   E ++ +  G L  LS QE++DC    +MGC+GG    
Sbjct: 146 AVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDY 205

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
             D++ +    L+ E +YP    D  C     S   V I  Y  + + P +   L     
Sbjct: 206 AFDFI-IKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGY--EDVPPFDEKALQKAVA 262

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           H PV  AV A     Q Y+ G+    C  +L   +H +  VGY
Sbjct: 263 HQPVSVAVEAGGRALQLYVSGIFTGECGTAL---DHGIVAVGY 302


>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 365

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 143/284 (50%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS+F+ ++ K+Y +K EHD RF  F+ ++        + Q   SA +G+T+FSDL+  EF
Sbjct: 51  FSTFKSKFGKTYATKEEHDHRFGVFKSNM---RRARLHAQLDPSAVHGVTKFSDLTPAEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
             + L   +    L +H           +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 108 HRKFL--GLKPLRLPAH----------AQKAPI-----LPTNNLPKDFDWRDKGAVTNVK 150

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           +Q +CG+CW+FST    E  H L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +++ +    ++ E +YP   +D  CK    S     + +Y+  +L   E  I  ++
Sbjct: 211 NNAFEYL-IGSGGVQREKDYPYTGRDGTCKFD-KSKIAASVSNYSVISL--DEEQIAANL 266

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 267 VKNGPLAVAINAVYMQTYVGGVSCPYICG---KHLDHGVLLVGY 307


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 131/282 (46%), Gaps = 21/282 (7%)

Query: 34  LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           LE F  +  R+ + Y+ + E   R + + ++++++E  N         R    +F+DL+ 
Sbjct: 51  LERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG---YRLADNKFADLTN 107

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG---IPVKKDWREAGI 149
           EEF+ + L     +         H    + V    I +G+    G   +P   DWRE G 
Sbjct: 108 EEFRAKMLGFGRPRS---GGGAGHSTAPSTVA--CIGSGLMGRQGYSDLPKSVDWREKGA 162

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CWAFS V   E ++ +KNG L  LS QE++DC     +GC+GG     
Sbjct: 163 VAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCD-TKAIGCAGGYMSWA 221

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            +++  N+  L  E  YP    + AC+      + V I  Y    + PS    L   A  
Sbjct: 222 FEFVMKNR-GLTTERNYPYQGLNGACQTPKLKESAVSISGYM--NVTPSSEPDLLRAAAA 278

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            PV  AV+A    WQ Y GGV    C    A +NH V +VGY
Sbjct: 279 QPVSVAVDAGSFVWQLYGGGVFTGPC---TAELNHGVTVVGY 317


>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/316 (28%), Positives = 142/316 (44%), Gaps = 28/316 (8%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F  +L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   +    G P  KDWRE G +  V+NQ  CG+CWAF+ +   E    L 
Sbjct: 105 FAAAQERARVPVDVEF-VGAPAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIECQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C  N N GC GG       W+ D N   +  E  YP          
Sbjct: 164 GNPLTRLSEQMLVSC-DNTNSGCGGGWPLVAFKWIVDRNNGTVYTEESYPYHSCIGISPP 222

Query: 238 KATSPN--GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             TS +  G  I  Y   T+   E+ I   +A +GPV   V+A +W +Y GGV+      
Sbjct: 223 CTTSGHTVGATITGYV--TIPRDENGIAAWLAVNGPVAVVVDASSWIFYTGGVMTSCVSK 280

Query: 296 SLANINHAVQIVGYDN 311
            L   +HAV +VGY++
Sbjct: 281 QL---SHAVLLVGYND 293


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 94/292 (32%), Positives = 141/292 (48%), Gaps = 39/292 (13%)

Query: 31  EQKLEL-FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESA-RYGITEF 87
           E +LE  F  F+  + + Y   E ++  K+ F  +L  I   N +  + +S     +  F
Sbjct: 26  EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS EEF+     +     V ++   H D   N V+             +P   DW   
Sbjct: 86  TDLSNEEFRATFNGYRRLAAVSLADSVHAD---NDVE------------ALPATVDWTTK 130

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDF 206
           G++  ++NQQ CG+CWAFS V + E  HALK G L  LS Q ++DC A  G+MGCSGG  
Sbjct: 131 GVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGG-- 188

Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
                WMD      +    ++ E+ YP    D +C+ K  S  G  I S+  D     ES
Sbjct: 189 -----WMDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSI-GATIHSFV-DVKTGDES 241

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
           ++   +A+ GP+  A++A   ++Q+Y  GV  YN  D S   ++H V  VGY
Sbjct: 242 ALQNAVASIGPISVAIDASQPSFQFYSSGV--YNEPDCSTEILDHGVTAVGY 291


>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
          Length = 322

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 150/300 (50%), Gaps = 32/300 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
           LI L  +A+   +    +E+   LF +F+    KSY     ++ RF  F  ++  IE+ N
Sbjct: 6   LIGLLIVAVNASL----IEKHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHN 61

Query: 72  K-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
               Q   S +  I +F+DL++EEFK     H   K VL +  ++               
Sbjct: 62  ALYEQGLVSYKKAINQFTDLTQEEFKAYLGLHV--KPVLNNTIQYE------------LK 107

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G+ +PT +    DWR AG +  V+NQ +CG+CW+F+   + E  +  K+  L  LS Q++
Sbjct: 108 GLEVPTSV----DWRSAGQVTGVKNQGSCGSCWSFALTGSTEGAYYRKHKQLVSLSEQQL 163

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           +DC+ + N GC+GG   A   +++  +  L+ ES YP    D +CK   +S    KI +Y
Sbjct: 164 VDCSTSINYGCNGGFLDATFPYIE--QYGLQTESSYPYTGVDGSCKYD-SSKVVTKISNY 220

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
              +L  SES +L  + + GPV   ++A     Y  G+   N C  +  N+NHAV +VGY
Sbjct: 221 V--SLHGSESKVLEPVGSIGPVAITMDASYLSSYSSGIYAANKC--TTTNLNHAVLVVGY 276


>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
          Length = 340

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 135/283 (47%), Gaps = 42/283 (14%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F  R+ K+Y SK E ++R + ++ ++  I   N ++    S   G    +D + +E+
Sbjct: 42  FVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHN-SQNDGTSFTLGPNHLADYTHDEY 100

Query: 96  KTR---HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           K       R+   K V  + +                        IP   DWRE G +  
Sbjct: 101 KKMLGYKPRNKTGKEVYSTPNLKD---------------------IPESIDWREKGAVNA 139

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CWAFST+ + ES + ++ G L  LS Q+++DC+ NGN GC+GGD    +D+
Sbjct: 140 VKDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDY 199

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD----TLIPSESSILTDIAT 268
           +  +   +E E +YP + KD  C  +A+       K    D     ++P + + L     
Sbjct: 200 I-ASAGGVETEKDYPYVGKDQTCAFEAS-------KEVATDKGHINIVPGKFATLQAAIA 251

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            GPV  A+ A  L +Q+Y  G+   +  G+  N++H V  VGY
Sbjct: 252 EGPVSVAIEADSLFFQFYRSGIFDSSWCGT--NLDHGVAAVGY 292


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 141/314 (44%), Gaps = 38/314 (12%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
           V  I   +  C     ++V+   L+     E    +  +Y K Y    E + RFK F ++
Sbjct: 7   VYHISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTEN 66

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           ++ +E  N +    +S + GI +F+DL+ EEF             + S +K   H  + +
Sbjct: 67  VNYVEASNAD--DTKSYKLGINQFADLTNEEF-------------VASRNKFKGHMCSSI 111

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
             R+ T      + IP   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G L 
Sbjct: 112 T-RTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLI 170

Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKR 237
            LS QE++DC   G + GC GG    L+D  D  K +     L  E++YP    D  C  
Sbjct: 171 SLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLSTEAQYPYEGVDGTCNA 224

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
              S   V I  Y  D    SE ++   +A   P+  A++A    +Q+Y  GV   +C  
Sbjct: 225 NKASVQAVTITGYE-DVPANSEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGT 282

Query: 296 SLANINHAVQIVGY 309
            L   +H V  VGY
Sbjct: 283 EL---DHGVTAVGY 293


>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 96/299 (32%), Positives = 142/299 (47%), Gaps = 47/299 (15%)

Query: 27  KPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
           K N++ +L+L   F +F   + K Y S  E   RF+ F  ++  ++ L  + Q   SA Y
Sbjct: 267 KNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQG--SAIY 324

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----- 137
           G T+F+DL++ EFK ++L                          S+T+  T+P       
Sbjct: 325 GATQFADLTKNEFKKKYLGLD----------------------SSMTSKKTLPMAVIPQS 362

Query: 138 --IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
             IP + DWR   ++  V+NQ  CG+CWAFS +   E  +ALK+  L  LS QE+IDC  
Sbjct: 363 ASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDC-D 421

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS--PNGVKIKSYTCD 253
           N + GC GG      + ++ N   LE ES+YP    +    RK      + VK+      
Sbjct: 422 NLDNGCGGGLMTQAFEAVE-NLGGLETESDYPY---EGHADRKGCQLKKSDVKVSISKAV 477

Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            +   E  I   +  HGP+   VNA   Q+Y+GGV   I   C  S  +++H V IVGY
Sbjct: 478 NVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALC--SPKSLDHGVAIVGY 534


>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 143/285 (50%), Gaps = 39/285 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F++R+ KSY S+ EHD RFK F+ +L       +++Q   SA +G+T+FSDL+  EF
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANL---RRARRHQQLDPSATHGVTQFSDLTPAEF 118

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L        L      HD      +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 119 RGTYLG-------LRPLKLPHD-----AQKAPI-----LPTNDLPEDFDWRDHGAVTAVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  + L  G L  LS Q++++C         G+ + GC+GG  
Sbjct: 162 NQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLM 221

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               ++  +    L  E +YP    D  +CK   T      + +++  +L   E  I  +
Sbjct: 222 NTAFEYT-LKAGGLMKEEDYPYTGTDRGSCKFDKTKI-AASVSNFSVISL--DEDQIAAN 277

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 278 LVKNGPLAVAINAVFMQTYVGGVSCPYICSKRL---DHGVLLVGY 319


>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 96/299 (32%), Positives = 142/299 (47%), Gaps = 47/299 (15%)

Query: 27  KPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
           K N++ +L+L   F +F   + K Y S  E   RF+ F  ++  ++ L  + Q   SA Y
Sbjct: 267 KNNIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQG--SAIY 324

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----- 137
           G T+F+DL++ EFK ++L                          S+T+  T+P       
Sbjct: 325 GATQFADLTKNEFKKKYLGLD----------------------SSMTSKKTLPMAVIPQS 362

Query: 138 --IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
             IP + DWR   ++  V+NQ  CG+CWAFS +   E  +ALK+  L  LS QE+IDC  
Sbjct: 363 ASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANIEGQYALKSKELLSLSEQELIDC-D 421

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS--PNGVKIKSYTCD 253
           N + GC GG      + ++ N   LE ES+YP    +    RK      + VK+      
Sbjct: 422 NLDNGCGGGLMTQAFEAVE-NLGGLETESDYPY---EGHADRKGCQLKKSDVKVSISKAV 477

Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            +   E  I   +  HGP+   VNA   Q+Y+GGV   I   C  S  +++H V IVGY
Sbjct: 478 NVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPIHALC--SPKSLDHGVAIVGY 534


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 152/315 (48%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ EEF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V+NQ  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I +Y    ++P   + L    T  PV   IAA + L  Q+Y GG      DG
Sbjct: 229 GKTA-AVQISNY---QVVPEGETSLLQAVTKQPVSIGIAASHDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S AN INHAV  +GY
Sbjct: 279 SCANRINHAVTAIGY 293


>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
 gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 149/284 (52%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ K+Y +K EHD RF  F+ +L  I+     +  P SA++GIT+FSDL+  EF
Sbjct: 51  FTSFKSKFSKNYATKEEHDYRFGVFKSNL--IKAKLHQKLDP-SAQHGITKFSDLTASEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +NK + +  H          +K  I     +PT  +P   DWRE G +  V+
Sbjct: 108 RRQFL--GLNKRLRLPAH---------AQKAPI-----LPTNNLPEDFDWREKGAVTPVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           +Q +CG+CWAFST    E  + L  G L+ LS Q+++DC         G+ + GC+GG  
Sbjct: 152 DQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +++  +  V+  E +Y    +D +CK    S     + +++  +L   E  I  ++
Sbjct: 212 NNAFEYILQSGGVVS-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVSL--DEDQIAANL 267

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA   Q Y+ GV   Y C  + A ++H V ++G+
Sbjct: 268 VKNGPLAVAINAAWMQTYMSGVSCPYIC--AKARLDHGVLLLGF 309


>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 373

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/285 (30%), Positives = 136/285 (47%), Gaps = 37/285 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ +Y+K+Y ++ EHD RF+ F+ +L       +N+    SA +G+T+FSDL+ +EF
Sbjct: 55  FSLFKSKYEKTYATQEEHDHRFRVFKANL---RRARRNQLLDPSAVHGVTQFSDLTPKEF 111

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L        L +                  T   +PT  +P + DWRE G +  V+
Sbjct: 112 RRKFLGLKRRGFRLPT---------------DTQTAPILPTSDLPTEFDWREQGAVTPVK 156

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ  CG+CW+FS +   E  H L    L  LS Q+++DC        A + + GCSGG  
Sbjct: 157 NQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLM 216

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               ++  +    L  E +YP   +D  ACK   +    +         +   E  I  +
Sbjct: 217 NNAFEYA-LKAGGLMKEEDYPYTGRDNTACKFDKSK---IAASVSNFSVVSSDEDQIAAN 272

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +  HGP+  A+NA+  Q Y+GGV   Y C  S    +H V +VG+
Sbjct: 273 LVKHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ---DHGVLLVGF 314


>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
          Length = 374

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/296 (30%), Positives = 141/296 (47%), Gaps = 38/296 (12%)

Query: 26  SKPNL-EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           S PNL   +    S F++++KKSY S+ EHD RF  F+ +L       ++++   +A +G
Sbjct: 47  SSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDYRFSVFKSNL---RRAARHQKLDPTASHG 103

Query: 84  ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKK 142
           +T+FSDL+  EF+         K VL           N            +PT  +P   
Sbjct: 104 VTQFSDLTSAEFR---------KQVLGLRKLRLPKDANKAP--------ILPTNDLPEDF 146

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------A 194
           DWRE G +G V+NQ +CG+CW+FST    E  H L  G L  LS Q+++DC         
Sbjct: 147 DWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEP 206

Query: 195 GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
           G+ + GC+GG   +  ++  +    L  E +YP    D    +         + +++  +
Sbjct: 207 GSCDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGMDRGACKFDKDKVAAGVANFSVVS 265

Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           L   E  I  ++  +GP+  A NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 266 L--DEDQIAANLVKNGPLAVATNAVFMQTYIGGVSCPYICSRRL---DHGVLLVGY 316


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 142/298 (47%), Gaps = 28/298 (9%)

Query: 18  FLAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKN 73
           F +I V  S  +L Q    + LF  +  +Y+K+Y   E  +R F+ F+ +L  I+E   N
Sbjct: 51  FFSI-VGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDE--AN 107

Query: 74  RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
           R+   S   G+  F+DL+ +EFK  +L   + K       ++        +         
Sbjct: 108 RKEVTSYWLGLNAFADLTHDEFKATYL-GLLPKRTSGGRFRYGGVGDGGDEV-------- 158

Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
                P   DWR+ G + +V+NQ  CG+CWAFSTV   E ++ +  G L+ LS Q+++DC
Sbjct: 159 -----PASVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDC 213

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           + +GN GCSGG       ++      L  E  YP L+++  C  +A     +   S   D
Sbjct: 214 STDGNNGCSGGVMDNAFSFI-ATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYED 272

Query: 254 TLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
                E +++  +A H PV  A+ A    +Q+Y GGV    C    + ++H V  VGY
Sbjct: 273 VPANDEQALVKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCG---SELDHGVAAVGY 326


>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K+Y S  EHD R   F+ ++       ++++   +A +G+T+FSDL+  EF
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDLTPTEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG+CW+FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 152 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 142/312 (45%), Gaps = 38/312 (12%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
           ++ V  + F+V     C  A+   V  P+  +  EL+  F++ Y KSY+  + + RF  F
Sbjct: 3   LYTVSCLTFLVG----CVFAVST-VQVPDSAR--ELYEQFKRDYGKSYANDDDEKRFAIF 55

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           + +L  +   N   Q   +ARYG+T+FSDL+ EEF  + L    +  V            
Sbjct: 56  KDNL--VRAQNYQLQEQGTARYGVTQFSDLTPEEFAAKFLSSRFDDQV-------ERVQL 106

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           N +K              P   DWRE G +  V +Q +CG+CWAFS     E    LK G
Sbjct: 107 NDLK------------AAPESVDWRELGAVAPVEDQGSCGSCWAFSVAGNVEGQWFLKTG 154

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LS Q+++DC    + GC GG +        +    LE + +YP + ++  CK   +
Sbjct: 155 QLVSLSKQQLVDCDVQ-DSGCDGG-YPPTTYGEIIRMGGLEAQRDYPYVGREQPCKLDES 212

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSL 297
               +  K  +   L  +E      IA HGP+ + +NA+T Q+Y  G+    +  C    
Sbjct: 213 K---LLAKINSSIVLEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPSKSQCQPDW 269

Query: 298 ANINHAVQIVGY 309
             +NH V  VGY
Sbjct: 270 --LNHGVLSVGY 279


>gi|330801846|ref|XP_003288934.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
 gi|325081026|gb|EGC34558.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
          Length = 334

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/316 (28%), Positives = 159/316 (50%), Gaps = 34/316 (10%)

Query: 8   LFIVALIALCFLAIPV----KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
           L  + +++L FL+I +    +V  PN  Q    F  + + + K+YS  E   +++ F+ +
Sbjct: 3   LSFILVLSLLFLSINIIASSRVFTPN--QYQSSFVQWMKSHGKAYSHDEFARKYRTFQDN 60

Query: 64  LDIIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +D + + N KN ++      G+  F+D++  E++   L  S+      +           
Sbjct: 61  MDYVHQWNSKNSETV----LGLNNFADMNNVEYRNTLLGASIEVEPFRT----------- 105

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
              R+ +  I +PT +    DWRE G +  +++Q  CG+C++FS +  AES + + NG +
Sbjct: 106 --PRTFSR-IQLPTSV----DWREKGAVHDIKDQGHCGSCYSFSAIGAAESAYYIANGEM 158

Query: 183 SLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS Q ++DC+ + GN GC+GG       ++ +++     E+ YP   KDA+C+  +  
Sbjct: 159 LTLSEQNILDCSRSYGNEGCNGGYMLESFQFL-LDQGGAVSEASYPYEAKDASCRFDSVK 217

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
              V   + T +     E  +   IATHGPV  A++A  +++Q Y  GV  Y    S  +
Sbjct: 218 TPIVATFNGTVEIRRGDEGDLQQAIATHGPVAVAIDAGHISFQLYKTGVY-YEPYCSSYS 276

Query: 300 INHAVQIVGYDNYSRT 315
           ++HAV  VGYD  S T
Sbjct: 277 LSHAVLAVGYDTDSVT 292


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/298 (30%), Positives = 141/298 (47%), Gaps = 28/298 (9%)

Query: 18  FLAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKN 73
           F +I V  S  +L Q    + LF  +  +Y+K+Y   E  +R F+ F+ +L  I+E   N
Sbjct: 65  FFSI-VGYSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDE--AN 121

Query: 74  RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
           R+   S   G+  F+DL+ +EFK  +L   + K       ++                  
Sbjct: 122 RKEVTSYWLGLNAFADLTHDEFKATYL-GLLPKRTSGGRFRY-------------GGVGD 167

Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
               +P   DWR+ G + +V+NQ  CG+CWAFSTV   E ++ +  G L+ LS Q+++DC
Sbjct: 168 GGDEVPASVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDC 227

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           + +GN GCSGG       ++      L  E  YP L+++  C  +A     +   S   D
Sbjct: 228 STDGNNGCSGGVMDNAFSFI-ATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYED 286

Query: 254 TLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
                E +++  +A H PV  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 287 VPANDEQALVKALA-HQPVSVAIEASGRHFQFYSGGVFDGPCGSEL---DHGVAAVGY 340


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 30/312 (9%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDI 66
             I ALI L   A              E    +  +Y + Y  ++E  +RF+ F  ++  
Sbjct: 28  FMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           IEE NK+ +  +S +  + EF+D + EEF+    R+     V     +     + +V   
Sbjct: 88  IEEFNKDGR--QSYKLAVNEFADQTNEEFQAS--RNGYKMAVSSRPSQTTLFRYENV--- 140

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                    T +P   DWR+ G +  V++Q  CG+CWAFST+   E +  LK G L  LS
Sbjct: 141 ---------TAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLS 191

Query: 187 VQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            QE++DC   G + GC GG      +++  NK +   E+ YP    D  C  K  +    
Sbjct: 192 EQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIA-LEASYPYTAADGTCNSKEEASRAA 250

Query: 246 KIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
           KI  Y     +P  SE+++L  +A   PV  +++A  + +Q+Y  GV    C     +++
Sbjct: 251 KISGY---EKVPANSETALLKAVANQ-PVSVSIDASGVAFQFYSSGVFTGECG---TDLD 303

Query: 302 HAVQIVGYDNYS 313
           H V  VGY   S
Sbjct: 304 HGVTAVGYGKTS 315


>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 140/284 (49%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K+Y S  EHD R   F+ ++       ++++   +A +G+T+FSDL+  EF
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDLTPTEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG+CW+FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 152 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 150/311 (48%), Gaps = 31/311 (9%)

Query: 8   LFIVALIALCFLA----IPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEK 62
           + +  ++AL F+     IP        E+ L  L+  ++  +  S   SE + RF  F++
Sbjct: 6   MLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHHTVSRDLSEKNKRFNVFKE 65

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +   I E NK + +P   + G+ +F+D++ +EF++ +    +             HHH  
Sbjct: 66  NAKFIHEFNK-KDAP--YKLGLNKFADMTNQEFRSTYAGSKI-------------HHHRT 109

Query: 123 VKKRSITTGITIPT---GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +     TG  +      IP   DWR  G +  V++Q  CG+CWAFST+ + E ++ +K 
Sbjct: 110 QRGTPRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKT 169

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  LS Q+++DC  + N GC+GG      +++  N  +   ES YP   +  +C  ++
Sbjct: 170 NQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITS-ESAYPYTAEQGSCASES 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLA 298
           ++P  V I  Y  D    +E++++  +A     +A   + + +Q+Y  GV   +C   L 
Sbjct: 229 SAPV-VTIDGYE-DVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNEL- 285

Query: 299 NINHAVQIVGY 309
             +H V +VGY
Sbjct: 286 --DHGVAVVGY 294


>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
          Length = 365

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 143/284 (50%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS+F+ ++ K+Y +K EHD RF  F+ ++        + Q   SA +G+T+FSDL+  EF
Sbjct: 51  FSTFKAKFGKTYATKEEHDHRFGVFKSNM---RRARLHAQLDPSAVHGVTKFSDLTPAEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
             + L   +    L +H           +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 108 HRKFL--GLKPLRLPAH----------AQKAPI-----LPTNNLPKDFDWRDKGAVTNVK 150

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           +Q +CG+CW+FST    E  H L  G L  LS Q+++DC         G+ + GC+GG  
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +++ +    ++ E +YP   +D  CK    S     + +Y+  +L   E  I  ++
Sbjct: 211 NNAFEYL-IGSGGVQREKDYPYTGRDGTCKFD-KSKIAASVSNYSVISL--DEEQIAANL 266

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 267 VKNGPLAVAINAVYMQTYVGGVSCPYICG---KHLDHGVLLVGY 307


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 147/313 (46%), Gaps = 32/313 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSY-SKSEHDIRFKN 59
            K   FI   +     A P K +   L Q + ++   +Q   +Y + Y   +E + R+  
Sbjct: 4   TKQSQFICLALLFVLGAWPSKSAARTL-QDVSMYERHEQWMAQYGRVYKDDAEKETRYNI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  I+  N   Q+ +S + G+ +F+DLS EEFK    R+    H  M   +     
Sbjct: 63  FKENVARIDAFNS--QTGKSYKLGVNQFADLSNEEFKAS--RNRFKGH--MCSPQAGPFR 116

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           + +V            + +P   DWR+ G +  V++Q  CG CWAFS V   E ++ L  
Sbjct: 117 YENV------------SAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTT 164

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS QEV+DC   G + GC+GG       +++ NK  L  E+ YP    D  C  +
Sbjct: 165 GKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNK-GLTTEANYPYTGTDGTCNTQ 223

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
             + +  KI  +  D    SE++++  +A   PV  A++A    +Q+Y  G+   +C   
Sbjct: 224 KEATHAAKITGFE-DVPANSEAALMKAVAKQ-PVSVAIDAGGFEFQFYSSGIFTGSCGTQ 281

Query: 297 LANINHAVQIVGY 309
           L   +H V  VGY
Sbjct: 282 L---DHGVTAVGY 291


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 85/265 (32%), Positives = 127/265 (47%), Gaps = 27/265 (10%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
           E + R+  F+++++ IE  N    S    + G+ +F+DL+ EEF+  H  +      LMS
Sbjct: 21  EKEKRYLIFKENIERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMHHGYKRQSSKLMS 78

Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
               H++                 + IP   DWR+AG +  V++Q TCG CWAFS V   
Sbjct: 79  SSFRHEN----------------LSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAI 122

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
           E +  LK G L  LS Q+++DC   G + GC GG       ++  N   L  E+ YP   
Sbjct: 123 EGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNG-GLTSEATYPYQG 181

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGV 288
            D  CK K T+    KI  Y  D  + +E+++L  +A   PV  AV      +Q+Y  GV
Sbjct: 182 VDGTCKSKKTASIEAKITGYE-DVPVNNENALLQAVAKQ-PVSVAVEGGGYDFQFYKSGV 239

Query: 289 IQYNCDGSLANINHAVQIVGYDNYS 313
            + +C   L   +HAV  +GY   S
Sbjct: 240 FKGDCGTYL---DHAVTAIGYGTNS 261


>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
          Length = 360

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/291 (29%), Positives = 140/291 (48%), Gaps = 50/291 (17%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FSSF  RY KSY+ ++EH  RF  F+ +L       ++++   +A +G+T F+DL+  EF
Sbjct: 45  FSSFLSRYGKSYADEAEHAYRFSVFKSNL---RRARRHQRLDPTAVHGVTRFADLTPSEF 101

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT-----IPTG-IPVKKDWREAGI 149
           +  +L                      +++R  T G T     +PT  +P   DWR+ G 
Sbjct: 102 RRTYL---------------------GLRRRPRTAGSTHDAPILPTNELPADFDWRDHGA 140

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGC 201
           +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC          + + GC
Sbjct: 141 VTPVKNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGC 200

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--E 259
           +GG      +++ +    LE E++YP    D    R     N  KI +   +  + S  E
Sbjct: 201 NGGLMTTAFEYI-LKSGGLEREADYPYTGTD----RGTCKFNKAKISAVASNFSVVSIDE 255

Query: 260 SSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             I  ++  HGP+   +NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 256 DQIAANLVKHGPLAVGINAVFMQTYVGGVSCPYICG---KHLDHGVLLVGY 303


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 156/313 (49%), Gaps = 29/313 (9%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNL-EQKLELFSSFQQRYKKSY-SKSEHDIRFKN 59
           F   ++LF    +   F AI  K+S     ++ + L+ S+  +Y KSY S  E ++R + 
Sbjct: 7   FISMSLLFFSTFLIFSF-AIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEI 65

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+++L  I+E N +     S   G+ +F+DL++EE+++ +L     K  L S  K  + +
Sbjct: 66  FKENLRFIDEHNADPN--RSYTVGLNQFADLTDEEYRSTYLGF---KSSLKS--KVSNRY 118

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
              V +            +P   DWR  G +  V+NQ  C +CWAF+T+ T ES++ +  
Sbjct: 119 MPQVGEV-----------LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIIT 167

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G L  LS QE++DC     N GC GG      +++ +N   +  E  YP + +D  C   
Sbjct: 168 GDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFI-INNGGINTEENYPYIGQDDQCDEP 226

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
             + N V I SY  + + P++   +     + PV  A++A  L +++Y  G+      G+
Sbjct: 227 KKNQNYVTIDSY--EQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGT 284

Query: 297 LANINHAVQIVGY 309
              +NHAV I+GY
Sbjct: 285 --TLNHAVTIIGY 295


>gi|10946820|ref|NP_067420.1| cathepsin 6 precursor [Mus musculus]
 gi|9931384|gb|AAG02172.1|AF223401_1 cathepsin-6 [Mus musculus]
 gi|12838129|dbj|BAB24093.1| unnamed protein product [Mus musculus]
 gi|16445021|gb|AAK00510.1| cathepsin 6 precursor [Mus musculus]
 gi|68534635|gb|AAH99455.1| Cathepsin 6 [Mus musculus]
 gi|148709368|gb|EDL41314.1| cathepsin 6 [Mus musculus]
          Length = 334

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/287 (32%), Positives = 144/287 (50%), Gaps = 30/287 (10%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITE 86
           PNL  +   +  ++++Y+KSY+  E  +R   +E+++ +I+  N +N     +    + E
Sbjct: 23  PNLNAE---WHDWKKQYEKSYTMEEEGLRRAIWEENMRMIKLHNWENSLGKNNFTLKMNE 79

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F DL+ EE     LR  +N   + SH           KKR I     +   +P   DWR+
Sbjct: 80  FGDLTPEE-----LRKMMNNFPIWSH-----------KKRKIIRKRAVGDVLPKFVDWRK 123

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGD 205
            G + +VR Q+ C +CWAF+     E     K G L+ LSVQ ++DC    GN GC  GD
Sbjct: 124 KGYVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKLTPLSVQNLVDCTKTQGNDGCQWGD 183

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                +++ +N   LE E+ YP   K+  C+    +P   K +     +L  SE  ++  
Sbjct: 184 PYIAYEYV-LNNGGLEAEATYPYEGKEGPCR---YNPKNSKAEITGFVSLPESEDILMEA 239

Query: 266 IATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
           +AT GP+ AAV+A    + +Y GG+  Q NC  S   +NHAV +VGY
Sbjct: 240 VATIGPISAAVDASFNRFSFYDGGIYHQPNC--SNNTVNHAVLVVGY 284


>gi|305434754|gb|ADM53739.1| cathepsin L2 precursor [Lepeophtheirus salmonis]
          Length = 382

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 158/318 (49%), Gaps = 34/318 (10%)

Query: 9   FIVALIALCFL----AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
           F +  + +CFL    A+    S P  +++++ F SF + Y KSY +++   ++ K F  +
Sbjct: 5   FKMKFLGVCFLFGLAALAAGTSSPT-QREIQEFESFVKEYSKSYHNRALRSLKLKVFVDN 63

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           L  IEE N N +   +   GI EFSDL++EEF+++++ +S      MS            
Sbjct: 64  LREIEEHNANPK--RTWDMGINEFSDLTDEEFESKYMGYSP-----MSSSAGLVTRTVAP 116

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           K+ +I         +P   DWRE G+I  V+NQ +CG+CW FS VE  ES  A++N   S
Sbjct: 117 KQGNIKD-------LPESVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTS 169

Query: 184 --LLSVQEVIDCAGNGNMGCSGGDFCALLD---WMDVNKVVLEPESEYP----LLLKDAA 234
             LLS Q++  C+ N       G     ++   +M      +E E EYP       +   
Sbjct: 170 PPLLSTQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSGFTEESGE 229

Query: 235 CKRKATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
           C   A+S  G        + L P++  S++  +A  GP+  +V A  ++ Y  G++   C
Sbjct: 230 CLYNASSVTGKMAHVRGYEVLPPNDMYSVMEHLANKGPLGVSVYAGRFKSYKSGILN-GC 288

Query: 294 DGSLAN--INHAVQIVGY 309
           D + AN  INHA+Q++GY
Sbjct: 289 DFN-ANIVINHAIQMIGY 305


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 140/285 (49%), Gaps = 20/285 (7%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           P  E  +E+F  ++ R++K+Y  +E  + RF NF+++L  I E    +++    R G+ +
Sbjct: 34  PPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIE-KTGKETTLRHRVGLNK 92

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F+DLS EEFK  +L   V K +  +     D    +++              P   DWR+
Sbjct: 93  FADLSNEEFKQLYLSK-VKKPINKTRIDAEDRSRRNLQS----------CDAPSSLDWRK 141

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G++  V++Q  CG+CW+FST    E ++A+    L  LS QE++DC    N GC GG  
Sbjct: 142 KGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDC-DTTNYGCEGGYM 200

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +W+ +N   ++ E+ YP    D  C    T+   +K+ S      +    S L   
Sbjct: 201 DYAFEWV-INNGGIDTEANYPYTGVDGTCN---TAKEEIKVVSIDGYKDVDETDSALLCA 256

Query: 267 ATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   P+   ++  A+ +Q Y GG+   +C     +I+HAV IVGY
Sbjct: 257 AAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGY 301


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 29/281 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + FSSFQ  Y KSY ++ E   R+  F+ +L  I   N   Q   S    +  F DLS +
Sbjct: 115 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHN---QQGYSYSLKMNHFGDLSRD 171

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI--TIPTGIPVKKDWREAGIIG 151
           EF+ ++L    +++ L SHH              + T +   +P+ +P   DWR  G + 
Sbjct: 172 EFRRKYLGFKKSRN-LKSHH------------LGVATELLNVLPSELPAGVDWRSRGCVT 218

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALL 210
            V++Q+ CG+CWAFST    E  H  K G L  LS QE++DC+   GN  CSGG+     
Sbjct: 219 PVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAF 278

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++ ++   +  E  YP L +D  C R  +    VKI  +  D    SE+++   +A   
Sbjct: 279 QYV-LDSGGICSEDAYPYLARDEEC-RAQSCEKVVKILGFK-DVPRRSEAAMKAALAK-S 334

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A+ A  + +Q+Y  GV   +C     +++H V +VGY
Sbjct: 335 PVSIAIEADQMPFQFYHEGVFDASCG---TDLDHGVLLVGY 372


>gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus anophagefferens]
          Length = 346

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 89/297 (29%), Positives = 140/297 (47%), Gaps = 41/297 (13%)

Query: 36  LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F+  Y KSY+ +E +  RF  F  +L   E LN  R   + A +G+T+F DL+E E
Sbjct: 19  LFELFKSDYVKSYNSTEAEAERFTIFSANLRKTEALNAQRVDEDDAEFGVTQFMDLTEAE 78

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR--EAGIIGK 152
           FK ++L +  ++ VL       D +       +   G   P  +    DWR  ++G++  
Sbjct: 79  FKAQYLNYVPSEQVLA-----EDVY-------AAPEGFAAPGSL----DWRTKQSGVVSD 122

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CWAFS  E  ES   L      + + Q+++ C    + GC+GG+      +
Sbjct: 123 VKDQGQCGSCWAFSATEQIESEWVLAGNDPLVFAPQQIVSC-DKVDQGCNGGNTETAYAY 181

Query: 213 MDVNKVVLEPESEYPLLLKDAA----CKRKATSPNGVKIKSYTCDTLIP----------S 258
           ++     +  ES YP     +     CK+  T+   V+  SY    ++P           
Sbjct: 182 VE-KAGGMALESAYPYKSGTSGNTGRCKKFETAGGDVESFSY----VVPECKKGKCNDQD 236

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLAN-INHAVQIVGYDNYS 313
           E  +   +A+HGP    VNA  WQ Y  GV+    C    AN ++H VQ+VGY  Y+
Sbjct: 237 EDKMAAALASHGPASICVNAGAWQTYTKGVMTNLQCGSHAANALDHCVQVVGYTGYT 293


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 140/281 (49%), Gaps = 30/281 (10%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
           F ++++ + KSYS +  +I  +   ++  ++ + + N     S   G+  F+DL+ EEFK
Sbjct: 30  FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAH-NGAGIHSYTLGMNIFADLTHEEFK 88

Query: 97  TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG----IPVKKDWREAGIIGK 152
             +L   V+ +                + RS  +   IPT     +P   DWR AGI+  
Sbjct: 89  RFYLGTKVDLN----------------RPRSNFSSTFIPTANVGALPDSVDWRTAGIVTP 132

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLD 211
           V++Q  CG+CW+FST  + E  HA K G L  LS Q ++DC+   GN GC+GG       
Sbjct: 133 VKDQGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQ 192

Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
           ++  NK + + E+ YP   KD  CK  A +  G  + S+  D    SES +   +AT GP
Sbjct: 193 YIITNKGI-DTEASYPYTAKDGTCKFNAANV-GATLSSFQ-DITRGSESDLQNAVATVGP 249

Query: 272 VIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
           V  A++A   ++Q Y  GV  +  C  S  +++H V   GY
Sbjct: 250 VSVAIDASKNSFQLYTSGVYNEKKC--SSTSLDHGVLAAGY 288


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 144/299 (48%), Gaps = 35/299 (11%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
           LA+P K+        + LF+S+  ++ K Y+  +  + R++ F+++L  I E N+   S 
Sbjct: 45  LALPNKL--------VGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGS- 95

Query: 78  ESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
                G+  F+D++ EEFK  +L  +  + +     H      + N V            
Sbjct: 96  --YWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVN----------- 142

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
             +P   DWR+ G +  V+NQ  CG+CWAFSTV   E ++ +  G L  LS QE++DC  
Sbjct: 143 --LPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN 200

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
             N GC GG       ++  N+ +   E +YP L+++  C+ K      + I  Y  D  
Sbjct: 201 TFNHGCRGGLMDFAFAYIMGNQGIYT-EEDYPYLMEEGYCREKQPHSKVITITGYE-DVP 258

Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDNY 312
             SE+S+L  +A H PV   + A +  +Q+Y GG+    C       +HA+  VGY +Y
Sbjct: 259 ANSETSLLKALA-HQPVSVGIAAGSRDFQFYKGGIFDGECG---IQPDHALTAVGYGSY 313


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 28/304 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
           +  L  +  LA        N     E    +  RY + Y + +E + R   F+++L  I+
Sbjct: 12  LALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQ 71

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             NK    P   + G+ EF+DL+ EEF T   R+    HV  +         N  +  ++
Sbjct: 72  TFNKANNKPY--KLGVNEFADLTNEEFTTS--RNKFKSHVCATVT-------NVFRYENV 120

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
           T        +P   DWR+ G +  ++NQ  CG CWAFS V   E +  LK G L  LS Q
Sbjct: 121 TA-------VPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQ 173

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           E++DC  NG + GC GG      D++  N   L  E+ YP    D  C     + +   I
Sbjct: 174 ELVDCDTNGEDQGCEGGLMDYAFDFIQQNH-GLSTETNYPYSGTDGTCNANKEANHAATI 232

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQ 305
             +  D    SES++L  +A   P+  A++A    +Q+Y  GV    C   L   +H V 
Sbjct: 233 TGHE-DVPANSESALLKAVANQ-PISVAIDASGSDFQFYSSGVFTGECGTEL---DHGVT 287

Query: 306 IVGY 309
            VGY
Sbjct: 288 AVGY 291


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 142/308 (46%), Gaps = 38/308 (12%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKNFEKSL 64
           L AL  L + +  S+    + L    S  +R+++  ++        +E   RF+ F  ++
Sbjct: 10  LPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANV 69

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           + IE  N         + G+ +F+DL+ EEFKTR+      K   M+  K   + +    
Sbjct: 70  ERIESFNAENHK---FKLGVNQFADLTNEEFKTRNTL----KPSKMASTKSFKYEN---- 118

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                      T +P   DWR  G +  +++Q  CG+CWAFS V   E +  L  G L  
Sbjct: 119 ----------VTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLIS 168

Query: 185 LSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS QEV+DC   + + GC+GG+     +++  NK +   E+ YP    D  C  K  + +
Sbjct: 169 LSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGIT-TEANYPYKAADGTCNTKKAASH 227

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
              I  Y  D  + SE+++L   A   P+  A++A    +Q Y  GV   +C     +++
Sbjct: 228 AASITGYE-DVTVNSEAALLKAAANQ-PIAVAIDAGDFAFQMYSSGVFTGDCG---TDLD 282

Query: 302 HAVQIVGY 309
           H V +VGY
Sbjct: 283 HGVTLVGY 290


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 29/281 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + FSSFQ  Y KSY ++ E   R+  F+ +L  I   N   Q   S    +  F DLS +
Sbjct: 114 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHN---QQGYSYSLKMNHFGDLSRD 170

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI--TIPTGIPVKKDWREAGIIG 151
           EF+ ++L    +++ L SHH              + T +   +P+ +P   DWR  G + 
Sbjct: 171 EFRRKYLGFKKSRN-LKSHH------------LGVATELLNVLPSELPAGVDWRSRGCVT 217

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALL 210
            V++Q+ CG+CWAFST    E  H  K G L  LS QE++DC+   GN  CSGG+     
Sbjct: 218 PVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAF 277

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++ ++   +  E  YP L +D  C R  +    VKI  +  D    SE+++   +A   
Sbjct: 278 QYV-LDSGGICSEDAYPYLARDEEC-RAQSCEKVVKILGFK-DVPRRSEAAMKAALAK-S 333

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A+ A  + +Q+Y  GV   +C     +++H V +VGY
Sbjct: 334 PVSIAIEADQMPFQFYHEGVFDASCG---TDLDHGVLLVGY 371


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 134/285 (47%), Gaps = 28/285 (9%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ ++LF S+  ++ K Y   +  I RF+ F  +L  I+E NK   S      G+  F
Sbjct: 40  SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK +++         + H  + D  + HV            T  P   DWR  
Sbjct: 97  ADLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ +CG+CWAFST+ T E ++ +  G L  LS QE++DC  N + GC GG   
Sbjct: 145 GAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKNSH-GCKGGYQT 203

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V       YP   K   C  +AT   G K+K  T    +PS  E+S L  
Sbjct: 204 TSLQYVADNGV--HTSKVYPYQAKAMQC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258

Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A     V+       +Q Y  GV    C   L   +HAV  VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300


>gi|118365744|ref|XP_001016092.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297859|gb|EAR95847.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 156/315 (49%), Gaps = 35/315 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LC LA  + V      +KL  ++ +  ++++ Y  +EH+  F+   F ++L
Sbjct: 6   LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM----SHHKHHDHHH 120
             I+E N N  +  S    + +FSD+++EEF  + L  S     LM        H+D ++
Sbjct: 58  QKIQEHNNNPNNTYSVH--LNQFSDMTKEEFAEKILMKSDFVDHLMKGISQEATHNDTNN 115

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           N  +  S    +T+   I    DWR  G +  V+NQ  CG+CW+FS     ES + ++N 
Sbjct: 116 NETQLSS--NSLTLADSI----DWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFIQNK 169

Query: 181 TLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
            L   S Q+++DC     G  + GC+GG     LD+   +KV +    +YP +     C 
Sbjct: 170 ALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDY--ASKVGITTLDKYPYVAVQNNCN 227

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDG 295
              T+ NG K KS+     IP+ S+ L       PV   V+A TW  Y  G+  YN CD 
Sbjct: 228 VTGTN-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTWGNYYSGI--YNGCDQ 281

Query: 296 SLANINHAVQIVGYD 310
              ++NHAV  VGYD
Sbjct: 282 LHISLNHAVLAVGYD 296


>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
 gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
          Length = 352

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/302 (30%), Positives = 149/302 (49%), Gaps = 38/302 (12%)

Query: 22  PVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
           P+++     EQ L++         F+ F  +Y K Y S  E   RF+ F ++L++I+  N
Sbjct: 29  PIRLVSDLEEQVLQVIGQTRHAVSFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTN 88

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITT 130
           K R S    + G+  F+DLS +EF+T+ L  + N    L+ +HK  D             
Sbjct: 89  KKRLS---YKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKLTD------------- 132

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                  +P +KDWR+  I+ +V++Q  CG+CW FST    E+ +A  +G    LS Q++
Sbjct: 133 -----AVLPAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQL 187

Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DCAG   N GC+GG      +++  N  +   E EYP   KD ACK  A +   V++  
Sbjct: 188 VDCAGAFNNFGCNGGLPSQAFEYIKYNGGI-ALEKEYPYTAKDEACKFTAENV-AVRVLD 245

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIV 307
            + +  + +E  +   +A   PV  A   +  ++ Y  GV   + C  +  ++NHAV  V
Sbjct: 246 -SVNITLGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAV 304

Query: 308 GY 309
           GY
Sbjct: 305 GY 306


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 145/309 (46%), Gaps = 31/309 (10%)

Query: 7   VLFIVALIALC---FLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
           +L I+  I LC    L+         +E+  +  + F + YK S  K++   RFK F+ +
Sbjct: 8   LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQ---RFKAFKAN 64

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           +  IE  N           G+ +F+DL+ +EF+       + ++   +  +     +N+V
Sbjct: 65  VAFIESFNTGNHK---FWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTRFK---YNNV 118

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
              ++          P   DWR  G++  +++Q  CG CWAFS V   E +  L  G L 
Sbjct: 119 STDAL----------PAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLV 168

Query: 184 LLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS QE++DC  +G + GC GG+      ++ +    L  E+ YP   +D  CK   TS 
Sbjct: 169 SLSEQELVDCDVHGVDQGCEGGEMDNAFKFI-IKNGGLTTEANYPYTAQDGQCKTSTTSN 227

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           +   IK Y  D     ESS++  +A   PV  AV+   + +Q+Y GGV+  +C     ++
Sbjct: 228 SVATIKGYE-DVPANDESSLMKAVANQ-PVSVAVDGGDVIFQHYSGGVMTGSCG---TDL 282

Query: 301 NHAVQIVGY 309
           +H +  +GY
Sbjct: 283 DHGIVAIGY 291


>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
 gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/289 (31%), Positives = 141/289 (48%), Gaps = 47/289 (16%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F++F+ ++ K+Y ++ EHD RFK F+ +L       K++    SA +G+T+FSDL+  EF
Sbjct: 51  FTAFKAKFGKNYATQEEHDYRFKVFKANL---RRAQKHQLMDPSAVHGVTKFSDLTPREF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           + ++L   + K  L +     D H   +          +PT GIP   DWR+ G +  V+
Sbjct: 108 RRQYL--GLKKLRLPA-----DAHEAPI----------LPTDGIPEDFDWRDHGAVTNVK 150

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 151 NQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLM 210

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACK----RKATSPNGVKIKSYTCDTLIPSESS 261
               +++ +    LE E +YP    D   CK    + A S N   + S         E  
Sbjct: 211 TNAFEYI-LKAGGLEREEDYPYTGSDRGPCKFERAKIAASVNNFSVVSV-------DEDQ 262

Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           I  ++  +GP+   +NA+  Q Y+GGV   Y C       +H V +VGY
Sbjct: 263 IAANLVQNGPLAVGINAVFMQTYIGGVSCPYICS---KRQDHGVVLVGY 308


>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
 gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 373

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 145/305 (47%), Gaps = 42/305 (13%)

Query: 22  PVK--VSKPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQ 75
           P++  V + N EQ L     F+ F+ +Y+K+Y ++ EHD RF+ F+ +L       +N+ 
Sbjct: 35  PIRQVVPEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANL---RRARRNQL 91

Query: 76  SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
              SA +G+T+FSDL+ +EF+ + L        L +                  T   +P
Sbjct: 92  LDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPT---------------DTQTAPILP 136

Query: 136 TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC- 193
           T  +P + DWRE G +  V+NQ  CG+CW+FS +   E  H L    L  LS Q+++DC 
Sbjct: 137 TSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCD 196

Query: 194 -------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGV 245
                  A + + GCSGG      ++  +    L  E +YP   +D  ACK   +    +
Sbjct: 197 HECDPAQANSCDSGCSGGLMNNAFEYA-LKAGGLMKEEDYPYTGRDHTACKFDKSK---I 252

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAV 304
                    +   E  I  ++  HGP+  A+NA+  Q Y+GGV   Y C  S    +H V
Sbjct: 253 VASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQ---DHGV 309

Query: 305 QIVGY 309
            +VG+
Sbjct: 310 LLVGF 314


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 87/316 (27%), Positives = 150/316 (47%), Gaps = 31/316 (9%)

Query: 4   VKNVLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFK 58
           + ++  I  L+ L F L+  +K S        E+ + +++   R++K Y++  + D RF+
Sbjct: 1   MASMTMIYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQ 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F+ +L  I+E N N  +  + + G+ +F+D++ EE++  +L    N    +   K   H
Sbjct: 61  VFKDNLGFIQEHNNNLNN--TYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGH 118

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
            +    +  +          PV  DWR  G +  +++Q +CG+CWAFSTV T E+++ + 
Sbjct: 119 RYAFSARDRL----------PVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIV 168

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAAC 235
            G    LS QE++DC    N GC+GG    L+D+     +    ++ + +YP    D  C
Sbjct: 169 TGKFVSLSEQELVDCDRAYNEGCNGG----LMDYAFEFIIQNGGIDTDKDYPYRGFDGIC 224

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
                +   V I  Y  + + P + + L     H PV  A+ A     Q Y  GV    C
Sbjct: 225 DPTKKNAKVVNIDGY--EDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKC 282

Query: 294 DGSLANINHAVQIVGY 309
             SL   +H V +VGY
Sbjct: 283 GTSL---DHGVVVVGY 295


>gi|326434958|gb|EGD80528.1| hypothetical protein PTSG_01119 [Salpingoeca sp. ATCC 50818]
          Length = 389

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 148/302 (49%), Gaps = 33/302 (10%)

Query: 31  EQKLE-LFSSFQQRYKKSYSK--SEHDIRFKNFEKSLDIIEELNKNR---QSPESARYGI 84
           E +L+ LF+SF + + + Y+   +EH  R + F +++ + ++ + +     +  +A +  
Sbjct: 48  EAQLDALFTSFVKDFGRLYASNATEHAFRRRVFARNVQLYQQRSASAATVSAGHTAVFKP 107

Query: 85  TEFSDLSEEEFK----TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
            +FSD + EEF+    TR +  ++      S   + +   N      + T   +   IP 
Sbjct: 108 DKFSDWTVEEFRALLGTRPVSTAIGNPRCASSPVNCELSTN------MNTNAALGLAIPD 161

Query: 141 KKDWR--EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS--LLSVQEVIDCAGN 196
             DWR    G+I  VR+Q  CG CWAFS VET E+   L   TL    LSVQ+++ C   
Sbjct: 162 AFDWRNDSRGVITAVRDQGQCGGCWAFSAVETVEASWVLSGHTLPEPKLSVQQILSCDTQ 221

Query: 197 GNMGCSGGD----FCALLDWMDVNKVVLEPESEYPLLLKDAACKRK-----ATSPNGVKI 247
            N GC GG     F  +LD  +  K  LEP++ +P    D  CK       A S   V I
Sbjct: 222 AN-GCHGGSISGAFTYVLDKSEQGKG-LEPDTAFPFKC-DKGCKNSLPQCPALSRPFVTI 278

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIV 307
            + TC      E  +L  +A +GP+   V+A  W  Y  G+++Y+C    A+ NHAVQIV
Sbjct: 279 NA-TCRCPKMKEKDMLAFVANYGPLAIQVDAEPWHGYSSGIMRYHCSSQPASANHAVQIV 337

Query: 308 GY 309
           GY
Sbjct: 338 GY 339


>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
          Length = 353

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)

Query: 4   VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
           V  +LF+V    ALIA   L +   ++  +       +  F++R+ K + + +E   RF 
Sbjct: 11  VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 63

Query: 59  NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
            F++++     LN +      A Y ++ +F+DL+ +EF   +L    N +    H K + 
Sbjct: 64  AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 116

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            H  HV   S+ +G+       +  DWRE G++  V+NQ  CG+CWAF+T    E   AL
Sbjct: 117 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 167

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
           KN +L  LS Q ++ C  N + GC+GG     + W+  D N  V   E  YP     A  
Sbjct: 168 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWIINDHNGTV-PTEDSYP--YTSAGG 223

Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
            R     N   G KI  Y   +L   E  I   +  +GPV  AV+A TWQ Y GGV+   
Sbjct: 224 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 280

Query: 293 CDGSLANINHAVQIVGYDNYSR 314
           C G   ++NH V +VG++  ++
Sbjct: 281 CFG--LSLNHGVLVVGFNRQAK 300


>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
 gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
          Length = 2676

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 87/287 (30%), Positives = 141/287 (49%), Gaps = 36/287 (12%)

Query: 32   QKLELFSSFQQRYKKSYSKSEHDIR--FKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
            Q   LF  F   YK  Y    H +R  F+ F++++  + ELN + +   +A YG+T F+D
Sbjct: 2366 QAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERG--TATYGVTRFAD 2423

Query: 90   LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAG 148
            L+ EEF T+H+             K      N V+ ++++   +T P       DWR+ G
Sbjct: 2424 LTYEEFSTKHM-----------GMKASLRDPNQVQFRKAVIPNVTAPDSF----DWRDHG 2468

Query: 149  IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
             +  V++Q +CG+CWAFS     E    +K G L  LS QE++DC    + GC+GG    
Sbjct: 2469 AVTGVKDQGSCGSCWAFSVTGNIEGQWKMKTGDLVSLSEQELVDC-DKLDQGCNGG---- 2523

Query: 209  LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L D  +  + ++  LE E +YP    D  C    T     +++      +  +E+ +   
Sbjct: 2524 LPDNAYRAIEQLGGLESEDDYPYEGSDDKCSFNKTL---ARVQISGAVNITSNETDMAKW 2580

Query: 266  IATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
            +  HGP+   +NA   Q+Y+GG+    +  C+ S  N++H V IVGY
Sbjct: 2581 LVKHGPISIGINANAMQFYMGGISHPWRMLCNPS--NLDHGVLIVGY 2625


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L  L F A  V           E    +  RY K Y    E + RFK F+++++ IE
Sbjct: 12  LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N     P   + GI +F+DL+ EEF             +   +K   H  + +  R+ 
Sbjct: 72  AFNNAADKP--YKLGINQFADLTNEEF-------------IAPRNKFKGHMCSSIT-RTT 115

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
           T      T +P   DWR+ G +  +++Q  CG CWAFS V   E +HAL +G L  LS Q
Sbjct: 116 TFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQ 175

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           EV+DC   G + GC+GG       ++  N   L  E+ YP    D  C     + +   I
Sbjct: 176 EVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNTEANYPYKAVDGKCNANEAANHAATI 234

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   L   +H V 
Sbjct: 235 TGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKTGVFTGSCGTQL---DHGVT 289

Query: 306 IVGY 309
            VGY
Sbjct: 290 AVGY 293


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/310 (29%), Positives = 146/310 (47%), Gaps = 36/310 (11%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSY-SKSEHDIRFKNFEKSL 64
           +I+AL  L  + I   +S+   E +  L    +Q   +Y K Y   +E + RF  F+ ++
Sbjct: 10  YILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNV 69

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHNH 122
           + IE  N     P   + G+   +DL+ EEFK     L+ S +  V  +  K+ +     
Sbjct: 70  EFIESFNAAGNKP--YKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYEN----- 122

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
                        T IP   DWR+ G +  +++Q  CG+CWAFSTV   E +H +  G L
Sbjct: 123 ------------VTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKL 170

Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC   G + GC GG      +++  N  +   E+ YP    D +CK  AT+
Sbjct: 171 VSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGGIT-TEANYPYKAVDGSCKN-ATA 228

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLAN 299
           P   +IK Y     + SE ++L  +A   PV  +++A   ++ +Y  G+    C   L  
Sbjct: 229 P-AAQIKGYE-KVPVNSEKALLKAVANQ-PVSVSIDAADGSFMFYSSGIFTGECGTEL-- 283

Query: 300 INHAVQIVGY 309
            +H V  VGY
Sbjct: 284 -DHGVTAVGY 292


>gi|281206749|gb|EFA80934.1| counting factor associated protein [Polysphondylium pallidum PN500]
          Length = 530

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 83/278 (29%), Positives = 142/278 (51%), Gaps = 25/278 (8%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F+  Y K Y+   EH  RF  ++++ ++I  +  N Q   S +  +  F D++ EEF
Sbjct: 227 FEQFKTTYDKVYAHDEEHSERFATYKQNREMI--IAHNTQE-SSYKLAMNHFGDMTAEEF 283

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + ++  V +      H  HD+       R+I         +P   DWR+ G + +V++
Sbjct: 284 ELK-IKPRVPRPDTNGAHDVHDN------DRTIN--------LPATVDWRQQGCVTRVKD 328

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMD 214
           Q  CG+CW F +  + E +  L  G L  LS Q+++DCA  G + GC+GG       ++ 
Sbjct: 329 QGVCGSCWTFGSTGSLEGVSCLATGKLVSLSEQQLVDCAYLGQSQGCNGGFASDAFQYI- 387

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
           +N   +  ES YP L+++  CK  ++  + +K+KSY   T   SE ++   +AT GPV  
Sbjct: 388 MNFGGIAYESTYPYLMQNGYCKDSSSQLSNIKVKSYVNVTSF-SEPALQNAVATVGPVAI 446

Query: 275 AVNALT--WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
           A++A    +++Y  GV   + C   L +++H V  VGY
Sbjct: 447 AIDASAPDFRFYSSGVYYSSVCKNGLDDLDHEVLAVGY 484


>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK   +  ES    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +          I +     V  DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L  L F A  V           E    +  RY K Y    E + RFK F+++++ IE
Sbjct: 12  LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N     P   + GI +F+DL+ EEF     R+    H+  S  +     + +V     
Sbjct: 72  AFNNAANKP--YKLGINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFKYENV----- 122

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                  T +P   DWR+ G +  +++Q  CG CWAFS V   E +HAL +G L  LS Q
Sbjct: 123 -------TALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQ 175

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           EV+DC   G + GC+GG       ++  N   L  E+ YP    D  C     + +   I
Sbjct: 176 EVVDCDTKGEDQGCAGGFMDGAFKFIIQNH-GLNTEANYPYKAVDGKCNANEAANHAATI 234

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   L   +H V 
Sbjct: 235 TGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKTGVFTGSCGTQL---DHGVT 289

Query: 306 IVGY 309
            VGY
Sbjct: 290 AVGY 293


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 86/287 (29%), Positives = 138/287 (48%), Gaps = 21/287 (7%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITE 86
           PNL    +L+  F+  ++++Y ++E   R + F  +L  I+  N  + Q     R GI +
Sbjct: 34  PNLVPFEKLWQDFKTVHERTYGETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQ 93

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F+D+   EF +      +N    +  H H ++               IP  +P + DWR+
Sbjct: 94  FADMEANEFASIMNGFRMNNRTEVRDHLHANY-----------ISPAIPVSVPAEVDWRK 142

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGD 205
            G +  V+NQ  CG+CWAFST  + E  H  K G L  LS Q ++DC+ + GN GC+GG 
Sbjct: 143 EGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGI 202

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                 ++  N    + E+ YP    D  C+ K+    G     YT D     E+ +   
Sbjct: 203 VDYAFQYIKDNDGD-DTEACYPYEAVDGTCRFKSVCV-GATCTGYT-DLPKGDEAKMKEA 259

Query: 266 IATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +A  GPV  A++A   ++Q Y  G+ ++  C  S   ++HAV +VGY
Sbjct: 260 VALVGPVSVAIDASHSSFQMYQSGIYVEQEC--SPKQLDHAVLVVGY 304


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 89/273 (32%), Positives = 132/273 (48%), Gaps = 32/273 (11%)

Query: 43  RYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR 101
           ++ KSY   E  + RF+ F+ +L  I+E NK   S      G+ EF+DLS EEFK ++L 
Sbjct: 3   KHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSS---YWLGLNEFADLSHEEFKRKYL- 58

Query: 102 HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGA 161
                 + +   K  D       K            +P   DWR+ G +  V+NQ  CG+
Sbjct: 59  -----GLKIELPKRRDSPEEFSYKDVAD--------LPKSVDWRKKGAVAHVKNQGACGS 105

Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKV 218
           CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG    L+D+     ++  
Sbjct: 106 CWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGG----LMDYAFAFIISNG 161

Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
            L  E +YP ++++  C  K      V I  Y  D    +E S L  +A   P+  A+ A
Sbjct: 162 GLRKEEDYPYVMEEGTCGEKKEELEVVTISGYH-DVPEDNEQSFLKALANQ-PLSVAIEA 219

Query: 279 LT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +  +Q+Y GG+   +C   L   +H V  VGY
Sbjct: 220 SSRGFQFYSGGIFNGHCGTEL---DHGVAAVGY 249


>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 333

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 139/302 (46%), Gaps = 30/302 (9%)

Query: 13  LIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + +  + P L Q L EL+S ++  + K Y   E   R + ++K++ +I + N
Sbjct: 7   LAALC---LGIASAAPQLNQSLDELWSQWKATHGKLYGMDEEGWRREVWKKNMKMIRQHN 63

Query: 72  -KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
            ++ Q   S    +  F D++ EEFK       + KH                 K+    
Sbjct: 64  WEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKH-----------------KKGKMF 106

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
              +   IP   DWRE G +  V++Q  CG+CWAFS     E     K G L  LS Q +
Sbjct: 107 QAPLFAKIPSSVDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+   GN GC+GG       ++  N   L+ E  YP   +D +CK K   P       
Sbjct: 167 VDCSQAEGNEGCNGGLMNNAFQYVKDNG-GLDSEESYPYHAQDESCKYK---PQDSAAND 222

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
                +   E +++  +AT GP+   ++A   T+Q+Y  G I Y+ D S  +++H V ++
Sbjct: 223 TGFFDIPQQEKALMVAVATKGPISVGIDASHFTFQFYHEG-IYYDPDCSSEDLDHGVLVI 281

Query: 308 GY 309
           GY
Sbjct: 282 GY 283


>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 91/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK   +  ES    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +          I +     V  DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280


>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
          Length = 467

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 145/322 (45%), Gaps = 40/322 (12%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F  +L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P  KDWRE G +  V+NQ  CG+CWAF+ +   E    L 
Sbjct: 105 FAAAEERARVPVDVEV-VGAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYP--------LL 229
              L+ LS Q ++ C  N N GC GG      +W+   N   +  E  YP        L 
Sbjct: 164 GNPLTRLSEQMLVSC-DNTNSGCGGGLSSKAFEWIVQENNGAVYTEDSYPYHSCIGIKLP 222

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
            KD+     AT    V++           E+ I    A  GP+  AV+A +W +Y GGV+
Sbjct: 223 CKDSDRTVGATITGHVELPQ--------DEAQIAASGAVKGPLSVAVDASSWFFYTGGVL 274

Query: 290 QYNCDGSLANINHAVQIVGYDN 311
             NC      ++HAV +VGY++
Sbjct: 275 T-NCVSK--RLSHAVLLVGYND 293


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 155/309 (50%), Gaps = 31/309 (10%)

Query: 9   FIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
            +V L++LC+ LA+   +    L++  EL+ ++ Q   KSY K+E   R   +E++L +I
Sbjct: 53  LLVCLLSLCWGLAVSAPLGDSELDKHWELWKNWHQ---KSYHKAEEGWRRMVWEENLKVI 109

Query: 68  EELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           E  N  +     + + G+ +F DL+ EEF+           +L+S  + H    N +   
Sbjct: 110 ELHNLEQSLGLHTYQLGMNQFGDLTNEEFQ----------QMLIS--ERHFSEGNRINGS 157

Query: 127 SI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +      + +PT +    DWR+ G +  V+NQ  CG+CWAFST    E     K+G L  
Sbjct: 158 AFLEVNYVQVPTSV----DWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVS 213

Query: 185 LSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA-CKRKATSP 242
           LS Q ++DC+   GN GC+GG       ++  N+ + + E  YP   KD A C  K    
Sbjct: 214 LSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGI-DSEDCYPYTAKDTAQCAFKPECA 272

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
              ++  +  D    SE +++  +AT GPV  A++A   ++++Y  G+  Y    S   +
Sbjct: 273 T-ARVTGFV-DIPPHSEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIF-YEPKCSSERL 329

Query: 301 NHAVQIVGY 309
           NHAV +VGY
Sbjct: 330 NHAVLVVGY 338


>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
          Length = 374

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 138/285 (48%), Gaps = 38/285 (13%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FSSF++R+ K+Y+   EHD RF  F+ +L       +N+    SA +G+T+F DL+  EF
Sbjct: 58  FSSFKKRFGKAYTSCDEHDRRFGVFKANL---RRAKRNQILDPSAVHGVTQFFDLTPAEF 114

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L        L       D H   +          +PT  +P   DWR+ G +  V+
Sbjct: 115 RRTYLG-------LKRLRLPADTHEAPI----------LPTNDLPADFDWRDHGAVTPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  + L  G L  LS Q+++DC          + + GC+GG  
Sbjct: 158 NQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLM 217

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            +  ++  +    LE E +YP    D + CK   T    + + +     +   E+ I  +
Sbjct: 218 TSAFEYT-LKAGGLEREEDYPYTGTDHSKCKFDKTK---IAVSASNFSVVSLDENQIAAN 273

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           + T+GP+   +NA+  Q Y+GGV   Y C   L  ++H V +VGY
Sbjct: 274 LVTNGPLAIGINAMFMQTYIGGVSCPYICSKRL--LDHGVLLVGY 316


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 134/286 (46%), Gaps = 53/286 (18%)

Query: 30  LEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +++ +  F S+  ++ K Y   E  + RF+ F ++L+ I+E NK   S      G+ EF+
Sbjct: 42  IDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSS---YWLGLNEFA 98

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS EEFK++ +                                     +P   DWR+ G
Sbjct: 99  DLSHEEFKSKDV-----------------------------------ADLPESVDWRKKG 123

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            +  V+NQ  CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC+GG    
Sbjct: 124 AVTHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGG---- 179

Query: 209 LLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           L+D+      +   L  E +YP L+++  C+ +    + V I  Y  D     E S+L  
Sbjct: 180 LMDYAFAFIASNGGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYE-DVPEKDEESLLKA 238

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A H P+  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 239 LA-HQPLSVAIEASGRDFQFYSGGVFNGPCGTEL---DHGVAAVGY 280


>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 96/325 (29%), Positives = 144/325 (44%), Gaps = 55/325 (16%)

Query: 14  IALCFLAIPVKVSKPNLEQ----KLEL-----------FSSFQQRYKKSYSKSE-HDIRF 57
           I LC L +   +    L Q    KLEL           F  F + Y K YS +E + +R 
Sbjct: 17  IFLCALTLSSSLHHETLIQDVARKLELKDNDLLTTEKKFKLFMKDYSKKYSTTEEYLLRL 76

Query: 58  KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
             F K  ++++        P +A +G+T+FSDLSEEEF+                 + + 
Sbjct: 77  GIFAK--NMVKAAEHQALDP-TAIHGVTQFSDLSEEEFE-----------------RFYT 116

Query: 118 HHHNHVKKRSITTGITIP---TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
                    +   G+  P    G P   DWRE G +  ++ Q  CG+CWAF+T  + E  
Sbjct: 117 GFKGGFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGA 176

Query: 175 HALKNGTLSLLSVQEVIDCAGNGNM-------GCSGGDFCALLDWMDVNKVVLEPESEYP 227
           + L  G L  LS Q+++DC    ++       GC+GG      D++ +    LE E+ YP
Sbjct: 177 NFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDYL-MEAGGLEEETSYP 235

Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
                  CK     PN V ++      +   E+ I   +  HGP+  AVNA+  Q Y+GG
Sbjct: 236 YTGAQGECK---FDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVGG 292

Query: 288 VIQYNCD--GSLANINHAVQIVGYD 310
           V   +C    S   +NH V +VGY+
Sbjct: 293 V---SCPLICSKRRLNHGVLLVGYN 314


>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
          Length = 1785

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 142/281 (50%), Gaps = 33/281 (11%)

Query: 37   FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
            F  F+  +++ Y+ S EH++R+  F  +L  I++LN++ +   + +YG+T+F+D++  E+
Sbjct: 1478 FEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERG--TGKYGVTKFADMTTAEY 1535

Query: 96   KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
            +  H    V K            H NH++   I T  T  T +P   DWR+ G +  V+N
Sbjct: 1536 RA-HTGLIVPKQ-----------HSNHIRN-PIATVSTERTSLPTSFDWRDHGAVTGVKN 1582

Query: 156  QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD- 214
            Q  CG+CWAFS +   E +H +K   L   S QE+IDC    N GC+GG       +MD 
Sbjct: 1583 QGNCGSCWAFSAIGNIEGLHQIKTKKLEAYSEQELIDCDTVDN-GCNGG-------YMDD 1634

Query: 215  ----VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
                + K+  LE E EYP   K         + + V++K      +  +E+ I   +  +
Sbjct: 1635 AFKAIEKLGGLELEDEYPYQAKAQKTCHFNKTLSHVRVKGAV--DMPKNETFIAQYLIEN 1692

Query: 270  GPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            GP+   +NA   Q+Y GG+   ++   S   I+H V IVGY
Sbjct: 1693 GPIAIGLNANAMQFYRGGISHPWHLLCSHKQIDHGVLIVGY 1733


>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
          Length = 377

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 142/285 (49%), Gaps = 39/285 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F++R+ KSY S+ EHD RFK F+ +L       +++Q   SA +G+T+FSDL+  EF
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANL---RRARRHQQLDPSATHGVTQFSDLTPAEF 118

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L        L      HD      +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 119 RGTYLG-------LRPLKLPHD-----AQKAPI-----LPTNDLPEDFDWRDHGAVTAVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  + L  G L  LS Q++++C         G+ + GC+GG  
Sbjct: 162 NQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLM 221

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
               ++  +    L  E +YP    D  +CK   T      + +++  +L   E  I  +
Sbjct: 222 NTAFEYT-LKAGGLMKEEDYPYTGTDRGSCKFDKTKI-AASVSNFSVISL--DEDQIAAN 277

Query: 266 IATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           +   GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 278 LVKIGPLAVAINAVFMQTYVGGVSCPYICSKRL---DHGVLLVGY 319


>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
          Length = 474

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/296 (32%), Positives = 143/296 (48%), Gaps = 37/296 (12%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
           PV+ S  ++E  L  F  F  RY ++YS + E D R + F ++L   E+L    Q   +A
Sbjct: 162 PVEESVDSVEL-LGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQG--TA 218

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IP 139
            YG+T+FSDL+EEEF+T +L   +++  L    K                   +P G  P
Sbjct: 219 EYGVTKFSDLTEEEFRTLYLNPLLSQQNLQQSMKP----------------AAMPRGPAP 262

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
              DWRE G +  V+NQ  CG+CWAFS     E     K G L  LS QE++DC    + 
Sbjct: 263 PSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFAKTGKLVSLSEQELVDC-DTVDQ 321

Query: 200 GCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT--LI 256
            C GG       +  + K+  LE E++Y    K  +C          K+ +Y   +  L 
Sbjct: 322 ACGGG--LPSNAYEAIEKLGGLETETDYSYTGKKQSCDFTTD-----KVIAYINSSVELS 374

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
             E+ I   +A +GPV  A+NA   Q+Y  GV   ++  C+  +  I+HAV +VGY
Sbjct: 375 TDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGY 428


>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
 gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
 gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
          Length = 354

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)

Query: 4   VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
           V  +LF+V    ALIA   L +   ++  +       +  F++R+ K + + +E   RF 
Sbjct: 12  VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 64

Query: 59  NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
            F++++     LN +      A Y ++ +F+DL+ +EF   +L    N +    H K + 
Sbjct: 65  AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 117

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            H  HV   S+ +G+       +  DWRE G++  V+NQ  CG+CWAF+T    E   AL
Sbjct: 118 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 168

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
           KN +L  LS Q ++ C  N + GC+GG     + W+  D N  V   E  YP     A  
Sbjct: 169 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMEQAMQWIINDHNGTV-PTEDSYP--YTSAGG 224

Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
            R     N   G KI  Y   +L   E  I   +  +GPV  AV+A TWQ Y GGV+   
Sbjct: 225 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 281

Query: 293 CDGSLANINHAVQIVGYDNYSR 314
           C G   ++NH V +VG++  ++
Sbjct: 282 CFG--LSLNHGVLVVGFNRQAK 301


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 143/316 (45%), Gaps = 38/316 (12%)

Query: 6   NVLFIVAL-IALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSK-SEHDIRFKNFE 61
           N L+ ++L +  C     ++V+   L+     E    +   Y K Y    E + RFK F 
Sbjct: 5   NQLYHISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFT 64

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++  IE  N N  + ES + GI +F+DL+ EEF             + S +K   H  +
Sbjct: 65  ENMKYIEAFN-NGDNNESYKLGINQFADLTNEEF-------------VASRNKFKGHMCS 110

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
            +  R+ T      + IP   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G 
Sbjct: 111 SII-RTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGK 169

Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAAC 235
           L  LS QE++DC   G + GC GG    L+D  D  K +     L  E++YP    D  C
Sbjct: 170 LVSLSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTC 223

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
                S     I  Y  D    +E ++   +A   P+  A++A    +Q+Y  GV   +C
Sbjct: 224 NANKASIQATTITGYE-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 281

Query: 294 DGSLANINHAVQIVGY 309
              L   +H V  VGY
Sbjct: 282 GTEL---DHGVTAVGY 294


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 144/299 (48%), Gaps = 35/299 (11%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
           LA+P K+        + LF+S+  ++ K Y+  +  + R++ F+++L  I E N+   S 
Sbjct: 36  LALPNKL--------VGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGS- 86

Query: 78  ESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
                G+  F+D++ EEFK  +L  +  + +     H      + N V            
Sbjct: 87  --YWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVN----------- 133

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
             +P   DWR+ G +  V+NQ  CG+CWAFSTV   E ++ +  G L  LS QE++DC  
Sbjct: 134 --LPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQELMDCDN 191

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
             N GC GG       ++  N+ +   E +YP L+++  C+ K      + I  Y  D  
Sbjct: 192 TFNHGCRGGLMDFAFAYIMGNQGIYT-EEDYPYLMEEGYCREKQPHSKVITITGYE-DVP 249

Query: 256 IPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDNY 312
             SE+S+L  +A H PV   + A +  +Q+Y GG+    C       +HA+  VGY +Y
Sbjct: 250 ENSETSLLKALA-HQPVSVGIAAGSRDFQFYKGGIFDGECG---IQPDHALTAVGYGSY 304


>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
          Length = 353

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 156/322 (48%), Gaps = 43/322 (13%)

Query: 4   VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
           V  +LF+V    ALIA   L +   ++  +       +  F++R+ K + + +E   RF 
Sbjct: 11  VVTILFVVCYGSALIAQTPLGVDDFIASAH-------YGRFKKRHGKPFGEDAEEGRRFN 63

Query: 59  NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
            F++++     LN +      A Y ++ +F+DL+ +EF   +L    N +    H K + 
Sbjct: 64  AFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYK 116

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            H  HV   S+ +G+       +  DWRE G++  V+NQ  CG+CWAF+T    E   AL
Sbjct: 117 EH-VHVDD-SVRSGV-------MSVDWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWAL 167

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM--DVNKVVLEPESEYPLLLKDAAC 235
           KN +L  LS Q ++ C  N + GC+GG     + W+  D N  V   E  YP     A  
Sbjct: 168 KNHSLVSLSEQVLVSCD-NIDDGCNGGLMEQAMQWIINDHNGTV-PTEDSYP--YTSAGG 223

Query: 236 KRKATSPN---GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN 292
            R     N   G KI  Y   +L   E  I   +  +GPV  AV+A TWQ Y GGV+   
Sbjct: 224 TRPPCHDNGTVGAKIAGYM--SLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVVTL- 280

Query: 293 CDGSLANINHAVQIVGYDNYSR 314
           C G   ++NH V +VG++  ++
Sbjct: 281 CFG--LSLNHGVLVVGFNRQAK 300


>gi|155966155|gb|ABU41032.1| cysteine proteinase [Lepeophtheirus salmonis]
          Length = 372

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 156/313 (49%), Gaps = 34/313 (10%)

Query: 14  IALCFL----AIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
           + +CFL    A+    S P  +++++ F SF + Y KSY +++   ++ K F  +L  IE
Sbjct: 1   LGVCFLFGLAALAAGTSSPT-QREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIE 59

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
           E N N +   +   GI EFSDL++EEF+++++ +S      MS            K+ +I
Sbjct: 60  EHNANPK--RTWDMGINEFSDLTDEEFESKYMGYSP-----MSSSAGLVTRTAAPKQGNI 112

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS--LLS 186
                    +P   DWRE G+I  V+NQ +CG+CW FS VE  ES  A++N   S  LLS
Sbjct: 113 KD-------LPESVDWREKGVITDVKNQGSCGSCWVFSAVEQIESYVAIENNMTSPPLLS 165

Query: 187 VQEVIDCAGNGNMGCSGGDFCALLD---WMDVNKVVLEPESEYP----LLLKDAACKRKA 239
            Q++  C+ N       G     ++   +M      +E E EYP       +   C   A
Sbjct: 166 TQQITSCSSNPYSCGGSGGCKGAINEIAYMYTQLYGIETEKEYPYTSGFTEESGECLYNA 225

Query: 240 TSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
           +S  G        + L P++  S++  +A  GP+  +V A  ++ Y  G++   CD + A
Sbjct: 226 SSVTGKMAHVRGYEVLPPNDMYSVMEHLANKGPLGVSVYAGRFKSYKSGILN-GCDFN-A 283

Query: 299 N--INHAVQIVGY 309
           N  INHA+Q++GY
Sbjct: 284 NIVINHAIQMIGY 296


>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 359

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 134/278 (48%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY  +E    RF  F  SL +I   NK   S      G+ EF+DL+ EEF
Sbjct: 60  FARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGLS---YTLGVNEFADLTWEEF 116

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +HK             +T G+     +P+KKDWRE GI+  V+
Sbjct: 117 RKHRLGAAQNCSATLKGNHK-------------LTNGL-----LPLKKDWREVGIVTPVK 158

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q+++DCA    N GC+GG      +++
Sbjct: 159 NQGHCGSCWTFSTTGALEAAYVQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYI 218

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    D  CK  + +  GV++   + +  + +E  +   +A   PV 
Sbjct: 219 KANG-GLDTEEAYPYTGVDGVCKFSSENI-GVQVLD-SVNITLGAEDELKDAVAFVRPVS 275

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   ++ ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 276 VAFEVVSGFRLYKSGVYTSDTCGNTPMDVNHAVVAVGY 313


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 88/292 (30%), Positives = 142/292 (48%), Gaps = 38/292 (13%)

Query: 29  NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           NL      F+SF+ ++ K+Y +K EHD RF  F+ +L        + +   SA +G+T+F
Sbjct: 48  NLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNL---RRARLHAKLDPSAVHGVTKF 104

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWRE 146
           SDL+  EF+ + L     +               H +K  I     +PT  +P   DWR+
Sbjct: 105 SDLTPAEFRRQFLGLKPLRFPA------------HAQKAPI-----LPTKDLPKDFDWRD 147

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGN 198
            G +  V++Q  CG+CW+FST    E  H L  G L  LS Q+++DC         G  +
Sbjct: 148 KGAVTNVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACD 207

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GC+GG      +++ +    ++ E +YP   +D  CK   T      + +Y+  +L   
Sbjct: 208 SGCNGGLMNNAFEYI-LQSGGVQKEKDYPYTGRDGTCKFDKTKV-AATVSNYSVVSL--D 263

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           E  I  ++  +GP+  A+NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 264 EEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICG---KHLDHGVLLVGY 312


>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK   +  ES    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +          I +     +  DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAI--DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 143/280 (51%), Gaps = 28/280 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+ S+   + KSY+   E D RF+ F+ +L  I+E  +N    +S + G+T+F+DL+ EE
Sbjct: 48  LYESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDE--QNSVPNQSYKLGLTKFADLTNEE 105

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +++ +L          S         N   +     G ++P  I    DWRE G++  V+
Sbjct: 106 YRSIYLG-------TKSSGDRKKLSKNKSDRYLPKVGDSLPESI----DWREKGVLVGVK 154

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-- 212
           +Q +CG+CWAFS V   ES++A+  G L  LS QE++DC  + N GC GG    L+D+  
Sbjct: 155 DQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGG----LMDYAF 210

Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
              +    ++ E +YP   ++  C +   +   VKI SY  D  + +E ++   +A H P
Sbjct: 211 EFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYE-DVPVNNEKALQKAVA-HQP 268

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           V  A+ A    +Q+Y  G+    C  +   ++H V I GY
Sbjct: 269 VSIALEAGGRDFQHYKSGIFTGKCGTA---VDHGVVIAGY 305


>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
          Length = 319

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 86/279 (30%), Positives = 134/279 (48%), Gaps = 29/279 (10%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++F++F ++Y K+YS +E   RF  F+ S++ I     N  +  S   G+ EF+DLS EE
Sbjct: 40  DMFTAFMKQYSKAYSHAEFSSRFNQFKASVETIRL--HNTLANASYTMGLNEFADLSFEE 97

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FK ++      KHV     + ++ H                   P   DWR +  +  ++
Sbjct: 98  FKGKYFG---CKHVEREFARSNNLHQE-------------VEAAPTSIDWRTSNAVTPIK 141

Query: 155 NQQTCGACWAFSTVETAESMHALKNG-TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
           +Q  CG+CWAFS   + E    L+   TL+ LS Q+++DC+ + GN GC+GG      ++
Sbjct: 142 DQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEY 201

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           +  NK +   ES YP       C++  T    V I  +  D     E+S L  + T GPV
Sbjct: 202 IIANKGIC-AESAYPYKGVGGLCQKSCTKV--VTISGHK-DVASGDEASSLNAVGTVGPV 257

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A    +Q+Y  GV    C     N++H V  VGY
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCG---HNLDHGVLAVGY 293


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 524

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 93/316 (29%), Positives = 156/316 (49%), Gaps = 30/316 (9%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFEK 62
           + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F++
Sbjct: 87  RTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFKQ 146

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           S+   E   +   +   A +G+T+FSD+S EEF+  +L  +          K++      
Sbjct: 147 SM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALKR 193

Query: 123 VKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
            +K      + + TG  P   DWR+ G +  V++Q +CG+CWAF+ +   E    +    
Sbjct: 194 PRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAAIGNIEGQWKIAGHE 248

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK--RK 238
           L+ LS Q ++ C    +  C GG       W+   NK  +  E  YP    D       K
Sbjct: 249 LTSLSEQMLVSCDTTED-NCGGGFADRAFKWIVSSNKGNVFTERSYPYASIDGYVPPCNK 307

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
           +    G KI  +    L   E++I   +A +GPV  AV+A T+  Y GGV+  +C  S  
Sbjct: 308 SGKVVGAKISGHI--NLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT-SC--SSK 362

Query: 299 NINHAVQIVGYDNYSR 314
           ++NH V +VGY++ S+
Sbjct: 363 HVNHEVLLVGYNDTSK 378


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 91/332 (27%), Positives = 146/332 (43%), Gaps = 47/332 (14%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKS 63
           K  L  V L  +C  +  +   +      +E    +  ++ + Y   +E   RF+ F  +
Sbjct: 5   KVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNN 64

Query: 64  LDIIEELNK--NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +  IE  N   NR+       G+ +F+DL+ +EF+                      +  
Sbjct: 65  VVFIESFNAAGNRRK---FWLGVNQFTDLTNDEFRATKT------------------NKG 103

Query: 122 HVKKRSITTGITIPTG-----------IPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
            +K+ +       PTG           +P   DWR  G +  ++NQ  CG CWAFS V  
Sbjct: 104 FIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAA 163

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            E +  L  G L  LS QE++DC  NG + GC GG+     +++ +    L  E+ YP  
Sbjct: 164 TEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFI-IKNGGLTSETNYPYT 222

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
            +D  CK K T  +   IK Y  D     E+S++  +A   PV  AV+   + +Q+Y GG
Sbjct: 223 AQDGQCKAKNTINSVATIKGYE-DVPANDEASLMKAVAAQ-PVSVAVDGGDMVFQHYAGG 280

Query: 288 VIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
           V+  +C  SL   +H +  VGY   D+ ++ W
Sbjct: 281 VLSGSCGTSL---DHGIVAVGYGAADDGTKFW 309


>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
          Length = 450

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDSS 298


>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
 gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
          Length = 475

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 143/303 (47%), Gaps = 39/303 (12%)

Query: 18  FLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSKSEH-DIRFKNFEKSLDIIEELNKN 73
           FL++         E  +EL   F++   RY ++YS  E  D R + F ++L   E+L   
Sbjct: 155 FLSLSTSKPVEETEDFVELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKL--- 211

Query: 74  RQSPE--SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
            QS +  +A YG+T+FSDL+EEEF+T +L   +++  L    K     H           
Sbjct: 212 -QSLDLGTAEYGVTKFSDLTEEEFRTLYLNPLLSQQKLQRSMKPAAMPHGPA-------- 262

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
                  P   DWRE G +  V+NQ  CG+CWAFS     E    +K G L  LS QE++
Sbjct: 263 -------PPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFVKTGKLVSLSEQELV 315

Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           DC    +  C GG      + ++    V E E++Y    K  +C          K+ +Y 
Sbjct: 316 DC-DTADQACGGGLPSNAYEAIEKLGGV-ETETDYSYTGKKQSCDFTTD-----KVTAYI 368

Query: 252 CDT--LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQI 306
             +  L   E+ I   +A +GPV  A+NA   Q+Y  GV   ++  C+  +  I+HAV +
Sbjct: 369 NSSVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLL 426

Query: 307 VGY 309
           VGY
Sbjct: 427 VGY 429


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 98/322 (30%), Positives = 151/322 (46%), Gaps = 38/322 (11%)

Query: 1   MFDVKNVLFIVALIALCFLAIP------VKVSKPNL---EQKLELFSSFQQRYKKSYSKS 51
           +F +  ++F+V  ++L  L +       V  S+ +L   E  + LF S+  ++ K Y   
Sbjct: 4   IFSISKLIFVVTCLSL-HLGLSSADFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTI 62

Query: 52  EHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +  I RF+ F+ +L  I+E NK   S      G+ EF+DL+ +EFK +++       +++
Sbjct: 63  DEKIYRFETFKDNLMYIDETNKKNNS---YWLGLNEFADLTHDEFKEKYVGSIPEDSMII 119

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
                 +  + HV        +  P  I    DWR+ G +  V+NQ  CG+CWAFSTV T
Sbjct: 120 EQSDDVEFPNKHV--------VDYPESI----DWRQKGAVTPVKNQNPCGSCWAFSTVAT 167

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
            E ++ +  G L  LS QE++DC    + GC GG     L ++  N V    E EYP   
Sbjct: 168 VEGINKIVTGNLISLSEQELLDCDRRSH-GCKGGYQTTSLKYVVDNGV--HTEKEYPYEK 224

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIATHG-PVIAAVNALTWQYYLGG 287
           K   C+ K      V I  Y     +PS  E S++  I+     V+       +Q+Y GG
Sbjct: 225 KQGNCRAKNKKGLKVYINGY---KRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGG 281

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           V    C   L   +HAV  VGY
Sbjct: 282 VFGGPCGTKL---DHAVTAVGY 300


>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
          Length = 347

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 150/323 (46%), Gaps = 45/323 (13%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           ++L    L+  C   +  +  KP  E +++ LF  F ++Y K Y   EH+ R++ F+ + 
Sbjct: 1   SLLIAAVLLIACVGVVLAQEYKPLAESEMKKLFIKFSRKYAKVYGTEEHNNRYQIFKAN- 59

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN----KHVLMSHHKHHDHHH 120
             +E+        +   +GIT+FSDL+ EEFK   L  +      K +L +        H
Sbjct: 60  --VEKSRYYNHVGKRENFGITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQ------H 111

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             + ++ + T        P   DWR+ G + +V+NQ  CG+CW FST    E   A+K G
Sbjct: 112 AVLSEKEVQTA-------PTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKG 164

Query: 181 TLSLLSVQEVIDCAGN---------GNMGCSGGDFCALLDWMDVNKVV----LEPESEYP 227
            L  LS Q+++DC  N          + GC+GG     L W     V+    L+ E  YP
Sbjct: 165 KLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGG-----LMWSAFQYVIKNGGLDTEDSYP 219

Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGG 287
               D  C+   ++   V     +  ++   E+ +   +A +GP+  A+NA   QYY  G
Sbjct: 220 YEGVDDTCRFNKSN---VAATISSWTSISSDENQMAAWLAANGPISIAINAEWLQYYTSG 276

Query: 288 VIQ-YNCDGSLANINHAVQIVGY 309
           +   + C+    +++H V IVGY
Sbjct: 277 ISDPWFCNPQ--DLDHGVLIVGY 297


>gi|44844206|emb|CAF32699.1| cathepsin L-like cysteine proteinase [Leishmania infantum]
          Length = 381

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 83/270 (30%), Positives = 131/270 (48%), Gaps = 26/270 (9%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F++ Y+++Y   +E   R  NFE++L+++ E     ++P  AR+GIT+F DLSE E
Sbjct: 37  LFEEFKRTYRRAYGTLAEEQQRLANFERNLELMRE--HQARNPH-ARFGITKFFDLSEAE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F  R+L    N     +  K H   H    +  ++        +P   DWRE G +  V+
Sbjct: 94  FAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
               CG+CWAFS V   ES  A     L  LS Q+++ C    N GC+GG      +W+ 
Sbjct: 143 XXGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN-GCNGGLMLQAFEWLL 201

Query: 215 VNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IATH 269
            +   ++  E  YP    +   A C   +    G +I  Y    +IPS  +++   +A +
Sbjct: 202 RHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VMIPSNETVMAAWLAEN 258

Query: 270 GPVIAAVNALTWQYYLGGV--IQYNCDGSL 297
           GP+  AV+A ++  Y  GV  + YN  G +
Sbjct: 259 GPIAIAVDASSFMSYQSGVLLVGYNKTGGV 288


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 139/285 (48%), Gaps = 30/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++  +  +  K Y+   E + RF+ F+ +L  I+E N   ++    + G+  F+D
Sbjct: 46  DEVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRT---YKLGLNGFAD 102

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE+++ +L                    N ++K S      +   +P   DWR+ G 
Sbjct: 103 LTNEEYRSTYL------------GARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGA 150

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           + +V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 151 VAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGG----L 206

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ E +YP L +D  C     +   V I  Y  D  + SE+++   +
Sbjct: 207 MDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYE-DVPVNSETALQKAV 265

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   PV  A+ A    +Q+Y  G+    C   L   +H V  VGY
Sbjct: 266 ANQ-PVSVAIEAGGRDFQFYASGIFSGRCGTQL---DHGVAAVGY 306


>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
          Length = 396

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 101/312 (32%), Positives = 159/312 (50%), Gaps = 37/312 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           +  L+A  F     K+    L+Q+ + F++  QR  K+    E+ +RF+ F+K+L  IEE
Sbjct: 64  MTILMASIFRIRAEKLKFFGLQQQFKDFNAKFQREHKTLE--EYKMRFEIFQKNLRDIEE 121

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKT-----RHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           LN   ++P S +YGI +FSD +E E K      + L  S++   L +   + +       
Sbjct: 122 LN--LKNP-SVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRN------- 171

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            R+I   +  P  I    DWR  G +  V++Q  CG+CWAF+TV   ES +A++ GTL  
Sbjct: 172 PRNIIKNVQRPDYI----DWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWS 227

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS QE++DC G  + GC GG   + L ++  N   LE E +YP     +A +      NG
Sbjct: 228 LSEQELVDCDG-ASYGCGGGFLTSALGFILGNG--LETEDDYPY----SATRHDQCWING 280

Query: 245 VKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI---QYNC-DGSL 297
            K + +  +   L  SE  +   +A  GPV  A++   ++ YY  G+    ++ C D SL
Sbjct: 281 DKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPYYHDGIYSPSEHECKDESL 340

Query: 298 ANINHAVQIVGY 309
               HA+ I+GY
Sbjct: 341 G--YHAMAIIGY 350


>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRVRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 146/307 (47%), Gaps = 30/307 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +L I  LIA+    +  +      EQ    F ++  R++K Y  SE   RF  F+ ++D 
Sbjct: 157 LLLIFGLIAISNALLFSE------EQYKNEFENWIDRFEKKYDVSEFKKRFSIFKSNMDF 210

Query: 67  IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           +   N KN Q+      G+   +DL+  E++  +L  +  K VL +   H   +   V  
Sbjct: 211 VHSWNSKNSQTV----LGLNHLADLTNLEYRQFYL-GTHKKAVLGTPGNHEVSNLQSVFG 265

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            S T             DWR+ G +  +++Q  CG+CW+FST  + E  H +K+G +  L
Sbjct: 266 DSATV------------DWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVEL 313

Query: 186 SVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           S Q ++DC+   GNMGC+GG      +++  N  + + ES YP         +   + +G
Sbjct: 314 SEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGI-DTESSYPYTASSGTTCKYNKANSG 372

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
             I SY  +    SES +   +   GPV  A++A   ++Q Y  G I Y+   S  N++H
Sbjct: 373 ATISSYK-NITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHG-IYYDASCSSVNLDH 430

Query: 303 AVQIVGY 309
            V +VGY
Sbjct: 431 GVLVVGY 437


>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
          Length = 781

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 29/283 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  F   Y ++YS   E ++R + F ++L IIE L K  Q+  + RYG+  F+D+S EE
Sbjct: 523 LFDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQA--TGRYGVNMFADMSREE 580

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+TR+L       +       ++      K  +I         +P   DWR+ G++  V+
Sbjct: 581 FRTRYL------GLRPDLQSENEIPLQEAKFPNIE--------LPPTFDWRKKGVVTPVK 626

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFS     E  +A+K+G L  LS QE++DC    +    G    A   +  
Sbjct: 627 NQGGCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDDLDDGCGGGLPDNA---YRA 683

Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
           + K+  LE ES+YP   ++  C  K    N VK++  +   +   E+ +   +  +GP+ 
Sbjct: 684 IEKLGGLELESDYPYEAENEKCHFKK---NLVKVELTSAVNVTSDETQMAQWLVQNGPIS 740

Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGYDNYS 313
             +NA   Q+Y+GGV    ++ C+    N++H V IVGY   S
Sbjct: 741 IGINANAMQFYMGGVSHPFKFLCNPK--NLDHGVLIVGYGTSS 781


>gi|118401108|ref|XP_001032875.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89287220|gb|EAR85212.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 360

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 148/318 (46%), Gaps = 34/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVS-KPNLEQKLEL-------FSSFQQRYKKSY-SKSEHD 54
            + +L  VA++ +  L +   +  K + E K  L       F +F+ +Y K+Y   +E  
Sbjct: 4   TQKILVSVAVLGVFLLTLNYVIDHKTDDEIKFMLRKSIERAFKNFKVKYAKTYKDDTEEQ 63

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF  F  +     E+ ++ +    ++ G+ +F+DL+ EEFK  +  H   KH       
Sbjct: 64  YRFSVFTNNY---VEIYRHNKFLVFSKVGVNQFADLTHEEFKALYTGH---KHSKDDDDD 117

Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
            + +   H           +PT  +P   DWR+ G I  V+ Q  CG CWAFSTV++ E 
Sbjct: 118 DNKNKQPH-----------LPTDNLPASFDWRDKGAITPVKVQNGCGGCWAFSTVQSIEG 166

Query: 174 MHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
           ++ LK G L  LS Q+VIDC      GC GGD       +  N  ++  E+EYP + K  
Sbjct: 167 LYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAFRCIQNNGGIMT-ETEYPYIAKQQ 225

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
           +CK     P   +I  Y     +PS+ S +       P+   +N+   +++YY  GVI  
Sbjct: 226 SCKFDEDKPT-FQIGGY---IDVPSDQSQVKAALLIQPLSICLNSSDTSFKYYKSGVITE 281

Query: 292 NCDGSLANINHAVQIVGY 309
             DG     +H + +VGY
Sbjct: 282 CEDGPYDGPDHCLLLVGY 299


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
 gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 450

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 451

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/264 (31%), Positives = 128/264 (48%), Gaps = 28/264 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           N ++ LELF S+   + K+Y   E  + RF+ F ++L  I++ N    S      G+ EF
Sbjct: 43  NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ EEFK R+L       +             + + R IT        +P   DWR+ 
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V++Q  CG+CWAFSTV   E ++ +  G LS LS QE+IDC    N GC+GG   
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203

Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     ++   L  E +YP L+++  C+ +      V I  Y  + +  ++   L 
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260

Query: 265 DIATHGPVIAAVNA--LTWQYYLG 286
               H PV  A+ A    +Q+Y G
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKG 284


>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 145/307 (47%), Gaps = 20/307 (6%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLD 65
           L +V L A   L      S   +    LE F ++Q  Y ++Y+  E    RF  + ++L 
Sbjct: 10  LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I+ +N+   +  S   G  +F+DL+EEEFK  +L       + +            +  
Sbjct: 70  FIKTMNQ-LSTGSSYELGENQFTDLTEEEFKDTYL-------MKLDEQPPAAEAMPPIVG 121

Query: 126 RSITTGITIP--TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
              T G++    TG  P   DWR  G +  V+NQQ CG+CWAF+TV + E +H +K G L
Sbjct: 122 TMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRL 181

Query: 183 SLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC   GN  GC GG   + ++W+  N   L  ES+YP +     C      
Sbjct: 182 VSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLG 240

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANI 300
            +  +I+ Y       +E+ +   +A   PV   ++A   +Q+Y  GV    C+ +   +
Sbjct: 241 HHAARIRGYQA-VQRKNEAELERAVAGR-PVAVVIDASRAFQFYKRGVFSGPCNTT--TV 296

Query: 301 NHAVQIV 307
           NHAV +V
Sbjct: 297 NHAVTVV 303


>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +        T +     +    DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280


>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
          Length = 396

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/276 (33%), Positives = 142/276 (51%), Gaps = 24/276 (8%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           + F+  L+A  F     K+    L+Q+   F  F +++ + + S  E+ +RF+ F+K+L 
Sbjct: 61  LFFMTILMASTFKIRAEKLKFFGLQQQ---FKDFNKKFGREHKSLEEYKMRFEVFQKNLR 117

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVK 124
            IEELN   ++P S +YGI  FSD +E E K   + +  ++  +  S  K    + N   
Sbjct: 118 DIEELN--LKNP-SVQYGINRFSDKTESELKNLLMDKKFMDSSLSNSSLKTLSSYRN--- 171

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            R+I   +  P  I    DWR  G +  V++Q  CG+CWAF+TV   ES +A++ GTL  
Sbjct: 172 PRNIIKNVQRPDYI----DWRNVGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWS 227

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS QE++DC G  + GCSGG   + L+++  N   LE E +YP      A K      NG
Sbjct: 228 LSEQELVDCDG-ASYGCSGGFLTSALEFILGNG--LETEDDYPY----TATKHDQCWING 280

Query: 245 VKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA 278
            K + +  +   L  +E  I   +A  GPV  A+ A
Sbjct: 281 DKTRVWIDEGYQLTMNEDDIAEWVANVGPVSFAMRA 316


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 596

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 140/278 (50%), Gaps = 27/278 (9%)

Query: 36  LFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           LF  F ++Y ++YS S  E++ RF+ F+ +  +++ LN+  +   +A YGIT+F D+SEE
Sbjct: 168 LFDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERG--TAVYGITKFMDMSEE 225

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           E+  R L     + +              V  +++ +     T IP   DWR+ G + +V
Sbjct: 226 EYH-RTLAPGFTRPL--------------VPIQTLNSAELDTTNIPDSMDWRKHGAVTEV 270

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           +NQ +CG+CWAFST    E    LK+  L  LS QE++DC    + GC GG       + 
Sbjct: 271 KNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCD-TLDSGCGGG--LPSNAYK 327

Query: 214 DVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
            + K+  LEPE +YP + +   C  K +     K+       L   E  +   +A +GP+
Sbjct: 328 SIEKLGGLEPEKDYPYVGEGEKCAIKQSD---FKVFVNNSVALPKDEVKLAAWLAQNGPI 384

Query: 273 IAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
              +NA   Q+Y GG+   +    +  +++H V IVGY
Sbjct: 385 SIGINANLMQFYWGGISHPWKIFCNPKSLDHGVLIVGY 422



 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/69 (46%), Positives = 42/69 (60%), Gaps = 1/69 (1%)

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
           T IP   DWR+ G + +V+NQ +CG+CWAFST    E    LK+  L  LS QE++DC  
Sbjct: 473 TNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQELVDCD- 531

Query: 196 NGNMGCSGG 204
             + GC GG
Sbjct: 532 TLDSGCGGG 540


>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +        T +     +    DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGY 280


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 145/307 (47%), Gaps = 20/307 (6%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLE-QKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLD 65
           L +V L A   L      S   +    LE F ++Q  Y ++Y+  E    RF  + ++L 
Sbjct: 10  LALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMVYSENLR 69

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I+ +N+   +  S   G  +F+DL+EEEFK  +L       + +            +  
Sbjct: 70  FIKTMNQ-LSTGSSYELGENQFTDLTEEEFKDTYL-------MKLDEQPPAAEAMPPIVG 121

Query: 126 RSITTGITIP--TG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
              T G++    TG  P   DWR  G +  V+NQQ CG+CWAF+TV + E +H +K G L
Sbjct: 122 TMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRL 181

Query: 183 SLLSVQEVIDCAGNGN-MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC   GN  GC GG   + ++W+  N   L  ES+YP +     C      
Sbjct: 182 VSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLG 240

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANI 300
            +  +I+ Y       +E+ +   +A   PV   ++A   +Q+Y  GV    C+ +   +
Sbjct: 241 HHAARIRGYQA-VQRKNEAELERAVAGR-PVAVVIDASRAFQFYKRGVFSGPCNTT--TV 296

Query: 301 NHAVQIV 307
           NHAV +V
Sbjct: 297 NHAVTVV 303


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 149/311 (47%), Gaps = 22/311 (7%)

Query: 7   VLFIVALIA-LCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
           +  IV+L++  CF     ++    L  + +    +   + ++Y+  +E + R+  F++++
Sbjct: 8   IFLIVSLVSSFCFSTTLSRLLDDELIMQKK-HDEWMAEHGRTYADMNEKNNRYVVFKRNV 66

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           + IE LN N  +  + +  + +F+DL+ +EF+  +  +     VL S         +  K
Sbjct: 67  ERIERLN-NVPAGRTFKLAVNQFADLTNDEFRFMYTGYK-GDFVLFSQ--------SQTK 116

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
             S          +P+  DWR+ G +  ++NQ +CG CWAFS V   E    +K G L  
Sbjct: 117 STSFRYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS Q+++DC  N + GCSGG      + + +    L  ES YP   +DA CK K+T P+ 
Sbjct: 177 LSEQQLVDCDTN-DFGCSGGLMDTAFEHI-MATGGLTTESNYPYKGEDANCKIKSTKPSA 234

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINH 302
             I  Y  D  +  E++++  +A H PV   +      +Q+Y  GV    C   L   +H
Sbjct: 235 ASITGYE-DVPVNDENALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTYL---DH 289

Query: 303 AVQIVGYDNYS 313
           AV  VGY   S
Sbjct: 290 AVTAVGYSQSS 300


>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
          Length = 358

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 132/276 (47%), Gaps = 25/276 (9%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
           F+ F +RY K Y   E +I+ + F+  LD +E +N +     S + G+ EFSDL+ +EF+
Sbjct: 59  FARFARRYGKRYDSVE-EIK-QRFDIFLDNLEMINSHNDKGLSYKLGVNEFSDLTWDEFR 116

Query: 97  TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
              L  + N                ++K R           +P  KDWREAGI+  V+NQ
Sbjct: 117 RDRLGAAQNCSATT---------KGNLKLRDAV--------LPETKDWREAGIVSPVKNQ 159

Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDV 215
             CG+CW FST    E+ +  K G    LS Q+++DCAG   N GC+GG      +++  
Sbjct: 160 GKCGSCWTFSTTGALEAAYTQKFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKS 219

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
           N   LE E  YP   K+  CK  + +  GVK+   + +  + +E  +   +A   PV  A
Sbjct: 220 NG-GLETEEAYPYTGKNGLCKFSSQNV-GVKVTD-SVNITLGAEDELKYAVALVRPVSVA 276

Query: 276 VNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
              +    QY  G      C  +  ++NHAV  VGY
Sbjct: 277 FEVVKGFKQYKSGVYTSTECGTTPMDVNHAVLAVGY 312


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 150/317 (47%), Gaps = 31/317 (9%)

Query: 14  IALCFLAIPVKV--SKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           + LC LA+ ++   + P+L+  L+  + +++  + K Y + E   R   +EK+L +I+  
Sbjct: 3   VYLCALALFLEACFAAPSLDSALDDHWQAWKTWHSKKYHQQEEGWRRMIWEKNLKMIQLH 62

Query: 71  NKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           N +      S R G+  F D++ EEF+            +M+ +KH      +     + 
Sbjct: 63  NLDHSLGKHSYRLGMNHFGDMTNEEFRQ-----------VMNGYKHSKTEKKYRGSEFLE 111

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
               +   +P   DWRE G +  V++Q  CG+CWAFST  + E  H  K G L  LS Q 
Sbjct: 112 PNFLV---VPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQN 168

Query: 190 VIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           ++DC+   GN GC+GG      +++  N  + + E  YP + KD       +  N     
Sbjct: 169 LVDCSRPEGNQGCNGGLMDQAFEYIADNGGI-DSEESYPYIAKDDEDCLYKSEFNAANDT 227

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
            +  D     E +++  +A  GPV  A++A   T+Q+Y  G I Y+ D S   ++H V +
Sbjct: 228 GFV-DVPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESG-IYYDPDCSSEELDHGVLV 285

Query: 307 VGY-------DNYSRTW 316
           VGY       DN  + W
Sbjct: 286 VGYGFEGTDDDNKKKYW 302


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 156/311 (50%), Gaps = 34/311 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQK-LELFSSFQQRYKK-SYSKSEHDIRFKNFEKSL 64
           +L+ + L  L  L++ + +S     ++ + ++  +  +++K  Y   E + RF+ F+ +L
Sbjct: 4   ILYSLILFGLITLSLSLDMSSGRSNKEVMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNL 63

Query: 65  DIIEELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHV---LMSHHKHHDHHH 120
             I+E N    +P  S R G+ EFSD++ +E++  +L    N ++   + S    +   H
Sbjct: 64  IFIDEHN----APNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGH 119

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           N+               +PV  DWR  G +  ++NQ +CGACWAFS V   E+++ +  G
Sbjct: 120 NN--------------KLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTG 163

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
           +L  LS QE++DC    N GC+GG+      ++ V    L+ + +YP L + + C +   
Sbjct: 164 SLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFI-VENGGLDSQIDYPYLGRQSTCNQAKK 222

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLA 298
           +   V I  Y  +    SES+++  +A   PV   + A    +Q Y  GV   +C  SL 
Sbjct: 223 NTKVVSINGYK-NVQRNSESALMEAVANQ-PVSVGIEAYGKDFQLYQSGVFTGSCGTSL- 279

Query: 299 NINHAVQIVGY 309
             +HAV +VGY
Sbjct: 280 --DHAVVVVGY 288


>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
          Length = 1118

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 138/281 (49%), Gaps = 19/281 (6%)

Query: 29   NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
            +LE+   LF  F + Y K Y +SE + RFK F  +L  I  +N   +   +A YGI +FS
Sbjct: 811  SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 867

Query: 89   DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
            DLS++EF   +    + +    S+  H        KK  +     +    P + DWR+ G
Sbjct: 868  DLSKDEFVKFYT--GLKREESPSNEDH--------KKTDLPKSFNVTA--PDQFDWRKKG 915

Query: 149  IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            ++  V+ Q  C +CWAFS     ES++A+K G L  +S Q+++DC    N GCSGG  C+
Sbjct: 916  VVSSVKFQGHCVSCWAFSVAGNVESINAIKTGKLIDVSEQQLVDC-DEWNFGCSGGIACS 974

Query: 209  LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
               +   +K        YP + K+  C R  +S   +++K Y    +  SE  I   +  
Sbjct: 975  KSHFSYFHKKGAMSLESYPYVGKEGQC-RYNSSKVVIRLKDYQY-FIALSEDEIKEYLYN 1032

Query: 269  HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             GP+   +++    +Y GG++   C   +   NHAV +VGY
Sbjct: 1033 IGPLSIDIDSSQIHHYKGGIVIKECQ-EVKKTNHAVLLVGY 1072



 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 139/281 (49%), Gaps = 21/281 (7%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +LE+   LF  F + Y K Y +SE + RFK F  +L  I  +N   +   +A YGI +FS
Sbjct: 511 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 567

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS+EEF   +    + +    S+  H        KK  +     +    P + DWR+ G
Sbjct: 568 DLSKEEFIKYYT--GLKREESPSNEDH--------KKTDLPESFNVTA--PDQFDWRKKG 615

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
           ++  ++NQ+ CG+CWAFS     ES+HA+K G L  +S Q+++DC    + GCSGG    
Sbjct: 616 VVSSIKNQKHCGSCWAFSAAGNVESIHAIKTGKLVHVSEQQLVDCDSQ-DSGCSGGLTWN 674

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
            + +   N  V      YP + ++  C R  ++   +++K Y   T + SE  I   +  
Sbjct: 675 AMRYFRTNGAV--SLKSYPYVAQNENC-RYDSNKVVIRLKDYKHITQL-SEDQIKEHLYN 730

Query: 269 HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            G +   + +    +Y GG++   C  S   ++HAV +V Y
Sbjct: 731 IGLLSIDITSTQLTWYEGGILIEECRRSDL-VDHAVLLVEY 770



 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 103/190 (54%), Gaps = 21/190 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           +LE+   LF  F + Y K Y +SE + RFK F  +L  I  +N   +   +A YGI +FS
Sbjct: 294 SLEEAPTLFEQFIKDYNKEYDESEKEERFKIFVNNLKDINAMN---ERSSNAVYGINKFS 350

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS+EEF   +     ++     HHK  D      K  +IT         P + DWR+ G
Sbjct: 351 DLSKEEFIKYYTGLKRDRCTTTEHHKSTDL----PKSFNITA--------PDQFDWRKKG 398

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
           ++  V+NQ+ CG+CWAFS     ES+HA+K G L  +S Q+++DC    + GCSGG    
Sbjct: 399 VVSSVKNQRHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYDSGCSGG---- 453

Query: 209 LLDWMDVNKV 218
            L+W+ + ++
Sbjct: 454 -LEWIAMREL 462



 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 75/231 (32%), Positives = 115/231 (49%), Gaps = 18/231 (7%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           +A YGI +FSDLS+EEF   +    + +    S+  H        KK  +     +    
Sbjct: 7   NAVYGINKFSDLSKEEFVKYYT--GLKREESPSNEDH--------KKTDLPESFNVTA-- 54

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P + DWR+ G++  ++NQ+ CG+CWAFS     ES+HA+K G L  +S Q+++DC    +
Sbjct: 55  PDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDC-DKYD 113

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GCSGG     L +   N  +      YP + K+  C R  +S   +++K Y     + S
Sbjct: 114 SGCSGGLPWDALRYFVANGAM--SLKSYPYVAKEGKC-RYDSSKVEIRLKEYKHKEKL-S 169

Query: 259 ESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           E  I   +   GP+  A+ +     Y GG++   C  S   INHAV +VGY
Sbjct: 170 EDQIKEHLYNIGPLSIAITSSPLASYNGGILIEECHRSYL-INHAVLLVGY 219


>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +        T +     +    DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGY 280


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 148/314 (47%), Gaps = 40/314 (12%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++    L S    +D  
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYL-SPSPINDLS 119

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +                +P   DWRE+G + +V+NQ  CG CWAFS V + E  + +  
Sbjct: 120 DDD---------------MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 164

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ + 
Sbjct: 165 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQE 222

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
            +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DGS
Sbjct: 223 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 272

Query: 297 LAN-INHAVQIVGY 309
            AN INHAV  +GY
Sbjct: 273 CANRINHAVTAIGY 286


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 154/325 (47%), Gaps = 38/325 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
           S A+ INHAV  +GY   +N  + W
Sbjct: 279 SCADRINHAVTAIGYGTDENGQKYW 303


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 148/314 (47%), Gaps = 40/314 (12%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++    L S    +D  
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYL-SPSPINDLS 119

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +                +P   DWRE+G + +V+NQ  CG CWAFS V + E  + +  
Sbjct: 120 DDD---------------MPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIAT 164

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ + 
Sbjct: 165 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGQQYTCRSQE 222

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
            +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DGS
Sbjct: 223 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 272

Query: 297 LAN-INHAVQIVGY 309
            AN INHAV  +GY
Sbjct: 273 CANRINHAVTAIGY 286


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KTNDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYSGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +        T +     +    DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGY 280


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 138/287 (48%), Gaps = 32/287 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF  +   + K Y   E    RF+ F+ +L  I+E NK   S      G+ EF
Sbjct: 37  SMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS---YWLGVNEF 93

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ +EFK  +L   V      +     +  +  V              +P   DWR+ 
Sbjct: 94  ADLTHQEFKNMYLGLKVESS--RTRQSPEEFTYKDV------------VDLPKSVDWRKK 139

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + +V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC GG   
Sbjct: 140 GAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGG--- 196

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     V+   L  E +YP L  ++ C  K      V I  Y  D    +E+S++ 
Sbjct: 197 -LMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYK-DVPENNEASLIK 254

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A H P+  A+ A    +Q+Y GGV    C      ++H V  VGY
Sbjct: 255 ALA-HQPLSVAIEASGRDFQFYSGGVFDGPCG---TQLDHGVTAVGY 297


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 133/281 (47%), Gaps = 21/281 (7%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E    L+  ++  +  S S  E   RF  F+++++ + E NK     E  +  + +F+D+
Sbjct: 32  ESLWNLYERWRSHHTVSRSLDEKHKRFNVFKENVNFVHEFNKK---DEPYKLKLNKFADM 88

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +  EF++ +    VN H +    +H      + K +S+          P   DWR+ G +
Sbjct: 89  TNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSV----------PPSVDWRKKGAV 138

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             +++Q  CG+CWAFSTV   E ++ +K   L  LS QE++DC  + N GC+GG      
Sbjct: 139 TPIKDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAF 198

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++   K  +  E  YP   +D  C     +   V I  +  +T+ P+    L   A + 
Sbjct: 199 EFIK-EKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGH--ETVPPNNEDALLKAAANQ 255

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           P+  A++A    +Q+Y  GV    C     +++H V IVGY
Sbjct: 256 PISVAIDAGGSAFQFYSEGVFAGRCG---TDLDHGVAIVGY 293


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/286 (29%), Positives = 137/286 (47%), Gaps = 22/286 (7%)

Query: 31  EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ +ELF  + +++ K Y    E + +F+NF  +L  + E N  R +      G+ +F+D
Sbjct: 45  ERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFAD 104

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR----SITTGITIPTGIPVKKDWR 145
           +S EEF+           V +S  K        +++R    +           P   DWR
Sbjct: 105 MSNEEFR----------EVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWR 154

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
           + GI+  V++Q  CG+CWAFS+    E ++AL NG L  LS QE++DC    N GC GG 
Sbjct: 155 KYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST-NDGCEGGY 213

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                +W+  N  + + E++YP   +D  C         V I  Y  + +   ES++   
Sbjct: 214 MDYAFEWVMSNGGI-DTETDYPYTGEDGTCNTTKEETKAVSIDGY--EDVAEEESALFCA 270

Query: 266 IATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +    P+   ++  A+ +Q Y GG+   +C     +I+HAV +VGY
Sbjct: 271 VLKQ-PISVGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGY 315


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 139/290 (47%), Gaps = 32/290 (11%)

Query: 34  LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +E   ++  +Y ++Y    E + R   F+ +++ IE  NK  + P   +  + EF+DL+ 
Sbjct: 1   MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKP--YKLSVNEFADLTN 58

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF+     + ++ H+  S  K   + +               + +P   DWR+ G +  
Sbjct: 59  EEFQASRNGYKMSAHLSSSSTKPFRYEN--------------VSAVPSTMDWRKKGAVTP 104

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLD 211
           +++Q  CG CWAFS V   E +  L  G L  LS QE++DC  +G + GC+GG      D
Sbjct: 105 IKDQGQCGCCWAFSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFD 164

Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
           ++  NK  L  E+ YP    D AC     +    KI  Y  D    SE+++L  +A   P
Sbjct: 165 FIIQNK-GLTTEANYPYQGADGACNSGKAA---AKITGYE-DVPANSEAALLKAVANQ-P 218

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
           V  A++A    +Q+Y  GV   +C     +++H V  VGY   D+ ++ W
Sbjct: 219 VSVAIDAGGSAFQFYSSGVFTGDCG---TDLDHGVTAVGYGMSDDGTKYW 265


>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 131/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +        T +     +    DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNFED--------TDMEEKDAV----DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 VKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGDYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGY 280


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 138/287 (48%), Gaps = 32/287 (11%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++++ +ELF  +   + K Y   E    RF+ F+ +L  I+E NK   S      G+ EF
Sbjct: 40  SMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS---YWLGVNEF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DL+ +EFK  +L   V      +     +  +  V              +P   DWR+ 
Sbjct: 97  ADLTHQEFKNMYLGLKVESS--RTRQSPEEFTYKDV------------VDLPKSVDWRKK 142

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + +V+NQ +CG+CWAFSTV   E ++ +  G L+ LS QE+IDC    N GC GG   
Sbjct: 143 GAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGG--- 199

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     V+   L  E +YP L  ++ C  K      V I  Y  D    +E+S++ 
Sbjct: 200 -LMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYK-DVPENNEASLIK 257

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A H P+  A+ A    +Q+Y GGV    C      ++H V  VGY
Sbjct: 258 ALA-HQPLSVAIEASGRDFQFYSGGVFDGPCG---TQLDHGVTAVGY 300


>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/325 (28%), Positives = 157/325 (48%), Gaps = 42/325 (12%)

Query: 1   MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
           +F V +++F+   +++C      +   V  ++P +    + F+ F++++ K Y S  EH 
Sbjct: 8   LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF  F+ +L  +  +   +  P SAR+G+T+FSDL+  EF+ +HL       V      
Sbjct: 67  YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117

Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
             D +   +          +PT  +P + DWR+ G +  V+NQ +CG+CW+FST    E 
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167

Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
            H L  G L  LS Q+++DC         G+ + GC+G    +  ++  +    L  E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGRLMNSAFEYT-LKTGGLMREKD 226

Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
           YP    D    +   S     + +++  ++  +E  I  ++  +GP+  A+NA   Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284

Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
           GGV   Y C   L   NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306


>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
          Length = 357

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/319 (29%), Positives = 143/319 (44%), Gaps = 35/319 (10%)

Query: 7   VLFIVALIALCFLA---IPVKVS--KPNLE------QKLELFSSFQQRYKKSYSK-SEHD 54
           + F + +  +CF +    PV+ S   PNL+      + ++LF  +++ +   Y    E  
Sbjct: 11  IFFFICITLICFSSSSNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVYKDLKEMA 70

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            RF+ F  +L+ I E N  R SP     G+  F+D S  EF+  +L HS+          
Sbjct: 71  KRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYL-HSL---------- 119

Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
             D   +   K     G  +    P   DWR    +  ++NQ +CG+CWAFS     E +
Sbjct: 120 --DMPTDSAPK---LNGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGI 174

Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
           HA+  G L  LS QE+++C    + GC+GG      DW+  N  +   E+EYP   KD  
Sbjct: 175 HAITTGELISLSEQELVNCD-RVSKGCNGGWVNKAFDWVISNGGI-TLEAEYPYTGKDGG 232

Query: 235 -CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YN 292
            C      P    I  Y  + +  S++ +L  I    P+   +NA  +Q Y  G+     
Sbjct: 233 NCNSDKQVPIKATIDGY--EQVEQSDNGLLCSIVKQ-PISICLNATDFQLYESGIFDGQQ 289

Query: 293 CDGSLANINHAVQIVGYDN 311
           C  S    NH V IVGYD+
Sbjct: 290 CSSSSKYTNHCVLIVGYDS 308


>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 368

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 138/284 (48%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K+Y S  EHD R   F+ ++       ++++   +A +G+T+FSD +  EF
Sbjct: 51  FTVFKRRFGKAYASDEEHDYRLSVFKANM---RRAKRHQELDPAAVHGVTQFSDSTPTEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 108 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDRGAVTPVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG CW+FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 152 NQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 212 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 268

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 269 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 309


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/287 (27%), Positives = 146/287 (50%), Gaps = 27/287 (9%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++ S+  +++K+Y+   E + RF  F+ +L+ I++ N +    ++ + G+ +F+D
Sbjct: 47  DEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSD--DSQTFKVGLNKFAD 104

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK--KRSITTGITIPTGIPVKKDWREA 147
           L+ EEF++ +L     +    S         + VK  +     G  +P  +    DWR+ 
Sbjct: 105 LTNEEFRSVYL----GRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAV----DWRKN 156

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + KV++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC GG   
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGG--- 213

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     +N   ++ +++YP   KD  C +   +   V I  +  + +  ++   L 
Sbjct: 214 -LMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDF--EDVPENDEKALQ 270

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               H PV  A+ A   T+Q+Y  GV    C    A+++H V  VGY
Sbjct: 271 KAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCG---ADLDHGVVAVGY 314


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 153/325 (47%), Gaps = 38/325 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY---DNYSRTW 316
           S A+ INHAV  +GY   +N  + W
Sbjct: 279 SCADRINHAVTAIGYGTDENGQKYW 303


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 135/304 (44%), Gaps = 26/304 (8%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIE 68
           +     L FLA  V           E    +  RY K Y   E  + RF+ F+++++ IE
Sbjct: 12  LALFFCLGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIE 71

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N     P   + GI +F+DL+ EEF     R+  N H   S+ +     + +V     
Sbjct: 72  AFNNAANKP--YKLGINQFADLTSEEFIVP--RNRFNGHTRSSNTRTTTFKYENV----- 122

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                  T +P   DWR+ G +  ++NQ +CG CWAFS +   E +H +  G L  LS Q
Sbjct: 123 -------TVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQ 175

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           EV+DC   G + GC GG       ++  N  +   E+ YP    D  C  K  + +   I
Sbjct: 176 EVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGI-NTEASYPYKGVDGKCNIKEEAVHAATI 234

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
             Y  D  I +E ++   +A   PV  A++A    +Q+Y  G+   +C   L   +H V 
Sbjct: 235 TGYE-DVPINNEKALQKAVANQ-PVSVAIDASGADFQFYKSGIFTGSCGTEL---DHGVT 289

Query: 306 IVGY 309
            VGY
Sbjct: 290 AVGY 293


>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
 gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
          Length = 475

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/287 (33%), Positives = 131/287 (45%), Gaps = 33/287 (11%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F  R++K YS K E   RF+ F+K+   I EL KN Q   +A YG T+FSD++  EF
Sbjct: 172 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQG--TAVYGFTKFSDMTTMEF 229

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTGIPVKKDWREAGIIGKVR 154
           K   L +   + V        +             GITI    +P   DWR+ G + +V+
Sbjct: 230 KQTMLPYQWEQPVYPMDQADFEKE-----------GITISEEDLPESFDWRDKGAVTQVK 278

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           NQ  CG+CWAFST    E    L    L  LS QE++DC G  + GC+GG        + 
Sbjct: 279 NQGNCGSCWAFSTTGNVEGAWFLAKNKLVSLSEQELVDCDGV-DQGCNGGLPSNAYKEI- 336

Query: 215 VNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +    LEPE  YP   K   C    K  A   NG          L   E  +   + T G
Sbjct: 337 IRMGGLEPEDAYPYDGKGETCHLVRKDIAVYING-------SIELPHDEVEMQKWLVTKG 389

Query: 271 PVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
           P+   +NA T Q+Y  GV+   +  C+  +  +NH V IVGY    R
Sbjct: 390 PISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 434


>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
          Length = 467

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/295 (28%), Positives = 135/295 (45%), Gaps = 26/295 (8%)

Query: 21  IPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
           +P   +  + E+ L   F+ F+QRY + Y S +E   R   F K+L  ++       +P 
Sbjct: 21  VPAATASLHAEETLASQFADFKQRYGRVYKSAAEEAFRLSVFRKNL--LDAKLHAAANPH 78

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG- 137
            A +G+T FSDL+ EEF++RH               H    H    ++     + +  G 
Sbjct: 79  -ATFGVTPFSDLTREEFRSRH---------------HSGAAHFAAGRKRARVPVDVGVGD 122

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
            P   DWR+ G +  V++Q  CG+CWAFS +   E    L    L+ LS Q ++ C    
Sbjct: 123 APAAVDWRDRGAVTPVKDQGQCGSCWAFSAIGNVEGQWFLAGNALTSLSEQMLVSC-DTM 181

Query: 198 NMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
           + GC GG   +  +W+ + +   +  E  Y     D   +   TS   V         L 
Sbjct: 182 DSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASGDGIAQPCRTSGRTVGAVITGHVKLP 241

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
           P E+ + T +A +GP+  AV+A +W +Y GGV+       L   +H V +VGY++
Sbjct: 242 PDEAKMATWLAANGPLAVAVDASSWMFYTGGVLTSCVSNEL---DHGVLLVGYND 293


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 153/318 (48%), Gaps = 43/318 (13%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
           + N   +  L  LCF A  +   + N +  +     S+  +Y +SY   +E D +F+ F+
Sbjct: 3   IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62

Query: 62  KSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFK-TRHLRHSVNKHVLMSHHKHHDHH 119
            +   I+  N KN +       GI +F+D++ EEFK T+  +  ++  V  S    +++ 
Sbjct: 63  ANAAFIDSFNAKNHK----FWLGINQFADITNEEFKVTKTNKGFISNKVRASTGFSYEN- 117

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                       ++I   +P   DWR  G +  V++Q  CG CWAFS V   E +  L  
Sbjct: 118 ------------VSIDA-LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLST 164

Query: 180 GTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDA 233
           G L  LS QE++DC  +G + GC GG    L+D  D  K +     L  ES YP   +D 
Sbjct: 165 GKLVSLSEQELVDCDVHGEDQGCEGG----LMD--DAFKFIITNGGLTQESSYPYDAEDG 218

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
            CK  + S     IKSY  D    +E +++  +A   PV  AV+   +T+Q+Y GGV+  
Sbjct: 219 KCKSGSKSAG--TIKSYE-DVPANNEGALMKAVANQ-PVSVAVDGGDMTFQFYSGGVMTG 274

Query: 292 NCDGSLANINHAVQIVGY 309
           +C     +++H +  +GY
Sbjct: 275 SCG---TDLDHGIAAIGY 289


>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
 gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
 gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F  RY K Y S  E  +RF  F+++LD+I   NK   S    +  + +F+DL+ +EF
Sbjct: 59  FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                 IT  T +P  KDWRE GI+  V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
            Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC GG      +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV+++  + +  + +E  +   +    PV 
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  +++Y  GV   N C  +  ++NHAV  VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312


>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/319 (29%), Positives = 157/319 (49%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
             L+ LS Q ++ C    + GC GG     L W+   NK  +     YP      K   C
Sbjct: 168 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G KI  +    L   E++I   +A +GPV  AV+A ++  Y GGV+  +C  
Sbjct: 227 -NKSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 282

Query: 296 SLANINHAVQIVGYDNYSR 314
               ++H V +VGYD+ S+
Sbjct: 283 K--GLDHDVLLVGYDDTSK 299


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 137/284 (48%), Gaps = 37/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F SF + + K Y S  E++ RF  F+ +L  ++ L      P +A +G+T FSDL+EEEF
Sbjct: 56  FESFMKDFGKVYHSVEEYEHRFGVFKSNL--LKALKHQALDP-TASHGVTMFSDLTEEEF 112

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
            +++L   + +  ++S               S      +PT  +P   DWRE G +G V+
Sbjct: 113 TSKYL--GLKRPSVLS---------------SAPQAPPLPTEDLPPNFDWREKGAVGPVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           +Q  CG+CWAFST    E  H L +G L  LS Q+++DC        A   + GC+GG  
Sbjct: 156 DQGGCGSCWAFSTTGAVEGAHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFM 215

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                +++     LE ES+YP   +D  CK  +   N V +K      +   E  +   +
Sbjct: 216 TNAYQYVEAAG-GLELESDYPYEGRDGKCKFDS---NKVAVKVSNFTNIPVDEDQVAAYL 271

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
              GP+   +NA   Q Y+ GV     C+    N++H V +VGY
Sbjct: 272 IKSGPLAIGINAEFMQTYIAGVSCPIFCNKR--NLDHGVLLVGY 313


>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L + V  +K      Q    F+ +   ++KSY+  E   R+  F  ++D +++ 
Sbjct: 4   LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYVQQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ EE++  +L    +   L+   +   H ++         
Sbjct: 64  NS--KGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVHTNSSA------- 113

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                      KDWR  G +  V+NQ  CG CW+FST  + E  H    G L  LS Q +
Sbjct: 114 ---------ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           IDC+   N GC GG      +++ +N   ++ ES YP   ++  C+ K+ +  G  + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSENS-GATLSSY 221

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              T   SESS+ + +  + PV  A++A   ++Q Y  G I Y  + S  N++H V  VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278

Query: 309 Y 309
           Y
Sbjct: 279 Y 279


>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
 gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
 gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
          Length = 381

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 144/287 (50%), Gaps = 40/287 (13%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF++R+ ++Y  + E   R   F  +L       ++++   +A +G+T+FSDL+  EF
Sbjct: 58  FASFERRFGRTYRDAGERAYRMSVFAANL---RRARRHQRLDPTATHGVTKFSDLTPGEF 114

Query: 96  KTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIG 151
           + R L   R S+   V    H+                   +PT G+P   DWRE G +G
Sbjct: 115 RDRFLGLRRPSLEGLVGGEPHE----------------APILPTDGLPDDFDWREHGAVG 158

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSG 203
            V++Q +CG+CW+FST    E  H L  G L +LS Q+++DC        +   + GC+G
Sbjct: 159 PVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNG 218

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G       ++ +    L+ E +YP   ++  CK    S    ++K+++  ++  +E  I 
Sbjct: 219 GLMTTAFSYL-MKSGGLQSEKDYPYAGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIA 274

Query: 264 TDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            ++  HGP+  A+NA   Q Y+GGV   + C     +++H V +VGY
Sbjct: 275 ANLVKHGPLAIAINAAYMQTYIGGVSCPFICG---RHLDHGVLLVGY 318


>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +LR +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    ++ Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             ++HAV IVGY
Sbjct: 286 --VDHAVTIVGY 295


>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
 gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
 gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L + V  +K      Q    F+ +   ++KSY+  E   R+  F+ ++D +++ 
Sbjct: 4   LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ EE++  +L    +   L+   +                
Sbjct: 64  NS--KGSETVL-GLNNFADITNEEYRNTYLGTKFDASSLIGTQEEK-------------- 106

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                T     KDWR  G +  V+NQ  CG CW+FST  + E  H    G L  LS Q +
Sbjct: 107 --VFTTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           IDC+   N GC GG      +++ +N   ++ ES YP   ++  C+ K+ + +G  + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSEN-SGATLSSY 221

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
              T   SESS+ + +  + PV  A++A   ++Q Y  G I Y  + S  N++H V  VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278

Query: 309 Y 309
           Y
Sbjct: 279 Y 279


>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 130/281 (46%), Gaps = 27/281 (9%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK   +  ES    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S+  H D+  +          I +     V  DWRE G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSNAVHFDNSED----------IDMEEKDAV--DWREEGAVTP 126

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
            ++Q  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      
Sbjct: 127 AKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAF 186

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+  V    ++ E  YP   + ++CK+       VK   +  D     E  +   +A  G
Sbjct: 187 DF--VQDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKG 239

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A+ A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGY 280


>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
           cathepsin; Flags: Precursor
 gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
 gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
 gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|225484|prf||1304284A cathepsin,prestalk
          Length = 376

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 21/281 (7%)

Query: 32  QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           Q    F+ +  ++ + YS SE   R+  F+ ++D ++  N N +       G+  F+D++
Sbjct: 31  QYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVD--NWNSKGDSQTVLGLNNFADIT 88

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
            EE++  +L   VN H            +N    R +     + T  P   DWR    + 
Sbjct: 89  NEEYRKTYLGTRVNAH-----------SYNGYDGREVLNVEDLQTN-PKSIDWRTKNAVT 136

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALL 210
            +++Q  CG+CW+FST  + E  HALK   L  LS Q ++DC+G   N GC GG      
Sbjct: 137 PIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAF 196

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D++  NK + + ES YP   +  +      S  G  IK Y  +    SE S L + A HG
Sbjct: 197 DYIIKNKGI-DTESSYPYTAETGSTCLFNKSDIGATIKGY-VNITAGSEIS-LENGAQHG 253

Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A   ++Q Y  G I Y    S   ++H V +VGY
Sbjct: 254 PVSVAIDASHNSFQLYTSG-IYYEPKCSPTELDHGVLVVGY 293


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 389

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 140/324 (43%), Gaps = 57/324 (17%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           NL Q  +LFS F+  +KK Y+  E   RF+ F ++LDII ELN+  +   +A YGIT+FS
Sbjct: 32  NLTQVKQLFSKFKAEHKKFYNFLEEQRRFEIFRQNLDIISELNQVEEG--TAEYGITQFS 89

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           D++ EEFK++ L  S       + +     +H   K         I    P   DWR+ G
Sbjct: 90  DMTTEEFKSQILIPST-----YARNFTGSRYHGFQK---------ISQDAPTSYDWRDHG 135

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------AGNGNMGC 201
            +  V+NQ T G CW FST    E    L    L  LS ++++DC        G+ + G 
Sbjct: 136 AVTPVKNQGTVGTCWTFSTTGNIEGQWFLAGNPLVSLSEEQIVDCDGSQEPSTGHADCGV 195

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK------------------------- 236
            GG      D++ +N   L  E  YP  + +  C                          
Sbjct: 196 FGGWPYLAFDYV-INAGGLPSEETYPYCVGNGGCYPCPAPGYNETLCGPAVPYCNATAYP 254

Query: 237 -RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
            R+   P   KI+ +    L   E SI   +   GP+  A++A   Q+Y  G+     C 
Sbjct: 255 CRQGQVPIAAKIEDW--KALSKDEDSIKQQLFEIGPLSVALDASYLQFYKKGISAPKFC- 311

Query: 295 GSLANINHAVQIVGY--DNYSRTW 316
            S   +NHAV + GY  DN    W
Sbjct: 312 -SKTTLNHAVLLTGYGIDNGVEFW 334


>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 146/331 (44%), Gaps = 52/331 (15%)

Query: 8   LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
           LF+++L+A      AI      P + Q +              FS F+ ++ K Y S+ E
Sbjct: 4   LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63

Query: 53  HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           HD RFK F+ +L        N+    SA +GIT+FSDL+  EF+  +L            
Sbjct: 64  HDHRFKVFKANL---RRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110

Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
                  H    K +      +PT  +P   DWR+ G +  V+NQ +CG+CW+FST    
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163

Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
           E  H L  G L  LS Q+++DC            + GC GG +    ++  +    L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYT-LKAGGLQLE 222

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
            +YP   KD  C     S     + +++   L   E  I  ++  HGP+   +NA   Q 
Sbjct: 223 KDYPYTGKDGKCHFD-KSKICAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279

Query: 284 YLGGVIQYNCDG-SLANINHAVQIVGYDNYS 313
           Y+GGV   +C        +H V +VGY ++ 
Sbjct: 280 YVGGV---SCPLICFKRQDHGVLLVGYGSHG 307


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/285 (31%), Positives = 133/285 (46%), Gaps = 28/285 (9%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ ++LF S+  ++ K Y   +  I RF+ F  +L  I+E NK   S      G+  F
Sbjct: 40  SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK +++         + H  + D  + HV            T  P   DWR  
Sbjct: 97  ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFST+ T E ++ +  G L  LS QE++DC  + + GC GG   
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V       YP   K   C  +AT   G K+K  T    +PS  E+S L  
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258

Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A     V+       +Q Y  GV    C   L   +HAV  VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300


>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 357

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F  RY K Y S  E  +RF  F+++LD+I   NK   S    +  + +F+DL+ +EF
Sbjct: 59  FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                 IT  T +P  KDWRE GI+  V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
            Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC GG      +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV+++  + +  + +E  +   +    PV 
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  +++Y  GV   N C  +  ++NHAV  VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 155/308 (50%), Gaps = 30/308 (9%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           F + L +LC   + +  + P  +Q L+  +  ++ +++++Y+ +E   R   +EK+L +I
Sbjct: 3   FYLCLASLC---LGLVAATPEFDQTLDSQWHQWKAQHRRTYAANEDGWRRATWEKNLKMI 59

Query: 68  EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           E  N    + + S + G+ +F D++ EEFK           V+      + ++ N  +KR
Sbjct: 60  EMHNLEYSAGKHSFQLGMNKFGDMTTEEFK----------QVM------NGYNSNGSQKR 103

Query: 127 SITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +  +    P    +P   DWRE G +  V+NQ  CG+CWAFS   + E     K   L  
Sbjct: 104 TKGSLYREPLLAQLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVS 163

Query: 185 LSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q ++DC+   GN GCSGG      +++  N   ++ E  YP L +D  CK +A   +
Sbjct: 164 LSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK-NNGGIDTEQAYPYLGQDNECKYRAEC-S 221

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
           G  +  +  D    +E +++  +A  GP+  A++A   ++Q+Y  GV  Y    S + ++
Sbjct: 222 GANVTGFV-DIPSMNERALMKAVANVGPISVAIDAGNPSFQFYESGVY-YEPQCSSSQLD 279

Query: 302 HAVQIVGY 309
           H V +VGY
Sbjct: 280 HGVLVVGY 287


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/264 (32%), Positives = 131/264 (49%), Gaps = 32/264 (12%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +E D RF+ F+ +L  I+E N    S    + G+T F+DL+ EE+++ +L     K VL 
Sbjct: 69  AEKDQRFEIFKDNLRFIDEHNTKNLS---YKLGLTRFADLTNEEYRSMYLGAKPTKRVL- 124

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
              K  D +   V       G  +P  +    DWR+ G +  V++Q +CG+CWAFST+  
Sbjct: 125 ---KTSDRYQARV-------GDALPDSV----DWRKEGAVADVKDQGSCGSCWAFSTIGA 170

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYP 227
            E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +    ++ E++YP
Sbjct: 171 VEGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEADYP 226

Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYL 285
               D  C +   +   V I SY  D    SE+S+   +A H P+  A+ A    +Q Y 
Sbjct: 227 YKAADGRCDQNRKNAKVVTIDSYE-DVPENSEASLKKALA-HQPISVAIEAGGRAFQLYS 284

Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
            GV    C   L   +H V  VGY
Sbjct: 285 SGVFDGLCGTEL---DHGVVAVGY 305


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
          Length = 588

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 146/305 (47%), Gaps = 32/305 (10%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           + L A C   + +  + P  +Q L+  +  ++  +++ Y  +E   R   +EK++ +IE 
Sbjct: 5   LVLAAFC---LGIASAAPKFDQNLDTQWYQWKATHRRLYGTNEEGWRRAVWEKNMKMIEL 61

Query: 70  LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N    Q        +  F D++ EEF+           V   + KH        K R +
Sbjct: 62  HNGEYSQGKHGFTMAMNAFGDMTNEEFR--------QVMVCFRNQKH--------KNRKV 105

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
             G  +   +P   DWR+ G +  V+NQ+ CG+CWAFS     E     K G L  LS Q
Sbjct: 106 FRGPLL-LNLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC+GG       ++  N   L+ E+ YP + KD +CK K  +     +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMNNAFQYVKENG-GLDSEASYPYVAKDGSCKYKPEN----SV 219

Query: 248 KSYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
            + T   +IP+ E  ++  +AT GP+  AV+A   ++Q+Y  G I +  D S  N++H V
Sbjct: 220 ANDTGFVVIPAHEKELMKAVATVGPISVAVDASHSSFQFYKSG-IYFEQDCSSKNLDHGV 278

Query: 305 QIVGY 309
            +VGY
Sbjct: 279 LVVGY 283


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y  +E   +RF  F+++LD+I   NK R S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK  +                    +P  KDWRE GI+  V+
Sbjct: 116 QRNKLGAAQNCSATLKGSHKLTE------------------AALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++   + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEEAYPYTGKDGTCKYSAENV-GVQVLD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVVKSFRLYKSGVYTDSHCGNTPMDVNHAVLAVGY 312


>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 366

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 139/284 (48%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K Y S  EHD R   F+ ++       ++++   +A +G+T+FSDL+  EF
Sbjct: 49  FTVFKRRFGKVYASDEEHDYRLSVFKANM---RRAKQHQELDPAAVHGVTQFSDLTPTEF 105

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 149

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG+CW+FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 150 NQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFVQTYIGGVSCPYICSKRL---DHGVLLVGY 307


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 147/309 (47%), Gaps = 41/309 (13%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           LF + L  LC      + + P  E   +L+  F+Q+YKK+Y   + + RF  F+++L   
Sbjct: 283 LFTLELWCLC-----ARTTTPEPENARQLYEEFKQKYKKTYVNDDDEYRFSVFKENLLRA 337

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
            +L    Q   +A YG+T+F DL+ +EF+ ++L             K+ D      ++ S
Sbjct: 338 HQLQTMEQG--TAEYGVTQFFDLTSQEFQIQYL-----------GFKYEDMQD--TEEMS 382

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
            +T + +        DWR+ G +G V +Q  CG+CWAFST+   E    LK G L  LS 
Sbjct: 383 PSTRVVMDED---SFDWRDHGAVGPVLDQGKCGSCWAFSTIGNIEGQWFLKTGELLSLSE 439

Query: 188 QEVIDCAGNGNMGCSGG----DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           Q++IDC  N + GC+GG     + A+     +    LE  S+YP       C        
Sbjct: 440 QQLIDCD-NVDEGCNGGYPPKTYGAV-----IKMGGLELNSDYPYKALAEKCHMDRQ--- 490

Query: 244 GVKIKSYTCDTLI-PSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN-I 300
             K+K Y  D+++ P    +  + +   GP+ +A+NA   ++Y  G++           +
Sbjct: 491 --KLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLPVASCFPRAL 548

Query: 301 NHAVQIVGY 309
           NHAV  VGY
Sbjct: 549 NHAVLTVGY 557



 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 90/171 (52%), Gaps = 12/171 (7%)

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
           DWR+ G +G V NQ  CG+CWAFS V   E    LK+G L  LSVQ+V+DC  + + GC+
Sbjct: 44  DWRQHGAVGPVWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQQVLDCD-HVDHGCN 102

Query: 203 GGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
           GG    +  +  VN++  L+ +++Y        C    +     K ++Y   ++I S++ 
Sbjct: 103 GGYPPQV--YRQVNQMGGLQLDADYSYKAAVGKCHTDRS-----KFRAYVNSSVILSQNE 155

Query: 262 IL--TDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGY 309
                 + T GP+ + +NA T Q+Y  G++       +   +NHAV  VGY
Sbjct: 156 QFQANKLKTIGPLASTLNARTLQFYRKGIMHPTPSACNPGQLNHAVLTVGY 206


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/280 (30%), Positives = 141/280 (50%), Gaps = 28/280 (10%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+ S+   + KSY+   E D RF+ F+ +L  I+E  +N    +S + G+T+F+DL+ EE
Sbjct: 48  LYESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDE--QNSVPNQSYKLGLTKFADLTNEE 105

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +++ +L             K         K +S      +   +P   DWR+ G++  V+
Sbjct: 106 YRSIYL-----------GTKSSGDRRKLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVK 154

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-- 212
           +Q +CG+CWAFS V   ES++A+  G L  LS QE++DC  + N GC GG    L+D+  
Sbjct: 155 DQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDGG----LMDYAF 210

Query: 213 -MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
              +N   ++ E +YP   ++  C +   +   VKI SY  D  + +E ++   +A H P
Sbjct: 211 EFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYE-DVPVNNEKALQKAVA-HQP 268

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           V  A+ A     Q+Y  G+    C  +   ++H V   GY
Sbjct: 269 VSIAIEAGGRDLQHYKSGIFTGKCGTA---VDHGVVAAGY 305


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 141/285 (49%), Gaps = 31/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++ S+  ++ KSY+   E + RF+ F+ +L  I+E   N +   S + G+  F+D
Sbjct: 44  DEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDE--HNAEENLSYKVGLNRFAD 101

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE+++ +L       +               K +S      +   +P   DWR  G 
Sbjct: 102 LTNEEYRSTYLGAKSKPKL--------------SKVKSDRYAPRVGDSLPESVDWRAKGA 147

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  +++Q +CG+CWAFSTV   E ++ +  G L  LS QE++DC  + N GC GG    L
Sbjct: 148 VAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGG----L 203

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ + +YP L +DA C +   +   V I SY  D  + +E ++   +
Sbjct: 204 MDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYE-DVPVNNEEALKKAV 262

Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A+  PV   +      +Q+Y  G+    C  +L   +H V +VGY
Sbjct: 263 ASQ-PVSVGIEGGGRAFQFYDSGIFTGKCGTAL---DHGVNVVGY 303


>gi|118365724|ref|XP_001016082.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297849|gb|EAR95837.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/317 (30%), Positives = 158/317 (49%), Gaps = 39/317 (12%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD------IRFKNF 60
           +L I+ L+ LC LA  + V      +KL  ++ +  + ++ Y  +EH+      + F+NF
Sbjct: 6   LLSIIMLMPLC-LAQNINV------EKLLAYNQWSSQNQRVY-LNEHEKLFRQMVFFENF 57

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS-VNKHVL--MSHHKHHD 117
           +K    I+E N +  +  S    + +FSD+++EEF  + L  S +  H++  +S    H+
Sbjct: 58  QK----IQEHNSDPNNTYSVH--LNQFSDMTKEEFAEKILMKSDLVDHLMKGISQEATHN 111

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
             +N+  + S +  +T+   I    DWR  G +  V+NQ  CG+CW+FS     ES + +
Sbjct: 112 DTNNNETQLS-SNSLTLADSI----DWRTKGAVTSVKNQGGCGSCWSFSAAAVMESFNFI 166

Query: 178 KNGTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
           +N  L   S Q+++DC     G  + GC+GG     LD+   +KV +    +YP +    
Sbjct: 167 QNKALVDFSEQQLVDCVIPANGYNSYGCNGGWPVQCLDY--ASKVGITTLDKYPYVAVQK 224

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNC 293
            C    T  NG K KS+     IP+ S+ L       PV   V+A TW  Y  G+    C
Sbjct: 225 NCNVTGTD-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTWGNYYSGIFN-GC 279

Query: 294 DGSLANINHAVQIVGYD 310
           D +  ++NHAV  VGYD
Sbjct: 280 DQTHISLNHAVLAVGYD 296


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
          Length = 442

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/319 (29%), Positives = 157/319 (49%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 2   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 61

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 62  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 105

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 106 ALKRPRKV---VNVSTGKAPPAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 162

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
             L+ LS Q ++ C    + GC GG     L W+   NK  +     YP      K   C
Sbjct: 163 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 221

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G KI  +    L   E++I   +A +GPV  AV+A ++  Y GGV+  +C  
Sbjct: 222 -NKSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 277

Query: 296 SLANINHAVQIVGYDNYSR 314
               ++H V +VGYD+ S+
Sbjct: 278 K--GLDHDVLLVGYDDTSK 294


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 147/314 (46%), Gaps = 33/314 (10%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDYM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGLMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSR 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG    NC  
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTYDGNC-- 280

Query: 296 SLANINHAVQIVGY 309
               INHAV  +GY
Sbjct: 281 -ADQINHAVTAIGY 293


>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
          Length = 377

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F  RY K Y S  E  +RF  F+++LD+I   NK   S    +  + +F+DL+ +EF
Sbjct: 59  FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                 IT  T +P  KDWRE GI+  V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
            Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC GG      +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV+++  + +  + +E  +   +    PV 
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  +++Y  GV   N C  +  ++NHAV  VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312


>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
          Length = 477

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/288 (34%), Positives = 134/288 (46%), Gaps = 35/288 (12%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F  R++K YS K E   RF+ F+K+  +I EL KN Q   SA YG T+FSD++  EF
Sbjct: 174 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQG--SAVYGFTKFSDMTTMEF 231

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           K   L +   + V        +             G+TI    +P   DWR+ G + +V+
Sbjct: 232 KQTMLPYQWEQPVYPMAEADFEKE-----------GVTISEDDLPDSFDWRDHGAVTQVK 280

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALLDWM 213
           NQ  CG+CWAFST    E    L    L  LS QE++DC  + + GC+GG    A  + M
Sbjct: 281 NQGNCGSCWAFSTTGNVEGAWYLAKKKLVSLSEQELVDC-DSVDQGCNGGLPSNAYKEIM 339

Query: 214 DVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            +    LEPE  YP   K   C    K  A   NG          L   E  I   + T 
Sbjct: 340 RMGG--LEPEDAYPYDGKGETCHIVRKDIAVYING-------SVELPHDEVKIQKWLVTK 390

Query: 270 GPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
           GP+   +NA T Q+Y  GV+   +  C+  +  +NH V IVGY    R
Sbjct: 391 GPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 436


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|41323856|gb|AAS00027.1| cathepsin L-like cysteine proteinase [Taenia solium]
          Length = 339

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 144/297 (48%), Gaps = 25/297 (8%)

Query: 19  LAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
           LA+ V+ S    E++L   ++ ++ ++ + YS  E   R   F ++L  I+  N+   + 
Sbjct: 16  LAVVVETSALLTERELSRQWAGWKLQHGRVYSGKEEAYRRGVFARNLLYIKGQNRRFNAG 75

Query: 78  -ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
            ES   G+ +F+DL   EF  R L       V               ++  I   +    
Sbjct: 76  LESYSTGLNQFADLESSEFSERFLGTRPESRVAG-------------RRGRIWKALASAA 122

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-G 195
           G+P   DWR+  ++ +V+NQ  CG+CWAFS+    E   A K G L  LS Q+++DC+  
Sbjct: 123 GLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLK 182

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           NGN GC+GG       +++ +   +EPES YP    D  C+   +   GV   +   D  
Sbjct: 183 NGNDGCNGGYMSYAFKYLEEH--FIEPESAYPYRATDGPCRYNESL--GVGTVTDIGDIP 238

Query: 256 IPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             +E++++  +AT GP+  A++A  L + +Y  G+ + + C      +NH V  +GY
Sbjct: 239 EGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNHGVLAIGY 293


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 144/304 (47%), Gaps = 22/304 (7%)

Query: 12  ALIALCFLAIPVKVSK-PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           A++ +  +  P  + + PN     +L+  F+  ++++Y ++E   R + F  +L  IE  
Sbjct: 18  AMVPMTNILRPDTILRFPNQVPFEKLWQDFKTVHERNYGETEEMQRKEVFRNNLKKIEMH 77

Query: 71  NK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           N  + Q   S R GI +F+D+  +EF +      VN   + +  K  DH H+H       
Sbjct: 78  NYLHSQGKSSYRMGINQFADMEVKEFAS-----VVNGFRMNNRTKVRDHLHSHY------ 126

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
               IP  +P + DWR+ G +  +++Q  CG+CW+FST    E  H  K G L  LS Q 
Sbjct: 127 ISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQN 186

Query: 190 VIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           +IDC+ + GN GC+GG       ++  N    + E  YP    D  C+ K     G    
Sbjct: 187 LIDCSTSYGNNGCNGGVMDYAFQYIKDNDGD-DTEDSYPYEAADGPCRFKKEYV-GATDT 244

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQ 305
            YT D     E  +   +A  GPV  A++A   ++Q Y  GV  +  CD     ++H V 
Sbjct: 245 GYT-DLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECD--PEGLDHGVL 301

Query: 306 IVGY 309
           +VGY
Sbjct: 302 VVGY 305


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 150/309 (48%), Gaps = 29/309 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
           ++ +  +V L A+C  +       P       +F+ + +   KSYS  E   R+  + ++
Sbjct: 1   MRAITILVLLAAICVASTLATTHDP----LTGVFAEWMRDNSKSYSNEEFVFRWNVWREN 56

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
             +IEE N+   S +++   + +F DL+  EF      + + K +   +  H        
Sbjct: 57  QQLIEEHNR---SNKTSFLAMNKFGDLTNAEF------NKLFKGLAFDYSFH-------A 100

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
            K +    +  P G+    DWR+ G +  V+NQ  CG+CW+FST  + E  + LK G L+
Sbjct: 101 NKAAAEKAVPAP-GLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLT 159

Query: 184 LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q +IDC+G+ GN GC+GG      +++ +N   ++ E+ YP       C+    + 
Sbjct: 160 SLSEQNLIDCSGSYGNNGCNGGLMDYAFEYI-INNKGIDTEASYPYQTAQYTCQYNPANS 218

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANI 300
            G  + SYT D     E+++L  +AT  P   A++A   ++Q+Y GGV  Y    S   +
Sbjct: 219 GG-SLTSYT-DVSSGDENALLNAVATE-PTSVAIDASHNSFQFYSGGVY-YESACSSTQL 274

Query: 301 NHAVQIVGY 309
           +H V  VG+
Sbjct: 275 DHGVLAVGW 283


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
 gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
 gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
 gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
 gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
 gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
 gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
 gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
          Length = 334

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 147/306 (48%), Gaps = 28/306 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           + A++ + FL + V    P L+  L+  +  ++ +Y KSYS  E  ++   +E+ L +I+
Sbjct: 1   MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60

Query: 69  ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N+ N          + EF D ++EEF+   +  SV  H               + KR 
Sbjct: 61  LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
              G  +P  +    DWR+ G +  VR Q  C ACWAF+     E+    + G L+ LSV
Sbjct: 111 --AGSILPKFV----DWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSV 164

Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC GGD      ++ ++   LE E+ YP   KD  C+    +P   K
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCR---YNPKNSK 220

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
            +     +L  SE  ++  +AT GP+ A ++A   +++ Y GG+  + NC  S   + H 
Sbjct: 221 AEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 96/313 (30%), Positives = 148/313 (47%), Gaps = 30/313 (9%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE +NK      S + G+ EF+D++ +EF    L      ++  S+       
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEF----LAKFTGLNIPNSYLSPSPMS 116

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               KK +  +   +P+ +    DWRE+G + +V++Q  CG CWAFS V + E  + +  
Sbjct: 117 STEFKKINDLSDDYMPSNL----DWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIAT 172

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ + 
Sbjct: 173 GNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQE 230

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
            +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG    NC   
Sbjct: 231 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTYDGNC--- 281

Query: 297 LANINHAVQIVGY 309
              INHAV  +GY
Sbjct: 282 ADRINHAVTAIGY 294


>gi|405966497|gb|EKC31775.1| Cathepsin L1 [Crassostrea gigas]
          Length = 305

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 151/308 (49%), Gaps = 33/308 (10%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIE 68
           +++ I L  L        P L+ +  L+   +Q Y+K Y  ++ +   ++ +E +LD I 
Sbjct: 4   LISYIYLAALIFSSLARVPELDTEWALY---KQEYRKQYLTADEETERRDIWEANLDYIN 60

Query: 69  ELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           + N   ++   S   G+ EF+DLS EEF           H+     +  D   +      
Sbjct: 61  QHNDEFKRGEHSYTLGLNEFADLSHEEFL----------HLYGGGIRPRDSGSS-----D 105

Query: 128 ITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
             T I + T G+P + DWR+ G +G V NQ  CG+CWAF+     E     K G L +LS
Sbjct: 106 PDTDIVVDTSGLPSEVDWRKEGWVGPVGNQFACGSCWAFTATGALEGQVRNKTGKLIVLS 165

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           VQ+++DC+   GN GC GG   A   ++ DV  +  E  + YP    +  CK   ++   
Sbjct: 166 VQQMMDCSEKWGNHGCEGGLMDAAFKYIHDVGGI--ESNASYPYKPAEEKCKFNESAVV- 222

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANIN 301
            K+K Y    L  SE S++  +AT GP+ AA++A   ++Q Y  GV    NC  S   ++
Sbjct: 223 AKVKGYK--DLPKSEESLMVAVATVGPISAALDASHSSFQLYKSGVYDDPNC--SSGQVD 278

Query: 302 HAVQIVGY 309
           H++ +VGY
Sbjct: 279 HSLVVVGY 286


>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/285 (31%), Positives = 131/285 (45%), Gaps = 37/285 (12%)

Query: 37  FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F ++Y K YS  E  + R   F K  +++         P +A +G+T FSDLSEEEF
Sbjct: 61  FRMFMEKYGKEYSSREEYVHRLGIFAK--NMVRAAEHQALDP-TALHGVTPFSDLSEEEF 117

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           + R     V +               H+K     T   +   G+P   DWRE G + +V+
Sbjct: 118 E-RMFTGVVGR--------------PHMKGGVAETAAALEVDGLPESFDWREKGAVTEVK 162

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM--------GCSGGDF 206
            Q TCG+CWAFST    E  H +    L  LS Q+++DC    ++        GC GG  
Sbjct: 163 MQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLM 222

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                ++ +    LE ES YP   K   CK K   P+ V ++      +  +E+ I  ++
Sbjct: 223 TNAYKYL-IEAGGLEEESSYPYTGKHGECKFK---PDRVAVRVVNFTEVPINENQIAANL 278

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN--INHAVQIVGY 309
             HGP+   +NA+  Q Y+GGV   +C        INH V +VGY
Sbjct: 279 VCHGPLAVGLNAIFMQTYIGGV---SCPLICPKRWINHGVLLVGY 320


>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 158/316 (50%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAIAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V++Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 EQLDHGVLLVGYNDNS 298


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/301 (33%), Positives = 142/301 (47%), Gaps = 33/301 (10%)

Query: 12  ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEEL 70
           A I    L + V  S   LE     F SF+ ++ KSYS + E   R   F ++L  IEE 
Sbjct: 3   AFILASLLIVAVGAS---LENVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEH 59

Query: 71  NKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           N    +   S    + +F+DL+ +EFK     HS  K  L           N V    + 
Sbjct: 60  NALYAAGLVSYNKSVNQFTDLTIDEFKAYLTLHS--KPTL-----------NTVPY--VR 104

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
           TG+ +PT +    DWR  G +  V++Q  CG+CWAFS V + E  +    G L  LS Q+
Sbjct: 105 TGLQVPTTL----DWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQ 160

Query: 190 VIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +IDC  N N GC GG       +  V +  L  ES YP   +D  C R + S    K+  
Sbjct: 161 LIDCTTNVNDGCDGGYLEETFPY--VQQTGLVSESSYPYTGRDGNC-RISESDVVTKVSK 217

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLANINHAVQIVG 308
           Y    L+  E+ +L  + + GPV  A++A     Y  GV + + C  SL ++NH V +VG
Sbjct: 218 Y---VLLGGEADLLEAVGSVGPVSVAMDATYIYSYASGVYESSLC--SLYSLNHGVLVVG 272

Query: 309 Y 309
           Y
Sbjct: 273 Y 273


>gi|449512065|ref|XP_002196301.2| PREDICTED: cathepsin O-like, partial [Taeniopygia guttata]
          Length = 193

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/152 (42%), Positives = 86/152 (56%), Gaps = 3/152 (1%)

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV 218
           CG CWAFS V   ES +A+K  TL  LSVQ+VIDC+ N N GC+GG   + L W++  KV
Sbjct: 1   CGGCWAFSVVGGIESAYAIKRNTLEELSVQQVIDCSYN-NYGCNGGSTVSALSWLNQTKV 59

Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
            L  +SEY    +   C     S  GV I  +        E  ++  + + GP+   V+A
Sbjct: 60  KLVRDSEYTFKAQTGLCHYFERSDFGVSITGFASYDFSGQEEEMMRMLVSWGPLAVTVDA 119

Query: 279 LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           ++WQ YLGG+IQY+C    A  NHAV I G+D
Sbjct: 120 VSWQDYLGGIIQYHCSSGRA--NHAVLITGFD 149


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            EH+ RF  F  +L  ++  N         R G+  F+DL+ EEF+   L   V +    
Sbjct: 68  GEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 127

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           +  ++    H+ V++            +P   DWRE G +  V+NQ  CG+CWAFS V T
Sbjct: 128 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 172

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            ES++ L  G +  LS QE+++C+ NG N GC+GG      D++ +    ++ E +YP  
Sbjct: 173 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFI-IKNGGIDTEDDYPYK 231

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
             D  C     +   V I  +  D     E S+   +A H PV  A+ A    +Q Y  G
Sbjct: 232 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 289

Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           V    C  SL   +H V  VGY  DN    W
Sbjct: 290 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 317


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 151/314 (48%), Gaps = 32/314 (10%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE +NK      S + G+ EF+D++ +EF    L      ++  S+       
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEF----LAKFTGLNIPNSYLSPSPMS 116

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               KK +  +   +P+ +    DWRE+G + +V++Q  CG CWAFS V + E  + +  
Sbjct: 117 STEFKKINDLSDDDMPSNL----DWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIAT 172

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ + 
Sbjct: 173 GKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQE 230

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDGS 296
            +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DGS
Sbjct: 231 KTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DGS 280

Query: 297 LAN-INHAVQIVGY 309
            A+ INHAV  +GY
Sbjct: 281 CADRINHAVTAIGY 294


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 146/303 (48%), Gaps = 32/303 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + +  + P L+Q L+  +  ++  + + Y  +E   R   +EK+L +IE  N
Sbjct: 7   LAALC---LGIVSALPKLDQTLDAQWDQWKAAHGRLYGLNEEGWRRAVWEKNLRMIELHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
               Q   S   G+  F D++ EEF+            +M+  +H  H    + +  +  
Sbjct: 64  GEYSQGRHSFTLGMNHFGDMTNEEFRQ-----------VMNGFQHQKHKTGKMYQEPLL- 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
            + +P  +    DWRE G + +V+NQ  CG+CWAFS   + E     K G L  LS Q +
Sbjct: 112 -LQLPKSV----DWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+   GN GC+GG       ++  NK  LE E  YP + KD  CK K      +   +
Sbjct: 167 VDCSRPQGNQGCNGGLMDFAFQYVKDNK-GLEAEKSYPYVGKDGECKYKPE----LSAAN 221

Query: 250 YTCDTLIPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
            T    +P    ++   +AT GP+  A++A   ++Q+Y  G I Y+   S  ++NH V +
Sbjct: 222 DTGFVDVPQREKVVQKALATVGPLSVAIDAGLQSFQFYKEG-IYYDPGCSSRDLNHGVLL 280

Query: 307 VGY 309
           VGY
Sbjct: 281 VGY 283


>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
          Length = 368

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 141/284 (49%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K+Y S  EH  RF  F+ +L       ++++   SA +G+T+FSD++ +EF
Sbjct: 54  FTLFKKRFGKTYASDEEHHYRFSVFKANL---RRAMRHQKLDPSAVHGVTQFSDMTPDEF 110

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
             + L   VN+ +            +   K  I     +PT  +P   DWRE G +  V+
Sbjct: 111 SQKFL--GVNRRLRFP---------SDANKAPI-----LPTEDLPSDFDWREHGAVTPVK 154

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  + L  G L  LS Q+++DC          + + GCSGG  
Sbjct: 155 NQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLM 214

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E +YP    D A  +   +    K+ +++  +L   E  I  ++
Sbjct: 215 NSAFEYT-LKAGGLMREEDYPYTGTDKATCKFDNTKVAAKVANFSVVSL--DEEQIAANL 271

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 272 VKNGPLAVAINAVFMQTYVGGVSCPYICSKQL---DHGVLLVGY 312


>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 320

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 135/282 (47%), Gaps = 39/282 (13%)

Query: 36  LFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+S+F+  Y K Y+  + +  R + F ++L II+   +N        +GIT+F DL++EE
Sbjct: 42  LWSTFKNSYNKKYADPDFEQYRIEVFTENLKIIDSNCQN--------FGITKFMDLTQEE 93

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           FK  +L     K++                   I   +   +   ++ DW   G +  V+
Sbjct: 94  FKQTYLTLKTKKYI-----------------EEIPETVFNDSNGDIEIDWTMKGAVTPVK 136

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
           +Q  CG+CW+FST    E  H L +  L  LS Q +IDC+ NGN GC+GG      D++ 
Sbjct: 137 DQGKCGSCWSFSTTGAVEGAHFLSSNELVSLSEQYLIDCSKNGNEGCNGGLMDTAFDFIA 196

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N +    E+ YP    D  CK   T P   KI SY     I S + +L+ +    P+  
Sbjct: 197 QNGI--PTENAYPYKALDGTCKM-TTGP--YKISSY---QNIISCNDLLSKLQKQ-PIAI 247

Query: 275 AVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRTW 316
           AV+A  +Q+Y  G+    C     N++H V +VGY +  + W
Sbjct: 248 AVDANNFQFYTKGIFS-KCG---KNLDHGVLLVGYSSKDKFW 285


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 134/277 (48%), Gaps = 27/277 (9%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEF 95
           F  + +++ ++YS  E   R++ F++++D I + N    S ES    G+T+F+DL+ EE+
Sbjct: 33  FIGWMRKHDRAYSHEEFTDRYQAFKENMDFIHKWN----SQESDTVLGLTKFADLTNEEY 88

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           K  +L   VN    +          N  +K       T P  I    DWRE G + +V++
Sbjct: 89  KKHYLGIKVNVKKNL----------NAAQKGLKFFKFTGPDSI----DWREKGAVSQVKD 134

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW+FST    E  H +K+G +  LS Q ++DC+G  GN GC GG      +++ 
Sbjct: 135 QGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI- 193

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
           ++   +  ES YP       CK    S NG  I  Y    +   E   LT      PV  
Sbjct: 194 IDNGGIATESSYPYTAAQGRCKF-TKSMNGANIIGYK--EIPQGEEDSLTAALAKQPVSV 250

Query: 275 AVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A++A  +++Q Y  GV       S A ++H V  VGY
Sbjct: 251 AIDASHMSFQLYSSGVYDEPACSSEA-LDHGVLAVGY 286


>gi|391328516|ref|XP_003738734.1| PREDICTED: cathepsin O-like [Metaseiulus occidentalis]
          Length = 247

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 101/184 (54%), Gaps = 10/184 (5%)

Query: 131 GITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSV 187
           G+ I T GIP   D+R    +  V+ Q  CGACWAF+ +E  E +  L+    S    SV
Sbjct: 25  GLEIATLGIPKVVDYRNVSSV--VKEQGACGACWAFAPLEAVELLSTLQGRAPSRASFSV 82

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEY-PLLLKDAACKRKATSPNGVK 246
           Q VIDC+ + + GCSGGD C  +D++  +K     E+ Y P       C+++A   + + 
Sbjct: 83  QHVIDCS-DISYGCSGGDICDAVDYLQTSKYHFVAEAAYFPYTEDKLECRKEAKYTSDIS 141

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQI 306
           I    C+     E  +L  +A  GPVIA V+A  W+ YLGG+I++NCD      NHAV I
Sbjct: 142 ITRSWCENYAGREGDLLRLVA-KGPVIATVDATVWRDYLGGIIRFNCDA--GEKNHAVVI 198

Query: 307 VGYD 310
           VGYD
Sbjct: 199 VGYD 202


>gi|29789900|gb|AAF21457.2|U56958_1 cysteine proteinase [Paragonimus westermani]
          Length = 272

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 114/234 (48%), Gaps = 26/234 (11%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           +ARYG+T+FSDL+ EEF  ++L   VN               N   KR   TG+      
Sbjct: 13  TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQVKRVRPTGLK---AA 54

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P + DWR  G +  V NQ +CG+CWAFST    E    +K G L  LS Q+++DC    +
Sbjct: 55  PERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD 114

Query: 199 MGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
            GC+GG    + L+ M +    LE + +YP       C  +      +  K      L P
Sbjct: 115 -GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALXP 168

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
           SE      +A HGP+   +NA+T QYY  G+I  +    S  ++NHAV  VGYD
Sbjct: 169 SEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYXXCSPVDLNHAVLTVGYD 222


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 147/316 (46%), Gaps = 39/316 (12%)

Query: 6   NVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
           N L+ V+L +  C   + ++V+   L+     E    +   Y K Y +  E + R + F 
Sbjct: 5   NQLYHVSLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFT 64

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           ++L  IE  N N  + +  + GI +F+DL+ EEF             + S +K   H  +
Sbjct: 65  ENLKYIEASN-NAGNKKPYKLGINQFADLTNEEF-------------IASRNKFKGHMCS 110

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
            + +   TT     T +P   DWR+ G +  V+NQ  CG CWAFS +   E +H +  G 
Sbjct: 111 SIIR--TTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGK 168

Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAAC 235
           L  LS QE++DC  NG + GC GG    L+D  D  K +++      E+ YP    D  C
Sbjct: 169 LVSLSEQELVDCDTNGVDQGCEGG----LMD--DAFKFIIQNNGISTEAGYPYQGVDGTC 222

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
           K    S +   I  Y  D    +E+++   +A   P+  A++A    +Q+Y  GV   +C
Sbjct: 223 KANEASTSAATITGYE-DVPANNENALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 280

Query: 294 DGSLANINHAVQIVGY 309
              L   +H V  VGY
Sbjct: 281 GTEL---DHGVTAVGY 293


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 85/282 (30%), Positives = 131/282 (46%), Gaps = 30/282 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E+  +  + + + YK +  KS+   R+K F+ ++  IE  NK     +S +  I EF+DL
Sbjct: 37  ERHEDWMAQYGRVYKDAGEKSK---RYKIFKDNVARIESFNKAMN--KSYKLSINEFADL 91

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + EEF  R  R+    H+  +      + H                 +P   DWR+ G +
Sbjct: 92  TNEEF--RASRNRFKAHICSTEATSFKYEH--------------VXAVPSTVDWRKKGAV 135

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCAL 209
             +++Q  CG+CWAFS V   E +  L  G L  LS QE++DC  +G + GCSGG     
Sbjct: 136 TPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDA 195

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
             +++ N   L  E+ YP    D  C RK  +    KI  Y  D    +E ++   +A H
Sbjct: 196 FKFIEQNH-GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-H 252

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            P+  A++A    +Q+Y  GV    C   L   +H V  VGY
Sbjct: 253 QPIAVAIDAGGFEFQFYSSGVFTGQCGTEL---DHGVSAVGY 291


>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 341

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 122/241 (50%), Gaps = 24/241 (9%)

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
           Q  +S R G+T+F+D+  EE+K       V++  L        H  N    R  +T   +
Sbjct: 73  QGLKSYRLGMTQFADMENEEYK-----RLVSQGCL--------HSFNSSLPRRGSTFFRL 119

Query: 135 PTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
           P G  +P   DWR+ G +  V+NQ  CG+CWAFS   + E  H  K G L  LS Q+++D
Sbjct: 120 PKGTVLPDTVDWRDKGYVTNVQNQMDCGSCWAFSATGSLEGQHFRKTGKLVSLSKQQLVD 179

Query: 193 CAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           C+G  GN GC+GG   +   ++  N  + + E  YP   +D  C+    S  G     Y 
Sbjct: 180 CSGEFGNEGCNGGLMDSAFQYIQANGGI-DTEESYPYEAEDGKCRYNPKS-TGATCTGYV 237

Query: 252 CDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
            D    +E ++   +AT GP+  A++A   ++Q+Y  GV  + +C  ++  ++HAV  VG
Sbjct: 238 -DVQPANEETLKEAVATIGPISVAIDAFHPSFQFYESGVYDEPDCSSTM--LDHAVLAVG 294

Query: 309 Y 309
           Y
Sbjct: 295 Y 295


>gi|42516556|gb|AAS17989.1| cysteine proteinase CP2 [Paragonimus westermani]
          Length = 272

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 115/234 (49%), Gaps = 26/234 (11%)

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           +ARYG+T+FSDL+ EEF  ++L   VN               N   KR   TG+      
Sbjct: 13  TARYGVTQFSDLTPEEFAAKYLSAPVN---------------NDQVKRVRPTGLK---AA 54

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P + DWR  G +  V NQ +CG+CWAFST    E    +K G L  LS Q+++DC    +
Sbjct: 55  PERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVDCDRAAD 114

Query: 199 MGCSGG-DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
            GC+GG    + L+ M +    LE + +YP       C  +      +  K      L P
Sbjct: 115 -GCNGGWPASSYLEIMHMGG--LESQDDYPYAGVKEQCFMEKER---LLAKIDDSIALGP 168

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG-SLANINHAVQIVGYD 310
           SE      +A HGP+   +NA+T QYY  G+I  + +  S  ++NHAV  VGYD
Sbjct: 169 SEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEECSPVDLNHAVLTVGYD 222


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 139/309 (44%), Gaps = 36/309 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L+ + FLA  V           E    +  RY K Y    E + RF+ F+++++ IE
Sbjct: 30  LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 89

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N    + +  +  I +F+DL+ EEF     R+    H+  S  +     + +V     
Sbjct: 90  AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 140

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                  T +P   DWR+ G +  +++Q  CG CWAFS V   E +HAL +G L  LS Q
Sbjct: 141 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 193

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
           E++DC   G + GC GG    L+D  D  K V     L  E+ YP    D  C     + 
Sbjct: 194 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNANEAAN 247

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           + V I  Y  D    +E ++   +A   PV  A++A    +Q+Y  GV   +C   L   
Sbjct: 248 DVVTITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 302

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 303 DHGVTAVGY 311


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 128/286 (44%), Gaps = 31/286 (10%)

Query: 32  QKLELFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           Q   ++  +  R+ K+ S +  EHD RF+ F  +L  ++  N  R      R GI  F+D
Sbjct: 47  QVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNA-RAGARGYRLGINRFAD 105

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+  EF+  +L          +      + H+ V+             +P   DWR+ G 
Sbjct: 106 LTNAEFRAAYLSAGARNGTATAATGER-YRHDGVEA------------LPEFVDWRQKGA 152

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGG---D 205
           +  V+NQ  CG+CWAFS V   E ++ +  G L  LS QE++DC+ NG N GC GG   D
Sbjct: 153 VAPVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDD 212

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
             A +    V    ++ + +YP   +D  C     S + V I  +  + +  ++   L  
Sbjct: 213 AFAFI----VGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGF--EGVPRNDEKSLQK 266

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              H PV  A+ A    +Q Y  GV    C  SL   +H V  VGY
Sbjct: 267 AVAHQPVAVAIEAGGREFQLYQSGVFTGRCGTSL---DHGVVAVGY 309


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  V  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 138/299 (46%), Gaps = 36/299 (12%)

Query: 21  IPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPE 78
             ++V+   L+  + E    +  +Y K Y  S E + RFK F ++++ IE  NK   + +
Sbjct: 21  FAIQVTSRTLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNN-K 79

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
               G+ +F+DL+ +EF +             S +K   H  + +  R+ T      + I
Sbjct: 80  LYTLGVNQFADLTNDEFTS-------------SRNKFKGHMCSSIT-RTSTFKYENASAI 125

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG- 197
           P   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G L  LS QE++DC   G 
Sbjct: 126 PSSVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 185

Query: 198 NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
           + GC GG    L+D  D  K +     L  E+ YP    D  C     S N V I  Y  
Sbjct: 186 DQGCEGG----LMD--DAFKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYE- 238

Query: 253 DTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           D    +E ++   +A   P+  A++A    +Q+Y  GV   +C   L   +H V  VGY
Sbjct: 239 DVPTNNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHGVTAVGY 293


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 83/283 (29%), Positives = 133/283 (46%), Gaps = 38/283 (13%)

Query: 37  FSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESAR------YGITEFSD 89
           F  F+  ++K Y   E + R F  F  +L  I      R + E+AR       G+ +F+D
Sbjct: 20  FDDFKTTFEKQYESPEEEARRFAIFADNLAFIA-----RHNAEAARGLHTHTVGVNQFAD 74

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE++  +LR    +  L+   +                 +  P    V  DWR+ G 
Sbjct: 75  LTNEEYRQLYLRPYPTE--LLGRERQE-------------VWLDGPNAGSV--DWRQKGA 117

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCA 208
           +  ++NQ  CG+CW+FST  + E  HA+  G L  LS Q+++DC+G+ GN GC+GG    
Sbjct: 118 VTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDN 177

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
              ++ ++   L+ E +YP   +D  C +   S + V I  Y  D    +E  +   +  
Sbjct: 178 AFKYI-ISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYK-DVPQNNEDQLAAAV-E 234

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            GPV  A+ A   ++Q Y  GV    C     N++H V +VGY
Sbjct: 235 KGPVSVAIEADQQSFQMYSSGVFSGPCG---TNLDHGVLVVGY 274


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
 gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
          Length = 276

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 86/258 (33%), Positives = 128/258 (49%), Gaps = 33/258 (12%)

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
            K FE ++    ++ K      +A+YG T FSDLSEEEF+ + +     K +    ++  
Sbjct: 1   MKIFESNMRKAAKMQKMDSG--TAQYGPTIFSDLSEEEFRKQKMMPGWGKPL----YEMK 54

Query: 117 DHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
           D                IP G IP   DWR+ G++  V+NQ +CG+CWAFST    E  +
Sbjct: 55  DAE--------------IPLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQY 100

Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAA 234
           A+K G L  LS QE++DC    + GC GG       +  + K+  LE ES+YP    D+ 
Sbjct: 101 AIKTGKLVSLSEQELVDCD-TIDKGCEGG--LPSNAYKQIEKLGGLESESDYPYKGADSK 157

Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QY 291
           CK        VK+   +   +   E  I   +A +GP+   +NA   Q+Y+GG+    + 
Sbjct: 158 CKFNKAE---VKVTINSSVVISKDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKI 214

Query: 292 NCDGSLANINHAVQIVGY 309
            C+ S  ++NH V IVGY
Sbjct: 215 FCNPS--SLNHGVLIVGY 230


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENIKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 151/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 143/308 (46%), Gaps = 58/308 (18%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
           LA+P K+        ++LFSS+  ++ K Y   E  + R++ F+++L  I E N+   S 
Sbjct: 38  LALPYKL--------VDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRRNGS- 88

Query: 78  ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT- 136
                G+ +F+D++ EEFK+ +L                           + TG+  P  
Sbjct: 89  --YWLGLNQFADVAHEEFKSTYL--------------------------GLKTGMDGPAR 120

Query: 137 -----------GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                       +P   DWR+ G +  V+NQ  CG+CWAFSTV   E ++ +  G L  L
Sbjct: 121 APTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIATGKLESL 180

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           S QE++DC    + GC GG F        +  + +  + +YP L+++  CK K      V
Sbjct: 181 SEQELMDCDTTFDHGCGGG-FMDFAFAYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKVV 239

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHA 303
            I  Y  D    SE S+L  +A H P+   + A +  +Q+Y  GV + +C      ++HA
Sbjct: 240 TISGYE-DVPENSEVSLLKALA-HQPISVGIAAGSKDFQFYKRGVFEGSCG---TELDHA 294

Query: 304 VQIVGYDN 311
           +  VGY +
Sbjct: 295 LTAVGYGS 302


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 137/280 (48%), Gaps = 30/280 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F S++  +  SY+   E   R   +  +LD IE+ N    S + A   + +F+DL+  EF
Sbjct: 22  FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLA---VNKFADLTYPEF 78

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
             ++L    +               N  K  + +T +     +P   DWR AGI+  +++
Sbjct: 79  AAKYLGLRFDAT-------------NATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKD 125

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMD 214
           Q  CG+CW+FST  + E  HA K G L  LS Q ++DC +  GN GC+GG       ++ 
Sbjct: 126 QGQCGSCWSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYII 185

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N  + + ES YP   +D  C+  + +  G  + SY  D    SES +   +AT GP+  
Sbjct: 186 SNNGI-DTESSYPYTAQDGTCQFNSANV-GATVASYQ-DIASGSESDLQNAVATVGPISV 242

Query: 275 AVNAL--TWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
           A++A   ++Q+Y  GV  YN   C  S + ++H V  VGY
Sbjct: 243 AIDASQPSFQFYSSGV--YNEPAC--SSSQLDHGVLAVGY 278


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 150/308 (48%), Gaps = 33/308 (10%)

Query: 11  VALIALCFLAIPVK-VSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +AL++  FL+I    +S+ +  +  E++  +  ++ K+Y+   E + RF+ F+++L  I+
Sbjct: 8   LALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFID 67

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKR 126
           + N   ++    + G+  F+DL+ EE++  +L  R    + V+ +      +  N++ + 
Sbjct: 68  DHNSENRT---YKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDR- 123

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                      +P   DWR  G +  V+NQ +CG+CWAFST+   E ++ +  G L  LS
Sbjct: 124 -----------LPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLS 172

Query: 187 VQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            QE++ C    N GC+GG    L+D+     ++   L+ E +YP    D  C     +  
Sbjct: 173 EQELVSCDKKYNSGCNGG----LMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAK 228

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
            V I +Y  D     E S+   +A H PV  A+ A  L  Q Y  GV    C  +L   +
Sbjct: 229 VVSIDAYE-DVPANDEESLKKAVA-HQPVSVAIEASGLALQLYQSGVFTGKCGSAL---D 283

Query: 302 HAVQIVGY 309
           H V  VGY
Sbjct: 284 HGVVAVGY 291


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
 gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
          Length = 356

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 149/324 (45%), Gaps = 42/324 (12%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNL---EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEK 62
           ++ +V L+A   LAI       N     +  +LF+ F++++ K Y +K   D R++ F++
Sbjct: 4   LILVVLLVASFILAIEAAKGPFNALPESEMQQLFTQFRRKHVKLYGTKQVQDRRYQIFKQ 63

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHH 119
           +   +E         E    G+T FSDL+ +EFK+  L  S        L+S  + +  +
Sbjct: 64  N---VERARFENYLTERDNMGVTRFSDLTPDEFKSMFLMKSYTPKQARELLSGMRQYPAN 120

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                K+         +  P + DWRE   +  V++Q  CG+CW FST    E M+A K 
Sbjct: 121 AKLTMKQV--------SDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGNVEGMYAAKT 172

Query: 180 GTLSLLSVQEVIDCAGN---------GNMGCSGGDFCALLDWMDVNKVV----LEPESEY 226
           G L  LS Q+++DC  N          N GC+GG     L W     ++    L  E  Y
Sbjct: 173 GKLISLSEQQLVDCDHNCVVWEGEKTCNAGCNGG-----LMWSSFEHIIKTGGLVTEESY 227

Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLG 286
           P    D  C R   S   VKI ++T   +  +E  +   +A +GP+  A+NA   QYY  
Sbjct: 228 PYEAVDNRC-RFNVSNAVVKISNWT--FVSSNEDEMAAWLANNGPIAIAINADYLQYYRK 284

Query: 287 GVIQ-YNCDGSLANINHAVQIVGY 309
           G++    CD     +NH V IVGY
Sbjct: 285 GILNPSRCDPE--ELNHGVLIVGY 306


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 143/323 (44%), Gaps = 46/323 (14%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNL------EQKLELFSSFQQRYKKSYSKSEHD 54
           M  V    +I   +     A   + +  NL      E+  +  + + + YK +  KS+  
Sbjct: 1   MASVNQYQYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSK-- 58

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
            R+K F+ ++  IE  NK     +S +  I EF+DL+ EEF  R  R+    H+  +   
Sbjct: 59  -RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RASRNRFKAHICSTEAT 113

Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
              + H                 +P   DWR+ G +  +++Q  CG+CWAFS V   E +
Sbjct: 114 SFKYEH--------------VAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGI 159

Query: 175 HALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPL 228
             L  G L  LS QE++DC  +G + GC+GG    L+D  D  K +     L  E+ YP 
Sbjct: 160 TQLSTGKLISLSEQELVDCDTSGEDQGCNGG----LMD--DAFKFIEQNHGLATEANYPY 213

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
              D  C RK  +    KI  Y  D    +E ++   +A H P+  A++A    +Q+Y  
Sbjct: 214 AGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDAGGFEFQFYSS 271

Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
           GV    C   L   +H V  VGY
Sbjct: 272 GVFTGQCGTEL---DHGVAAVGY 291


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 90/315 (28%), Positives = 139/315 (44%), Gaps = 30/315 (9%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSKS-EHDIRF 57
           M  V    +I   +     A   + +  NL +    E    +  +Y + Y  + E   R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 58  KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
           K F+ ++  IE  NK     +S +  I EF+DL+ EEF  R  R+    H+  +      
Sbjct: 61  KIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RASRNRFKAHICSTEATSFK 116

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
           + +               T +P   DWR+ G +  +++Q  CG+CWAFS V   E +  L
Sbjct: 117 YEN--------------VTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQL 162

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
             G L  LS QE++DC  +G + GCSGG       +++ N   L  E+ YP    D  C 
Sbjct: 163 STGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH-GLTTEANYPYAGTDGTCN 221

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
           RK  +    KI  Y  D    +E ++   +A H P+  A++A    +Q+Y  GV    C 
Sbjct: 222 RKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDAGGSEFQFYSSGVFTGQCG 279

Query: 295 GSLANINHAVQIVGY 309
             L   +H V  VGY
Sbjct: 280 TEL---DHGVSAVGY 291


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 134/284 (47%), Gaps = 29/284 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFS 88
           E+ + ++  +  ++ K Y+   E + RF+ F+ +L+ IEE N  NR    + + G+  FS
Sbjct: 46  EEVMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNR----TYKVGLNRFS 101

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS EE+++++L   ++   +M+                      +   +P   DWR+ G
Sbjct: 102 DLSNEEYRSKYLGTKIDPSRMMARPSRRYSPR-------------VADNLPESVDWRKEG 148

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            + +V+NQ  C  CWAFS +   E ++ +  G L+ LS QE++DC    N GCSGG    
Sbjct: 149 AVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDY 208

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIA 267
             +++ +N   ++ E +YP    D  C +   +   V I  Y     +P+   + L    
Sbjct: 209 AFEFI-INNGGIDTEEDYPFQGADGICDQYKINARAVTIDGY---ERVPAYDELALKKAV 264

Query: 268 THGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            + PV  A+ A    +Q Y  G+    C  S   I+H V  VGY
Sbjct: 265 ANQPVSVAIEAYGKEFQLYESGIFTGTCGTS---IDHGVTAVGY 305


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 145/315 (46%), Gaps = 29/315 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKK-------SYSKSEHDIR 56
            K     +AL+AL FL+I   +  P  E+ L    S    Y+K       +    E + R
Sbjct: 2   AKPKFIALALVALSFLSIAQSI--PFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRR 59

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           F  F++++  I E N+ + +P   +  + +F D++ +EF++++    +       HH+  
Sbjct: 60  FNVFKENVKFIHEFNQKKDAP--YKLALNKFGDMTNQEFRSKYAGSKIQ------HHRSQ 111

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
                +          ++P       DWR  G +  V++Q  CG+CWAFST+ + E ++ 
Sbjct: 112 RGIQKNTGSFMYENVGSLPA---ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQ 168

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           +K G L  LS QE++DC  + N GC+GG      +++  N +    E  YP   +D  C 
Sbjct: 169 IKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKNGITT--EDSYPYAEQDGTCA 226

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
               +   V I  +  D    +E++++  +A   P+  ++ A    +Q+Y  GV    C 
Sbjct: 227 SNLLNSPVVSIDGHQ-DVPANNENALMQAVANQ-PISVSIEASGYGFQFYSEGVFTGRCG 284

Query: 295 GSLANINHAVQIVGY 309
             L   +H V IVGY
Sbjct: 285 TEL---DHGVAIVGY 296


>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
 gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
           Precursor
 gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
 gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
          Length = 368

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 143/297 (48%), Gaps = 34/297 (11%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  ++P +    + FS F++++ K Y S  EHD RF  F+ +L       ++++   SA 
Sbjct: 37  VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
           +G+T+FSDL+  EF+ +HL        + S  K        + K +    I     +P  
Sbjct: 94  HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
            DWR+ G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC        
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           A + + GC+GG   +  ++  +    L  E +YP   KD    +   S     + +++  
Sbjct: 199 ADSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257

Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++   E  I  ++  +GP+  A+NA   Q Y+GGV   Y C      +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 141/312 (45%), Gaps = 29/312 (9%)

Query: 5   KNVLF---IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNF 60
           KN L+   +  L  + FLA  V           E  + +  RY K Y    E + RF+ F
Sbjct: 4   KNQLYHISLALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIF 63

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++++ IE  N      +S +  I +F+DL+ EEF     R+    H+  S  +     +
Sbjct: 64  KENVNYIETFNS--ADNKSYKLDINQFADLTNEEFIAP--RNRFKGHMCSSITRTTTFKY 119

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
            +V            T IP   DWR+ G +  +++Q  CG CWAFS V   E +HAL  G
Sbjct: 120 ENV------------TVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAG 167

Query: 181 TLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QEV+DC   G + GC+GG       ++  N   L  E  YP    D  C  KA
Sbjct: 168 KLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNH-GLNTEPNYPYKAADGKCNAKA 226

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
            + +   I  Y  D  + +E ++   +A   PV  A++A    +Q+Y  GV   +C   L
Sbjct: 227 AANHAATITGYE-DVPVNNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL 284

Query: 298 ANINHAVQIVGY 309
              +H V  VGY
Sbjct: 285 ---DHGVTAVGY 293


>gi|118397782|ref|XP_001031222.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89285547|gb|EAR83559.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 331

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 88/312 (28%), Positives = 147/312 (47%), Gaps = 31/312 (9%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
           N   ++A+I   F+       + +L + L+ ++ F + Y + Y +++E D R   F ++ 
Sbjct: 2   NTKLLLAIIFSAFIC-SAYADQVSLVEALQAYNKFTRNYPRIYLNEAESDYRLAIFLENY 60

Query: 65  DIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
             I++ N N   PE+  + G+  FSD++++EF  + L+   N ++L        +  N+V
Sbjct: 61  QKIQDHNNN---PENTYQIGVNRFSDMTQQEFSQKILQ---NPNIL-------SNGKNYV 107

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           +K+  +     P       DWR  G++  V+NQ  CG+CWAFS     ES +A+ N  L 
Sbjct: 108 QKQQASVNDVQPA---TSIDWRTKGVVTPVKNQGECGSCWAFSATAAMESYNAIHNKVLL 164

Query: 184 LLSVQEVIDCAGNGN-----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
             S QE +DC    N      GC GG     + +  +  V  + E+EYP +    +C   
Sbjct: 165 RFSEQEFVDCTTEKNGGFYSFGCEGGVPGEAIRYASLYGV--KTEAEYPYVGIQGSCNTT 222

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
            ++    K  SY     +P  +  L     + PV  +++A     Y+ GV  Y+C     
Sbjct: 223 NSTTTNFKPVSYYS---LPETTEALKVALNNAPVSVSIDATLLGDYVSGV--YDCKNQTI 277

Query: 299 NINHAVQIVGYD 310
            INHAV  VGYD
Sbjct: 278 EINHAVLAVGYD 289


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 93/310 (30%), Positives = 148/310 (47%), Gaps = 32/310 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +F V ++ALC  A    +S P+L+ +L E ++ ++  + K Y + E   R   +EK+L  
Sbjct: 1   MFPVVVLALCVTAA---LSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRMVWEKNLKK 57

Query: 67  IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           IE  N ++     +   G+  F D++ EEF     R  +N + L S             +
Sbjct: 58  IELHNLEHSMGKHTYSLGMNHFGDMTHEEF-----RQIMNGYKLKS-------------Q 99

Query: 126 RSITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           R +   + +       P   DWR+ G +  V++Q  CG+CWAFST    E  H  K GTL
Sbjct: 100 RKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTL 159

Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS Q ++DC+   GN GC+GG       ++  N   L+ E  YP L  D        S
Sbjct: 160 VSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNG-GLDSEESYPYLGTDEGPCHYDPS 218

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            N      +  D    SE +++  +A+ GPV  A++A   ++Q+Y  G I Y+ + S   
Sbjct: 219 YNSANDTGFV-DVPSGSERALMKAVASVGPVSVAIDAGHESFQFYHSG-IYYDKECSSEE 276

Query: 300 INHAVQIVGY 309
           ++H V +VGY
Sbjct: 277 LDHGVLVVGY 286


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 86/293 (29%), Positives = 139/293 (47%), Gaps = 31/293 (10%)

Query: 24  KVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
           ++ K    + + ++ ++  ++ KSY+   E + RF+ F+ +L  IEE N  NR    + +
Sbjct: 41  RLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNR----TYK 96

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
            G+  F+DL+ EE+++R+L         +   +  D +       S   G  +P  +   
Sbjct: 97  VGLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRY-------SFRAGEDLPESV--- 146

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
            DWRE G +  V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC
Sbjct: 147 -DWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGC 205

Query: 202 SGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
           +GG    L+D+     +N   ++ E +YP    D  C     +   V I  Y  D     
Sbjct: 206 NGG----LMDYAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYE-DVPQND 260

Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           E S+   +A   PV  A+ A    +Q Y  GV    C   L   +H V  VGY
Sbjct: 261 ERSLKKAVANQ-PVSVAIEAGGRAFQLYQSGVFTGQCGTQL---DHGVVAVGY 309


>gi|354504282|ref|XP_003514206.1| PREDICTED: cathepsin J-like [Cricetulus griseus]
 gi|344250851|gb|EGW06955.1| Cathepsin J [Cricetulus griseus]
          Length = 334

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 148/305 (48%), Gaps = 32/305 (10%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           V L  LCF    V ++ P L+  L+  +  ++++Y+KSYS+ E   +   +EK++ +I  
Sbjct: 5   VFLTILCF---GVALAAPVLDSSLDAEWQQWKKKYEKSYSQEEEVWKRAVWEKNMQMIRT 61

Query: 70  LN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N ++ Q        +  F D++ EE++T      V   V               K +S+
Sbjct: 62  HNGEDGQGKHGFTVEMNAFGDMTGEEYRTFLTDIPVPAAV---------------KVKSV 106

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              +     +P  +DW + G +  VR Q  CG+CWAF+ +   E     + G L+ LSVQ
Sbjct: 107 QNPLL--NDLPKSEDWTKKGFVTPVRKQGQCGSCWAFAAIGAIEGQMFWRTGNLTTLSVQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC  GD  +   ++ ++   LE E  YP   KD  C+    +PN  + 
Sbjct: 165 NLLDCSKPQGNNGCVRGDAYSAYQYV-LHNGGLEAEETYPYEAKDGPCRY---NPNNSRA 220

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
                 +L   E  +L  ++  GPV AA++A   ++++Y GG+  + NC   L   NHAV
Sbjct: 221 YITEVVSLPAHEDYLLVAVSMIGPVAAAIDASHDSFRFYRGGIYHEPNCSSYL--TNHAV 278

Query: 305 QIVGY 309
            +VGY
Sbjct: 279 LVVGY 283


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 IINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 91/311 (29%), Positives = 140/311 (45%), Gaps = 31/311 (9%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
           FIV  +ALC L +       +  +K         EL+  ++  +  + S  E   RF  F
Sbjct: 4   FIV--LALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLEEKAKRFNVF 61

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           + ++  I E NK   S    +  + +F D++ EEF+  +   ++       HH+      
Sbjct: 62  KHNVKHIHETNKKENS---YKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGER 112

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              K        T+PT +    DWR+ G +  V+NQ  CG+CWAFSTV   E ++ ++  
Sbjct: 113 QTTKSFMYANVDTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L+ LS QE++DC  N N GC+GG      +++   K  L  E  YP    D  C     
Sbjct: 169 KLTSLSEQELVDCDTNKNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
           +   V I  +  D    SE  ++  +A H PV  A++A    +Q+Y  GV    C   L 
Sbjct: 228 NAPVVSIDGHE-DVPKNSEVDLMKAVA-HQPVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284

Query: 299 NINHAVQIVGY 309
             NH V +VGY
Sbjct: 285 --NHGVAVVGY 293


>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
          Length = 331

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 140/304 (46%), Gaps = 34/304 (11%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + +  + P L Q L+  +S ++  + K Y ++E   R   +EK+L +I++ N
Sbjct: 7   LAALC---LGIVSAAPKLYQSLDARWSQWKAAHGKLYDENEEGWRRAVWEKNLKVIKQHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           +   Q   S    +  F DL+ EEFK            +M+  K       +V       
Sbjct: 64  QEYSQGKHSFTMAMNAFGDLTNEEFKQ-----------VMNGLKSQKRKEGNV------- 105

Query: 131 GITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
               P     P   DWR+ G +  V+NQ  CG+CWAFS     E     K   L  LS Q
Sbjct: 106 -FQAPPFAETPSSVDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GCSGG       ++  N   L+ E  YP   +D +CK K   P     
Sbjct: 165 NLVDCSQAEGNEGCSGGLMDYAFQYVKDNG-GLDSEESYPYRAQDESCKYK---PEQSAA 220

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
                  + P E S+   +AT GP+ AA++A   T+Q+Y  G I Y+ D S  N++H + 
Sbjct: 221 NDTGFMDIHPEEESLKLAVATVGPISAAIDASLSTFQFYHKG-IYYDPDCSSENLDHGIL 279

Query: 306 IVGY 309
           +VGY
Sbjct: 280 VVGY 283


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRF- 123

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 124 ----------GQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    +++Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 147/316 (46%), Gaps = 39/316 (12%)

Query: 6   NVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSY-SKSEHDIRFKNFE 61
           N L+ V+L +  C   + ++V+   L+     E    +   Y K Y +  E + R + F 
Sbjct: 5   NQLYHVSLALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFT 64

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           ++L  IE  N N  + +  + GI +F+DL+ EEF             + S +K   H  +
Sbjct: 65  ENLKYIEASN-NAGNNKPYKLGINQFADLTNEEF-------------IASRNKFKGHMCS 110

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
            + +   TT     T +P   DWR+ G +  V+NQ  CG CWAFS +   E +H +  G 
Sbjct: 111 SIIR--TTTFKYENTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGK 168

Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAAC 235
           L  LS QE++DC  NG + GC GG    L+D  D  K +++      E+ YP    D  C
Sbjct: 169 LVSLSEQELVDCDTNGVDQGCEGG----LMD--DAFKFIIQNNGISTEAGYPYQGVDGTC 222

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
           K    S +   I  Y  D    +E+++   +A   P+  A++A    +Q+Y  GV   +C
Sbjct: 223 KANEASTSAATITGYE-DVPANNENALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSC 280

Query: 294 DGSLANINHAVQIVGY 309
              L   +H V  VGY
Sbjct: 281 GTEL---DHGVTAVGY 293


>gi|328869030|gb|EGG17408.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 379

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 139/275 (50%), Gaps = 23/275 (8%)

Query: 43  RYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH 102
           R++KSY   +   RF  F+ ++D + E N +++ P      + +F+D++ +E++  +L  
Sbjct: 45  RFEKSYESFDFLQRFAVFKTNMDYVHEWN-SKKLPTVLE--LNQFADITNQEYRRLYLGT 101

Query: 103 SVNKHVLMSHHKHHDHHHNHVK----KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
            +N   L+     H+  +N  K      S ++G T+        DWR  G +  ++NQ  
Sbjct: 102 RINARHLLGTPGTHEMSNNFGKVFGDDDSDSSGATV--------DWRAKGAVSPIKNQGQ 153

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNK 217
           CG+CW+FST  + E  H +  G +  LS Q ++DC+G+ GNMGC GG      D++  N+
Sbjct: 154 CGSCWSFSTTGSVEGAHYISTGKMVPLSEQNLVDCSGSEGNMGCQGGLMNLAFDYIIKNE 213

Query: 218 VVLEPESEYPLLLKDA-ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
            + + E  YP   +    C    T+  G  I SY  +     ES++   +   GPV  A+
Sbjct: 214 GI-DTEDSYPYSAETGKKCLFNKTNV-GATISSYK-NITSGDESNLADAVKNAGPVSVAI 270

Query: 277 NAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   ++Q Y  G I Y  D S  N++H V +VGY
Sbjct: 271 DASHNSFQLYSHG-IYYEKDCSSVNLDHGVLVVGY 304


>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
          Length = 396

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 155/313 (49%), Gaps = 39/313 (12%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIE 68
           +  L+A  F     K+    L+Q+   F  F +++ + + S  E+ +RF+ F+K+L   E
Sbjct: 64  MTILMASVFRIRAEKLKSFGLQQQ---FKDFNKKFGREHKSLEEYKMRFEVFQKNLREFE 120

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKT-----RHLRHSVNKHVLMSHHKHHDHHHNHV 123
           ELN   Q   S +YGI +FSD +E E K      + L  S++   L +   + +      
Sbjct: 121 ELN---QKNPSVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRN------ 171

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
             R+I   +  P  I    DWR  G +  V++Q  CG+CWAF+TV   ES +A++ GTL 
Sbjct: 172 -PRNIIKNVQRPDYI----DWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLW 226

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            LS QE++DC G  + GC GG   + L ++  N   LE E +YP     +A K      N
Sbjct: 227 SLSEQELVDCDG-ASYGCGGGFLTSALGFILGNG--LETEDDYPY----SATKHDQCWIN 279

Query: 244 GVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI---QYNC-DGS 296
           G K + +  +   L  SE  +   +A  GPV  A++   ++  Y  G+    ++ C D S
Sbjct: 280 GDKTRVWIDEGYQLTMSEDDVAEWVANVGPVSFAMSVPKSFPAYHDGIYSPSEHECKDES 339

Query: 297 LANINHAVQIVGY 309
           L    HA+ I+GY
Sbjct: 340 LG--YHAMAIIGY 350


>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 365

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/288 (30%), Positives = 131/288 (45%), Gaps = 38/288 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ ++ K Y S+ EHD RFK F+ +L        N+    SA +GIT+FSDL+  EF
Sbjct: 49  FSLFKSKFGKIYASEEEHDHRFKVFKANL---RRARLNQLLDPSAEHGITKFSDLTPSEF 105

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L                   H    K +      +PT  +P   DWR+ G +  V+
Sbjct: 106 RRTYLGL-----------------HKPKPKVNAEKAPILPTSDLPADYDWRDHGAVTGVK 148

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC          + + GC GG  
Sbjct: 149 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLM 208

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L+ E +YP   KD  C     S     + +++   L   E  I  ++
Sbjct: 209 TTAFEYT-LKAGGLQLEKDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANL 264

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
             HGP+   +NA   Q Y+GGV     C       +H V +VGY ++ 
Sbjct: 265 VKHGPLAVGINAAWMQTYVGGVSCPLIC---FKRQDHGVLLVGYGSHG 309


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 90/277 (32%), Positives = 129/277 (46%), Gaps = 28/277 (10%)

Query: 46  KSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
           +SY+   EH+ RF+ F  +L   +  N  R      R G+  F+DL+ EEF+   L   V
Sbjct: 63  RSYNALGEHERRFRVFWDNLRFADAHNA-RADDHGFRLGMNRFADLTNEEFRATFLGAKV 121

Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
              V  S      + H+ V++            +P   DWRE G +  V+NQ  CG+CWA
Sbjct: 122 ---VERSRAAGERYRHDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWA 166

Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPE 223
           FS V T ES++ L  G +  LS QE+++C+ NG N GC+GG      D++ +    ++ E
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTE 225

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTW 281
            +YP    D  C     +   V I  +  D     E S+   +A H PV  A+ A    +
Sbjct: 226 DDYPYKAVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREF 283

Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           Q Y  GV    C  SL   +H V  VGY  DN    W
Sbjct: 284 QLYHSGVFSGRCGTSL---DHGVVAVGYGTDNGKDYW 317


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 148/307 (48%), Gaps = 27/307 (8%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           +LF   L+ L        ++K   ++   ++ S+  +Y KSY S  E + RF+ F+++L 
Sbjct: 12  LLFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLR 71

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I+E N +  +  S R G+ +F+D + EEF++ +L  +   + +   +++          
Sbjct: 72  FIDEHNAD--TNRSYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEPR------- 122

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                   +   +P   DWR AG +  +++Q  CG+CWAFS + T E ++ +  G L  L
Sbjct: 123 --------VGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISL 174

Query: 186 SVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           S QE++DC    N  GC GG       ++ +N   +  E+ YP   +D  C     +   
Sbjct: 175 SEQELVDCGRTQNTRGCDGGSITDGFQFI-INNGGINTEANYPYTAEDGQCNLDLQNEKY 233

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
             I +Y  +    +E ++ T +A + PV  A+ A    +Q+Y  G+    C  +   ++H
Sbjct: 234 ASIDTYE-NVPYNNEWALQTAVA-YQPVSVALEAAGDAFQHYSSGIFTGPCGTA---VDH 288

Query: 303 AVQIVGY 309
           AV IVGY
Sbjct: 289 AVTIVGY 295


>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 92/317 (29%), Positives = 154/317 (48%), Gaps = 30/317 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRMFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +S+   E   +   +   A +G+T+FSD+S EEF+  +L  +          K++     
Sbjct: 67  QSM---ERAKEEAAANPYATFGVTQFSDMSPEEFRATYLNGA----------KYYAAALE 113

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +K      + + TG  P   DWR+ G +  V++Q +CG+CWAF+     E    +   
Sbjct: 114 RPRKV-----VNVSTGKAPPAVDWRKKGAVTPVKDQGSCGSCWAFAATGNIEGQWKIAGH 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK--R 237
            L+ LS Q ++ C    +  C GG       W+   NK  +  E  YP    D       
Sbjct: 169 ELTSLSEQMLVSCDTTED-NCRGGFADRAFKWIVSSNKGNVFTEESYPYASTDGYVPPCN 227

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
           K+    G KI  +    L   E++I   +A +GPV  AV+A T+  Y GGV+  +C  S 
Sbjct: 228 KSGKVVGAKISGHI--NLPKDENAIAEWLARNGPVAIAVDASTFLDYKGGVLT-SC--SS 282

Query: 298 ANINHAVQIVGYDNYSR 314
             ++H V +VGY++ S+
Sbjct: 283 EGLSHDVLLVGYNDTSK 299


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 132/283 (46%), Gaps = 33/283 (11%)

Query: 37  FSSFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F +++ K YS +E H  RF  F+K+L    +  ++++    A +GI +FSDL+EEEF
Sbjct: 75  FAHFVKKFNKEYSGAEEHARRFSIFKKNL---HKALRHQKLDRDAIHGINKFSDLTEEEF 131

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
             ++L  +     L               +R+    I     +P   DWRE G +  V+N
Sbjct: 132 HEQYLGLTTPPRSL--------------SQRTQPAPILPTDDLPPDFDWRELGAVTPVKN 177

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q  CG+CW FST    E  + +K G L  LS Q+++DC            + GC+GG   
Sbjct: 178 QGACGSCWTFSTTGAMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMT 237

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               +  +    L+ E +YP    D +CK   T    V        T+   E  I  ++ 
Sbjct: 238 TAYQYA-LKAGGLQREEDYPYTGIDGSCKFDNTK---VAAMVANFSTVSIDEDQIAANLV 293

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +GP+   +NA   Q Y+GGV   Y C+    N++H V +VGY
Sbjct: 294 KNGPLAVGINAAFMQTYVGGVSCPYVCNKQ--NLDHGVLLVGY 334


>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 367

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ K+Y ++ EHD RF  F+ +L       K++    +A +G+T+FSDL+ +EF
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGVTKFSDLTPKEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L       +    +K                 I   T +P   DWR+ G + +V++
Sbjct: 108 RRQFLGLKRRLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
           Q +CG+CW+FS     E  H L  G L+ LS Q+++DC         G  + GC GG   
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              ++  +    LE E +YP    D    +   S     + +++  ++   E  I  ++ 
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            HGP+  A+NA   Q Y+GGV   Y C       +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309


>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 94/331 (28%), Positives = 145/331 (43%), Gaps = 52/331 (15%)

Query: 8   LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
           LF+++L+A      AI      P + Q +              FS F+ ++ K Y S+ E
Sbjct: 4   LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63

Query: 53  HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           HD RFK F+ +L       +++    SA +GIT+FSDL+  EF+  +L            
Sbjct: 64  HDHRFKVFKANL---RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110

Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
                  H    K +      +PT  +P   DWR+ G +  V+NQ +CG+CW+FST    
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163

Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
           E  H L  G L  LS Q+++DC            + GC GG      ++  +    L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYT-LKAGGLQLE 222

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
            +YP   KD  C     S     + +++   L   E  I  ++  HGP+   +NA   Q 
Sbjct: 223 KDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279

Query: 284 YLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
           Y+GGV     C       +H V +VGY ++ 
Sbjct: 280 YVGGVSCPLIC---FKRQDHGVLLVGYGSHG 307


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    +++Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F +++K+ YS  +E   RFK + ++L  +E+L    +   +A YG+T+FSD+S EEF
Sbjct: 170 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKG--TAIYGVTQFSDMSPEEF 227

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  S+    ++S+   +D     +KK ++T        +P + DWR  G++  V+N
Sbjct: 228 QKTML-PSLWWDRVVSNGVEYD-----LKKFNLTF-----NNLPEQFDWRTKGVVTPVKN 276

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q +CG+CWAFS     E + A+K G L  LS QE+IDC    + GC+GG    +  + ++
Sbjct: 277 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDC-DRIDKGCNGG--LPINAFREI 333

Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-----IPSESSILTD-IAT 268
            ++  LEPE +YP   ++  C           I+S    T+     IP   +++   I  
Sbjct: 334 QRMGGLEPEDQYPYKARNGTCHL---------IRSAIAVTIDDAVEIPRNETVMKAWIVQ 384

Query: 269 HGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
            GP+   ++A    YY  G++   +  C  S   I+H V I GY
Sbjct: 385 RGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITGY 426


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 135/286 (47%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ ++LF S+  ++ K Y   +  I RF+ F  +L  I+E NK   S      G+  F
Sbjct: 40  SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK +++         + H  + D  + HV            T  P   DWR  
Sbjct: 97  ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFST+ T E ++ +  G L  LS QE++DC  + + GC GG   
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V       YP   K   C  +AT   G K+K  T    +PS  E+S L  
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   P+   V A    +Q Y  GV    C   L   +HAV  VGY
Sbjct: 259 LANQ-PLSFLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300


>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 151/308 (49%), Gaps = 30/308 (9%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           F + L +LC   + +  + P  ++ L+  +  ++ ++ KSY  +E  +R   +EK+L +I
Sbjct: 3   FYLCLASLC---LGLAAAIPPFDRALDSQWHQWKAQHGKSYEANEDSLRRATWEKNLKMI 59

Query: 68  EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           E  N+   + + S +  + +F D+S EEFK            +M+ +K +       + +
Sbjct: 60  ERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQ-----------VMNGYKSNGSQR---RTK 105

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                 ++   +P   DWRE G +  V+ Q  CGACW+FS V   E     K G L  LS
Sbjct: 106 GSLYRESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLS 165

Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           +Q +IDC    GN GC GG       ++  N  + + E  YP + +D  CK K    +G 
Sbjct: 166 IQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGI-DTEECYPYVAQDTECKYKPEC-SGA 223

Query: 246 KIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANIN 301
            I  +     IPS  E +++  +AT GP+   +++   ++++Y  GV  Y  D S + ++
Sbjct: 224 NITGF---VDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGVY-YEPDCSSSQLD 279

Query: 302 HAVQIVGY 309
           H V +VGY
Sbjct: 280 HGVLVVGY 287


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 91/310 (29%), Positives = 149/310 (48%), Gaps = 33/310 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPN---LEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
           +L IV  I LC  A+       +   +E+  +  + F + YK    K++   RF+ F+ +
Sbjct: 8   LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQ---RFEVFKAN 64

Query: 64  LDIIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +  IE  N +NR+       G+ +F+DL+ +EF+        NK + MS  +        
Sbjct: 65  VAFIESFNAENRK----FWLGVNQFTDLTNDEFRATK----TNKGLKMSGGRAPTGF--- 113

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K S  +   +PT +    DWR  G++  +++Q  CG CWAFS V   E +  L  G L
Sbjct: 114 --KYSNVSIDALPTAV----DWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKL 167

Query: 183 SLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS QE++DC  +G + GC GG+      ++ +    L  E+ YP   +D  CK    S
Sbjct: 168 ISLSEQELVDCDVHGVDQGCEGGEMDDAFKFI-IKNGGLTTEANYPYTAQDGQCKTSIAS 226

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            +   IK Y  D     ESS++  +A   PV  AV+   + +Q+Y GGV+  +C     +
Sbjct: 227 NSVATIKGYE-DVPANDESSLMKAVANQ-PVSVAVDGGDVIFQHYSGGVMTGSCG---TD 281

Query: 300 INHAVQIVGY 309
           ++H +  +GY
Sbjct: 282 LDHGIAAIGY 291


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 139/306 (45%), Gaps = 22/306 (7%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLD 65
           + F++A+I     +             +E    +  R+ + YS  SE   RF+ F+K+L 
Sbjct: 5   IFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLK 64

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            +E  N N  + ++    + EFSDL++EEFK R+    V +   M+     D H   V  
Sbjct: 65  FVESFNMN--TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEG--MTRMSTTDSHET-VSF 119

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
           R    G T  +      DWRE G +  V++QQ CG CWAFS V   E M  +  G L  L
Sbjct: 120 RYENVGETGES-----MDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSL 174

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           S Q+++DC+   N GC GG      D++  N+ +   E  YP       C+    +    
Sbjct: 175 SEQQLLDCSTE-NDGCDGGIMWKAFDYIVENQGIT-AEDNYPYQGAQQTCESNHVAA--A 230

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLANINHA 303
            I  Y  +T+  ++   L    +  PV  A+    +++  Y GG+    C     ++NHA
Sbjct: 231 TISGY--ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECG---THLNHA 285

Query: 304 VQIVGY 309
           V IVGY
Sbjct: 286 VTIVGY 291


>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ K+Y ++ EHD RF  F+ +L       K++    +A +G+T+FSDL+ +EF
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGVTKFSDLTPKEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L       +    +K                 I   T +P   DWR+ G + +V++
Sbjct: 108 RRQFLGLKRRLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
           Q +CG+CW+FS     E  H L  G L+ LS Q+++DC         G  + GC GG   
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              ++  +    LE E +YP    D    +   S     + +++  ++   E  I  ++ 
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            HGP+  A+NA   Q Y+GGV   Y C       +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309


>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
 gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
          Length = 327

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 92/304 (30%), Positives = 147/304 (48%), Gaps = 30/304 (9%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
           +  + FL   V+ +  NL    +LF  F Q+Y KSYS + E  I+F NF+ +   I  +N
Sbjct: 1   MFYILFLIGLVQGALYNLNDSEKLFEDFVQKYNKSYSSEEERQIKFDNFKNN---IRSIN 57

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
           +      SA Y I  +SD+++ E   +     +N         + D   N    + +  G
Sbjct: 58  EKNSLSNSAVYDINFYSDMNKNELLRKQTGFKINLK-----KNNLDLSWNIKCNKKLING 112

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
                 +P   DWR+  +I  V+NQ+ CG+CWAFST+   ES++A+K   L  LS Q+++
Sbjct: 113 -NPAVLLPDSFDWRDRHVITSVKNQRDCGSCWAFSTIANIESLYAIKYNKLLDLSEQQLV 171

Query: 192 DCAGNGNMGCSGGDFCALLDW-MD--VNKVVLEPESEYPLLLKDAACKRKA--TSPNGVK 246
           +C    N GC+GG    L+ W M+  + +  +  E+++P    D  CKRK    + NG  
Sbjct: 172 NCDEQNN-GCNGG----LMHWAMEEIIRQGGVSNETDFPYTASDGFCKRKQGFVNING-- 224

Query: 247 IKSYTCDTLIPSESSILTDIAT-HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQ 305
                C+  I S    L ++   +GP+  A++ +    Y  G I   C      +NHAV 
Sbjct: 225 -----CNQFILSNEDRLRELLIFNGPISIAIDVIDVIDYSQG-ISSTCRNDNG-LNHAVL 277

Query: 306 IVGY 309
           +VGY
Sbjct: 278 LVGY 281


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/283 (29%), Positives = 133/283 (46%), Gaps = 39/283 (13%)

Query: 39  SFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
           + Q R  +S    EH  RF+ F++++  I+ +NK +  P   + G+ +F+DLS EEFK  
Sbjct: 49  ALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNK-KDGP--YKLGLNKFADLSNEEFKAM 105

Query: 99  HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI---PTGIPVKKDWREAGIIGKVRN 155
           H+   + KH  +               R + +G  +      +P   DWR+ G +  V+N
Sbjct: 106 HMTTKMEKHKSLR------------GDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKN 153

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q  CG+CWAFST+ + E ++ +K G L  LS Q+++DC+   N GC+GG       ++  
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI-------PSESSILTDIAT 268
           N  ++  E EYP   +   C          KI+S +  T+I        +    L     
Sbjct: 213 NGGIV-TEDEYPYTAEAGECST-------TKIESKSIATIIDGFEDVPANNEGALKKAVA 264

Query: 269 HGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           H PV  A+ A    +Q+Y  GV    C   L   +H V +VGY
Sbjct: 265 HQPVSIAIEASGHDFQFYSTGVFTGKCGTEL---DHGVVVVGY 304


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 145/284 (51%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F +++K+ YS  +E   RFK + ++L  +E+L    +   +A YG+T+FSD+S EEF
Sbjct: 135 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKG--TAIYGVTQFSDMSPEEF 192

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  S+    ++S+   +D     +KK ++T        +P + DWR  G++  V+N
Sbjct: 193 QKTML-PSLWWDRVVSNGVEYD-----LKKFNLTF-----NNLPEQFDWRTKGVVTPVKN 241

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q +CG+CWAFS     E + A+K G L  LS QE+IDC    + GC+GG    +  + ++
Sbjct: 242 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDC-DRIDKGCNGG--LPINAFREI 298

Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-----IPSESSILTD-IAT 268
            ++  LEPE +YP   ++  C           I+S    T+     IP   +++   I  
Sbjct: 299 QRMGGLEPEDQYPYKARNGTCHL---------IRSAIAVTIDDAVEIPRNETVMKAWIVQ 349

Query: 269 HGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGY 309
            GP+   ++A    YY  G++   +  C  S   I+H V I GY
Sbjct: 350 RGPLSVGIDAKLLAYYKSGILHPSRSRCPPS--GIDHGVLITGY 391


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 146/313 (46%), Gaps = 31/313 (9%)

Query: 7   VLFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFE 61
            L I  L+ L F L+  +  S        E+ + +++   +++K Y+   E D RF+ F+
Sbjct: 6   TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFK 65

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
            +L  I+E N N+ +  + + G+ +F+D++ EE++  +     +    +   K   H + 
Sbjct: 66  DNLGFIQEHNNNQNN--TYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYA 123

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           +          +    +PV  DWR  G +  +++Q +CG+CWAFSTV T E+++ +  G 
Sbjct: 124 Y----------SAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGK 173

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
              LS QE++DC    N GC+GG    L+D+     +    ++ + +YP    D  C   
Sbjct: 174 FVSLSEQELVDCDRAYNQGCNGG----LMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPT 229

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
             +   V I  Y  + + P + + L       PV  A+ A     Q Y  GV    C  S
Sbjct: 230 KKNAKAVNIDGY--EDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTS 287

Query: 297 LANINHAVQIVGY 309
           L   +H V +VGY
Sbjct: 288 L---DHGVVVVGY 297


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 148/315 (46%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S     YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGI-SSESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 89/318 (27%), Positives = 150/318 (47%), Gaps = 33/318 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKS------EHDIR- 56
           VL  V+L AL  LA P +   P  E+ L   E   +  ++++  Y  S      E D + 
Sbjct: 6   VLAAVSL-ALLVLAPPARAGIPFTEKDLASEESLRALYEQWRSHYMVSRPAGLQEQDDKA 64

Query: 57  --FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
             F  F++++  I E NK  +S    R  + +F+D++ +EF+          +   S  +
Sbjct: 65  RWFNVFKENVRYIHEANKKGRS---FRLALNKFADMTTDEFR--------RAYAAGSRTR 113

Query: 115 HHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
           HH    + +++    + +    G +P+  DWR+ G +  +++Q  CG+CWAFST+   E 
Sbjct: 114 HHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEG 173

Query: 174 MHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
           ++ ++ G L  LS QE++DC    N GC+GG       ++  N  +   ES YP L +  
Sbjct: 174 INKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQYIKRNGGIT-TESNYPYLAEQR 232

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
           +C +     + V I  Y  D    +E ++   +A   PV  A+ A    +Q+Y  GV   
Sbjct: 233 SCNKAKERSHDVTIDGYE-DVPANNEDALQKAVANQ-PVSIAIEASGQDFQFYSEGVFTG 290

Query: 292 NCDGSLANINHAVQIVGY 309
           +C   L   +H V  VGY
Sbjct: 291 SCGTEL---DHGVAAVGY 305


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 136/286 (47%), Gaps = 33/286 (11%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           + ++ +RY + Y  + E ++RF  ++ ++  IE  N    S +        F+D++ EEF
Sbjct: 39  YETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYSYKLID---NRFADITNEEF 95

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           K+ +L + + +  + +  ++H H                   +P   DWR+ G +  V++
Sbjct: 96  KSTYLGY-LPRFRVQTEFRYHKHGE-----------------LPKSIDWRKKGAVTHVKD 137

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMD 214
           Q  CG+CWAFS V   E ++ +K   L  LS Q++IDC   +GN GC GGD     +++ 
Sbjct: 138 QGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIK 197

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            +  +   + EYP   +D  C +     N V I  Y  +++      +L     H PV  
Sbjct: 198 KHGGIATAK-EYPYKGRDGNCNKSKAKNNAVTISGY--ESVPARNEKMLKAAVAHQPVSI 254

Query: 275 AVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           A +A    +Q+Y  G+   +C     N+NH + IVGY  +N  + W
Sbjct: 255 ATDAGGYAFQFYSKGIFSGSCG---KNLNHGMTIVGYGEENGDKYW 297


>gi|440798540|gb|ELR19607.1| papain family cysteine protease subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 368

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 90/319 (28%), Positives = 149/319 (46%), Gaps = 30/319 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL----------FSSFQQRYKKSYSKSEHDIR 56
           +L   A + LC   +   +S    E+   L          F+++ ++  +SY+  E   R
Sbjct: 11  MLMAAACVVLCLATLGSAISPRFDERGYTLLADTHAARSEFNAWARQNGRSYAAQEFGYR 70

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHK 114
           +  +  +   +E  N N  +  S   G+ + +D++ +E    +  L  + N     S   
Sbjct: 71  YNVWRDNAAYVEHFNANANA--SFTVGLNDLADMTLDEVARVYTGLAPAANPFTDASSPA 128

Query: 115 HHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
                     +R       +   +P   DWR AG +  V+NQ +CGAC+ FS     E M
Sbjct: 129 APVVDDETELER-------VARQLPASYDWRNAGAVTPVKNQGSCGACYTFSANAAIEGM 181

Query: 175 HALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA 233
           + +  G L+ LS Q ++DCA G GN+GC+GG+      W+  N   +   + YP     +
Sbjct: 182 YKIAAGQLTSLSEQMLLDCAQGTGNLGCNGGNMEITYSWILNNGGGVNTLASYPWSGFRS 241

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG-VIQ 290
            C+  A S NG  IK+Y   T   SE+ +LT +A+ GPV   +NA   ++ YY  G +I 
Sbjct: 242 TCRYSA-SNNGAVIKAYRRAT-SGSEAGLLT-LASRGPVSVGINASPRSFTYYRSGTLID 298

Query: 291 YNCDGSLANINHAVQIVGY 309
            +C  + A +NHAV +VG+
Sbjct: 299 SSC--TAAGMNHAVTVVGW 315


>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 135/304 (44%), Gaps = 51/304 (16%)

Query: 27  KPNL-----EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESA 80
           +PNL     E K  +F S    Y K+YS  E  I R   F K  ++++        P +A
Sbjct: 39  RPNLLGTHTESKFRVFMS---DYGKNYSTREEYIHRLGIFAK--NVLKAAEHQMMDP-TA 92

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---- 136
            +G+T+FSDL+EEEFK                 + +    +    R    G   P     
Sbjct: 93  VHGVTQFSDLTEEEFK-----------------RMYTGVADVGGSRGHAVGAEAPMVEVD 135

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--- 193
           G+P   DWRE G + +V+NQ  CG+CWAFST   AE  H +  G L  LS Q+++DC   
Sbjct: 136 GLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQA 195

Query: 194 ------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
                     + GC GG      +++ +    LE E  YP   K   CK     P  V +
Sbjct: 196 VCDPKDKKACDNGCGGGLMTNAYEYL-MEAGGLEEERSYPYTGKRGHCK---FDPEKVAV 251

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQ 305
           +     T+   E  I  ++   GP+   +NA+  Q Y+GGV   +C    S   +NH V 
Sbjct: 252 RVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGV---SCPLICSKRKVNHGVL 308

Query: 306 IVGY 309
           +VGY
Sbjct: 309 LVGY 312


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 138/309 (44%), Gaps = 36/309 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L+ + FLA  V           E    +  RY K Y    E + RF+ F+++++ IE
Sbjct: 12  LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 71

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N    + +  +  I +F+DL+ EEF     R+    H+  S  +     + +V     
Sbjct: 72  AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 122

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                  T +P   DWR+ G +  +++Q  CG CWAFS V   E +HAL +G L  LS Q
Sbjct: 123 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 175

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
           E++DC   G + GC GG    L+D  D  K V     L  E+ YP    D  C     + 
Sbjct: 176 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAAN 229

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           +   I  Y  D    +E ++   +A   PV  A++A    +Q+Y  GV   +C   L   
Sbjct: 230 DAATITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 284

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 285 DHGVTAVGY 293


>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
          Length = 368

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 135/284 (47%), Gaps = 37/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F++F+ ++ K+Y ++ EHD RFK F+ +L       K++    +A +G+T FSDL+  EF
Sbjct: 52  FTTFKAKFGKTYATQEEHDYRFKLFKANL---RRARKHQMMDPTAVHGVTMFSDLTPREF 108

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + ++L        L       D H   +          +PT  +P   DWR+ G +  V+
Sbjct: 109 RRQYLG-------LRRLRLPADAHEAPI----------LPTNDLPTDFDWRDHGAVTNVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           NQ +CG+CW+FS     E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 152 NQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPEEYGACDSGCNGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    LE E +YP    D    +   +     + +++  ++   E  I  ++
Sbjct: 212 TTAFEYT-LKAGGLEREEDYPYTGNDRGPCKFDRNKIVASVSNFSVVSI--DEDQIAANL 268

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA+  Q Y+GGV   Y C       +H V +VGY
Sbjct: 269 VKHGPLAVGINAVFMQTYMGGVSCPYICS---KRQDHGVLLVGY 309


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            EH+ RF  F  +L  ++  N         R G+  F+DL+ EEF+   L   V +    
Sbjct: 69  GEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 128

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           +  ++    H+ V++            +P   DWRE G +  V+NQ  CG+CWAFS V T
Sbjct: 129 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 173

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            ES++ L  G +  LS QE+++C+ NG N GC+GG      D++ +    ++ E +YP  
Sbjct: 174 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTEDDYPYK 232

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
             D  C     +   V I  +  D     E S+   +A H PV  A+ A    +Q Y  G
Sbjct: 233 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 290

Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           V    C  SL   +H V  VGY  DN    W
Sbjct: 291 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 318


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 138/320 (43%), Gaps = 40/320 (12%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSKS-EHDIRF 57
           M  V    +I   +     A   + +  NL +    E    +  +Y + Y  + E   R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRY 60

Query: 58  KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
           K F+ ++  IE  NK     +S +  I EF+DL+ EEF T   R+    H+  +      
Sbjct: 61  KIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEFGTS--RNRFKAHICSTEATSFK 116

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
           + +               T +P   DWR+ G +  +++Q  CG+CWAFS V   E +  L
Sbjct: 117 YEN--------------VTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQL 162

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
             G L  LS QE++DC  +G + GC+GG    L+D  D  K +     L  E+ YP    
Sbjct: 163 STGKLISLSEQELVDCDTSGEDQGCNGG----LMD--DAFKFIKQNHGLTTEANYPYAGT 216

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           D  C RK  +    KI  Y  + +  +    L     H P+  A++A    +Q+Y  GV 
Sbjct: 217 DGTCNRKKAAHPAAKINGY--EDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVF 274

Query: 290 QYNCDGSLANINHAVQIVGY 309
              C   L   +H V  VGY
Sbjct: 275 TGQCGTEL---DHGVAAVGY 291


>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
          Length = 473

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 141/285 (49%), Gaps = 34/285 (11%)

Query: 32  QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           Q L  F  F  +YKK YS + E + R + F+++L   E+L    Q   SA YG+T+FSDL
Sbjct: 170 QLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQG--SAEYGVTKFSDL 227

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +EEEF++ +L      + L+S    H         R +       T  P   DWR+ G +
Sbjct: 228 TEEEFRSTYL------NPLLSQWTLH---------RGMKPAPPAKTPAPDSWDWRDHGAV 272

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V+NQ  CG+CWAFS     E    LKNGTL  LS QE++DC G  +  C GG      
Sbjct: 273 SPVKNQGMCGSCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGL-DQACRGGLPSNAY 331

Query: 211 DWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT--LIPSESSILTDIA 267
           +   + K+  LE E++Y         K+K    N  K+ +Y   +  L   E  I   +A
Sbjct: 332 E--AIEKLGGLESETDYSY----TGHKQKCDFTN-RKVAAYINSSVELPKDEREIAAWLA 384

Query: 268 THGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            +GP+  A+NA   Q+Y  GV    +  C+  +  I+HAV +VGY
Sbjct: 385 ENGPISVALNAFAMQFYKKGVSHPWKIFCNPWM--IDHAVLLVGY 427


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 145/314 (46%), Gaps = 35/314 (11%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDI 66
           L I     L   A      + +    +E    +  ++ K Y   E  +R F+ F+ +++ 
Sbjct: 10  LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKT--RHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           IE  + N     S   GI  F+DL+ EEF+      +  ++   +++  K+ +       
Sbjct: 70  IE--SSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVTPFKYEN------- 120

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                      T +P   DWR  G +  +++Q+ CG+CWAFS V   E +H L+ G L  
Sbjct: 121 ----------VTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVS 170

Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS QE++DC   G + GC GG       ++  N  +   E+ Y    +D  C  K  + +
Sbjct: 171 LSEQELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGIT-TEANYAYRGRDGKCDTKKEASH 229

Query: 244 GVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
             KI  Y    ++P  SE+++L  +A H PV  +++A  +++Q+Y  G+   +C    ++
Sbjct: 230 VAKITGY---QVVPENSEAALLKAVA-HQPVSVSIDAGSMSFQFYQSGIYAGSCG---SD 282

Query: 300 INHAVQIVGYDNYS 313
           +NH V  VGY   S
Sbjct: 283 LNHGVAAVGYGTSS 296


>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
          Length = 358

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 86/279 (30%), Positives = 133/279 (47%), Gaps = 31/279 (11%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y S  E  +RF  F ++L++I   N+ R  P   + GI  ++D+S EEF
Sbjct: 58  FARFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNR-RGLP--YKLGINRYADMSWEEF 114

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +HK  D                    +P  KDWRE GI+  V+
Sbjct: 115 RASRLGAAQNCSATLKGNHKMTDEL------------------LPKTKDWREDGIVSPVK 156

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q +CG+CW FST    E+ +    G    LS Q+++DCA    N GC+GG      +++
Sbjct: 157 DQGSCGSCWTFSTTGALEAAYTQATGKGISLSEQQLVDCAYAFNNFGCNGGLPSQAFEYI 216

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P  V +K   + +  + +E  +L  +    PV
Sbjct: 217 KYNG-GLDTEESYPYAGVNGFCHFK---PENVGVKVVESVNITLGAEDELLHAVGLVRPV 272

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   ++ +++Y GGV   + C  +  ++NHAV  VGY
Sbjct: 273 SIAFEVVSGFRFYKGGVYTSDTCGRTQMDVNHAVLAVGY 311


>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 92/319 (28%), Positives = 157/319 (49%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYS-KSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  CG+CWAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLL---KDAAC 235
             L+ LS Q ++ C    + GC GG     L W+   NK  +     YP      K   C
Sbjct: 168 HELTSLSEQMLVSC-DTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G KI  +    L   E++I   +A +GPV  AV+A ++  Y GGV+  +C  
Sbjct: 227 N-KSGKVVGAKISGHI--NLPKDENAIAEWLAKNGPVAIAVDATSFLGYKGGVLT-SCIS 282

Query: 296 SLANINHAVQIVGYDNYSR 314
               ++H V +VGY++ S+
Sbjct: 283 K--GLDHDVLLVGYNDTSK 299


>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
          Length = 335

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 39/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS+F+ ++ K+Y +K EHD RF  F+ +   +     + +   SA +G+T+FSDL+  EF
Sbjct: 22  FSTFKSKFSKTYATKEEHDYRFGVFKSN---VRRAKLHAKLDPSAVHGVTKFSDLTPSEF 78

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           + + L     K + +  H          +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 79  RRQFLGL---KPLRLPEH---------AQKAPI-----LPTHDLPEDFDWRDKGAVTHVK 121

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           NQ +CG+CWAFST    E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 122 NQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGGLM 181

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +++ +    ++ E +YP   +D          N   + +++  +L   E  I  ++
Sbjct: 182 NNAFEYI-LESGGVQREEDYPYTGRDRG--PAIDEANAASVSNFSVVSL--DEDQISANL 236

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+   +NA+  Q Y+GGV   Y C     N++H V +VGY
Sbjct: 237 VKNGPLAIGINAVFMQTYIGGVSCPYICG---KNLDHGVLLVGY 277


>gi|118365756|ref|XP_001016098.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|161754|gb|AAA30114.1| cysteine protease [Tetrahymena thermophila]
 gi|89297865|gb|EAR95853.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 152/312 (48%), Gaps = 30/312 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           +L I+ L+ LC LA  + V      +KL  ++ +  + +++Y ++ E   R   F ++L 
Sbjct: 7   ILSIIMLMPLC-LAQDISV------EKLLAYNKWSSQNQRAYLNEDEKLYRQIVFFENLQ 59

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS--VNKHVL-MSHHKHHDHHHNH 122
            I+E N N  +  S    + +FSD++ EEF  + L     +N ++  +     H++ +N 
Sbjct: 60  KIKEHNSNPNNTYSIH--LNQFSDMTREEFAEKILMKQDLINDYMKGIGQQATHNNANNE 117

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
            +  S     T+   I    DWR  G +  V++Q  CG+CW+FS     ES + ++N  L
Sbjct: 118 TQMNS--QNHTLAASI----DWRTKGAVTSVKDQGQCGSCWSFSAAALMESFNFIQNKAL 171

Query: 183 SLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              S Q+++DC     G  + GC GG     LD+   +KV +    +YP +     C   
Sbjct: 172 VNFSEQQLVDCVTPENGYPSYGCKGGWPATCLDY--ASKVGITTLDKYPYVAVQKNCTVT 229

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
            T+ NG K+K +    +IP+ S+ L       PV   V+A  W YY  G+    C+ +  
Sbjct: 230 GTN-NGFKLKKW---IVIPNTSNDLKSALNFSPVSVLVDATNWDYYSSGIFN-GCNQTNI 284

Query: 299 NINHAVQIVGYD 310
           N+NHAV  VGYD
Sbjct: 285 NLNHAVLAVGYD 296


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 152/318 (47%), Gaps = 35/318 (11%)

Query: 6   NVLFIVALIALCFLAI--------PVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIR 56
           + LF+V  ++L  ++I        P++ ++      ++++  +  ++ K+Y+   E + R
Sbjct: 13  SFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNAIGEKERR 72

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           F+ F+ +L  ++E  +N     + + G+T+F+DL+ EE++  +L   + K   +   +  
Sbjct: 73  FEIFKDNLRFVDE--QNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQ 130

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
            + H       +          P   DWRE G + +V++Q  CG+CWAFSTV + E ++ 
Sbjct: 131 RYLHKAGNDDDL----------PSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQ 180

Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDA 233
           +  G L  LS QE++DC    N GC+GG    L+D+     +    ++ E++YP    D 
Sbjct: 181 IVTGDLISLSEQELVDCDKAYNQGCNGG----LMDYAFEFIIKNGGIDSEADYPYRASDN 236

Query: 234 ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
            C     + + V I  Y  D     E S+   +A   PV  A+ A    +Q Y  GV   
Sbjct: 237 MCDSNRKNAHVVTIDGYE-DVPENDEESLKKAVANQ-PVSVAIEAGGREFQLYQSGVFTG 294

Query: 292 NCDGSLANINHAVQIVGY 309
            C     N++H V  VGY
Sbjct: 295 RCG---TNLDHGVVAVGY 309


>gi|118369234|ref|XP_001017822.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|18913076|gb|AAL79510.1| granule-biosynthesis induced protease Gip1p [Tetrahymena
           thermophila]
 gi|89299589|gb|EAR97577.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 345

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 144/320 (45%), Gaps = 31/320 (9%)

Query: 6   NVLFIVALIALCFLAIPV--------KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRF 57
           N L I ALI L  +A P          +S+    Q L  ++ ++  YK+ Y   E  I  
Sbjct: 2   NKLLISALICL-MIATPSVFCQDVENNISEDIKVQDLLAYNKWRFNYKRVYLNEEEQIY- 59

Query: 58  KNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
               + +   E L    + P    Y  G+ +FSD+++EEFK R L   ++K         
Sbjct: 60  ----RQIVFFENLASVNKHPSHKSYSKGLNQFSDMTKEEFKQRVLNKKISKKA------S 109

Query: 116 HDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
            +    ++      + +  PT  +P+  DWR+ G++  V+NQ TCG+CW F+T    ES 
Sbjct: 110 SNKGGRNLAADPAVSNLVFPTNNLPLSVDWRKRGVLNPVKNQGTCGSCWTFATAGILESF 169

Query: 175 HALKNGTLSLLSVQEVIDC---AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           + +KN  L   S Q+++DC   AG  + GC GG     + +     +V     +YP +  
Sbjct: 170 NQIKNKQLLKFSEQQLVDCVSLAGYDSDGCDGGFQEDGVRYAIEYGIV--QSYKYPYVGY 227

Query: 232 DAACKRKATSPNGVKIKSYTCD-TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQ 290
              C  K TSP    +  Y     L+    + L       PV  +VNA TW+ Y GGV  
Sbjct: 228 QGRC--KVTSPTSRSVGFYPQKFQLVNKTEADLKAALVFSPVSISVNADTWKEYYGGVFD 285

Query: 291 YNCDGSLANINHAVQIVGYD 310
                +  ++NHAV  VGYD
Sbjct: 286 ECGYTTEEDLNHAVIAVGYD 305


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 143/312 (45%), Gaps = 37/312 (11%)

Query: 7   VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
            L++VA  ALC   +     + P L+    L+ ++   +KKSY   E   R   +EK+L 
Sbjct: 2   ALYLVA-AALCLTTVFAAPTTDPALDDHWHLWKNW---HKKSYLPKEEGWRRVLWEKNLR 57

Query: 66  IIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
            IE  N +      S R G+ +F D++ EEF+            LM          N  K
Sbjct: 58  TIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQ-----------LM----------NGYK 96

Query: 125 KRSITTGITI--PTGIPVKK--DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
            + +  G T   P      K  DWRE G +  V++Q  CG+CWAFST    E  H  K G
Sbjct: 97  NQKMIKGSTFLAPNNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAG 156

Query: 181 TLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++DC+   GN GC+GG       ++  N  + + E  YP   KD       
Sbjct: 157 KLISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYD 215

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
            + N      +  D    SE  ++  +A+ GPV  AV+A   ++Q+Y  G I Y+ + S 
Sbjct: 216 PNYNSANDTGFV-DVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSG-IYYDPECSS 273

Query: 298 ANINHAVQIVGY 309
            +++H V +VGY
Sbjct: 274 EDLDHGVLVVGY 285


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C  + 
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVEL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    ++ Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 151/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGQQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---KVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 135/283 (47%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ K+Y ++ EHD RF  F+ +L       K++    +A +GIT+FSDL+ +EF
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGITKFSDLTPKEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L       +    +K                 I   T +P   DWR+ G + +V++
Sbjct: 108 RRQFLGLKRWLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
           Q +CG+CW+FS     E  H L  G L+ LS Q+++DC         G  + GC GG   
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              ++  +    LE E++YP    D    +   S     + +++  ++   E  I  ++ 
Sbjct: 213 NAFEYA-LKAGGLEREADYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            HGP+  A+NA   Q Y+GGV   Y C       +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 135/286 (47%), Gaps = 30/286 (10%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           ++E+ ++LF S+  ++ K Y   +  I RF+ F  +L  I+E NK   S      G+  F
Sbjct: 40  SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           +DLS +EFK +++         + H  + D  + HV            T  P   DWR  
Sbjct: 97  ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G +  V+NQ  CG+CWAFST+ T E ++ +  G L  LS QE++DC  + + GC GG   
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
             L ++  N V       YP   K   C  +AT   G K+K  T    +PS  E+S L  
Sbjct: 204 TSLQYVANNGV--HTSKVYPCQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   P+   V A    +Q Y  GV    C   L   +HAV  VGY
Sbjct: 259 LANQ-PLSFLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 140/284 (49%), Gaps = 37/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS+F+ ++ K+Y ++ EHD RF+ F+ +L  +   +  +  P SA +G+T FSDL+  EF
Sbjct: 51  FSAFKTKFGKTYATQEEHDHRFRIFKNNL--LRAKSHQKLDP-SAVHGVTRFSDLTPAEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +    L S            +K  I     +PT  +P   DWRE G +  V+
Sbjct: 108 RRQFL--GLKPLRLPS----------DAQKAPI-----LPTNDLPTDFDWREHGAVTGVK 150

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS V   E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L  E +YP   +D    +   S     + +++  +L   E  I  ++
Sbjct: 211 TTAFEYT-LQAGGLMREKDYPYTGRDRGPCKFDKSKVAASVANFSVVSL--DEEQIAANL 267

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+   +NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 268 VQNGPLAVGINAVFMQTYIGGVSCPYICG---KHLDHGVLLVGY 308


>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
          Length = 377

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 139/294 (47%), Gaps = 35/294 (11%)

Query: 30  LEQKLELFS---SFQQRYKKSYSKSE-HDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
           L+  LEL S    F QR+ K+Y  +E H  R   F+ +L       +++    SA +G+T
Sbjct: 43  LDNDLELDSQLLGFVQRFGKTYRDAEEHAHRLSVFKANL---RRARRHQMLDPSAEHGVT 99

Query: 86  EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDW 144
           +FSDL+  EF+   L     +   +       H               +PT G+P   DW
Sbjct: 100 KFSDLTPAEFRRTFLGLKTTRRSFLREMAGSAH-----------DAPVLPTDGLPEDFDW 148

Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGN 196
           R+ G +G V+NQ +C +CW+FS     E  + L  G + +LS Q+++DC          +
Sbjct: 149 RDHGAVGPVKNQGSCWSCWSFSASGALEGANYLATGKMEVLSEQQLVDCDHECDPAEPDS 208

Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLI 256
            + GC+GG   +   ++ +    LE E +YP   KD  CK +  S     +++++   + 
Sbjct: 209 CDAGCNGGLMTSAFSYL-LKSGGLEREKDYPYTGKDGTCKFE-KSKIAASVQNFS--VVA 264

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             E  I  ++  +GP+   +NA   Q Y+GGV   Y C     +++H V +VGY
Sbjct: 265 VDEEQIAANLVEYGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 315


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 140/285 (49%), Gaps = 30/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++  +  ++ K+Y S  E + RF+ F+ +L  I+E N   ++    R G+  F+D
Sbjct: 36  DEVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRT---YRVGLNRFAD 92

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE+++ +L                    N ++K S      +   +P   DWR+ G 
Sbjct: 93  LTNEEYRSMYL------------GALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGA 140

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q +CG+CWAFS V   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 141 VVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGG----L 196

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ E +YP L +D  C     +   V I SY  D  + +E+++   +
Sbjct: 197 MDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYE-DVPVNNEAALQKAV 255

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   PV  A+ A    +Q Y  GV    C  +L   +H V  VGY
Sbjct: 256 ANQ-PVSVAIEAGGRDFQLYSSGVFSGRCGTAL---DHGVVAVGY 296


>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
 gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
          Length = 363

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/279 (31%), Positives = 128/279 (45%), Gaps = 29/279 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL ++   N+   S    R GI  FSD+S EEF
Sbjct: 62  FARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLS---YRLGINRFSDMSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+                       +P  KDWRE GI+  V+
Sbjct: 119 RATRLGAAQNCSATLAGNHRMR----------------AAAVALPKTKDWREDGIVSPVK 162

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q+++DC     N GC+GG      +++
Sbjct: 163 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYI 222

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  C  KA +  GVK+   + +  + +E  +   +A   PV 
Sbjct: 223 KYNG-GLDTEESYPYKGVNGICDFKAENV-GVKVLD-SVNITLGAEDELKDAVALVRPVS 279

Query: 274 AA---VNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A   VN    QY  G     +C  +  ++NHAV  VGY
Sbjct: 280 VAFQVVNGFR-QYKSGVYTSDSCGNTPMDVNHAVLAVGY 317


>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 382

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 156/320 (48%), Gaps = 36/320 (11%)

Query: 4   VKNVLFIVAL--IALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKN 59
            + + F V L  +A CF  +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ 
Sbjct: 7   TRTLRFSVGLHAVAACF--VPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRV 64

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++   E   +   +   A +G+T FSD+S EEF+              ++H   +++
Sbjct: 65  FKQNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYY 108

Query: 120 HNHVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
              +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAFS +   E    +
Sbjct: 109 AAALKRPRKV---VNVSTGKAPPAIDWRKKGAVTPVKDQGQCDSSWAFSAIGNIEGQWKV 165

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK 236
               L+ LS Q ++ C  N + GC GG       W+   NK  +  E  YP         
Sbjct: 166 AGHELTSLSEQMLVSCDTN-DFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVP 224

Query: 237 R--KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
              K+    G KI+      L   E++I   +A +GPV  AV+A ++Q Y GGV+  +C 
Sbjct: 225 TCDKSGKVVGAKIRDRV--DLPRDENAIAEWLAKNGPVAIAVDATSFQSYTGGVLT-SCI 281

Query: 295 GSLANINHAVQIVGYDNYSR 314
                +N AV +VGYD+ S+
Sbjct: 282 SK--EMNSAVLLVGYDDTSK 299


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 145/308 (47%), Gaps = 22/308 (7%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS----EHDIRFKNFEKS 63
           L +VA       A  + ++  +LE +  L++ ++ R++  ++ S    E   RF  F+++
Sbjct: 6   LILVASFLASVAATAIDIADKDLETEDSLWNLYE-RWRSHHTVSRDLDEKQKRFNVFKEN 64

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
              I + NK +  P   R  + +F+DL+  EF++ +    +N      HH+         
Sbjct: 65  PRYIHDFNKRKDIPYKLR--LNKFADLTNHEFRSTYAGSRIN------HHRSLRGSRRGG 116

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
              S          +P   DWR+ G +  V++Q  CG+CWAFSTV   E ++ +K   L 
Sbjct: 117 ATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLL 176

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            LS QE+IDC  + N GC+GG      D++  N  +   E+EYP   +D+ C  +  S +
Sbjct: 177 SLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGI-SSEAEYPYAAEDSYCATEKKS-H 234

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
            V I  +  D     E S+L  +A   PV  A+ A    +Q+Y  GV       S   ++
Sbjct: 235 VVSIDGHE-DVPANDEDSLLKAVANQ-PVSIAIEASGYDFQFYSEGVF---TGRSGTELD 289

Query: 302 HAVQIVGY 309
           H V IVGY
Sbjct: 290 HGVAIVGY 297


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 146/312 (46%), Gaps = 38/312 (12%)

Query: 9   FIVALIALCFL--AIPVK------VSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
           F    +AL F+  A P K      +  P  E+  +  + + + YK     +E   R+  F
Sbjct: 7   FQFVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDD---NERATRYSIF 63

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           ++++  I+  N   Q+ +S + G+ +F+DL+ EEFK    R+    H  M   +     +
Sbjct: 64  KENVARIDAFNS--QTGKSYKLGVNQFADLTNEEFKAS--RNRFKGH--MCSPQAGPFRY 117

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
            +V            + +P   DWR+ G +  V++Q  CG CWAFS V   E ++ L  G
Sbjct: 118 ENV------------SAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTG 165

Query: 181 TLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QEV+DC   G + GC+GG       +++ NK  L  E+ YP    D  C    
Sbjct: 166 KLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNK-GLTTEANYPYKGTDGTCNTNK 224

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
            + +  KI  +  D    SE++++  +A   PV  A++A    +Q+Y  G+   +CD  L
Sbjct: 225 AAIHAAKITGFE-DVPANSEAALMKAVAKQ-PVSVAIDAGGSDFQFYSSGIFTGSCDTQL 282

Query: 298 ANINHAVQIVGY 309
              +H V  VGY
Sbjct: 283 ---DHGVTAVGY 291


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 141/282 (50%), Gaps = 32/282 (11%)

Query: 35  ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
           E+++ F+  + K+Y+    D+R   +E+ L++I + N +      +   G+ E+ DL++ 
Sbjct: 22  EMWTLFKTTHSKTYATEAEDMRRFIWERHLNMINQHNIEADLGKHTFSLGMNEYGDLTQH 81

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK--DWREAGIIG 151
           E+              MS +K        + K S+ +    P  + V K  DWRE G + 
Sbjct: 82  EY------------AAMSGYK--------MAKSSVGSSFLEPENLQVPKTVDWREKGYVT 121

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
            V+NQ  CG+CWAFS+  + E     K G L  +S Q ++DC+ + GNMGCSGG      
Sbjct: 122 PVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAF 181

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
            ++  N + ++ E  YP    D  C+ K +  + V   S   D     E+++ T +A+ G
Sbjct: 182 TYIKKN-MGIDSEKSYPYEAVDGECRYKKS--DSVTTDSGFVDIPHGDETALRTAVASVG 238

Query: 271 PVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           PV  A++A   ++Q+Y  GV  + NC  S   ++H V +VGY
Sbjct: 239 PVSVAIDASHTSFQFYKTGVYTEANC--SSTQLDHGVLVVGY 278


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 139/297 (46%), Gaps = 37/297 (12%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
           LA+P ++        + LF S+  +++K Y S  E   R+  F+++L  I E N+   S 
Sbjct: 35  LALPNRL--------VNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRKNGS- 85

Query: 78  ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG 137
                G+ +F+D++ EEFK  HL             K          +   T        
Sbjct: 86  --YWLGLNQFADITHEEFKANHL-----------GLKQGLSRMGAQTRTPTTFRYAAAAN 132

Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
           +P   DWR  G +  V+NQ  CG+CWAFS+V   E ++ +  G L  LS QE++DC    
Sbjct: 133 LPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTML 192

Query: 198 NMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT 254
           + GC GG    L+D+     +    +  E +YP L+++  CK K    N V I  Y  D 
Sbjct: 193 DHGCEGG----LMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITGYE-DV 247

Query: 255 LIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              SE S+L  +A H PV   + A +  +Q+Y GGV   +C   L   +HA+  VGY
Sbjct: 248 PENSEISLLKALA-HQPVSVGIAAGSRDFQFYKGGVFDGSCSDEL---DHALTAVGY 300


>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 329

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 134/295 (45%), Gaps = 36/295 (12%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR----YGITEFSDLSE 92
           F +F   + K+Y+        K + K L+I  E N  R    SAR    YG T F+DL+E
Sbjct: 8   FDAFVLEHGKTYASDA-----KEYAKRLEIFAE-NMARAKEMSARDGAEYGATPFADLTE 61

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIG 151
           +EF +  L         +   K H+         S      +PT  IP+  DWR  G + 
Sbjct: 62  DEFASSLLMREPIDAARVERLKRHE---------SSRVLPHLPTENIPLNFDWRALGAVT 112

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------AGNG-NMGCSG 203
            V+NQ  CG+CW+FS     E  H +K+G L  LS Q+++DC       +G   + GC G
Sbjct: 113 PVKNQGMCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDG 172

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLL--LKDAACKRKATSPNGVKIKSYTCDTLIPSESS 261
           G     + ++ V +  L+ E+ YP L    D  CK K   P    I +Y+   +   ES 
Sbjct: 173 GLPANAMAYV-VKRGGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYS--FVSADESQ 229

Query: 262 ILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVGYDNYSRT 315
           I   +  HGP+   ++A   Q Y  GV   + CD +   ++H V IVG+    R 
Sbjct: 230 IAAALVKHGPLSVGIDARWMQLYRRGVACPWACDKT--RLDHGVLIVGFGAEGRA 282


>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
          Length = 343

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 146/312 (46%), Gaps = 36/312 (11%)

Query: 7   VLFIVALIA-LCFLAIPVKVSKPNL-----EQKLELFSSFQQRYKKSY-SKSEHDIRFKN 59
            L IV  +A +   AI    +  NL      Q    F+++  +Y KSY +K E   R++ 
Sbjct: 7   TLAIVGTVATVGLFAISEAPASTNLFAIEVTQDNVAFANYLAKYGKSYGTKEEFQFRYEQ 66

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           ++K++  + + N   Q+  + R GI +F+D + EE+K           VL+ +       
Sbjct: 67  YQKNMAKVAQYNG--QNGNTFRLGINKFTDYTPEEYK-----------VLLGYKPQS--- 110

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
               K  ++          P   DWRE G +  V++Q  CG+CWAFS     E  + + N
Sbjct: 111 ----KPMTLEASYLSEENTPASIDWREKGAVTPVKDQGQCGSCWAFSATGALEGHYQISN 166

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             L  +S Q+++DC+ +GN GC+GG+     D+   NK  +E ES+Y    KD  C  +A
Sbjct: 167 NKLISISEQQLVDCSHDGNNGCNGGEMYLAFDYASKNK--MELESDYVYHAKDEKCSYEA 224

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
           +     K+++     +  +  + L     +GPV  A+ A    +Q Y GG++  N     
Sbjct: 225 SKG---KMEADHFQRVPKNSPAQLKAALANGPVSVAIEADNEVFQAYDGGIL--NSKECG 279

Query: 298 ANINHAVQIVGY 309
            N++H V  VG+
Sbjct: 280 TNLDHGVLAVGF 291


>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
          Length = 358

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 139/289 (48%), Gaps = 43/289 (14%)

Query: 35  ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E F +FQ +Y KSY  + E + R K F  +L   ++L +  Q    A++G+T FSDL+EE
Sbjct: 42  ERFKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQG--LAQFGVTRFSDLTEE 99

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK---DWREAGII 150
           EF+                  +     N++  R  T G   P    +K    DWR+A ++
Sbjct: 100 EFR----------------RLYQPSQPNYLGLRVKTEGGGYPRLQRLKTRSCDWRKARVL 143

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             VR+Q+ C +CWA S V   E++ A+    L  LSVQE++DC   G  GC GG F    
Sbjct: 144 TPVRDQKNCNSCWAISAVGNVEALWAINYQQLFKLSVQELLDCRRCGQ-GCEGG-FVWDA 201

Query: 211 DWMDVNKVVLEPESEYPLLLK-DAACKRKATSPNGVKIKSYTCDTLI-------PSESSI 262
               +N+  L  E +YP   +    C++K       K +++  D L+       PS   +
Sbjct: 202 YMTILNQSGLAEEQDYPYRPQLSKGCQKK-------KKRAWIHDFLMLHKEENSPSPPDM 254

Query: 263 LTDIATHGPVIAAVNALTWQYYLGGVIQ--YNCDGSLANINHAVQIVGY 309
              +A  GP+   +N+   + Y+ GVI+   NCD     ++H VQ+VG+
Sbjct: 255 AQYLAEKGPITVTINSRLLKSYIRGVIKPGNNCDPKY--VDHVVQLVGF 301


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 149/312 (47%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E   N  +  S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDE--HNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+GG       ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    ++ Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
          Length = 313

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 85/281 (30%), Positives = 137/281 (48%), Gaps = 36/281 (12%)

Query: 40  FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
           F++++ K Y S  EH  RF  F+ +L  +  +   +  P SAR+G+T+FSDL+  EF+ +
Sbjct: 3   FKKKFGKVYGSIEEHYYRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRK 59

Query: 99  HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQ 157
           HL       V        D +   +          +PT  +P + DWR+ G +  V+NQ 
Sbjct: 60  HL------GVKGGFKLPKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQG 103

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCAL 209
           +CG+CW+FST    E  H L  G L  LS Q+++DC         G+ + GC+GG   + 
Sbjct: 104 SCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSA 163

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
            ++  +    L  E +YP    D    +   S     + +++  ++  +E  I  ++  +
Sbjct: 164 FEYT-LKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKN 220

Query: 270 GPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           GP+  A+NA   Q Y+GGV   Y C   L   NH V +VGY
Sbjct: 221 GPLAVAINAAYMQTYIGGVSCPYICSRRL---NHGVLLVGY 258


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 83/279 (29%), Positives = 129/279 (46%), Gaps = 32/279 (11%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + +  +  +Y + Y S+ E + RF  ++ ++  I+  N    S   A      F+DL+ E
Sbjct: 17  DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAE---NNFADLTNE 73

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK  +L +       +S       + N V              +P   DWR+ G +  +
Sbjct: 74  EFKATYLGYKT-----VSIPDTCFRYGNMVN-------------LPTNVDWRQEGAVTPI 115

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDW 212
           +NQ  CG+CWAFS V   E ++ +K G L  LS QE++DC   +GN GC+GG      ++
Sbjct: 116 KNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF 175

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
             + +  L  E EYP    ++AC  +      V I  Y     +  E S+   +A   PV
Sbjct: 176 --IKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYE-KVPVNDEKSLKAAVANQ-PV 231

Query: 273 IAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y GG+   NC   L   NH V IVGY
Sbjct: 232 SVAIDAEGNNFQFYSGGIFSGNCGNQL---NHGVAIVGY 267


>gi|229596051|ref|XP_001013456.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|225565626|gb|EAR93211.3| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 315

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 92/318 (28%), Positives = 150/318 (47%), Gaps = 46/318 (14%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKP----NLEQKLE-LFSSFQQRYKKSYSKSEHD-IRFK 58
           KN+LF +A +AL   A  + ++K       +Q ++ L+S+F+ +Y K Y+  + +  R +
Sbjct: 3   KNILFAIAGLALLATATTILLTKTHHNTQEDQNIQALWSAFKTKYNKKYADPDFERYRIE 62

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F ++L ++E   KN        YGIT+F D++ EEFK  +L   +   +  S     + 
Sbjct: 63  IFTENLKVVESNTKN--------YGITQFMDITREEFKQTYLTLKMKNGLKASPFAKFND 114

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                       G+ I        DW   G +  V++Q  CG+CW+FST    E    L 
Sbjct: 115 -----------AGVEI--------DWTTKGAVTPVKDQGQCGSCWSFSTTGAVEGALFLS 155

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              L+ LS Q ++DC+ +GN GC+GG      D+  +++  +  E+ YP    D  CK  
Sbjct: 156 TKKLTSLSEQYLVDCSKDGNEGCNGGLMDTAFDF--ISQHGIPTEAAYPYKAVDGTCKM- 212

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
            + P   KI S+   T I   + +L  I    P+  AV+A  +QYY   +   +C   L 
Sbjct: 213 TSGP--YKISSH---TDIQDCNDLLNKIQKQ-PIAIAVDANNFQYYQKDIFS-DCGTEL- 264

Query: 299 NINHAVQIVGYDNYSRTW 316
             +H V +VGY    + W
Sbjct: 265 --DHGVLLVGYSASGKYW 280


>gi|300121328|emb|CBK21708.2| unnamed protein product [Blastocystis hominis]
          Length = 318

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 97/306 (31%), Positives = 141/306 (46%), Gaps = 39/306 (12%)

Query: 14  IALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNK 72
           I     A+ + V+  N E     F+S+  +Y K+Y+  E    R + F  +L  I+E N 
Sbjct: 4   IFFVLFAVALSVNLRNSE-----FTSYMSKYGKTYAAPEEARYRLRVFNDNLLKIKEHNA 58

Query: 73  NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGI 132
            +  P +   G+ +F+D+S EEF  +    + +                   K   T   
Sbjct: 59  -KNLPWT--LGVNKFADVSAEEFAYKFCGCAKDP------------------KTRGTRQT 97

Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
           T+   +P + DWRE G +  V+NQ  CG+CWAFST  T E  + LK G L  LS Q+++D
Sbjct: 98  TLVGDVPARVDWREQGAVTPVKNQGMCGSCWAFSTTGTTEGAYFLKTGNLVSLSEQQLVD 157

Query: 193 CAGNG---NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           CA +    N GCSGG   + +D+  V K  L  E +YP    DA CK  +     V ++S
Sbjct: 158 CARDPEYENFGCSGGWPWSAVDY--VTKHGLCTEEDYPYKGVDAECKESSCK---VAVQS 212

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
                L   +   L    +  PV   ++A   Q Y  G+I   C  S   INHAV  VGY
Sbjct: 213 VDKVQLPVGDEDSLAVAVSKTPVSIVLDATAMQLYDKGIIT-RCSES---INHAVLAVGY 268

Query: 310 DNYSRT 315
           D  + T
Sbjct: 269 DKDAET 274


>gi|281207557|gb|EFA81740.1| hypothetical protein PPL_05734 [Polysphondylium pallidum PN500]
          Length = 387

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/306 (28%), Positives = 147/306 (48%), Gaps = 27/306 (8%)

Query: 11  VALIALCFLAIPVKVSKPNL--EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           V L+A     + V  +   L   Q  + F S+ Q +   Y+  E + R+  F+K+L+ + 
Sbjct: 6   VYLLACTVFMLAVLSANATLTERQYQDSFVSWMQTHNVKYTTQEFNHRYGVFKKNLNFVN 65

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH--KHHDHHHNHVKKR 126
           + N       S   G+  F+DL+  E++  +L   ++   +M+ +  +  D  +N VK  
Sbjct: 66  QWNA---KGSSTVLGMNVFADLTNAEYQRIYLGSKIDTSSMMNANAARLFDRTYN-VKAL 121

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           S T             DWR+ G +  ++NQQ CG+CW+FST  + E  H +  G L  LS
Sbjct: 122 SPTV------------DWRQKGAVTHIKNQQQCGSCWSFSTTGSIEGAHEIATGNLVSLS 169

Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q +IDC+   GN GC+GG      +++  N  + + E+ YP         R   + +G 
Sbjct: 170 EQNLIDCSTAEGNQGCNGGLMTNAFEYVIKNGGI-DTEASYPYSATGPNKCRYNPANSGA 228

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
            I SY  +  + SE++++   A  GPV  A++A   ++Q Y  G I Y    S   ++H 
Sbjct: 229 TISSYV-NVTVGSETALMA-AANIGPVSVAIDASHNSFQLYDSG-IYYESKCSTTQLDHG 285

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 286 VLVVGY 291


>gi|228245|prf||1801240C Cys protease 3
          Length = 321

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 152/307 (49%), Gaps = 41/307 (13%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
           VA + LC LA+    + P+ +        F+ +Y + Y  ++ ++ R + F+++  +IE+
Sbjct: 2   VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 52

Query: 70  LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK  ++ E + +  + +F D++ EEF           + +M  +K           R  
Sbjct: 53  FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKK--------GSRGE 93

Query: 129 TTGITIPTGIPVKKD--WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
              +    G P+ +D  WR   ++  V++Q+ CG+CWAFS     E  H LKN  L  LS
Sbjct: 94  PKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLS 153

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+ + GN GC GG   +  D++  N  + + ES YP   +D +C+  A S   +
Sbjct: 154 EQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAI 212

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINH 302
              S   + +  +E ++   ++  GP+  A++A   ++Q+Y  GV  + NC  +   ++H
Sbjct: 213 CTGS--VEIVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDH 268

Query: 303 AVQIVGY 309
            V  VGY
Sbjct: 269 GVLAVGY 275


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 138/294 (46%), Gaps = 35/294 (11%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           ++ P   Q     +  ++  Y++ Y  +E + R   +EK++ +IE  N         ++G
Sbjct: 16  LATPKFNQTFNAQWHKWKSTYRRLYGTNEEEWRRAVWEKNMKMIELHNGEYSE---GKHG 72

Query: 84  IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
            T     F D++ EEF+            L++ +KH  H    V +  +   + +P  + 
Sbjct: 73  YTMEMNAFGDMTNEEFRQ-----------LVNGYKHQKHRKGKVFQEPLM--LQLPKSV- 118

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
              DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN
Sbjct: 119 ---DWREKGCVTPVKNQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGN 175

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GC+GG       ++ +N   L+ E  YP   KD  CK K          + T    IP 
Sbjct: 176 QGCNGGLMDFAFQYV-LNNKGLDSEESYPYEAKDGTCKYKPE----FAAANDTGYVDIPQ 230

Query: 259 -ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E +++  +AT GP+  A++A   ++Q+Y  G I Y  + S   ++H V +VGY
Sbjct: 231 LEKALMKAVATVGPIAIAIDASHPSFQFYSSG-IYYEPNCSSKELDHGVLVVGY 283


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/279 (29%), Positives = 129/279 (46%), Gaps = 32/279 (11%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + +  +  +Y + Y S+ E + RF  ++ ++  I+  N    S   A      F+DL+ E
Sbjct: 17  DRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAE---NNFADLTNE 73

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK  +L +       +S       + N V              +P   DWR+ G +  +
Sbjct: 74  EFKATYLGYKT-----VSIPDTCFRYGNMVN-------------LPTNVDWRQEGAVTPI 115

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDW 212
           +NQ  CG+CWAFS V   E ++ +K G L  LS QE++DC   +GN GC+GG      ++
Sbjct: 116 KNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF 175

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
             + +  L  E EYP    ++AC  +      V I  Y     +  E S+   +A   PV
Sbjct: 176 --IKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYE-KVPVNDEKSLKAAVANQ-PV 231

Query: 273 IAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y GG+   NC   L   NH V IVGY
Sbjct: 232 SVAIDAEGNNFQFYSGGIFSGNCGNQL---NHGVAIVGY 267


>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
          Length = 478

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/300 (32%), Positives = 137/300 (45%), Gaps = 34/300 (11%)

Query: 24  KVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY 82
           K+ KP        F  F  R++K Y +K E   RF+ F+++  +I EL KN Q   +A Y
Sbjct: 163 KIIKPRDYVVWNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQG--TAVY 220

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVK 141
           G T+FSD++  EFK   L +   + V M                    G+TI    +P  
Sbjct: 221 GFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKE------------GVTISEEDLPDS 268

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
            DWRE G + +V+NQ +CG+CWAFST    E    L    L  LS QE++DC  + + GC
Sbjct: 269 FDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD-SVDQGC 327

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIP 257
           +GG        + +    LEPE  YP   +   C    K  A   NG          L  
Sbjct: 328 NGGLPSNAYKEI-IRMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELPH 379

Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
            E  +   + T GP+   +NA T Q+Y  GV+   +  C+  +  +NH V IVGY    R
Sbjct: 380 DEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 437


>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/285 (31%), Positives = 127/285 (44%), Gaps = 37/285 (12%)

Query: 37  FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F ++Y K YS  E  + R   F K  +++         P  A +G+T FSDLSEEEF
Sbjct: 7   FRMFMEKYGKEYSSREEYVHRLGIFAK--NMVRAAEHQALDP-XALHGVTPFSDLSEEEF 63

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           + R     V +               H+K     T   +   G+P   DWRE G + +V+
Sbjct: 64  E-RMFTGVVGR--------------PHMKGGVAETAAALEVDGLPESFDWREKGAVTEVK 108

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
            Q TCG+CWAFST    E  H +    L  LS Q+++DC            + GC GG  
Sbjct: 109 MQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLM 168

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
                ++ +    LE ES YP   K   CK K   P+ V ++      +   E+ I  ++
Sbjct: 169 TNAYKYL-IEAGGLEEESSYPYTGKHGECKFK---PDRVAVRVVNFTEVPIBENQIAANL 224

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN--INHAVQIVGY 309
             HGP+   +NA   Q Y+GGV   +C        INH V +VGY
Sbjct: 225 VCHGPLAVGLNAXFMQTYIGGV---SCPLICPKRWINHGVLLVGY 266


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F+K++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKKNMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGKLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y  G      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAEGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/279 (29%), Positives = 132/279 (47%), Gaps = 21/279 (7%)

Query: 34  LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           L+ F  +  R+ ++Y+ S E   RF+ + ++++++E  N      + A     +F+DL+ 
Sbjct: 29  LDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLAD---NKFADLTN 85

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF+ + L      HV +          N         G +    +P   DWR+ G + +
Sbjct: 86  EEFRAKML--GFRPHVTIPQIS------NTCSADIAMPGESSDDILPKSVDWRKKGAVVE 137

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFS V   E ++ +KNG L  LS QE++DC     +GC GG      ++
Sbjct: 138 VKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEF 196

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E+ YP    + AC+    + + V I  Y    + PS    L   A   PV
Sbjct: 197 V-VGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYR--NVTPSSEPDLARAAAAQPV 253

Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             AV+  +  +Q Y  GV    C    A++NH V +VGY
Sbjct: 254 SVAVDGGSFMFQLYGSGVYTGPC---TADVNHGVTVVGY 289


>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
 gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 134/283 (47%), Gaps = 34/283 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF+ ++ K+Y ++ EHD RF  F+ +L       K++    +A +GIT+FSDL+ +EF
Sbjct: 51  FTSFKSKFGKTYATQEEHDYRFGVFKANL---RRAKKHQMIDPTAAHGITKFSDLTPKEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + + L       +    +K                 I   T +P   DWR+ G + +V++
Sbjct: 108 RRQFLGLKRWLRLPTDANK---------------APILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDFC 207
           Q +CG+CW+FS     E  H L  G L+ LS Q+++DC         G  + GC GG   
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
              ++  +    LE E +YP    D    +   S     + +++  ++   E  I  ++ 
Sbjct: 213 NAFEYA-LKAGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSI--DEDQIAANLV 269

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            HGP+  A+NA   Q Y+GGV   Y C       +H V +VGY
Sbjct: 270 KHGPLSVAINAAFMQTYVGGVSCPYICS---KRQDHGVLLVGY 309


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 40/323 (12%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKN 59
           LF+    + CF    + +S+P L+ +L +    Q+R+ +  +K         E + R+  
Sbjct: 10  LFVAIFSSFCF---SITLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEENNRYVV 61

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR-HSVNKHVLMSHHKHHDH 118
           F+ +++ IE LN +  +  + +  + +F+DL+ +EF++ +     V+     S  K    
Sbjct: 62  FKNNVERIEHLN-SIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
            + +V   ++          PV  DWR+ G +  ++NQ +CG CWAFS V   E    +K
Sbjct: 121 RYQNVSSGAL----------PVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIK 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L  LS Q+++DC  N + GC GG      + +      L  ES YP   +DA C  K
Sbjct: 171 KGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATG-GLTTESNYPYKGEDATCNSK 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
            T+P    I  Y  D  +  E +++  +A H PV   +      +Q+Y  GV    C   
Sbjct: 229 KTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTY 286

Query: 297 LANINHAVQIVGYD---NYSRTW 316
           L   +HAV  +GY    N S+ W
Sbjct: 287 L---DHAVTAIGYGESTNGSKYW 306


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 125/262 (47%), Gaps = 28/262 (10%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVL 109
            E + RF+ F  + + IEE   NRQ  ++   G+  F+D++ +EFK  +    V   + +
Sbjct: 49  GEKERRFQIFRDNAEYIEE--HNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPLSNTI 106

Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
            S  ++ D                  T +P+  DWR  G +  V+NQ  CG+CWAFSTV 
Sbjct: 107 KSGFRYED-----------------ATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149

Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
             E ++ +  G L  LS QE++DC    N GC+GG   +  +++ +    L+ E++YP  
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFI-IQNGGLDSEADYPYK 208

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
               +C     + + V I  +  D    SE+ +L  +A   PV  A+ A    +Q Y GG
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFE-DVPAESEADLLKAVANQ-PVSVAIEASGRNFQLYSGG 266

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           V   +C   L   +H V  VGY
Sbjct: 267 VYTGHCGYEL---DHGVVAVGY 285


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 139/309 (44%), Gaps = 36/309 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L+ + FLA  V           E    +  RY K Y    E + RF+ F+++++ IE
Sbjct: 559 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 618

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
             N    + +  +  I +F+DL+ EEF     R+    H+  S  +     + +V     
Sbjct: 619 AFNN--AANKRYKLAINQFADLTNEEFIAP--RNRFKGHMCSSIIRTTTFKYENV----- 669

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
                  T +P   DWR+ G +  +++Q  CG CWAFS V   E +HAL +G L  LS Q
Sbjct: 670 -------TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQ 722

Query: 189 EVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSP 242
           E++DC   G + GC GG    L+D  D  K V     L  E+ YP    D  C     + 
Sbjct: 723 ELVDCDTKGVDQGCEGG----LMD--DAFKFVIQNHGLNTEANYPYKGVDGKCNANEAAN 776

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           + V I  Y  D    +E ++   +A   PV  A++A    +Q+Y  GV   +C   L   
Sbjct: 777 DVVTITGYE-DVPANNEKALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTEL--- 831

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 832 DHGVTAVGY 840


>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
 gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
          Length = 343

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 148/315 (46%), Gaps = 35/315 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           ++F+  ++A     + + V+  NL Q  E + +F+  Y KSY+  E +    NF + +  
Sbjct: 5   LVFVATVVAFAKSQLSIGVTLENLLQ--EEWMAFKLTYNKSYASPEEE----NFRREI-F 57

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           IE  N+++ +  +  YG  ++S + +           +N    M HH+ H   +   +  
Sbjct: 58  IE--NRHKIARFNQEYGRGQWSFVQQ-----------LNNFADMLHHEFHRTLNGFNRTL 104

Query: 127 SITTGIT-----IPTG---IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
           S   GI      IP+     P   DWRE G +  V+NQ +C  CWAFS     E  +  K
Sbjct: 105 SARVGIPQSSTFIPSANVIFPDYVDWREVGAVTPVKNQGSCAGCWAFSAAGALEGHNFRK 164

Query: 179 NGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
            G L  LS Q +IDC+ N GN GCSGG      +++  N  + + E  YP   ++  C+ 
Sbjct: 165 TGRLVELSPQNLIDCSTNYGNDGCSGGLMNPAYEYVRTNPGI-DTEDSYPYEARNGPCRF 223

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCD 294
           +  +  G     Y  D     E  +   IAT GPV AA++A   ++Q+Y  G+     C 
Sbjct: 224 RPETV-GAYCTGYV-DIAEGDEQGLEAAIATLGPVSAAMDAGRQSFQFYSDGIYYDPQCG 281

Query: 295 GSLANINHAVQIVGY 309
               ++NHAV +VGY
Sbjct: 282 NRPDDVNHAVLVVGY 296


>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/276 (32%), Positives = 128/276 (46%), Gaps = 27/276 (9%)

Query: 40  FQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKT 97
           F+  + K+Y S  E   RF  F+K+L  I+E NK   +  ES    +T+F+D++ EEF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85

Query: 98  RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
                 V    L S+  H D+  +          I +     V  DWRE G +  V++Q 
Sbjct: 86  LLKLQGV--PALPSNAVHFDNFED----------IDMEEKDAV--DWREEGAVTPVKDQA 131

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALLDWMDV 215
            CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC GG      D+  V
Sbjct: 132 NCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDF--V 189

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
               ++ E  YP   + ++CK+       VK   +  D     E  +   +A  GPV  A
Sbjct: 190 QDEGIQTEESYPYEGRRSSCKKSGEYVTKVKTYVFPLD-----EQEMARTVAAKGPVAVA 244

Query: 276 VNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           + A    +Y  G++   C  S    ++N  V +VGY
Sbjct: 245 IEASQLSFYDKGIVDERCRCSNKREDLNPGVLVVGY 280


>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
          Length = 355

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 132/288 (45%), Gaps = 38/288 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ ++ K Y S+ EHD RFK F+ +L       +++    SA +GIT+FSDL+  EF
Sbjct: 49  FSLFKSKFGKIYASEEEHDHRFKVFKANL---RRARRHQLLDPSAEHGITKFSDLTPSEF 105

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L                   H    K +      +PT  +P   DWR+ G +  V+
Sbjct: 106 RRTYLGL-----------------HKPKPKLNAEKAPILPTSDLPADYDWRDHGAVTGVK 148

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC          + + GCSGG  
Sbjct: 149 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLM 208

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L+ E +YP   K   C     S     + +++   L   E  I  ++
Sbjct: 209 TTAFEYT-LKAGGLQREKDYPYTGKXGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANL 264

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
             HGP+   +NA   Q Y+GGV     C       +H V +VGY ++ 
Sbjct: 265 VKHGPLAVGINAAWMQTYVGGVSCPLIC---FKRQDHGVLLVGYGSHG 309


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 92/310 (29%), Positives = 147/310 (47%), Gaps = 33/310 (10%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
            ++ALI  CFL I    +     QK     F ++  +++KSY+  E   R+  F+ ++DI
Sbjct: 3   LVLALI-FCFLIINCCSAARIFSQKQYQTAFQNWMVKHQKSYTNDEFGSRYSVFQDNMDI 61

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           + + N   Q   +   G+   +DL+ EEFK  +L    N                   K+
Sbjct: 62  VAKWN---QKGSNTILGLNVMADLTNEEFKKLYLGTKANV----------------TYKK 102

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
               G++   G+P   DWR  G +  V+NQ  CG C+AFST  + E +H + +  L  LS
Sbjct: 103 KTLVGVS---GLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLS 159

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+G+ GN GC GG      +++ +    L+ E+ YP   +   CK    +  G 
Sbjct: 160 EQQILDCSGSEGNNGCDGGLMTNSFEYI-IAVGGLDTEASYPYTGEVGKCKFNKKNI-GA 217

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
            I  Y  +    SES + T +A   PV  A++A   ++Q Y  GV  Y  + S   ++H 
Sbjct: 218 TITGYK-NVESGSESDLQTAVAAQ-PVSVAIDASQSSFQLYASGVY-YEPECSSTQLDHG 274

Query: 304 VQIVGYDNYS 313
           V  VGY + S
Sbjct: 275 VLAVGYGSQS 284


>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
 gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
           V+ V   V L+A+      V +   ++E+ LE+ F++F+++Y K Y  + E   RF+ FE
Sbjct: 7   VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E+      +   A +G+T FSD++ EEF+ R+              ++   +  
Sbjct: 67  ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109

Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             +KR   T + + TG  P   DWRE G +  V+ Q  CG+CWAFST+   E    +   
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGN 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q ++ C    + GC+GG      +W+ + N   +  E+ YP +  +   ++  
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225

Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
              NG +I +   D   L   E +I   +A +GP+  AV+A ++  Y GG++  +C  + 
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGILT-SC--TS 282

Query: 298 ANINHAVQIVGYDNYS 313
             ++H V +VGY++ S
Sbjct: 283 KQLDHGVLLVGYNDNS 298


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y    +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGI-SRESDYEYQGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/262 (28%), Positives = 127/262 (48%), Gaps = 22/262 (8%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           SE D RF+ F+ +L  I+E N   ++    + G+  F+DLS EE+++R+L   ++   +M
Sbjct: 70  SEKDKRFEIFKDNLKFIDEHNAENRT---YKVGLNRFADLSNEEYRSRYLGTKIDPIGMM 126

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
                        K RS     ++   +P   DWR  G + +V++Q +CG+CWAFST+  
Sbjct: 127 ---------MARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAA 177

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
            E ++ +  G L  LS QE++DC    N GC GG      +++ +N   ++ + +YP   
Sbjct: 178 VEGINKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFI-INNGGIDSDEDYPYRG 236

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGG 287
            D  C +   +   V I  Y     +P+   + L     + P+  A+ A    +Q Y+ G
Sbjct: 237 VDGKCDQYKKNARVVSIDDY---EQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSG 293

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           +    C  +L   +H V  VGY
Sbjct: 294 IFTGKCGTAL---DHGVTAVGY 312


>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
          Length = 803

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 135/276 (48%), Gaps = 29/276 (10%)

Query: 45  KKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHS 103
           ++SY  +E    RF+ F  ++   + L K  Q   +A+YG+T FSD+S +EFK       
Sbjct: 508 QRSYKTTEELKKRFRIFRANMKKADYLQKTEQG--TAKYGVTIFSDISSKEFK------- 558

Query: 104 VNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACW 163
             KH L    +  D     +K +     I   T +P + DWR    +  V+NQ  CG+CW
Sbjct: 559 --KHYLGLKKRTPD-----IKFKQEMAQIPNIT-LPEEYDWRNYNAVTPVKNQGMCGSCW 610

Query: 164 AFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
           AFS     E  +A+K G L  LS QE++DC    + GC GG F      ++     LE E
Sbjct: 611 AFSVTGNIEGQYAIKTGNLVSLSEQELVDCDKYDD-GCEGGLFETAYHAIE-ELGGLELE 668

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
           S+YP   +D  C   ++    V++   +   +   E+ +   +  +GP+   +NA   Q+
Sbjct: 669 SDYPYSGRDNTCHFNSSE---VRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQF 725

Query: 284 YLGGV---IQYNCDGSLANINHAVQIVGYDNYSRTW 316
           YLGGV   +++ CD     ++H V IVGY    RTW
Sbjct: 726 YLGGVSHPLKFLCDPK--TLDHGVLIVGY-GIHRTW 758


>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
          Length = 362

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/286 (29%), Positives = 143/286 (50%), Gaps = 36/286 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F++F+ ++ KSY +K EHD RF  F+ +L   ++   +++   SA +G+T+FSDL+  EF
Sbjct: 47  FTTFKSKFSKSYATKEEHDYRFGVFKSNL---KKAKLHQKLDPSAEHGVTKFSDLTASEF 103

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   + K + +  H          +K  I     +PT  +P   DWRE G +  V+
Sbjct: 104 RRQFL--GLKKRLRLPAH---------AQKAPI-----LPTNNLPEDFDWREKGAVTPVK 147

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           +Q +CG+CWAFST    E  + L  G L  LS Q+++DC          + + GC+GG  
Sbjct: 148 DQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLM 207

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               +++  +  V+  E +Y    +D +CK   +    +         +   E  I  ++
Sbjct: 208 NNAFEYLLQSGGVVR-EQDYSYTGRDGSCKFDKSK---IAASVSNFSVVSVDEDQIAANL 263

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
             +GP+  A+NA   Q Y+ GV   Y C  + + ++H V +VG+ N
Sbjct: 264 VKNGPLAVAINAAWMQTYMSGVSCPYIC--AKSRLDHGVLLVGFGN 307


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 144/307 (46%), Gaps = 30/307 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +L ++A+I L     P     PNL Q  E F +  +  KK  S  E  +R   FE++   
Sbjct: 58  LLAVLAVIGLASALSP----NPNLNQHWENFKA--EHNKKYESFPEELMRRLIFEENHQF 111

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           IE+ N  ++       G+  F DL+ +E++ R+L              +    +   K  
Sbjct: 112 IEDHNSKKEF--DFYLGMNHFGDLTNKEYRERYL-------------GYRRPENTPSKAS 156

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I +       +P + DWR+ G +  V+NQ  CG+CWAFS V + E  H    G L  LS
Sbjct: 157 YIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLS 216

Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q ++DC+   GN GC+GG      +++  N  + + E  YP +  D +C  K  S  G 
Sbjct: 217 EQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGI-DTEDSYPYVGTDGSCHFKNKSI-GA 274

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG-SLANINH 302
            +K +  D     E ++   +   GPV  A++A  + +Q+Y GGV  YN    S + ++H
Sbjct: 275 TLKGFM-DVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGGV--YNVPWCSTSELDH 331

Query: 303 AVQIVGY 309
            V +VGY
Sbjct: 332 GVLVVGY 338


>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
 gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
          Length = 477

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 132/286 (46%), Gaps = 31/286 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F  R++K Y+ K E   RF+ F+K+  +I EL KN Q   +A YG T+FSD++  EF
Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQG--TAVYGFTKFSDMTTMEF 231

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           K   L +   + V      + + H   + +  +          P   DWRE G + +V+N
Sbjct: 232 KKIMLPYQWEQPVYPMEQANFEKHDVTINEEDL----------PESFDWREKGAVTQVKN 281

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q  CG+CWAFST    E    +    L  LS QE++DC  + + GC+GG        + +
Sbjct: 282 QGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDC-DSMDQGCNGGLPSNAYKEI-I 339

Query: 216 NKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
               LEPE  YP   +   C    K  A   NG          L   E  +   + T GP
Sbjct: 340 RMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELPHDEVEMQKWLVTKGP 392

Query: 272 VIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYSR 314
           +   +NA T Q+Y  GV+   +  C+  +  +NH V IVGY    R
Sbjct: 393 ISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDGR 436


>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 326

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 142/304 (46%), Gaps = 30/304 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
           +   I L F ++     K       E +  F+ R  KSY    E   RF  F+ SL  IE
Sbjct: 3   VFVFILLAFASVHALSDK-------EEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIE 55

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N K      + + G+T+F+DL+E+EF        +++    S  +             
Sbjct: 56  NHNDKYDHGLSTFKLGVTKFADLTEKEFSDML---GISRSTKSSRPR------------- 99

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
           +   +T    +P K DWRE G + +V++Q +CG+CW+FST  T E  + LK G L  LS 
Sbjct: 100 VIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEGAYFLKTGKLVSLSE 159

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           Q ++DCA     GCSGG     L++++    ++  E++YP    D  C R  +S    KI
Sbjct: 160 QNLVDCAKEDCYGCSGGYMDKALEYIETAGGIM-SENDYPYEGIDDKC-RFDSSKVAAKI 217

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQ 305
            ++T       E  +   +   GP+  A++A   +Q Y  G++   +C     ++NH V 
Sbjct: 218 SNFTY-IKKNDEDDLKNAVIAKGPISVAIDASFNFQLYDSGILDDSSCYSDFNSLNHGVL 276

Query: 306 IVGY 309
           +VGY
Sbjct: 277 VVGY 280


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 129/277 (46%), Gaps = 27/277 (9%)

Query: 46  KSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
           +SY+   E + RF+ F  +L  ++  N         R G+  F+DL+ +EF++  L   V
Sbjct: 58  RSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRSTFLGAKV 117

Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
              V  S      + H+ V++            +P   DWRE G +  V+NQ  CG+CWA
Sbjct: 118 ---VERSRAAGERYRHDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWA 162

Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPE 223
           FS V T ES++ L  G +  LS QE+++C+ NG N GC+GG      D++ +    ++ E
Sbjct: 163 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTE 221

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTW 281
            +YP    D  C     +   V I  +  D     E S+   +A H PV  A+ A    +
Sbjct: 222 DDYPYKAVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREF 279

Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           Q Y  GV    C  SL   +H V  VGY  DN    W
Sbjct: 280 QLYHSGVFSGRCGTSL---DHGVVAVGYGTDNGKDYW 313


>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 366

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 85/284 (29%), Positives = 140/284 (49%), Gaps = 37/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS+F+ ++ K+Y ++ EHD RF+ F+ +L  +   +  +  P SA +G+T FSDL+  EF
Sbjct: 51  FSAFKTKFAKTYATQEEHDHRFRIFKNNL--LRAKSHQKLDP-SAVHGVTRFSDLTPSEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +    L S            +K  I     +PT  +P   DWR+ G +  V+
Sbjct: 108 RGQFL--GLKPLRLPS----------DAQKAPI-----LPTSDLPTDFDWRDHGAVTGVK 150

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FS V   E  H L  G L  LS Q+++DC         G  + GC+GG  
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L  E +YP   +D    +   S     + +++  +L   E  I  ++
Sbjct: 211 TTAFEYT-LKAGGLMREEDYPYTGRDRGPCKFDKSKIAASVANFSVVSL--DEEQIAANL 267

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+   +NA+  Q Y+GGV   Y C     +++H V +VGY
Sbjct: 268 VKNGPLAVGINAVFMQTYIGGVSCPYICG---KHLDHGVLLVGY 308


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 128/273 (46%), Gaps = 30/273 (10%)

Query: 40  FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH 99
           + + YK +  KS+   R+K F+ ++  IE  NK     +S +  I EF+DL+ EEF  R 
Sbjct: 46  YGREYKDADEKSK---RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RA 98

Query: 100 LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTC 159
            R+    H+  +      + +               T +P   DWR+ G +  +++Q  C
Sbjct: 99  SRNRFKAHICSTEATSFKYEN--------------VTAVPSTVDWRKKGAVTPIKDQGQC 144

Query: 160 GACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKV 218
           G+CWAFS V   E +  L  G L  LS QE++DC  +G + GCSGG       +++ N  
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH- 203

Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
            L  E+ YP    D  C RK  +    KI  Y  D    +E ++   +A H P+  A++A
Sbjct: 204 GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDA 261

Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               +Q+Y  GV    C   L   +H V  VGY
Sbjct: 262 SGSEFQFYSSGVFTGQCGTEL---DHGVAAVGY 291


>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
 gi|255645733|gb|ACU23360.1| unknown [Glycine max]
          Length = 362

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/313 (28%), Positives = 150/313 (47%), Gaps = 33/313 (10%)

Query: 8   LFIVALIALCFLAIPVKVSK----PNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEK 62
            FIV +   C L++ +  ++     + E+  +LF ++Q+ +K+ Y   E    RF+ F+ 
Sbjct: 12  FFIVLVSFTCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQS 71

Query: 63  SLDIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +L  I E+N  R+SP +  R G+ +F+D+S EEF   +L     K + M +        N
Sbjct: 72  NLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYL-----KEIEMPYS-------N 119

Query: 122 HVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              ++ +  G       +P   DWR+ G + +VR+Q  C + WAFS     E ++ +  G
Sbjct: 120 LESRKKLQKGDDADCDNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIEGINKIVTG 179

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LSVQ+V+DC    + GC+GG +     ++  N  + + E+ YP   ++  CK  A 
Sbjct: 180 NLVSLSVQQVVDC-DPASHGCAGGFYFNAFGYVIENGGI-DTEAHYPYTAQNGTCKANAN 237

Query: 241 SPNGVKIKSYTCDTL---IPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGS 296
                  K  + D L   +  E ++L  ++   PV  +++A   Q+Y GGV    NC  +
Sbjct: 238 -------KVVSIDNLLVVVGPEEALLCRVSKQ-PVSVSIDATGLQFYAGGVYGGENCSKN 289

Query: 297 LANINHAVQIVGY 309
                    IVGY
Sbjct: 290 STKATLVCLIVGY 302


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 125/262 (47%), Gaps = 28/262 (10%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVL 109
            E + RF+ F  + + IEE   NRQ  ++   G+  F+D++ +EFK  +    V   + +
Sbjct: 49  GEKERRFQIFRDNAEYIEE--HNRQVNQTYWLGLNNFADMTHDEFKALYFGTKVPLSNTI 106

Query: 110 MSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVE 169
            S  ++ D                  T +P+  DWR  G +  V+NQ  CG+CWAFSTV 
Sbjct: 107 KSGFRYKD-----------------ATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTVA 149

Query: 170 TAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
             E ++ +  G L  LS QE++DC    N GC+GG   +  +++ +    L+ E++YP  
Sbjct: 150 AVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFI-IQNGGLDSEADYPYK 208

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
               +C     + + V I  +  D    SE+ +L  +A   PV  A+ A    +Q Y GG
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFE-DVPAESEADLLKAVANQ-PVSVAIEASGRNFQLYSGG 266

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           V   +C   L   +H V  VGY
Sbjct: 267 VYTGHCGYEL---DHGVVAVGY 285


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 31/308 (10%)

Query: 7   VLFIVALIALCFL-AIPVKVSK--PNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEK 62
           V  +   + LC + A P   S+  PN +  ++ F  +   Y + Y   +  +R F+ F+ 
Sbjct: 5   VQLVFLFLFLCAMWASPSAASRDEPN-DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKN 63

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           ++  IE  N   ++  S   GI +F+D+++ EF  ++   S+  ++        D     
Sbjct: 64  NVKHIETFNSRNEN--SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD---- 117

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
                    + I + +P   DWR+ G + +V+NQ  CG+CW+F+ + T E ++ +K G L
Sbjct: 118 ---------VNI-SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYL 167

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
             LS QEV+DCA   + GC GG      D++  N  V   E  YP L     C   +  P
Sbjct: 168 VSLSEQEVLDCA--VSYGCKGGWVNKAYDFIISNNGVTT-EENYPYLAYQGTCNANSF-P 223

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANIN 301
           N   I  Y+       E S++  ++   P+ A ++A   +QYY GGV    C  SL   N
Sbjct: 224 NSAYITGYSY-VRRNDERSMMYAVSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---N 278

Query: 302 HAVQIVGY 309
           HA+ I+GY
Sbjct: 279 HAITIIGY 286


>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
          Length = 478

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 137/301 (45%), Gaps = 34/301 (11%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
            K+ KP        F  F  R++K Y +K E   RF+ F+++  +I EL KN Q   +A 
Sbjct: 162 AKIIKPRDYVIWNSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQG--TAV 219

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI-PTGIPV 140
           YG T+FSD++  EFK   L +   + V M                    G+TI    +P 
Sbjct: 220 YGFTKFSDMTTMEFKETMLPYQWEQPVPMDQANFEKE------------GVTISEEDLPD 267

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWRE G + +V+NQ +CG+CWAFST    E    L    L  LS QE++DC  + + G
Sbjct: 268 SFDWREHGAVTQVKNQGSCGSCWAFSTTGNIEGAWFLAKKKLVSLSEQELVDCD-SVDQG 326

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC----KRKATSPNGVKIKSYTCDTLI 256
           C+GG        + +    LEPE  YP   +   C    K  A   NG          L 
Sbjct: 327 CNGGLPSNAYKEI-IRMGGLEPEDAYPYDGRGETCHLVRKDIAVYING-------SVELP 378

Query: 257 PSESSILTDIATHGPVIAAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYDNYS 313
             E  +   + T GP+   +NA T Q+Y  GV+   +  C+  +  +NH V IVGY    
Sbjct: 379 HDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFM--LNHGVLIVGYGKDG 436

Query: 314 R 314
           R
Sbjct: 437 R 437


>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
 gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
 gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F++F+ ++ K+Y ++ EHD RF  F+ +L       K++    +A +G+T+FSDL+ +EF
Sbjct: 51  FTTFKSKFGKNYATQEEHDYRFSVFKANL---LRAKKHQIMDPTAAHGVTKFSDLTPKEF 107

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L       +    +K                   +PTG +P   DWR+ G +  V+
Sbjct: 108 RRQLLGLKRRLRLPTDANK----------------APILPTGDLPTDFDWRDHGAVTSVK 151

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA--------GNGNMGCSGGDF 206
           +Q +CG+CW+FS     E  H L  G L  LS Q+++DC         G  + GCSGG  
Sbjct: 152 DQGSCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLM 211

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    LE E +YP    D    +   S     + +++  +L   E  I  ++
Sbjct: 212 NNAFEYA-LKAGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSL--DEDQIAANL 268

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+  A+NA+  Q Y+GGV   Y C     + +H V +VGY
Sbjct: 269 VKHGPLSVAINAVFMQTYIGGVSCPYICS---KHQDHGVLLVGY 309


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 128/273 (46%), Gaps = 30/273 (10%)

Query: 40  FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRH 99
           + + YK +  KS+   R+K F+ ++  IE  NK     +S +  I EF+DL+ EEF  R 
Sbjct: 46  YGREYKDADEKSK---RYKIFKDNVARIESFNKAMD--KSYKLSINEFADLTNEEF--RA 98

Query: 100 LRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTC 159
            R+    H+  +      + +               T +P   DWR+ G +  +++Q  C
Sbjct: 99  SRNRFKAHICSTEATSFKYEN--------------VTAVPSTVDWRKKGAVTPIKDQGQC 144

Query: 160 GACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKV 218
           G+CWAFS V   E +  L  G L  LS QE++DC  +G + GCSGG       +++ N  
Sbjct: 145 GSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNH- 203

Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
            L  E+ YP    D  C RK  +    KI  Y  D    +E ++   +A H P+  A++A
Sbjct: 204 GLTTEANYPYAGTDGTCNRKKAAHPAAKINGYE-DVPANNEKALQKAVA-HQPIAVAIDA 261

Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               +Q+Y  GV    C   L   +H V  VGY
Sbjct: 262 SGSEFQFYSSGVFTGQCGTEL---DHGVAAVGY 291


>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
 gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
          Length = 360

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/278 (31%), Positives = 129/278 (46%), Gaps = 27/278 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL ++   N+   S    R GI  F+D+S EEF
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+                       +P  KDWRE GI+  V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q++IDC     N GC+GG      +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  CK K  +  GVK+   + +  + +E  +   +    PV 
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +T ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 91/298 (30%), Positives = 142/298 (47%), Gaps = 28/298 (9%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSP 77
           L + VK S   L    E +  F+  + K Y   E +I RF  F  +L+ IEE N+     
Sbjct: 37  LKLQVKAS-TRLGPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMG 95

Query: 78  ESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
           + + Y G+ +FSD+S +E+    LRH+  +            +  + K     +      
Sbjct: 96  QKSYYMGVNQFSDMSHDEY----LRHNGLRR----------GNRKYSKGEGCDSYTKSGK 141

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
            +  K DWR+ G +  V+NQ  CG+CW+FST  + E  H  + G L  LS Q+++DC+G 
Sbjct: 142 QLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGT 201

Query: 197 -GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
            GN GC+GG      +++  +   LE E +YP   K   C  K +     K     C  +
Sbjct: 202 FGNEGCNGGLMDNAFEYIK-SIGGLEGEDDYPYTAKQGKCHLKKSL---FKANDTGCTDV 257

Query: 256 IPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
              +   L D +A+ GP+  A++A   ++Q Y GGV  +  C  S  N++H V  VGY
Sbjct: 258 ESGDEDALKDALASVGPISVAIDASHASFQSYDGGVYDEEEC--SSQNLDHGVLTVGY 313


>gi|357621272|gb|EHJ73161.1| putative C1A cysteine protease precursor [Danaus plexippus]
          Length = 545

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/276 (30%), Positives = 133/276 (48%), Gaps = 19/276 (6%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           +F+ F Q++ K+Y   EH+ R K FE +L  IEE N+   S ++ +  I +F+DL+ +E 
Sbjct: 242 VFAEFMQKHNKNYDGPEHEQRRKIFETNLRKIEEHNR---SNKNFKLAINKFADLTHKEM 298

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + R     + +    S      +  + + + S T        +P + D R  G++  V++
Sbjct: 299 EKRK---GLKRRGKSSGAIPFPYSKSKIAEMSDT--------LPKEYDARMYGLVTSVKD 347

Query: 156 QQTCGACWAFSTVETAE-SMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           QQ CG+CW F T    E ++  +  G L  L+ Q +IDCA G  N GC GG       WM
Sbjct: 348 QQDCGSCWTFGTTSAVEGALARINGGRLMRLANQALIDCAWGYENFGCDGGTDTGAYHWM 407

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +N  +   E   P + KD  C R        KIK +T  T    E ++   +  HGP+ 
Sbjct: 408 -LNYGMPTEEEYGPYVNKDGFC-RIHNMTQTYKIKGFTNVTPYSVE-ALKVALVNHGPLS 464

Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +++A     Y  G I  + D S  N+NH V +VGY
Sbjct: 465 VSIDATDMLTYYNGGIYSDSDCSTTNLNHEVTLVGY 500


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 91/325 (28%), Positives = 148/325 (45%), Gaps = 34/325 (10%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
           F+ KN   I+ L  +  L   + +S   LE+  +      + YK +   +E + RF+ F+
Sbjct: 4   FNQKNQYNILTLFFILTLWTSLVISSRLLEKHEQWMEEHGKFYKDA---AEKEQRFQIFK 60

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           ++L+ IE  N            I +F D + +EFK  +L     K  L+          +
Sbjct: 61  ENLEFIESFNA--AGDNGFNLSINQFGDQTNDEFKANYLNGK--KKPLIGVGIAAIEEES 116

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
             +  ++T        +P   DWRE G +  +++Q  CG+CWAF+TV   E +H +  G 
Sbjct: 117 VFRYENVTE-------VPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGR 169

Query: 182 LSLLSVQEVIDCA-GNGNMGCSGG---DFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           L  LS QE++DC   N   GC+GG   D C  +    V K  +  E+ YP    D  C  
Sbjct: 170 LVSLSEQELVDCVKTNTTDGCNGGYVEDACDFI----VKKGGITSETNYPYTRVDGKCNV 225

Query: 238 KATSPNGVKIKSYTCDTLIPS--ESSILTDIATHG-PVIAAVNALTWQYYLGGVIQYNCD 294
           +  + N  KIK Y     +P+  E ++L  +A     V  A     +Q+Y  G+++  C 
Sbjct: 226 RKGTYNVAKIKGY---EHVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCG 282

Query: 295 GSLANINHAVQIVGY---DNYSRTW 316
               +++H V IVGY   D+  + W
Sbjct: 283 ---IDLDHTVTIVGYGTSDDGVKYW 304


>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
          Length = 367

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 136/280 (48%), Gaps = 28/280 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ ++LF+S+   + K Y   +  + RF+ F+ +L+ I+E NK   S    R G+ EF+D
Sbjct: 42  ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YRLGLNEFAD 98

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS +EF  +++   ++  +  S+ +             I   I     +P   DWR+ G 
Sbjct: 99  LSNDEFNEKYVGSLIDATIEQSYDEEF-----------INEDIV---NLPENVDWRKKGA 144

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  VR+Q +CG+CWAFS V T E ++ ++ G L  LS QE++DC    + GC GG     
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
           L+++  N + L   S+YP   K   C+ K     G  +K+     + P+    L +    
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
            PV   V +    +Q Y GG+ +  C      ++HAV  V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/294 (29%), Positives = 139/294 (47%), Gaps = 21/294 (7%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  S+ +L     LF S+  ++ K Y S +E   R++ F+++L  I E N+   S     
Sbjct: 30  VGYSQEDLALPSSLFRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRKNGS---YW 86

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
            G+ +F+D++ EEFK  +L     K  L              +  +   G      +P  
Sbjct: 87  LGLNQFADVAHEEFKASYLGL---KRALPRAGAPQTRTPTAFRYAAAAAG-----SLPWS 138

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGC 201
            DWR  G +  V+NQ  CG+CWAFS+V   E ++ +  G L  LS QE++DC    + GC
Sbjct: 139 VDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELVDCDTTLDHGC 198

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SE 259
            GG       +M +    +  E +YP L+++  CK K     G+  +  T    +P  SE
Sbjct: 199 EGGTMDLAFAYM-MGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITEQDLTGFEDVPENSE 257

Query: 260 SSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
            S+L  +A H PV   + A +  +Q+Y GGV    C      ++HA+  VGY +
Sbjct: 258 ISLLKALA-HQPVSVGIAAGSRDFQFYRGGVFDGACS---VELDHALTAVGYGS 307


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 128/263 (48%), Gaps = 29/263 (11%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
           E + RF+ F+ +L  I+E N   +S    + G+  F+DL+ EE+++ +L           
Sbjct: 70  EKERRFQVFKDNLRFIDEHNSENRS---YKVGLNRFADLTNEEYRSMYL----------- 115

Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
                    N + + S      +   +P   DWR+ G + +V++Q +CG+CWAFST+   
Sbjct: 116 -GARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAV 174

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
           E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +N   ++ E +YP 
Sbjct: 175 EGINKIVTGDLISLSEQELVDCDRSYNEGCNGG----LMDYAFQFIINNGGIDSEEDYPY 230

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
           L +D  C     +   V I +Y  D  +  E ++   +A   PV  A+ A    +Q+Y  
Sbjct: 231 LARDGTCDTYRKNAKVVTIDNYE-DVPVNDEKALQKAVANQ-PVSVAIEAGGREFQFYQS 288

Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
           G+    C  +L   +H V  VGY
Sbjct: 289 GIFTGRCGTAL---DHGVAAVGY 308


>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 155/313 (49%), Gaps = 32/313 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF--- 60
           +K +L + A++A+   A    +S  +LE     F +++ +++KSY     +   K     
Sbjct: 1   MKLLLVVSAVLAVASCA---SISLEDLE-----FHAWKLKFEKSYDSESDEAHRKQVWLN 52

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
            +   ++  +  + Q  +S R G+T F+D+  EE+K            L+S    H  + 
Sbjct: 53  NRKFVLMHNILAD-QGLKSYRLGMTHFADMDNEEYKQ-----------LVSQGCLHTFNA 100

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           +  ++ S   G+   T +P   DWR+ G + +V++Q+ CG+CWAFST    E  H  K G
Sbjct: 101 SLPERGSAFLGLPEGTALPDTVDWRDKGYVTEVKDQKQCGSCWAFSTTGVLEGQHFRKTG 160

Query: 181 TLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q+++DC+ + GN GC+GG     L ++  N  + + E+ YP   K   C+ K 
Sbjct: 161 KLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGI-DTETSYPYKAKGQRCRYK- 218

Query: 240 TSPNGVKIKSYTCDTLIPS-ESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGS 296
             P+G+  K      + PS E ++   +AT GP+   ++A   ++Q+Y  GV   + D S
Sbjct: 219 --PDGIGAKCTGYVHVKPSNEETLKKAVATLGPISVGIDASRHSFQFYQSGVYD-DPDCS 275

Query: 297 LANINHAVQIVGY 309
              ++H    VGY
Sbjct: 276 KTVLDHGALAVGY 288


>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
          Length = 368

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 142/297 (47%), Gaps = 34/297 (11%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
           V  ++P +    + FS F++++ K Y S  EHD RF  F+ +L       ++++   SA 
Sbjct: 37  VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
           +G+T+FSDL+  EF+ +HL        + S  K        + K +    I     +P  
Sbjct: 94  HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
            DWR+ G +  V+NQ +CG+CW+FS     E  + L  G L  LS Q+++DC        
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           A + + GC+GG   +  +   +    L  E +YP   KD    +   S     + +++  
Sbjct: 199 ADSCDSGCNGGLMNSAFE-HTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257

Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++   E  I  ++  +GP+  A+NA   Q Y+GGV   Y C      +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 138/282 (48%), Gaps = 42/282 (14%)

Query: 40  FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT----EFSDLSEEEF 95
           ++  +++ Y  +E + R   +EK++ +IE  N         ++G T     F D++ EEF
Sbjct: 32  WKSTHRRLYDTNEEEWRRAVWEKNMKMIELHNGEYSE---GKHGFTMEMNAFGDMTNEEF 88

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +            L++ +KH  H    + +  +   + +P  +    DWRE G +  V+N
Sbjct: 89  RQ-----------LVNGYKHQKHRKGKLFQEPLM--LQLPKSV----DWREKGCVTPVKN 131

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMD 214
           Q  CG+CWAFS     E    LK G L  LS Q ++DC+ G GN GC+GG    L+D+  
Sbjct: 132 QGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGG----LMDFAF 187

Query: 215 ---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS-ESSILTDIATHG 270
              +N   L+ E  YP   KD  CK K          + T    IP  E +++  +AT G
Sbjct: 188 QYVLNNKGLDSEESYPYEAKDGTCKYKPE----FAAANDTGYVDIPQLEKALMKAVATVG 243

Query: 271 PVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           P+  A++A   ++Q+Y  G+  + NC  S  +++H V ++GY
Sbjct: 244 PIAVAIDASHPSFQFYSSGIYFEPNC--SSKDLDHGVLVIGY 283


>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
          Length = 326

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 147/311 (47%), Gaps = 34/311 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           LFI+A++A+  L               +L+  +++ Y K Y+ ++ + R   +E+++  I
Sbjct: 3   LFILAVLAVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51

Query: 68  EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +E N ++     +   G+ +F+D++ EEFK ++L        ++SH   ++ ++  V   
Sbjct: 52  QEHNLRHYLGFVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNRAV--- 108

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                       P K DWRE+G + +V++Q  CG+CWAFST  T E  +     T    S
Sbjct: 109 ------------PDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+G  GNMGC GG      +++   +  LE ES YP    +  C+         
Sbjct: 157 EQQLVDCSGPWGNMGCMGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
               YT  +   SE  +   +   GP   AV+  + +  Y GG+ Q     SL  +NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-RVNHAV 271

Query: 305 QIVGYDNYSRT 315
             VGY   S T
Sbjct: 272 LAVGYGTQSGT 282


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 148/314 (47%), Gaps = 37/314 (11%)

Query: 8   LFIVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
           + I  L+ L F    A  + +   +  + ++++  +  +++K Y+   E + RF+ F+ +
Sbjct: 4   MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHN 121
           L  I++ N    +      G+ +F+D++ EE++  +L  R    + V+ + +  H + +N
Sbjct: 64  LGFIQDHNAQNNT---YTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
              +            +PV  DWR  G +G +++Q  CG+CWAFSTV   E ++ +  G 
Sbjct: 121 SGDQ------------LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGE 168

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
              LS QE++DC    + GC+GG    L+D+     +    ++ E +YP    D  C + 
Sbjct: 169 FVSLSEQELVDCDREYDEGCNGG----LMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQT 224

Query: 239 ATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
                 V+I  Y     +PS + + L    +H PV  A+ A     Q Y  GV    C  
Sbjct: 225 KKKTKVVQIDGYED---VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGT 281

Query: 296 SLANINHAVQIVGY 309
           +L   +H V +VGY
Sbjct: 282 AL---DHGVVVVGY 292


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 150/330 (45%), Gaps = 47/330 (14%)

Query: 4   VKNVLFIVAL-IALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKN 59
            KN  + V+  + LC      +VS   L+     E    +  RY + Y    E + RF  
Sbjct: 3   TKNQFYQVSFALVLCLGLWAFQVSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F+++++ IE  N     P   + G+ +F+DL+ EEF             + + +K   H 
Sbjct: 63  FKENVNYIEASNNAGDKP--YKLGVNQFADLTNEEF-------------IATRNKFKGHM 107

Query: 120 HNHVKKRSI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            + + + +      +T P+ +    DWR+ G +  V+NQ TCG CWAFS V   E +H L
Sbjct: 108 SSSITRTTTFKYENVTAPSTV----DWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKL 163

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
             G L  LS QE++DC  +G + GC GG    L+D  D  K +     L  E++YP    
Sbjct: 164 STGNLVSLSEQELVDCDTSGADQGCQGG----LMD--DAFKFIIQNGGLNTEAQYPYQGV 217

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           D  C     + +   I  Y  D    +E ++   +A   P+  A++A    +Q Y  GV 
Sbjct: 218 DGTCNTNEEATHVATITGYE-DVPSNNEQALQQAVANQ-PISIAIDASGSDFQNYQSGVF 275

Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
             +C   L   +H V +VGY   D+ ++ W
Sbjct: 276 TGSCGTQL---DHGVAVVGYGVSDDGTKYW 302


>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 320

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
           VA + LC LA+    + P+ +        F+ +Y + Y  ++ ++ R + F+++  +IE+
Sbjct: 2   VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 52

Query: 70  LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK  ++ E + +  + +F D++ EEF           + +M  +K      +  + +++
Sbjct: 53  FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 97

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
            T    P    V  DWR   ++  V++Q+ CG+CWAFS     E  H LKN  L  LS Q
Sbjct: 98  FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 155

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           +++DC+ + GN GC GG   +  D++  N  + + ES YP   +D +C+  A S   +  
Sbjct: 156 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 214

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
            S     +  +E ++   ++  GP+  A++A   ++Q+Y  GV  + NC  +   ++H V
Sbjct: 215 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 269

Query: 305 QIVGY 309
             VGY
Sbjct: 270 LAVGY 274


>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
          Length = 354

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/285 (32%), Positives = 141/285 (49%), Gaps = 32/285 (11%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEE 94
           +  F++R+ K + + +E   RF  F++++     LN +      A Y ++ +F+DL+ +E
Sbjct: 42  YGRFKKRHGKPFGEDAEEGRRFNAFKQNMQTAYFLNAHN---PHAHYDVSGKFADLTPQE 98

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F   +L    N +    H K +  H  HV   S+ +G+       +  DWRE G++  V+
Sbjct: 99  FAKLYL----NPNYYARHGKDYKEH-VHVDD-SVRSGV-------MSVDWREKGVVTPVK 145

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM- 213
           NQ  CG+CWAF+T    E   ALKN +L  LS Q ++ C  N + GC+GG     + W+ 
Sbjct: 146 NQGMCGSCWAFATTGNIEGQWALKNHSLVSLSEQVLVSCD-NIDDGCNGGLMQQAMQWII 204

Query: 214 -DVNKVVLEPESEYPLLLKDAACKRKATSPN---GVKIKSYTCDTLIPSESSILTDIATH 269
            D N  V   E  YP     A   R     N   G KIK Y   +L   E  I   +  +
Sbjct: 205 NDHNGTV-PTEDSYP--YTSAGGTRPPCHDNGTVGAKIKGYM--SLPHDEEEIAAYVGKN 259

Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSR 314
           GPV  AV+A T Q Y GGV+   C G   ++NH V +VG++  ++
Sbjct: 260 GPVAVAVDATTRQLYFGGVVTL-CFG--LSLNHGVLVVGFNRQAK 301


>gi|391328550|ref|XP_003738751.1| PREDICTED: cathepsin K-like [Metaseiulus occidentalis]
          Length = 320

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 134/281 (47%), Gaps = 26/281 (9%)

Query: 34  LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDL 90
           L  +S F++++ K Y S S  +I   NF ++   I E NK     +S  Y   +  +SD 
Sbjct: 15  LAEWSQFKEQFGKEYRSTSAEEIALLNFGRNSRTITEHNKRLHDGDSPSYRMAVNPWSDK 74

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           S EEF+  +  +        S+    D   N+V +R          G P   DW +AG +
Sbjct: 75  SHEEFRQYYGLYGD------SYDFTSDRILNYVPER----------GTPANVDWNKAGFV 118

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
              R+Q+ CG+CWAF+ V   E+  +   G L+ LSVQ +IDC+ + N GCSGG    +L
Sbjct: 119 TPSRDQKGCGSCWAFAAVGAIEARVSKSTGNLTALSVQNLIDCS-DTNFGCSGG--SPIL 175

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
              D+  + L     YP L +D  C R   S    +I  +  +    SE  +   +A  G
Sbjct: 176 ALRDLLSIGLHTADSYPYLARDGICHR-VNSSRLYQISGFYREEYYLSEERLKEMVAIIG 234

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV A ++A    + +Y  G+  Y+   +  + NHAV +VG+
Sbjct: 235 PVTATIDASPFGFMHYRDGIF-YDPACNPDSPNHAVLVVGF 274


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/264 (31%), Positives = 131/264 (49%), Gaps = 32/264 (12%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +E D RF+ F+ +L  I+E N    S    + G+T F+DL+ +E+++ +L     K VL 
Sbjct: 69  AEKDQRFEIFKDNLRYIDEHNTKNLS---YKLGLTRFADLTNDEYRSMYLGAKPVKRVL- 124

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
              K  D +   V       G  +P  +    DWR+ G +  V++Q +CG+CWAFST+  
Sbjct: 125 ---KTSDRYEARV-------GDALPDSV----DWRKEGAVADVKDQGSCGSCWAFSTIGA 170

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYP 227
            E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +    ++ E++YP
Sbjct: 171 VEGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEADYP 226

Query: 228 LLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYL 285
               D  C +   +   V I SY  D    SE+S+   +A H P+  A+ A    +Q Y 
Sbjct: 227 YKAADGRCDQNRKNAKVVTIDSYE-DVPENSEASLKKALA-HQPISVAIEAGGRAFQLYS 284

Query: 286 GGVIQYNCDGSLANINHAVQIVGY 309
            GV    C   L   +H V  VGY
Sbjct: 285 SGVFDGICGTEL---DHGVVAVGY 305


>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
          Length = 335

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 147/319 (46%), Gaps = 38/319 (11%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
           M  +K  L +  L+ +C   IP+ +  P L +  + F  +Q+++ K YS + E   R K 
Sbjct: 1   MSAMKLFLGLCVLVHVCSAFIPLVLPIPGLYE--DYFKEWQEKHGKVYSTEEESQSRLKV 58

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F K++  I+  NK   S E     + E++D++ +EFK ++L     +H   +H    D  
Sbjct: 59  FMKNVIYIDNHNKQGHSYELE---VNEYADMTLDEFKDQYLMEP--QHCSATHSLKSDPP 113

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                 ++I              DWR  G +  V+NQ  CG+CW FST    ES H LK 
Sbjct: 114 KYRDPPKAI--------------DWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKT 159

Query: 180 GTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC--- 235
           G L  LS Q+++DCA    N GC+GG      +++  N   L+ E  YP    D  C   
Sbjct: 160 GQLVSLSEQQLVDCAQAFNNNGCNGGLPSQAFEYIHYNG-GLDSEESYPYRAHDEKCHFV 218

Query: 236 --KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN-ALTWQYYLGGVIQY- 291
             +  AT  N V I S         E  +   + T GPV  A + +  +++Y  GV +  
Sbjct: 219 PSEVSATVSNVVNITS-------KDEMQLYNAVGTVGPVSIAYDVSADFRFYKKGVYKSK 271

Query: 292 NCDGSLANINHAVQIVGYD 310
            C     ++NHAV  VGY+
Sbjct: 272 ECKTDPEHVNHAVLAVGYN 290


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 22/270 (8%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +E + R+  F+++++ IE LN+  Q   + +  + +F+DL+ EEF++ +  +  N  VL 
Sbjct: 52  NEKNNRYVVFKRNVESIERLNE-VQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNS-VLS 109

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           S  K     + HV   ++          P+  DWR+ G +  +++Q +CG+CWAFS V  
Sbjct: 110 SRTKPTSFRYQHVSSDAL----------PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
            E +  +K G L  LS QE++DC  N + GC GG   +  ++  +    L  ES YP   
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCDTNDD-GCMGGYMNSAFNYT-MTTGGLTSESNYPYKS 217

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVI 289
            D  C    T      IK +  D     E +++  +A H   I      T +Q+Y  GV 
Sbjct: 218 TDGTCNINKTKQIATSIKGFE-DVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVF 276

Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
              C     +++H V +VGY    N S+ W
Sbjct: 277 SGECS---THLDHGVAVVGYGKSSNGSKYW 303


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 82/279 (29%), Positives = 133/279 (47%), Gaps = 21/279 (7%)

Query: 34  LELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           L+ F  +  R+ ++Y+ + E   RF+ + ++++++E  N      + A     +F+DL+ 
Sbjct: 28  LDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLAD---NKFADLTN 84

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF+ + L      HV +          N         G +    +P   DWR+ G + +
Sbjct: 85  EEFRAKML--GFRPHVTIPQIS------NTCSADIAMPGESSDDILPKSVDWRKKGAVVE 136

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFS V   E ++ +KNG L  LS QE++DC  +  +GC GG      ++
Sbjct: 137 VKNQGDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC-DDEAVGCGGGYMSWAFEF 195

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + V    L  E+ YP    + AC+    + + V I  Y    + PS    L   A   PV
Sbjct: 196 V-VGNHGLTTEASYPYHAANGACQAAKLNQSAVAIAGYR--NVTPSSEPDLARAAAAQPV 252

Query: 273 IAAVN--ALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             AV+  +  +Q Y  GV    C    A++NH V +VGY
Sbjct: 253 SVAVDGGSFMFQLYGSGVYTGPC---TADVNHGVTVVGY 288


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/259 (31%), Positives = 124/259 (47%), Gaps = 30/259 (11%)

Query: 56  RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
           RF+ F+ ++  IE  N      +S   GI +F+DL+ EEF+     +   K  L +  K 
Sbjct: 59  RFQIFKSNVVFIESFNT--AGNKSYMLGINKFADLTNEEFRAFWNGY---KRPLGASRKI 113

Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
               + +V            T +P   DWR  G +  +++Q  CG+CWAFS V   E +H
Sbjct: 114 TPFKYENV------------TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIH 161

Query: 176 ALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
            L+ G L  LS QE++DC   G + GC GG       ++  +   +  E+ YP   +D  
Sbjct: 162 KLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHG-GMTSEANYPYQGRDGK 220

Query: 235 CKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ 290
           C  K  +   VKI  Y     +P  SE+++L  +A   PV  A++A  L++Q+Y  G+  
Sbjct: 221 CDTKKEASRAVKITGYQA---VPKNSEAALLKAVANQ-PVSVAIDAGSLSFQFYRSGIFT 276

Query: 291 YNCDGSLANINHAVQIVGY 309
             C     +INH V  VGY
Sbjct: 277 GICG---KDINHGVAAVGY 292


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 89/283 (31%), Positives = 142/283 (50%), Gaps = 33/283 (11%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
           + S++ +Y KSY  + E  +R + +E +L I+++ N    Q   + R G+  ++DL  EE
Sbjct: 19  WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F    L+ S    +L +  K        +       G+T+P+ +    DWR  G +  V+
Sbjct: 79  FMA--LKGS--GGLLQAKDKSSTQTFKPL------VGVTLPSSV----DWRNQGYVTPVK 124

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FS   + E  H  K G L  LS Q+++DCAG  GN GC+GG   +  D++
Sbjct: 125 DQGQCGSCWTFSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYI 184

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD--TLIP--SESSILTDIATH 269
                V E ES YP   +D  CK   +     K+ + TC    +IP   E +++  + T 
Sbjct: 185 KGVGGV-ELESAYPYTARDGRCKFDRS-----KVVA-TCKGYVVIPVGDEQALMQAVGTI 237

Query: 270 GPVIAAVNA--LTWQYYLGGVIQY-NCDGSLANINHAVQIVGY 309
           GPV  +++A   ++Q Y  GV  +  C  S  N++H V  VGY
Sbjct: 238 GPVAVSIDASGYSFQLYESGVYDFRRC--SSTNLDHGVLAVGY 278


>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
 gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
          Length = 360

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 27/278 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL ++   N+   S    R GI  F+D+S EEF
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+                       +P  KDWRE GI+  V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q+++DC     N GC+GG      +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  CK K  +  GVK+   + +  + +E  +   +    PV 
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +T ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314


>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/319 (27%), Positives = 157/319 (49%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAF+ +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
             L+ LS Q ++ C  N ++GC  G       W+   N   +  E  YP         AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPAC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G  I+ +    ++ +E++I   +A +GPV  AV+A ++Q Y GGV+  +C  
Sbjct: 227 N-KSGKVVGANIRDHV--HILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT-SCIS 282

Query: 296 SLANINHAVQIVGYDNYSR 314
               +N A  +VGYD+ S+
Sbjct: 283 K--EVNSAALLVGYDDTSK 299


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 88/320 (27%), Positives = 148/320 (46%), Gaps = 39/320 (12%)

Query: 11  VALIALCFLAIP---------VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
           + LI LC L IP         + +       K+      +Q  +K  +K E+ +RF  + 
Sbjct: 12  LMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLRFGIYH 71

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
            ++  IE +N    S    +    +F+DL+ +EF + +L + +  +      ++  H H 
Sbjct: 72  SNIQFIEYINSQNLS---FKLTDNKFADLTNDEFNSIYLGYQIRSY----KRRNLSHMHE 124

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           +             T +P   DWRE G +  +++Q  CG+CWAFS V   E ++ +K G 
Sbjct: 125 N------------STDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGN 172

Query: 182 LSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
           L  LS QE++DC  NG N GC+GG       ++  +   L  E++YP    D +C++  T
Sbjct: 173 LVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIK-SIGGLTTENDYPYKGTDGSCEKAKT 231

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLA 298
             + V I  Y  +T+  +  + L    +  PV  A++A  +++  Y  GV    C     
Sbjct: 232 DNHAVIIGGY--ETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCG---I 286

Query: 299 NINHAVQIVGY--DNYSRTW 316
            +NH V IVGY  +N  + W
Sbjct: 287 QLNHGVTIVGYGDNNGQKYW 306


>gi|37911662|gb|AAR05023.1| cathepsin L-like protein [Tenebrio molitor]
          Length = 336

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 139/305 (45%), Gaps = 25/305 (8%)

Query: 16  LCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN- 71
              LAI +  +   L      E + +F+  Y +SY +  E   R + F+K L+  EE N 
Sbjct: 4   FIILAIAIYGASAALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNE 63

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITT 130
           K RQ   S   G+  F+D++ EE K          H L+      D H N +  K     
Sbjct: 64  KYRQGLVSYTLGVNLFTDMTPEEMKAY-------THGLI---MPADLHKNGIPIKTREDL 113

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL--SLLSVQ 188
           G+      P   DWR+ G++  V+NQ +CG+CWAFS+    ES   + NG    S +S Q
Sbjct: 114 GLNASVRYPASFDWRDQGMVSPVKNQGSCGSCWAFSSTGAIESQMKIANGAGYDSSVSEQ 173

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           +++DC  N  +GCSGG       ++  N  + + E  YP  + D  C      PN V  +
Sbjct: 174 QLVDCVPNA-LGCSGGWMNDAFTYVAQNGGI-DSEGAYPYEMADGNCHY---DPNQVAAR 228

Query: 249 SYTCDTLIPSESSILTD-IATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQI 306
                 L   + ++L D +AT GPV  A +A   +  Y GGV  YN         HAV I
Sbjct: 229 LSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGGVY-YNPTCETNKFTHAVLI 287

Query: 307 VGYDN 311
           VGY N
Sbjct: 288 VGYGN 292


>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
 gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
          Length = 335

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 93/309 (30%), Positives = 148/309 (47%), Gaps = 27/309 (8%)

Query: 6   NVLFIVALIALCFLAIPVKV---SKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFE 61
           N++ IV L ALC  +    V   +  NL++  + F SF + Y K+Y+   E + R+  F+
Sbjct: 2   NIIVIVTL-ALCAASSRAAVVAETAYNLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFK 60

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
            +L  I   N N     +A YGI +FSDLS+ E   +    S+ +              N
Sbjct: 61  DNLHEINAKNGNATDGPTATYGINKFSDLSKSELIAKFTGLSIPQRA-----------SN 109

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
             K   +      P   P+  DWRE   +  ++NQ  CGACWAF+T+ + ES  A+++  
Sbjct: 110 FCKTIVLNQP---PDKGPLHFDWREQNKVTSIKNQGACGACWAFATLASVESQFAMRHNR 166

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
           L  LS Q++IDC  + +MGC+GG      + + +    ++ E +YP + +D  C      
Sbjct: 167 LVDLSEQQLIDC-DSVDMGCNGGLLHTAFEEI-IRMGGVQAELDYPFVGRDRRCGVDRHR 224

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
           P  V +    C   +      L D+    GP+  A++A     Y  GVI  +C+ +   +
Sbjct: 225 PYVVSLVG--CYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GL 279

Query: 301 NHAVQIVGY 309
           NHAV +VGY
Sbjct: 280 NHAVLLVGY 288


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 32/270 (11%)

Query: 45  KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
           K   S +E D RF+ F+ +L  I+E N    S    R G+T+F+DL+ +E+++ +L   +
Sbjct: 57  KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 113

Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
            +    S  ++                + +   IP   DWR+ G + +V++Q +CG+CWA
Sbjct: 114 KRKATKSSLRYE---------------VRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 158

Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
           FST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +N   ++
Sbjct: 159 FSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINNGGID 214

Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
            E +YP    D  C +   +   V I  Y  D    SE S L    +H P+  A+     
Sbjct: 215 TEEDYPYKGVDGRCDQTRKNAKVVTIDLYE-DVPANSEES-LKKALSHQPISVAIEGGGR 272

Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +Q Y  G+    C     +++H V  VGY
Sbjct: 273 AFQLYDSGIFDGICG---TDLDHGVVAVGY 299


>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
          Length = 321

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
           VA + LC LA+    + P+ +        F+ +Y + Y  ++ ++ R + F+++  +IE+
Sbjct: 3   VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 53

Query: 70  LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            NK  ++ E + +  + +F D++ EEF           + +M  +K      +  + +++
Sbjct: 54  FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 98

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
            T    P    V  DWR   ++  V++Q+ CG+CWAFS     E  H LKN  L  LS Q
Sbjct: 99  FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 156

Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
           +++DC+ + GN GC GG   +  D++  N  + + ES YP   +D +C+  A S   +  
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 215

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
            S     +  +E ++   ++  GP+  A++A   ++Q+Y  GV  + NC  +   ++H V
Sbjct: 216 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 270

Query: 305 QIVGY 309
             VGY
Sbjct: 271 LAVGY 275


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 135/282 (47%), Gaps = 27/282 (9%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           + ++  +  +++K Y+   E D RF+ F+ +L  I+E N N+ +  + + G+ +F+D++ 
Sbjct: 37  MTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNN--TYKLGLNQFADMTN 94

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EE++  +     +    +   K   H + +          +    +PV  DWR  G +  
Sbjct: 95  EEYRVMYFGTKSDAKRRLMKTKSTGHRYAY----------SAGDRLPVHVDWRVKGAVAP 144

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           +++Q +CG+CWAFSTV T E+++ +  G    LS QE++DC    N GC+GG    L+D+
Sbjct: 145 IKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGG----LMDY 200

Query: 213 ---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
                +    ++ + +YP    D  C     +   V I  +  + + P + + L     H
Sbjct: 201 AFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGF--EDVPPYDENALKKAVAH 258

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            PV  A+ A     Q Y  GV    C  SL   +H V +VGY
Sbjct: 259 QPVSIAIEASGRDLQLYQSGVFTGKCGTSL---DHGVVVVGY 297


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 32/270 (11%)

Query: 45  KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
           K   S +E D RF+ F+ +L  I+E N    S    R G+T+F+DL+ +E+++ +L   +
Sbjct: 51  KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 107

Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
            +    S  ++                + +   IP   DWR+ G + +V++Q +CG+CWA
Sbjct: 108 KRKATKSSLRYE---------------VRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 152

Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
           FST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +N   ++
Sbjct: 153 FSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIINNGGID 208

Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
            E +YP    D  C +   +   V I  Y  D    SE S L    +H P+  A+     
Sbjct: 209 TEEDYPYKGVDGRCDQTRKNAKVVTIDLYE-DVPANSEES-LKKALSHQPISVAIEGGGR 266

Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +Q Y  G+    C     +++H V  VGY
Sbjct: 267 AFQLYDSGIFDGICG---TDLDHGVVAVGY 293


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 127/273 (46%), Gaps = 29/273 (10%)

Query: 34  LELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           LE F ++Q  Y ++Y+  E    RF  + +++  I+ +N+   +  S   G  +F+DL+E
Sbjct: 61  LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQ-LSTGSSYELGENQFTDLTE 119

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI---------PVKKD 143
           EEFK  +L                D      +    T G     G+         P   D
Sbjct: 120 EEFKDTYL-------------MKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVD 166

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCS 202
           WR  G + +V++QQ CG+CWAF+TV + E +H +K G L  LS QE++DC   GN  GC 
Sbjct: 167 WRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCR 226

Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
           GG   + ++W+  N   L  ES+YP +     C       +  +I+ Y       +E+ +
Sbjct: 227 GGSPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQA-VQRNNEAEL 284

Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCD 294
              +A   PV   V+A   +Q+Y  GV    CD
Sbjct: 285 ERAVAGQ-PVAVFVDASRAFQFYKSGVFSGPCD 316


>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
 gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
          Length = 401

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 133/296 (44%), Gaps = 23/296 (7%)

Query: 21  IPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
           +P     P   +  + F  F+++Y K+YS   E + RF+ ++++++ I+  N    S   
Sbjct: 70  VPGDYVDPATREYRKSFEEFKKKYNKTYSSMEEENQRFEIYKQNMNFIKTTNSQGFS--- 126

Query: 80  ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
               + EF DLS+EEF  R        ++  S         + V    +      P  I 
Sbjct: 127 YVLEMNEFGDLSKEEFMARF-----TGYIKDSKDDERVFKSSRVSASELEEEFVPPNSI- 180

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH-ALKNGTLSLLSVQEVIDCAG-NG 197
              +W EAG +  +RNQ+ CG+CWAFS V   E    A  N  L  LS Q+ +DC+  NG
Sbjct: 181 ---NWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNG 237

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           N GC GG       +   NK  L    +YP   ++  C   +   N ++I       + P
Sbjct: 238 NFGCDGGTMGLAFQYAIKNK-YLCTNDDYPYFAEEKTC-MDSFCENYIEIPVKAYKYVFP 295

Query: 258 SESSIL-TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
              + L T +A +GP+  A+ A    +Q+Y  GV    C      +NH V +VGYD
Sbjct: 296 RNINTLKTALAKYGPISVAIQADQTPFQFYKSGVFDAPCG---TKVNHGVVLVGYD 348


>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
          Length = 367

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 131/284 (46%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ ++ K Y ++ EHD R K F+ +L       +++    +A +GIT+FSDL+  EF
Sbjct: 50  FSLFKSKFGKIYATQEEHDHRLKVFKANL---RRARRHQLLDPTAEHGITKFSDLTPSEF 106

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L                   H    K S T    +PT  +P   DWRE G +  V+
Sbjct: 107 RRTYLGL-----------------HKPKPKLSTTKAPILPTSDLPEDFDWREKGAVTGVK 149

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC            + GC GG  
Sbjct: 150 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLM 209

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L+ E +YP   ++  C     S     + +Y+   L   E  I  ++
Sbjct: 210 TTAFEYT-LKAGGLQREKDYPYTGRNGQCHFD-KSKIAASVTNYSVVGL--DEDQIAANL 265

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQYNCD-GSLANINHAVQIVGY 309
             HGP+   +N+   Q Y+GGV   +C      + +H V +VGY
Sbjct: 266 VKHGPLAVGINSAWMQTYIGGV---SCPLVCFKHQDHGVLLVGY 306


>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 308

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/289 (28%), Positives = 130/289 (44%), Gaps = 27/289 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
            L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R   F  +L
Sbjct: 35  ALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLSVFRANL 94

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             +  L+    +   A +G+T FSDL+ EEF++R+              ++   H    +
Sbjct: 95  -FLARLHA--AANPHANFGVTPFSDLTREEFRSRY--------------QNGAAHFAAAQ 137

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +R+         G P  KDWRE G +  V+NQ  CG+CWAF+ +   E    L    L+ 
Sbjct: 138 ERARVPVDVEVVGAPAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIEGQWFLAGNPLTR 197

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPL---LLKDAACKRKAT 240
           LS Q ++ C  N N GC GG       W+ D N   +  E  YP    +     CK  + 
Sbjct: 198 LSEQMLVSC-DNTNSGCGGGSPFRAFKWIVDRNNGAVYTEDSYPYHSCIGIKLPCK-DSD 255

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
              G  I  Y   T+   E  I   +A  GP+  AV+A +W +Y GGV 
Sbjct: 256 RTVGATISGYV--TIPSDEKRIAAVLAVKGPLSVAVDASSWMHYTGGVF 302


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 130/270 (48%), Gaps = 22/270 (8%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
           +E + R+  F+++++ IE LN+  Q   + +  + +F+DL+ EEF++ +  +  N  VL 
Sbjct: 46  NEKNNRYVVFKRNVESIERLNE-VQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNS-VLS 103

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           S  K     + HV   ++          P+  DWR+ G +  +++Q +CG+CWAFS V  
Sbjct: 104 SRTKPTSFRYQHVSSDAL----------PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
            E +  +K G L  LS QE++DC  N + GC GG   +  ++  +    L  ES YP   
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCDTNDD-GCMGGYMNSAFNYT-MTTGGLTSESNYPYKS 211

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVI 289
            D  C    T      IK +  D     E +++  +A H   I      T +Q+Y  GV 
Sbjct: 212 TDGTCNINKTKQIATSIKGFE-DVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVF 270

Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
              C     +++H V +VGY    N S+ W
Sbjct: 271 SGECS---THLDHGVAVVGYGKSSNGSKYW 297


>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------LTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAYNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP + KD  CK  A +  GV++   + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEEAYPYIGKDGTCKFSAENV-GVQVLD-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
          Length = 333

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 138/302 (45%), Gaps = 30/302 (9%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + +  + P  +  L+  +S +++ + K Y K E   R   +E+++++IE+ N
Sbjct: 7   LAALC---LGIASAAPQQDHSLDAHWSQWKEAHGKLYDKDEEGWRRTVWERNMEMIEQHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           +   Q   S    +  F D++ EEFK       + KH                 K+    
Sbjct: 64  QEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKH-----------------KKGKVF 106

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
              +   +P   DWRE G +  V++Q  C  CWAFS     E     K G L  LS Q +
Sbjct: 107 PAPLFAEVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+   GN GC+GG       ++  N   L+ E  YP L ++  CK +   P       
Sbjct: 167 VDCSWSQGNRGCNGGLMEYAFQYVKDNG-GLDSEESYPYLARNEPCKYR---PEKSAANV 222

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
                ++  E  ++T +AT GPV AAV++   ++Q+Y  G I Y+   S   +NH V +V
Sbjct: 223 TAFWPILNEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKG-IYYDPKCSNKLLNHGVLVV 281

Query: 308 GY 309
           GY
Sbjct: 282 GY 283


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/286 (27%), Positives = 140/286 (48%), Gaps = 30/286 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + +++S+  ++ KSY+   E + RF+ F+ +L  I+  N N     S   G+  F+D
Sbjct: 43  DEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYID--NHNADPDRSYELGLNRFAD 100

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE++ ++L          S             + +   G  +P  I    DWRE G 
Sbjct: 101 LTNEEYRAKYLG-------TKSRESRPKLSKGPSDRYAPVEGEELPDSI----DWREKGA 149

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q +CG+CWAFS +   E ++ +  G L  LS QE++DC  + N GC GG    L
Sbjct: 150 VAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGG----L 205

Query: 210 LDWMDVNKVV----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           +D+   N ++    ++ + +YP   +D  C +   +   V I SY  D  +  E + L  
Sbjct: 206 MDYA-FNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYE-DVPVYDEKA-LQK 262

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A + P+  A+ A  + +Q Y+ G+    C  +   ++H V +VGY
Sbjct: 263 AAANQPISVAIEAGGMDFQLYVSGIFTGKCGTA---VDHGVVVVGY 305


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/279 (29%), Positives = 135/279 (48%), Gaps = 28/279 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E    +  +Y + Y   +E   R+  F++++  I+  N   Q+ +S + G+ +F+DL+ E
Sbjct: 3   ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNS--QTGKSYKLGVNQFADLTNE 60

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK    R+    H  M   +     + +V            + +P   DWR+ G +  V
Sbjct: 61  EFKAS--RNRFKGH--MCSPQAGPFRYENV------------SAVPSTVDWRKEGAVTPV 104

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
           ++Q  CG CWAFS V   E ++ L  G L  LS QEV+DC   G + GC+GG       +
Sbjct: 105 KDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKF 164

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           ++ NK  L  E+ YP    D  C  K ++ +  KI  +  D    SE++++  +A   PV
Sbjct: 165 IEQNK-GLTTEANYPYKGTDGTCNTKKSAIHAAKITGFE-DVPANSEAALMKAVAKQ-PV 221

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A++A    +Q+Y  G+   +CD  L   +H V  VGY
Sbjct: 222 SVAIDAGGSDFQFYSSGIFTGSCDTQL---DHGVTAVGY 257


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 139/311 (44%), Gaps = 31/311 (9%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
           FIV  +ALC L +       +   K         EL+  ++  +  + S  E   RF  F
Sbjct: 4   FIV--LALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVF 61

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           + ++  I E NK     +S +  + +F D++ EEF+  +   ++       HH+      
Sbjct: 62  KHNVKHIHETNK---KDKSYKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGEK 112

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
              K        T+PT +    DWR+ G +  V+NQ  CG+CWAFSTV   E ++ ++  
Sbjct: 113 KATKSFMYANVNTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L+ LS QE++DC  N N GC+GG      +++   K  L  E  YP    D  C     
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
           +   V I  +  D    SE  ++  +A   PV  A++A    +Q+Y  GV    C   L 
Sbjct: 228 NAPVVSIDGHE-DVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284

Query: 299 NINHAVQIVGY 309
             NH V +VGY
Sbjct: 285 --NHGVAVVGY 293


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 40/323 (12%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEHDIRFKN 59
           LF+    + CF    + +S+P L+ +L +    Q+R+ +  +K         E + R+  
Sbjct: 10  LFVAIFSSFCF---SITLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEENNRYVV 61

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR-HSVNKHVLMSHHKHHDH 118
           F+ +++ IE LN +  +  + +  + +F+DL+ +EF + +     V+     S  K    
Sbjct: 62  FKNNVERIEHLN-SIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
            + +V   ++          PV  DWR+ G +  ++NQ +CG CWAFS V   E    +K
Sbjct: 121 RYQNVSSGAL----------PVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIK 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L  LS Q+++DC  N + GC GG      + +      L  ES+YP   +DA C  K
Sbjct: 171 KGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATG-GLTTESDYPYKGEDATCNSK 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGS 296
            T+P    I  Y  D  +  E +++  +A H PV   +      +Q+Y  GV    C   
Sbjct: 229 KTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFTGECTTY 286

Query: 297 LANINHAVQIVGYD---NYSRTW 316
           L   +HAV  +GY    N S+ W
Sbjct: 287 L---DHAVTAIGYGESTNGSKYW 306


>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 144/331 (43%), Gaps = 52/331 (15%)

Query: 8   LFIVALIALCFL--AIPVKVSKPNLEQKLEL------------FSSFQQRYKKSY-SKSE 52
           LF+++L+A      AI      P + Q +              FS F+ ++ K Y S+ E
Sbjct: 4   LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEE 63

Query: 53  HDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
           HD RFK F+ +        +++    SA +GIT+FSDL+  EF+  +L            
Sbjct: 64  HDHRFKVFKANR---RRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGL---------- 110

Query: 113 HKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
                  H    K +      +PT  +P   DWR+ G +  V+NQ +CG+CW+FST    
Sbjct: 111 -------HKPKPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAV 163

Query: 172 ESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPE 223
           E  H L  G L  LS Q+++DC            + GC GG      ++  +    L+ E
Sbjct: 164 EGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLMTTAFEYT-LKAGGLQLE 222

Query: 224 SEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY 283
            +YP   KD  C     S     + +++   L   E  I  ++  HGP+   +NA   Q 
Sbjct: 223 KDYPYTGKDGKCHFD-KSKIAAAVTNFSVIGL--DEDQIAANLVKHGPLAVGINAAWMQT 279

Query: 284 YLGGV-IQYNCDGSLANINHAVQIVGYDNYS 313
           Y+GGV     C       +H V +VGY ++ 
Sbjct: 280 YVGGVSCPLIC---FKRQDHGVLLVGYGSHG 307


>gi|66816665|ref|XP_642342.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
 gi|60470393|gb|EAL68373.1| hypothetical protein DDB_G0278401 [Dictyostelium discoideum AX4]
          Length = 337

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 146/305 (47%), Gaps = 32/305 (10%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L I V  +K  L   Q  + F+ +    +KSYS SE   R+  F+ + D IEE 
Sbjct: 4   LSVLCALLITVATAKQELSESQYRDAFTDWMISNQKSYSSSEFITRYNIFKTNFDYIEEW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+ + +D++ EE+++ +L    +   L+   +                
Sbjct: 64  NS--KGSETV-LGLNKMADITNEEYRSLYLGKPFDASSLIGTKEE--------------- 105

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL-KNGTLSLLSV-- 187
            I          DWR+ G +  V+NQQ+C  CW+FS     E  H L  NGT  L+S+  
Sbjct: 106 -ILFSNKFSSTVDWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSE 164

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q +IDC+   GN GC+GG      +++  N  + + E  YP    D  C+ K+ + +G  
Sbjct: 165 QNLIDCSTPFGNTGCNGGVITYAFEYIISNGGI-DTEKSYPFEGTDGTCRYKSEN-SGAT 222

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
           I SY  +    SESS+ + +  + PV  +++A   ++ +Y  G I +    S  N++H V
Sbjct: 223 ISSYV-NVTFGSESSLESAVNVN-PVACSIDASHSSFLFYKSG-IYFEPACSRTNLDHGV 279

Query: 305 QIVGY 309
            +VGY
Sbjct: 280 LVVGY 284


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 153/311 (49%), Gaps = 25/311 (8%)

Query: 7   VLFIVALIALCFLAIPVKV-----SKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNF 60
           VLFI A +A    ++P +         + E+  ELF  +++R+K+ Y  +E    RF+ F
Sbjct: 11  VLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEIF 70

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  + E N           G+ +F+D+S EEFK ++L          +++       
Sbjct: 71  KENLKYVIERNSKGHR---HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRR---- 123

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
             ++++  T     P+ +    DWR+ G++  +++Q  CG+CWAFS+    E ++A+  G
Sbjct: 124 -SMQQKKGTASCEAPSSL----DWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTG 178

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
            L  LS QE++DC    N GC GG      +W+ ++   ++ ES+YP    D  C     
Sbjct: 179 DLISLSEQELVDC-DTTNYGCEGGYMDYAFEWV-ISNGGIDSESDYPYTGTDGTCNTTKE 236

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVN--ALTWQYYLGGVIQYNCDGSLA 298
               V I  Y    +  S+S++L   A + P+   ++  AL +Q Y  G+   +C     
Sbjct: 237 DTKVVSIDGYK--DVDESDSALLC-AAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPD 293

Query: 299 NINHAVQIVGY 309
           +I+HAV IVGY
Sbjct: 294 DIDHAVLIVGY 304


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 143/323 (44%), Gaps = 39/323 (12%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
           N + +  L+ + FLA  V           E    +  RY K Y    E + RF+ F++++
Sbjct: 8   NHISLAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENV 67

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           + IE  N    + +S + GI +F+DL+ +EF     R+    H+  S             
Sbjct: 68  NYIEAFN--NAANKSYKLGINQFADLTNKEFIAP--RNGFKGHMCSS------------I 111

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            R+ T      T  P   DWR+ G +  +++Q  CG CWAFS V   E +HAL  G L  
Sbjct: 112 IRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLIS 171

Query: 185 LSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRK 238
           LS QE++DC   G + GC GG    L+D  D  K +     L  E+ YP    D  C   
Sbjct: 172 LSEQELVDCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEANYPYKGVDGKCNAN 225

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
             + N   I  Y  D    +E ++   +A   PV  A++A    +Q+Y  GV   +C   
Sbjct: 226 EAAKNAATITGYE-DVPANNEMALQKAVANQ-PVSVAIDASGSDFQFYKSGVFTGSCGTE 283

Query: 297 LANINHAVQIVGY---DNYSRTW 316
           L   +H V  VGY   D+ +  W
Sbjct: 284 L---DHGVTAVGYGVSDDGTEYW 303


>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
          Length = 361

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/302 (31%), Positives = 142/302 (47%), Gaps = 44/302 (14%)

Query: 30  LEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFS 88
           L Q+  LFS F + Y K+Y  K EH+ RF  F+ +L  I   N+  +   +A YG+TEFS
Sbjct: 27  LSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEG--TAHYGLTEFS 84

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DLS  EF+    RH +     ++ HK         + + I  G  +   +P   DWR  G
Sbjct: 85  DLSPSEFE----RHYLGLKKDLAEHK--------AEVKPIKVG-PVNEPLPDLFDWRTKG 131

Query: 149 IIGKVRNQQTCG------------------ACWAFSTVETAESMHALKNGTLSLLSVQEV 190
            + +V+NQ  CG                  +CWAFS     E    L    L  LS QE+
Sbjct: 132 AVTEVKNQGMCGSCWAFSXXTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQEL 191

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           +DC  +G+ GC GG     +  + +    LE ESEYP    D  C+   T     +++S+
Sbjct: 192 VDCD-HGDHGCKGGYMGQAMKAV-IEMGGLETESEYPYKGVDGTCEFNKTESK-ARVQSF 248

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIV 307
               L  +E+ +   +  HGPV   +NA   Q+Y GG+    ++ C  S  +++H V +V
Sbjct: 249 V--GLPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLC--SPTDLDHGVLLV 304

Query: 308 GY 309
           G+
Sbjct: 305 GF 306


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 133/268 (49%), Gaps = 36/268 (13%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E+  +  +S+ + YK +   +E  +R+K F++++  I+  N   +S +S +  + +F+DL
Sbjct: 37  ERHEQWMASYARVYKDA---NEKQMRYKIFKENVQRIDSFNS--ESDKSYKLAVNQFADL 91

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + EEFK+  LR+    H+  +   H  + +               T +P   DWR+ G +
Sbjct: 92  TNEEFKS--LRNGFKGHMCSAQAGHFRYEN--------------VTAVPASIDWRKKGAV 135

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCAL 209
            +++ Q  CG+CWAFS V   E +  +K G L  LS QE++DC  N  + GC GG    L
Sbjct: 136 TQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGG----L 191

Query: 210 LDWMDVNKVV----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           +D  D  K +    L  E+ YP    D+ CK K  +    KI  Y  + +  ++ + L +
Sbjct: 192 MD--DAFKFIEQHGLASEATYPYDAADSTCKTKEEAKPSAKITGY--EDVPANDEAALKN 247

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQY 291
              + PV  A++A    +Q+Y  G+  Y
Sbjct: 248 AVANQPVSVAIDAGGFEFQFYSSGIEWY 275


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/318 (27%), Positives = 147/318 (46%), Gaps = 46/318 (14%)

Query: 7   VLFIVALIALCFLAIPV------KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKN 59
           +  ++ L   C +A         K    ++E   + F  + +R+ + Y    E ++RF  
Sbjct: 10  IFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGI 69

Query: 60  FEKSLDIIEELNKNRQSPESARYGITE--FSDLSEEEFKTRHLRHSVNKHVLMSHHK--H 115
           ++ ++  I+  N  + S     Y +T+  F+DL+ EEF++ ++  S     L SH+    
Sbjct: 70  YQANVQYIQCKNAQKNS-----YNLTDNKFADLTNEEFQSTYMGLSTR---LRSHNTGFR 121

Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
           +D H +                +P  KDWR+ G + ++ +Q  CG CWAF+ V   E ++
Sbjct: 122 YDEHGD----------------LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGIN 165

Query: 176 ALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
            +K+G L  LS QE+IDC   +GN GC GG       ++ +    L  E +YP    D  
Sbjct: 166 KIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFI-IENGGLTTEQDYPYEGVDGT 224

Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQY 291
           CK +  +     I  Y     +P+++   L   A H PV  A++A   ++Q+Y  GV   
Sbjct: 225 CKMEKAAHYAASISGY---EEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSG 281

Query: 292 NCDGSLANINHAVQIVGY 309
            C   L   NH V +VGY
Sbjct: 282 ICGKQL---NHGVTVVGY 296


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 152/326 (46%), Gaps = 46/326 (14%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSL 64
           +V+ ++  +AL   +IP        E+ L  L+  ++  +  S    + D RF  F++++
Sbjct: 10  SVVLVLGSVALA-QSIPFDEKDLASEESLWSLYEKWRAHHAVSRDLDDTDKRFNVFKENV 68

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM---------SHHKH 115
             I E N+ + +  + +  + +F D++ +EF++ +    ++ H+ +         S+ K 
Sbjct: 69  KFIHEFNQKKDA--TYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKF 126

Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH 175
           HD                    +P   DWRE G +  V++Q  CG+CWAFSTV   E ++
Sbjct: 127 HD--------------------LPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGIN 166

Query: 176 ALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
            +K   L  LS Q+++DC    N GC+GG      D++  N   L  E  YP L +  +C
Sbjct: 167 QIKTNELVSLSEQQLVDC-DTKNSGCNGGLMDYAFDFIK-NNGGLSSEDSYPYLAEQKSC 224

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNC 293
             +A S   V I  Y  D    +E++++  +A   PV  A+ A    +Q+Y  GV   +C
Sbjct: 225 GSEANSAV-VTIDGYQ-DVPRNNEAALMKAVANQ-PVSVAIEASGYAFQFYSQGVFSGHC 281

Query: 294 DGSLANINHAVQIVGY---DNYSRTW 316
              L   +H V  VGY   D+  + W
Sbjct: 282 GTEL---DHGVAAVGYGVDDDGKKYW 304


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 125/273 (45%), Gaps = 29/273 (10%)

Query: 34  LELFSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           LE F ++Q  Y ++Y+  E    RF  + +++  I+ +N+   +  S   G  +F+DL+E
Sbjct: 35  LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQ-LSTGSSYELGENQFTDLTE 93

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI---------PVKKD 143
           EEFK  +L                D      +    T G     G+         P   D
Sbjct: 94  EEFKDTYL-------------MKLDEQPPAAEAMGPTVGTMSTAGMSNGNNTGEAPNSVD 140

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN-MGCS 202
           WR  G + +V++QQ CG+CWAF+TV + E +H +K G L  LS QE++DC   GN  GC 
Sbjct: 141 WRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCR 200

Query: 203 GGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
           GG   + ++W+  N   L  ES+YP +     C       +  +I+ Y    +  +  + 
Sbjct: 201 GGSPRSAMEWVTRNG-GLTTESDYPYVGSQRQCMSGKLGHHAARIRGY--QAVQRNNEAE 257

Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCD 294
           L       PV   ++A   +Q+Y  GV    CD
Sbjct: 258 LERAVAERPVAVFIDASRAFQFYKSGVFSGPCD 290


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 139/308 (45%), Gaps = 42/308 (13%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           L +  LIA CF       S+ + +++   +  F   + K+Y+  E D+R   +  +L+I+
Sbjct: 8   LLVAVLIAQCF-------SELSQDRQWHAWKDF---HGKTYTGEEEDLRRAIWNDNLEIV 57

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           ++ N    S    +  +  F+DL+  EFK R + +                        S
Sbjct: 58  KKHNAENHS---YKLDMNHFADLTVTEFKQRFMGY-------------------RAASNS 95

Query: 128 ITTGITIPTG---IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                 +P     +P + DWR+ G +  V+NQ  CG+CWAFS+  + E  H  K G L  
Sbjct: 96  TGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVS 155

Query: 185 LSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q ++DC+   GN GC GG       ++  N  + + E  YP   +D  C  K  S  
Sbjct: 156 LSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGI-DTEQSYPYTARDGQCHFKPGSV- 213

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
           G  +  YT D    SE  + + +AT GP+  A++A   ++Q Y  GV     D S   ++
Sbjct: 214 GATVTGYT-DVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYS-EPDCSSTQLD 271

Query: 302 HAVQIVGY 309
           H V  VGY
Sbjct: 272 HGVLAVGY 279


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 80/289 (27%), Positives = 145/289 (50%), Gaps = 37/289 (12%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++   LF S+   + KSY+   E + RF+ F+ +L  I+E  +N       + G+ +F+D
Sbjct: 39  DEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDE--QNLVEDRGFKLGLNKFAD 96

Query: 90  LSEEEFKTRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           L+ EE+++++       + K V     ++           +  +G ++P  +    DWRE
Sbjct: 97  LTNEEYRSKYTGIKSKDLRKKVSAKSGRY-----------ATLSGESLPESV----DWRE 141

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
           +G +  V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG  
Sbjct: 142 SGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGG-- 199

Query: 207 CALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI- 262
             L+D+     +N   ++ + +YP   +D  C +   +   V I SY     +P+   + 
Sbjct: 200 --LMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSY---EDVPAYDELA 254

Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           L   A + P+  A+ A    +Q+Y  G+    C  +L   +H V +VGY
Sbjct: 255 LKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIAL---DHGVVVVGY 300


>gi|145476403|ref|XP_001424224.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391287|emb|CAK56826.1| unnamed protein product [Paramecium tetraurelia]
          Length = 312

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 85/274 (31%), Positives = 126/274 (45%), Gaps = 29/274 (10%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
           F S++ +Y KSY+  +   RF NF+ +L+ +   N +       R  + +FSDLSEEEF 
Sbjct: 29  FQSWKTKYGKSYTGEQEVFRFLNFQINLNKVNSHNSDETKTYKMR--MNQFSDLSEEEFA 86

Query: 97  TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
             +L H  N   ++   +  D   + +KK            I    DWR    I +V++Q
Sbjct: 87  LLYLTH-YNSDEIIEQQQITDDKESSIKKND---------NIKTSVDWRS---ITQVKDQ 133

Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN 216
             CG CWAF  V   E+   +KN T  +LS Q++IDC    + GC+GG     L +  V 
Sbjct: 134 GKCGGCWAFGAVGAVEAWFQVKNKTQVVLSEQQLIDCDTQ-SFGCNGGYQNLALKY--VA 190

Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
              L   + YP   K ++  +  + P       Y  +      SS    + T  P++  V
Sbjct: 191 NHGLNDANVYPYTQKQSSACQYNSGP-------YKTNGAQGVSSSNFKSLLTEYPLVVVV 243

Query: 277 NALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           +A  WQ Y GGV    C  S   +NHAV  VG+D
Sbjct: 244 DASNWQLYGGGVFN-ECSKS---VNHAVLAVGFD 273


>gi|328875652|gb|EGG24016.1| counting factor associated protein [Dictyostelium fasciculatum]
          Length = 529

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 87/288 (30%), Positives = 150/288 (52%), Gaps = 26/288 (9%)

Query: 37  FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F++++ K+Y  + EH+ RF  +++ L  +   N +  S  + +  +  F D+S+EEF
Sbjct: 222 FDQFKKQFGKTYENTLEHNTRFATYKQMLHRVATHNAHN-SESTYKLAMNHFGDMSDEEF 280

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   + H V++       + HD+                 + +P   DWR +G +  V++
Sbjct: 281 RKFIIPH-VDRDENNGASEVHDNED--------------VSALPASLDWRTSGCVTPVKD 325

Query: 156 QQTCGACWAFSTVETAESMHALK-NGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWM 213
           Q  CG+CW F ++ + E++  LK N  L  LS QE++DCA  G +MGC+GG F +     
Sbjct: 326 QGVCGSCWTFGSLASLETVACLKHNKDLISLSEQELVDCAYVGQSMGCNGG-FASNAYQY 384

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +N   +  ES+YP L+++A CK      +GV+++SY   T   SE+++   +AT G V 
Sbjct: 385 IMNAGGIATESDYPYLMQNAYCKASTVQNSGVRVQSYVNVTAF-SEAALQNAVATVGVVA 443

Query: 274 AAVNALT--WQYYLGGVIQYN-CDGSLANINHAVQIVGY--DNYSRTW 316
            A++A    ++YY  GV     C   L  ++H V ++GY  DN  + W
Sbjct: 444 VAIDASAPDFRYYSSGVYYSTVCQSGLDYLDHEVAVLGYGTDNGQQYW 491


>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
          Length = 333

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 89/303 (29%), Positives = 145/303 (47%), Gaps = 31/303 (10%)

Query: 14  IALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN- 71
           + L  L + +  + P  +Q L E ++ +   + K YS  E  +R   +EK+L +IE+ N 
Sbjct: 5   LFLTILCLGIASAAPTHDQSLDEQWNQWTAEHGKVYSTGEESLRRAVWEKNLKMIEQHNL 64

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
           +  Q   +   G+  F D++ E+F+            +M+  ++  ++   V +      
Sbjct: 65  EYSQGKHTFTMGMNAFGDMTNEDFRQ-----------MMTGFQNQKYNKGEVFQPPQ--- 110

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
              P  +P   DWRE G +  V+NQ  CG+CWAFS     E     K G L  LS Q ++
Sbjct: 111 ---PLEVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167

Query: 192 DCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           DC+    N GC GG       ++  N   L+ E  YP    ++ C+    SP G    + 
Sbjct: 168 DCSQPQHNSGCKGGLVIKAFQYVKDNG-GLDSEESYPYEEMESTCRY---SP-GNSAATV 222

Query: 251 TCDTLIPSESSILTD-IATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQI 306
           T    IP+E   L   +A+ GP+  A++A   ++Q+Y GG++ + NC  S   +NHAV +
Sbjct: 223 TGFKHIPAEEKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNC--SPKWLNHAVLV 280

Query: 307 VGY 309
           VGY
Sbjct: 281 VGY 283


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 153/316 (48%), Gaps = 31/316 (9%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKN 59
           +F + +++FIV+  AL    I    ++P+ ++   L+ ++  ++ K+Y+   E  +RF  
Sbjct: 8   IFLLFSIIFIVSSSALDLSIIDRAFNRPD-DEIASLYETWLVKHGKNYNGLGEKQLRFNI 66

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDH 118
           F+ +L  ++E N    S    + G+  F+DL+ EE+++ +L     +  V  S     D 
Sbjct: 67  FKDNLRFVDERNSENLS---FKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDR 123

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
           +       +   G T+P  +    DWR+ G +  +++Q +CG+CWAFS +   E ++ + 
Sbjct: 124 Y-------AFRAGDTLPESV----DWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIV 172

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAAC 235
            G L  LS QE+++C  + N GC GG    L+D+     +    ++ + +YP   +D  C
Sbjct: 173 TGDLISLSEQELVECDTSYNDGCDGG----LMDYAFEFIIKNEGIDSDEDYPYTGRDGRC 228

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNC 293
                +   V I  Y  D+ +  E S+   +A   PV  A+      +Q Y  GV    C
Sbjct: 229 DTNRKNAKVVTIDDYE-DSPVYDEKSLQKAVANQ-PVSVAIEGGGRDFQLYDSGVFTGKC 286

Query: 294 DGSLANINHAVQIVGY 309
             +L   +H V +VGY
Sbjct: 287 GTAL---DHGVAVVGY 299


>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 324

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 82/279 (29%), Positives = 135/279 (48%), Gaps = 25/279 (8%)

Query: 35  ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSE 92
           E + +F+  + KSY    E   RF  F  +L  IEE N+N  +   +   G+ +F+DL+ 
Sbjct: 21  EKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNKFADLTP 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF  R       K   +S     +   +                +P + DW + G + +
Sbjct: 81  EEFMERFRPLRKTKPKFLSEQAKFNFDGD----------------LPAEVDWTKQGAVTE 124

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q +CG+CWAFST  + ES + +K G L  LS Q+++DC  N N GC+GG     L++
Sbjct: 125 VKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN-NSGCAGGWMDIALEY 183

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           ++ + ++   E +YP   ++  C R   S   V+IKSY        E  +   +A  GPV
Sbjct: 184 IEADGIM--SEDDYPYEERNTTC-RFNNSKAAVQIKSYKA-IKKNDEIDLQKAVALEGPV 239

Query: 273 IAAVN-ALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
             A+   + +Q Y  G++    C  +  ++ HAV + GY
Sbjct: 240 SVAIEVTIAFQLYARGILNDPQCKNTEGDLTHAVLVTGY 278


>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
          Length = 476

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 137/282 (48%), Gaps = 32/282 (11%)

Query: 34  LELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           L LF  F  +Y K YS + E D R + F+++L   E++    +   SA YG+T+FSDL+E
Sbjct: 175 LGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEG--SAEYGVTKFSDLTE 232

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF+  +L   +++  L               +R +       +  P   DWR+ G +  
Sbjct: 233 EEFRLTYLNPLLSQWTL---------------RRPMKPASPARSPAPASWDWRDHGAVSP 277

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ  CG+CWAFS     E    LK+G L  LS QE++DC G  +  C GG      + 
Sbjct: 278 VKNQGLCGSCWAFSVTGNIEGQWFLKHGKLLSLSEQELVDCDGL-DHACRGGLPSNAYEA 336

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL-IPS-ESSILTDIATHG 270
           ++     LE E++Y        C          K+ +Y   ++ +PS E+ +   +A +G
Sbjct: 337 IE-GLGGLEAENDYTYSGHKQKCSFATE-----KVAAYINSSVELPSDENEMAAWLAENG 390

Query: 271 PVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           PV  A+NA   Q+Y  GV       C+  +  I+HAV +VGY
Sbjct: 391 PVSVALNAFAMQFYKKGVSHPWMILCNPWM--IDHAVLLVGY 430


>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
          Length = 440

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 93/320 (29%), Positives = 156/320 (48%), Gaps = 36/320 (11%)

Query: 4   VKNVLFIVAL--IALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKN 59
            + + F V L  +A CF  +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ 
Sbjct: 7   TRTLGFSVGLHAVAACF--VPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRV 64

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++   E   +   +   A +G+T FSD+S EEF+              ++H   +++
Sbjct: 65  FKQNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYY 108

Query: 120 HNHVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
              +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAFS +   E    +
Sbjct: 109 AAALKRPRKV---VNVSTGKAPPAIDWRKKGAVTPVKDQGQCHSSWAFSAIGNIEGQWKI 165

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK 236
               L+ LS Q ++ C  N + GC GG       W+   NK  +  E  YP         
Sbjct: 166 AGHELTSLSEQMLVSCDTN-DFGCGGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVP 224

Query: 237 R--KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD 294
              K+    G KI+      L   E++I   +A  GPV  AV+A ++Q Y GGV+  +C 
Sbjct: 225 TCDKSGKVVGAKIRDRV--DLPRDENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCI 281

Query: 295 GSLANINHAVQIVGYDNYSR 314
               +++H V +VGYD+ S+
Sbjct: 282 SE--HLDHGVLLVGYDDTSK 299


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 148/304 (48%), Gaps = 31/304 (10%)

Query: 22  PVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESA 80
           P   S  + ++ + L+ S+  ++ K+Y+   E + RF+ F+ +L  I+E N N  +  + 
Sbjct: 30  PSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNT--TY 87

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
           + G+ +F+DL+ +E++ + L    +    +   K     + H        G  +P  +  
Sbjct: 88  KLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAH------RAGDNLPDSV-- 139

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR+ G +  V++Q +CG+CWAFST+ T E ++ + +G L  LS QE++DC  + + G
Sbjct: 140 --DWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAG 197

Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           C+GG    L+D+     ++   ++ E +YP L  +  C     +   V I  Y     +P
Sbjct: 198 CNGG----LMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYED---VP 250

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNY 312
           +  + L     H PV  A+ A    +Q Y  GV    C   LA ++H V  VGY   DN 
Sbjct: 251 NNENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGEC--GLA-LDHGVVAVGYGTDDNG 307

Query: 313 SRTW 316
              W
Sbjct: 308 QDYW 311


>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
          Length = 354

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 90/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
            + +  L  +C+ +  +  + P ++  +    + SF++R+ K++   +E   RF  F+++
Sbjct: 10  AIVVTILFVVCYGSALIAQTPPAVDNFVASAHYGSFKKRHSKAFGGDAEEGHRFNAFKQN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+DL+ +EF   +L    N     SH K H      
Sbjct: 70  MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYTSHLKDH------ 116

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K  +    + P+G+ +  DWR+ G +  V+NQ  CG+CWAFS +   E   A    +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
             LS Q ++ C  N + GC+GG     ++W M  +   +  E+ YP          C  +
Sbjct: 174 VSLSEQMLVSC-DNVDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
                G KI  +   +L   E  I   +   GPV  AV+A TWQ Y GGV+      SL 
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIADWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287

Query: 299 NINHAVQIVGYDNYSR 314
             NH V IVG++  ++
Sbjct: 288 --NHGVLIVGFNKNAK 301


>gi|343477207|emb|CCD11901.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 89/320 (27%), Positives = 156/320 (48%), Gaps = 36/320 (11%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P+  DWR+ G +  V++Q  C + WAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDA---AC 235
             L+ LS Q ++ C  + + GC GG       W +  NK  +  E  YP          C
Sbjct: 168 HELTSLSEQMLVSCDTD-DFGCRGGFSDPAFKWILWSNKGNVFTEQSYPYASGGGNVPTC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCD 294
           K       G KI +      +P +  ++T+ +A  GPV  AV+A ++Q Y GGV+  +C 
Sbjct: 227 KMSGKV-VGAKISN---RLYLPEDEDMITEWLARKGPVAIAVDATSFQSYTGGVLT-SCI 281

Query: 295 GSLANINHAVQIVGYDNYSR 314
                +N+   +VGYD+ S+
Sbjct: 282 SK--EMNYGALLVGYDDTSK 299


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 87/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ II+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
 gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
          Length = 354

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 154/325 (47%), Gaps = 49/325 (15%)

Query: 4   VKNVLFIV----ALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFK 58
           V  +LF+V    AL+A   L +   ++  +       +  F++R+ KS+ + ++   RF 
Sbjct: 12  VVTILFVVCYGSALVAQTPLGVDNFIASAH-------YGRFKERHGKSFGEDADEGHRFN 64

Query: 59  NFEKSLDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
            F++++     LN +      A Y ++ +F+DL+ +EF   +L      H    + +H  
Sbjct: 65  AFKQNMQTAYFLNTHN---PHAHYDVSGKFADLTPQEFAKLYLNPDYYAHRGKDYKEH-- 119

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
               HV    ++  +++        DWRE G +  V+NQ  CG+CWAFS +   ES  AL
Sbjct: 120 ---VHVDDSVLSGAMSV--------DWREKGAVTPVKNQGMCGSCWAFSAIGNIESQWAL 168

Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACK 236
           KN +L  LS Q ++ C  + + GC+GG     ++W +  +   +  E  YP         
Sbjct: 169 KNHSLVSLSEQMLVSC-DDIDDGCNGGLMDQAMEWIIQHHNGTVPTEKSYPY------AS 221

Query: 237 RKATSPN-------GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI 289
              TSP        G +I  Y   +L   E +I   +   GPV  AV+A TWQ Y GGV+
Sbjct: 222 AGGTSPPCHDKGEFGARISGYM--SLPHDEKAIAAYVEKKGPVAVAVDATTWQLYFGGVV 279

Query: 290 QYNCDGSLANINHAVQIVGYDNYSR 314
              C G   ++NH V +VG++  ++
Sbjct: 280 TL-CFG--LSLNHGVLVVGFNKRAK 301


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 136/285 (47%), Gaps = 31/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++  +  ++ K+Y+   E + RF+ F+ +L  I++ N   ++      G+  F+D
Sbjct: 36  DEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRT---YTVGLNRFAD 92

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EEF++ +L                  H   + K S      +   +P   DWR+ G 
Sbjct: 93  LTNEEFRSMYLGTRTG-------------HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGA 139

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           + +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 140 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGG----L 195

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ E +YP L +D  C     +   V I SY  + +  ++ + L   
Sbjct: 196 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY--EDVPENDETALKKA 253

Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             + PV  A+      +Q Y  GV    C  SL   +H V  VGY
Sbjct: 254 VANQPVSVAIEGGGRNFQLYNSGVFTGECGTSL---DHGVAAVGY 295


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 156/315 (49%), Gaps = 34/315 (10%)

Query: 9   FIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
            +V L++LC+ LA+   +    L++  +L+ ++ Q   KSY ++E   R   +E++L  I
Sbjct: 3   LLVCLVSLCWGLAVSAPLGDSELDRHWKLWKNWHQ---KSYHEAEEGWRRTVWEENLKAI 59

Query: 68  EELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +  N  +     + R G+ +F DL+ EEF+            +++  +H     N +   
Sbjct: 60  QLHNLEQSLGLHTYRLGMNQFGDLTNEEFQE-----------ILTGERHFSKG-NRINGS 107

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           +      +   +P   DWR+ G +  V+NQ  CG+CWAFST    E     K+G L  LS
Sbjct: 108 AFLEANFVQ--VPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLS 165

Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA-CKRK---ATS 241
            Q ++DC+   GN GC GG       ++  N+ + + E  YP   KD A C  K   AT+
Sbjct: 166 EQNLVDCSWQQGNQGCHGGIVDLAFQYILQNQGI-DSEDCYPYTAKDTAQCTFKPECATA 224

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
           P    +  +  D    SE +++  +AT GPV   ++A   ++++Y  G+  Y+   S  +
Sbjct: 225 P----VTGFV-DIPPHSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIF-YDPKCSSES 278

Query: 300 INHAVQIVGYDNYSR 314
           ++HAV +VGY  Y R
Sbjct: 279 LDHAVLVVGY-GYER 292


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 138/282 (48%), Gaps = 30/282 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E    +  +Y K Y   +E + RF+ F+ ++  IE  N     P +    I +F+DL +E
Sbjct: 33  ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFN--LSINQFADLHDE 90

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK   L +   K   +         + +V K            IP   DWR+ G +  +
Sbjct: 91  EFKAL-LNNVQKKASRVETATETSFRYENVTK------------IPSTMDWRKRGAVTPI 137

Query: 154 RNQQ-TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           ++Q  TCG+CWAF+TV T ES+H +  G L  LS QE++DC    + GC GG      ++
Sbjct: 138 KDQGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEF 197

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
           +  NK  +  E+ YP   KD +CK K  +    +I  Y     +P  SE ++L  +A   
Sbjct: 198 I-ANKGGITSEAYYPYKGKDRSCKVKKETHGVARIIGYES---VPSNSEKALLKAVANQ- 252

Query: 271 PVIAAVN--ALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           PV   ++  A+ +++Y  G+ +  NC     +++HAV +VGY
Sbjct: 253 PVSVYIDAGAIAFKFYSSGIFEARNCG---THLDHAVAVVGY 291


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 149/311 (47%), Gaps = 35/311 (11%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
           +VL ++AL   C LA   K     L Q  +L+   ++   K YS +E  +R   +E +L 
Sbjct: 5   SVLAVLALAFSCTLAFDAK-----LNQHWKLW---KEANNKRYSDAEEHVRRATWEGNLQ 56

Query: 66  IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
            ++E N +      +   G+ +++D++  EF    ++     +  M   +  D H     
Sbjct: 57  KVQEHNLQADLGVHTYWLGMNKYADMTVTEF----VKVMNGYNATMRGQRTQDRH----- 107

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
             S  + I +P  +    DWR+ G +  V++Q  CG+CWAFST    E  H  + G L  
Sbjct: 108 TFSFNSKIALPDTV----DWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVS 163

Query: 185 LSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q ++DC+G  GNMGC+GG      +++  N  + + E  YP    D  C+ KA +  
Sbjct: 164 LSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGI-DTEDSYPYEAVDNQCRFKAANV- 221

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLA 298
           G     +T D     ES++   +AT GP+  A++A   ++Q Y  GV  YN   C  S  
Sbjct: 222 GATDTGFT-DITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGV--YNEPFC--SQT 276

Query: 299 NINHAVQIVGY 309
            ++H V  VGY
Sbjct: 277 RLDHGVLAVGY 287


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 137/293 (46%), Gaps = 23/293 (7%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
           + +P  V K  L      F+++  ++ K YS +E   R   F    D +E + ++ +   
Sbjct: 29  IRMPTDVGKDQLLAGQ--FAAWAHKHGKVYSAAEE--RAHRFLVWKDNLEYIQRHSEKNL 84

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
           S   G+T+F+DL+ EEF+ ++    +++   +   ++      +    +           
Sbjct: 85  SYWLGLTKFADLTNEEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEA----------- 133

Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
           P   DWRE G +  V++Q +CG+CWAFS V + E ++A++ G    LSVQE++DC    N
Sbjct: 134 PKSIDWREKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYN 193

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
            GC+GG      D++ +    ++ E +YP    D  C     +   V I SY  D     
Sbjct: 194 QGCNGGLMDYAFDFV-IQNGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYE-DVPEND 251

Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           E ++   +A   PV  A+ A    +Q Y GGV    C     +++H V  VGY
Sbjct: 252 EEALKKAVAGQ-PVSVAIEAGGRDFQLYSGGVFTGRCG---TDLDHGVLAVGY 300


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 148/315 (46%), Gaps = 37/315 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEK 62
           +L      +L   ++ + +  P      E+ + +++   +++K Y+   E D RF+ F+ 
Sbjct: 6   ILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKD 65

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHH 120
           +L+ I+E N    +      G+ +F+D++ EE++  +L  R  + + ++ +    H + +
Sbjct: 66  NLNFIDEHNAQNYT---YIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAY 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           N   +            +PV  DWR  G I  +++Q +CG+CWAFST+ T E+++ +  G
Sbjct: 123 NSGDR------------LPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTG 170

Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKR 237
            L  LS QE++DC    N GC+GG    L+D+     +    ++ +  YP    +  C  
Sbjct: 171 KLVSLSEQELVDCDRAFNEGCNGG----LMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDP 226

Query: 238 KATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
                  V I  Y     +PS + + L     H PV  A+ A     Q Y  GV    C 
Sbjct: 227 TRKKAKIVSIDGYED---VPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCG 283

Query: 295 GSLANINHAVQIVGY 309
            SL   +HAV IVGY
Sbjct: 284 TSL---DHAVVIVGY 295


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 149/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ N+L  +  +   F       S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEL 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DW E+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+Y GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFYAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
 gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
          Length = 358

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F  R+ K Y S+ E  +RF  F ++LD I   N+   S   A   + +F+DL+ +EF
Sbjct: 59  FSRFVYRHGKRYQSEDEMKMRFAIFSENLDFIRSTNRKGLSYTLA---VNDFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N       +HK               TG+ +P      KDWRE GI+  V+
Sbjct: 116 QKHRLGAAQNCSATTKGNHK--------------LTGVALPD----TKDWREVGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC GG      +++
Sbjct: 158 NQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNFGCHGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   LE E  YP   +D ACK  + +  G+++   + +  + +E  +   +    PV 
Sbjct: 218 KYNG-GLETEEAYPYTGEDGACKFSSENV-GIQVLD-SVNITLGAEDELKEAVGLVRPVS 274

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   ++ +++Y  GV   + C  +  ++NHAV  VGY
Sbjct: 275 VAFEVVSGFRFYKSGVYTSDTCGSTPMDVNHAVLAVGY 312


>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
            castaneum]
          Length = 1726

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 147/287 (51%), Gaps = 37/287 (12%)

Query: 31   EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
            E  L LF+ F ++Y K Y K E+  RF  F ++L  I  LN   Q   +A YGIT F+D+
Sbjct: 1417 EYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQG--TATYGITRFADM 1474

Query: 91   SEEEF-KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAG 148
            +++EF ++  LR  +                   +  +      IP   +P + DWR+  
Sbjct: 1475 TQKEFSRSLGLRTDLRN-----------------ENETPFAQAKIPNIELPKEFDWRKKN 1517

Query: 149  IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            ++ +V+NQ+ CG+CWAFS     E  +AL++G L   S QE++DC  + + GC+GG    
Sbjct: 1518 VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD-DQGCNGG---- 1572

Query: 209  LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L+D  +  + K+  LE E +YP   +D  C    T     +++      +  +E+ +   
Sbjct: 1573 LMDTAYRSIEKIGGLETEQDYPYDAEDEKCHFNRTL---ARVQVTGALNISHNETDMAKW 1629

Query: 266  IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            +  +GP+  A+NA   Q+Y+GGV    ++ C  S  N++H V IVGY
Sbjct: 1630 LVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVGY 1674


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 136/285 (47%), Gaps = 31/285 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++  +  ++ K+Y+   E + RF+ F+ +L  I++ N   ++      G+  F+D
Sbjct: 45  DEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRT---YTVGLNRFAD 101

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EEF++ +L                  H   + K S      +   +P   DWR+ G 
Sbjct: 102 LTNEEFRSMYLGTRTG-------------HKKRLPKTSDRYAPRVGDSLPDSVDWRKEGA 148

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           + +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 149 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGG----L 204

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ E +YP L +D  C     +   V I SY  + +  ++ + L   
Sbjct: 205 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY--EDVPENDETALKKA 262

Query: 267 ATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             + PV  A+      +Q Y  GV    C  SL   +H V  VGY
Sbjct: 263 VANQPVSVAIEGGGRNFQLYNSGVFTGECGTSL---DHGVAAVGY 304


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 83/280 (29%), Positives = 126/280 (45%), Gaps = 31/280 (11%)

Query: 40  FQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQS--PESARYGITEFSDLSEEEFK 96
           +  +Y + YS + E   RF+ F+ ++ +IE +N        E+ R     F+DL+++EF+
Sbjct: 44  WMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANR-----FADLTDDEFR 98

Query: 97  TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT----GIPVKKDWREAGIIGK 152
                          +        +  + R+ TTG          +P   DWR  G +  
Sbjct: 99  A----------TWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTP 148

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLD 211
           ++NQ  CG CWAFS V + E +  L  G L  LS QE++DC  NG + GC GG+     D
Sbjct: 149 IKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFD 208

Query: 212 WMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGP 271
           ++ V    L  ES YP    D  C     S +   IK Y  D     E+S+   +A   P
Sbjct: 209 FI-VGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE-DVPANDEASLRKAVANQ-P 265

Query: 272 VIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           V  AV+     +++Y GGV+   C   L   +H +  VGY
Sbjct: 266 VSVAVDGGDSHFRFYKGGVLSGACGTEL---DHGIAAVGY 302


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 94/310 (30%), Positives = 143/310 (46%), Gaps = 36/310 (11%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSL 64
           N L+I AL  L    I V     +     +LF ++ + + KSY S+ E   R K FE + 
Sbjct: 2   NFLYIFALTLL----ISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNY 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           D + + N    S  S    +  F+DL+  EFKT  L   ++   L   H++ +       
Sbjct: 58  DFVTKHNSKGNSSYS--LALNAFADLTHHEFKTSRL--GLSAAPLNLAHRNLE------- 106

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                TG+     IP   DWR  G++  V++Q +CGACW+FS     E ++ +  G+L  
Sbjct: 107 ----ITGVV--GDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVS 160

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATS 241
           LS QE+I+C  + N GC GG    L+D+     +N   ++ E +YP   +D  C +    
Sbjct: 161 LSEQELIECDKSYNDGCGGG----LMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMK 216

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLAN 299
              V I  Y  D    +E  +L  +A   PV   +  +   +Q Y  G+    C  SL  
Sbjct: 217 RRVVTIDKYV-DVPENNEKQLLQAVAAQ-PVSVGICGSERAFQMYSKGIFTGPCSTSL-- 272

Query: 300 INHAVQIVGY 309
            +HAV IVGY
Sbjct: 273 -DHAVLIVGY 281


>gi|145506497|ref|XP_001439209.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406393|emb|CAK71812.1| unnamed protein product [Paramecium tetraurelia]
          Length = 349

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 139/284 (48%), Gaps = 32/284 (11%)

Query: 34  LELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFSDLSE 92
           ++++ ++Q+ + K Y++ E+  RF  F+K+   I+E  +  ++  E+   G+ +F+DLS 
Sbjct: 37  MKVYQNWQKEHGKRYTQFENSHRFGIFKKNYQYIQEHQQRVEAGLETFELGLNDFADLSV 96

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF+ ++L++                  N V +R   TG  +P  + ++KD    G++ +
Sbjct: 97  EEFEAKYLKYRSTPR----------EQTNQVYRR---TGKQVPIEVDLRKD----GVVSE 139

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLS--LLSVQEVIDCAGNGNM---GCSGGDFC 207
           V+NQ +CG+CWAFS V   E+  AL+ G +    LS QE++DCA        GC GG+  
Sbjct: 140 VKNQGSCGSCWAFSAVAALET--ALRQGGVKNVELSEQELVDCAVKDEFESEGCDGGEMY 197

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               +   +K  +   SEYP    D  C  K T     +   Y    + P  +    + A
Sbjct: 198 DGFQY--ASKYGIAIRSEYPYAGVDQKCAAKQTKTR-YQFAGYV--DVEPLSAQAYVEAA 252

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +   +   +NA  + +Q Y  G+    CDGS   +NH V  VGY
Sbjct: 253 SEHALSIGINASGINFQLYKKGIYSAKCDGSKPALNHGVTNVGY 296


>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 143/313 (45%), Gaps = 48/313 (15%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKL---ELFSSFQQRYKKSYSKSEHDIRFKNF 60
           +K++ F++  +AL            NL       +LF +F+ +Y K+Y  SE + R K  
Sbjct: 1   MKSIFFVLFAVALSL----------NLHSDAYYEKLFQTFEAKYGKNYLSSEREYRKKVL 50

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
             ++D IE+ N +  S      G+T F+D++  EF T  L   + K +            
Sbjct: 51  AYNMDWIEKFNSDEHS---FTLGMTPFADMTNTEFATSKLCGCMKKPL------------ 95

Query: 121 NHVKKRSITTGITIPTGIPVKK-DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           NH + R +         + V+  DWRE G +  V+NQ +CG+CWAFS     E  + +  
Sbjct: 96  NHKQARVLNN-------MAVESIDWREKGAVTPVKNQGSCGSCWAFSATGALEGGNFVAT 148

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
           G L  LS Q+++DC    + GC GG      ++  V K  L  E +YP   KD  CK   
Sbjct: 149 GKLVSLSEQQLVDCDTE-DAGCGGGFMDTAFEY--VMKKGLCTEEDYPYHAKDEDCKDDQ 205

Query: 240 TSPNGVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGS 296
            +     + S T    +P+   + L    T  PV  A+ A    +Q Y GGV+  +  G+
Sbjct: 206 CT----SVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGVLDSDMCGT 261

Query: 297 LANINHAVQIVGY 309
             ++NH V  VGY
Sbjct: 262 --SLNHGVLAVGY 272


>gi|118365720|ref|XP_001016080.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297847|gb|EAR95835.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 335

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 152/311 (48%), Gaps = 28/311 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LC LA  + V      +KL  ++ +  +Y++ Y  +EH+  F+   F + L
Sbjct: 6   LLSIIMLMPLC-LAQDINV------EKLLAYNKWSSQYQRVY-LNEHEKLFRQMIFFEKL 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
             ++E N N  +  S    + +FSD+++EEF  + L +  +  H+     +   H+  + 
Sbjct: 58  QKMKEHNSNPNNTYSIH--LNQFSDMTKEEFTQKILMKQDLVGHLTKGASQEATHNDVNS 115

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           + +  +   T+   I    DWR  G +  V+NQ  CG+CW+FS     ES + +KN  L 
Sbjct: 116 EAQLNSKSPTLAASI----DWRTKGAVTSVKNQGNCGSCWSFSAAGLMESFNFIKNKALV 171

Query: 184 LLSVQEVIDC--AGNG-NM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             S Q+++DC  A NG N+ GC GG     +D+   +KV +     YP +     C    
Sbjct: 172 DFSEQQLLDCVIAANGYNIHGCDGGWPAYCVDY--ASKVGITTLKNYPYVGVQNKCNVTG 229

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
           T+ NG K K +     +P+ S+ L       PV   V+A  W  Y  G+    CD SL  
Sbjct: 230 TN-NGFKPKQW---NQVPNTSNDLKMALNFSPVSVLVDANNWDGYQSGIFN-GCDQSLII 284

Query: 300 INHAVQIVGYD 310
           +NHAV  VGYD
Sbjct: 285 LNHAVLAVGYD 295


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 78/276 (28%), Positives = 129/276 (46%), Gaps = 22/276 (7%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           L+  ++ R+  +    +   RF  F++++ +I + N+ R  P   R  +  F D++ +EF
Sbjct: 46  LYERWRGRHAVARDLGDKARRFNVFKENVRLIHDFNQ-RDEPYKLR--LNRFGDMTADEF 102

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +         +H   S   HH       +  + +        +P   DWR+ G +  V++
Sbjct: 103 R---------RHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKD 153

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q  CG+CWAFST+   E ++A+K   L+ LS Q+++DC   GN GC GG       ++  
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAK 213

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
           +  V   E  YP   + A+CK K+ +P  V I  Y  + +  ++ S L     H PV  A
Sbjct: 214 HGGVA-AEDAYPYKARQASCK-KSPAP-AVTIDGY--EDVPANDESALKKAVAHQPVSVA 268

Query: 276 VNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           + A    +Q+Y  GV    C   L   +H V  VGY
Sbjct: 269 IEASGSHFQFYSEGVFAGRCGTEL---DHGVTAVGY 301


>gi|395509415|ref|XP_003758993.1| PREDICTED: cathepsin L1-like, partial [Sarcophilus harrisii]
          Length = 323

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 86/287 (29%), Positives = 140/287 (48%), Gaps = 32/287 (11%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITE 86
           P L+ + ELF S    Y+K+Y++ E   R + +EK++  I + N   +  + + Y G+  
Sbjct: 2   PELDSEWELFKS---TYEKNYTEKEESFRKQVWEKNMKFINDQNLLYKEGKLSYYLGMNN 58

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
             DL+++EFK       +N  +L             V++ + T   +I + +P   DWRE
Sbjct: 59  LGDLTDKEFKIM-----LNPSMLQ-----------RVRRDTTTKNFSIFSHLPKSVDWRE 102

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G I  VR Q  CG+CWAFS     E    LK G L  LS Q +IDC+     GC GG  
Sbjct: 103 KGFITPVRQQGRCGSCWAFSATGAVEGQLFLKTGKLVELSKQNLIDCS--KFQGCHGGTV 160

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILT 264
            +   ++  N+ ++  E  YP + K  +     +    VKI+ Y    ++P  +E  ++ 
Sbjct: 161 TSAFKYIKKNEGIVSEEC-YPYVAKKNSLCSYRSECAAVKIRDY---VVLPYGNEEILME 216

Query: 265 DIATHGPVIAAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            +A  GPV  ++NA  +  +Y GG+ ++  C       NHA+ +VGY
Sbjct: 217 AVAIVGPVSVSLNAQKSLHFYKGGIYVEPKCKPRYT--NHALLLVGY 261


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 79/267 (29%), Positives = 124/267 (46%), Gaps = 28/267 (10%)

Query: 49  SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV---N 105
            + E D RF  F  +L  ++  N+ R      R G+ +F+DL+ +EF+  +L   V    
Sbjct: 74  GEGERDRRFLVFWDNLRFVDAHNE-RAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAAR 132

Query: 106 KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAF 165
           +  ++     HD                    +P   DWRE G +  V+NQ  CG+CWAF
Sbjct: 133 RGAVVGERYRHDGAAEE---------------LPESVDWREKGAVAPVKNQGQCGSCWAF 177

Query: 166 STVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPES 224
           S V + ES++ +  G +  LS QE+++C+ + GN GC+GG   A  D++ +    ++ E 
Sbjct: 178 SAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI-IKNGGIDTED 236

Query: 225 EYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQ 282
           +YP    D  C     +   V I  +  D     E S+   +A H PV  A+ A    +Q
Sbjct: 237 DYPYRAVDGKCDMNRKNARVVSIDGFE-DVPENDEKSLQKAVA-HQPVSVAIEAGGREFQ 294

Query: 283 YYLGGVIQYNCDGSLANINHAVQIVGY 309
            Y  GV   +C     N++H V  VGY
Sbjct: 295 LYKSGVFSGSC---TTNLDHGVVAVGY 318


>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 371

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 145/286 (50%), Gaps = 40/286 (13%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F+ ++ K+Y+   EHD RF+ F+ +L    +  ++++    A +G+T FSDL+E EF
Sbjct: 58  FQDFKLKFGKTYTTDEEHDYRFRVFKANL---RKAKRHQKLDPDAVHGVTRFSDLTESEF 114

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
           +   +   +N+  L +     D H   +          +PT  +    DWR+ G +  V+
Sbjct: 115 RENFV--GLNRLRLPA-----DAHQAPI----------LPTDNLASDFDWRDQGAVTPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           +Q +CG+CW+FS V   E  + L  G L  LS Q+++DC        AG  + GC+GG  
Sbjct: 158 DQGSCGSCWSFSAVGALEGANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLM 217

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDTLIPSESS-ILT 264
            +  +++ V    LE E +YP    D  +CK +    NG    S    ++I +++  I  
Sbjct: 218 TSAFEYI-VKAGGLEREEDYPYTGTDRGSCKFQ----NGKIAASAANFSVISNDADQIAA 272

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  +GP+   +NA+  Q Y+ G+   Y C  S  N++H V +VGY
Sbjct: 273 NLVKNGPLAIGINAVFMQTYMKGISCPYIC--SKRNLDHGVLLVGY 316


>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
          Length = 358

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++ + + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
          Length = 1761

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 147/287 (51%), Gaps = 37/287 (12%)

Query: 31   EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
            E  L LF+ F ++Y K Y K E+  RF  F ++L  I  LN   Q   +A YGIT F+D+
Sbjct: 1452 EYHLSLFTDFLKKYNKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQG--TATYGITRFADM 1509

Query: 91   SEEEF-KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAG 148
            +++EF ++  LR  +                   +  +      IP   +P + DWR+  
Sbjct: 1510 TQKEFSRSLGLRTDLRN-----------------ENETPFAQAKIPNIELPKEFDWRKKN 1552

Query: 149  IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            ++ +V+NQ+ CG+CWAFS     E  +AL++G L   S QE++DC  + + GC+GG    
Sbjct: 1553 VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFSEQELVDCDTD-DQGCNGG---- 1607

Query: 209  LLD--WMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
            L+D  +  + K+  LE E +YP   +D  C    T     +++      +  +E+ +   
Sbjct: 1608 LMDTAYRSIEKIGGLETEQDYPYDAEDEKCHFNRTL---ARVQVTGALNISHNETDMAKW 1664

Query: 266  IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            +  +GP+  A+NA   Q+Y+GGV    ++ C  S  N++H V IVGY
Sbjct: 1665 LVANGPISIAINANAMQFYMGGVSHPFKFLC--SPKNLDHGVLIVGY 1709


>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 88/319 (27%), Positives = 156/319 (48%), Gaps = 34/319 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYRDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAF+ +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDA---AC 235
             L+ LS Q ++ C  N ++GC  G       W+   N   +  E  YP         AC
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPAC 226

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDG 295
             K+    G  I  +    ++ +E++I   +A +GPV  AV+A ++Q Y GGV+  +C  
Sbjct: 227 N-KSGKVVGANIDDHV--HILDNENAIAEWLAKNGPVAIAVDATSFQRYTGGVLT-SCIS 282

Query: 296 SLANINHAVQIVGYDNYSR 314
               +N A  +VGYD+ S+
Sbjct: 283 K--EVNSAALLVGYDDTSK 299


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/301 (28%), Positives = 131/301 (43%), Gaps = 27/301 (8%)

Query: 21  IPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDI---------RFKNFEKSLDIIEEL 70
           IP   S  + E+ L  L+  ++ RY  S   +   +         RF  F ++   I E 
Sbjct: 25  IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N+    P   R  + +F+D++ +EF+  +       H  +S  +         +  S   
Sbjct: 85  NRRGGRP--FRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGG-------EGGSFRY 135

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G      +P   DWRE G +  +++Q  CG+CWAFSTV   E ++ +K G L  LS QE+
Sbjct: 136 GGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQEL 195

Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           +DC    N GC GG       ++  N  +   ES YP   +   C +   S + V I  Y
Sbjct: 196 VDCDTGDNQGCDGGLMDYAFQFIKRNGGIT-TESNYPYRAEQGRCNKAKASSHDVTIDGY 254

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
             D     ES++   +A   PV  AV A    +Q+Y  GV    C     +++H V  VG
Sbjct: 255 E-DVPANDESALQKAVANQ-PVAVAVEASGQDFQFYSEGVFTGECG---TDLDHGVAAVG 309

Query: 309 Y 309
           Y
Sbjct: 310 Y 310


>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
           Full=Senescence-associated gene product 2; Flags:
           Precursor
 gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
 gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
 gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
 gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
 gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
 gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 358

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++ + + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 150/315 (47%), Gaps = 35/315 (11%)

Query: 3   DVKNVLFIVALIALCFLAIPVKVSKPNLE--QKLELFSSFQQR-YKKSYSKSEHDIRFKN 59
           D+ ++L  +  +   F +     S+P L   ++ EL+ S   R YK    K E   RF  
Sbjct: 6   DLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGE---RFMI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV-NKHVLMSHHKHHDH 118
           F++++  IE +NK      S + G+ EF+D++ +EF  +    ++ N ++  S     + 
Sbjct: 63  FKENMKFIESVNK--AGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEF 120

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
             N +    +          P   DWRE+G + +V++Q  CG CWAFS V + E  + + 
Sbjct: 121 KINDLSDDDM----------PSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
            G L   S QE++DC  N N GC+GG      D++  N  +   ES+Y  L +   C+ +
Sbjct: 171 TGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGI-SRESDYEYLGEQYTCRSQ 228

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV---IAAVNALTWQYYLGGVIQYNCDG 295
             +   V+I SY    ++P   + L    T  PV   IAA   L  Q+  GG      DG
Sbjct: 229 EKTA-AVQISSY---QVVPEGETSLLQAVTKQPVSIGIAASQDL--QFCAGGTY----DG 278

Query: 296 SLAN-INHAVQIVGY 309
           S A+ INHAV  +GY
Sbjct: 279 SCADRINHAVTAIGY 293


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 92/310 (29%), Positives = 145/310 (46%), Gaps = 32/310 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSF---QQRYKKSYSKSEHDIRFKNFEKSL 64
           L +  +++L  L+I V  +  NL       +SF    +++ K+Y   E + +++ F+ ++
Sbjct: 3   LAVFLIVSLVILSINV-CAATNLFSAQTYQTSFLGWMKKHNKAYHHHEFNDKYQTFKDNM 61

Query: 65  DIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           D I   N    S ES    G+  F+DL+ EE+K  +L  S+N ++            N V
Sbjct: 62  DFIHNWN----SKESDTVLGLNRFADLTNEEYKKTYLGMSINVNL----------RANQV 107

Query: 124 KKRSIT-TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
               +     T P+ I    DWR+ G +  V++Q  CG+CWAF+T    E  H +K G +
Sbjct: 108 PMNGLNFERFTGPSSI----DWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNM 163

Query: 183 SLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
              S Q ++DC+G  GN GC GG   +   ++ ++   +  E  YP       C    T 
Sbjct: 164 VTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYI-IDNDGIATEEAYPYTATQNRCVYNTTM 222

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
             G  I  Y  D    SES++   I+   PV  A++A  +T+Q Y  GV Q     S   
Sbjct: 223 L-GTAISGYK-DVPRGSESALTAAISKQ-PVAVAIDASPITFQLYKSGVYQ-EATCSSYR 278

Query: 300 INHAVQIVGY 309
           +NH V  VGY
Sbjct: 279 LNHGVLAVGY 288


>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 357

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++ + + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 133/285 (46%), Gaps = 29/285 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F ++Y KSY ++ E+  RF  F K+L  I         P +A +G+T+FSDLSEEEF
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNL--IRAAEHQALDP-TAVHGVTQFSDLSEEEF 145

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +           + M               +++        G+P + DWR+ G + +V+ 
Sbjct: 146 E----------RMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKM 195

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q TCG+CWAFST    E  + +  G L  LS Q+++DC            N GC+GG   
Sbjct: 196 QGTCGSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMT 255

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               ++ +    LE ES YP   +   C  ++     VK+ ++T  T+   E+ I   + 
Sbjct: 256 NAYKYL-IQSGGLEEESSYPYTGRSGQCNFQSDKI-AVKVSNFT--TIPIDENQIAAHLV 311

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
             GP+   +NA+  Q Y+GGV     C      +NH V +VGY +
Sbjct: 312 RSGPLAVGLNAVFMQTYIGGVSCPLICGKRF--VNHGVLMVGYGD 354


>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 326

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 140/314 (44%), Gaps = 24/314 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 34  MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 93

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F  +L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 94  VFRANL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 137

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P  KDWR  G +  V++Q  CG+CWAFS +   E    L 
Sbjct: 138 FAAAQERARVPVDVEV-VGAPAAKDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 196

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C    + GC GG      +W+   N   +  E  YP    +     
Sbjct: 197 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPP 255

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             TS + V         L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C    
Sbjct: 256 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 313

Query: 298 ANINHAVQIVGYDN 311
             ++H V +VGY++
Sbjct: 314 -QLDHGVLLVGYND 326


>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 361

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E  +RF  F+++LD+I   NK   S    + G+ +F+DL+ +EF
Sbjct: 59  FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK                       +P  KDWRE GI+  V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  A +  GV++ + + +  + +E  +   +    PV 
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 135/288 (46%), Gaps = 28/288 (9%)

Query: 30  LEQKLELFSSFQQ---RYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           L+ K    ++FQQ   +Y K+Y+    E + RF  + ++L+ I   N    S       +
Sbjct: 35  LDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTS---HWLHL 91

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDW 144
             F+DL+ +EF+ R           + +        N ++             +P + DW
Sbjct: 92  NAFADLTTDEFRNR-----------LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDW 140

Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG 204
           R+ G + +V+NQ  CG+CWAF+T  + E ++A+  G L+ LS QE++DC  + + GCSGG
Sbjct: 141 RKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGG 200

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI-L 263
                  W+ +    L+ E +YP   +D  C     +   V I  Y     IP    + L
Sbjct: 201 LMDYAYQWI-IKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGY---VDIPENDEVAL 256

Query: 264 TDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              A H P+  A+  +A ++Q Y GGV  Y+      ++NH V +VGY
Sbjct: 257 KKAAAHQPIAVAIEADAKSFQLYGGGV--YDDPTCGTSLNHGVLVVGY 302


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 80/280 (28%), Positives = 136/280 (48%), Gaps = 37/280 (13%)

Query: 40  FQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEEEFKT 97
           F+ +Y + Y  ++ ++ R + F+++  ++E  NK  ++ E + +  + +F D++ EEF  
Sbjct: 15  FKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF-- 72

Query: 98  RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD--WREAGIIGKVRN 155
                    + +M  +K           R   T +    G P+  D  WR  G +  V++
Sbjct: 73  ---------NAVMKGYKK--------GSRGEPTTVFTAEGRPMAADVDWRTKGAVTPVKD 115

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CWAFS   + E  H LKN  L  LS QE++DC+   GN GC GG   +  D++ 
Sbjct: 116 QGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIK 175

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP---SESSILTDIATHGP 271
            N  + + ES YP   +D +C+  A S         TC   +    +E ++   ++  GP
Sbjct: 176 DNGGI-DTESSYPYEAQDRSCRFDANSIGA------TCTGFVEVQHTEEALHEAVSDIGP 228

Query: 272 VIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +  A++A   ++Q+Y  GV  Y    S  N++H V  VGY
Sbjct: 229 ISVAIDASHFSFQFYSSGVY-YEKKCSPTNLDHGVLAVGY 267


>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
          Length = 335

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 147/307 (47%), Gaps = 29/307 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           + A++ + FL + V    P L+  L+  +  ++ +Y KSYS  E  ++   +E+ L +I+
Sbjct: 1   MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60

Query: 69  ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N+ N          + EF D ++EEF+   +  SV  H               + KR 
Sbjct: 61  LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110

Query: 128 ITTGITIPTGIPVKKDWR-EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
              G  +P  +    DWR + G +  VR Q  C ACWAF+     E+    + G L+ LS
Sbjct: 111 --AGSILPKFV----DWRTKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLS 164

Query: 187 VQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           VQ ++DC+   GN GC GGD      ++ ++   LE E+ YP   KD  C+    +P   
Sbjct: 165 VQNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCRY---NPKNS 220

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINH 302
           K +     +L  SE  ++  +AT GP+ A ++A   +++ Y GG+  + NC  S   + H
Sbjct: 221 KAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTH 278

Query: 303 AVQIVGY 309
            V +VGY
Sbjct: 279 GVLVVGY 285


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 76/260 (29%), Positives = 122/260 (46%), Gaps = 27/260 (10%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
           E + R+  F+++++ IE  N    S    + G+ +F+DL+ EEF+  +  +      LMS
Sbjct: 21  EKEKRYLIFKENIERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMS 78

Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
               +++  +                IP   DWR  G +  V++Q TCG CWAFSTV   
Sbjct: 79  SSFRYENLSD----------------IPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAI 122

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLK 231
           E +  L+ G L  LS Q+++DC   GN GC GG       ++ +    L  E  YP    
Sbjct: 123 EGIIKLQTGNLISLSEQQLVDCTA-GNKGCQGGLMDTAFQYI-IRNGGLTSEDNYPYQGV 180

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVI 289
           D  C  +  +    +I  Y  D    +E+++L  +A   PV  AV+     +++Y  GV 
Sbjct: 181 DGTCSSEKAASTEAQITGYE-DVPQNNENALLQAVAKQ-PVSVAVDGGGNDFRFYKSGVF 238

Query: 290 QYNCDGSLANINHAVQIVGY 309
           + +C     N+NH V  +GY
Sbjct: 239 EGDCG---TNLNHGVTAIGY 255


>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
 gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
          Length = 373

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 92/296 (31%), Positives = 148/296 (50%), Gaps = 39/296 (13%)

Query: 26  SKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGI 84
           + PNL      FS F++++KK+Y S+ EHD RFK F+ +L   E   ++++   +A +G+
Sbjct: 47  ANPNLLGAEHHFSLFKKKFKKTYASQEEHDYRFKIFKSNLRRAE---RHQKLDPTATHGV 103

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKD 143
           T+FSDL+  EF+ + L   + +  L                +       +PT  +P   D
Sbjct: 104 TQFSDLTHSEFRRQFL--GLRRLRL---------------PKDANEAPMLPTNDLPADFD 146

Query: 144 WREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AG 195
           WRE G +  V+NQ +CG+CW+FST    E  + L  G L  LS Q+++DC         G
Sbjct: 147 WREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLVSLSEQQLVDCDHECDPAEEG 206

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD-AACKRKATSPNGVKIKSYTCDT 254
             + GC+GG   +  ++  +    L  E +YP    D  AC+   T     K+ +++  +
Sbjct: 207 ACDSGCNGGLMNSAFEYT-LKAGGLMREEDYPYTGTDRGACQFDKTKI-AAKVANFSVVS 264

Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           L   E  I  ++  +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 265 L--DEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 315


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 138/281 (49%), Gaps = 21/281 (7%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E   +L+  ++  +  S S +E   RF  F++++  +   NK     +  +  + +F+D+
Sbjct: 34  ESLWDLYERWRSHHTVSRSLTEKHKRFNVFKENVMHVHNTNK---MDKPYKLKLNKFADM 90

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +  EF++ +    VN H +    +H +    + K  S+          P   DWR+ G +
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSV----------PASVDWRKKGAV 140

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V++Q  CG+CWAFSTV   E ++ +K   L  LS QE++DC    N GC+GG   +  
Sbjct: 141 TDVKDQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAF 200

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++   K  +  ES YP   ++  C     +   V I  +  +  +  E+++L  +A   
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A    +Q+Y  GV+  +C+    ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVLTGDCN---TDLNHGVAIVGY 295


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 92/306 (30%), Positives = 139/306 (45%), Gaps = 39/306 (12%)

Query: 16  LCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
            C     ++V+   L+    ++   +Q    Y K Y    E + R K F+++++ IE  N
Sbjct: 17  FCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASN 76

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
            N  + +  + GI +F+DL+ EEF             + S +K   H  + + K S  T 
Sbjct: 77  -NAGNNKLYKLGINQFADLTNEEF-------------IASRNKFKGHMCSSITKTS--TF 120

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
                 +P   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G L  LS QE++
Sbjct: 121 KYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELV 180

Query: 192 DCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGV 245
           DC   G + GC GG    L+D  D  K +     L  E++YP    D  C     S + V
Sbjct: 181 DCDTKGVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAV 234

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
            I  Y  D    +E ++   +A   P+  A++A    +Q+Y  GV   +C   L   +H 
Sbjct: 235 TITGYE-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHG 289

Query: 304 VQIVGY 309
           V  VGY
Sbjct: 290 VTAVGY 295


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 133/285 (46%), Gaps = 29/285 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F ++Y KSY ++ E+  RF  F K+L  I         P +A +G+T+FSDLSEEEF
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNL--IRAAEHQALDP-TAVHGVTQFSDLSEEEF 145

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +           + M               +++        G+P + DWR+ G + +V+ 
Sbjct: 146 E----------RMFMGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKM 195

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
           Q TCG+CWAFST    E  + +  G L  LS Q+++DC            N GC+GG   
Sbjct: 196 QGTCGSCWAFSTCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMT 255

Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
               ++ +    LE ES YP   +   C  ++     VK+ ++T  T+   E+ I   + 
Sbjct: 256 NAYKYL-IQSGGLEEESSYPYTGRSGQCNFQSDKI-AVKVSNFT--TIPIDENQIAAHLV 311

Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGYDN 311
             GP+   +NA+  Q Y+GGV     C      +NH V +VGY +
Sbjct: 312 RSGPLAVGLNAVFMQTYIGGVSCPLICGKRF--VNHGVLMVGYGD 354


>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
          Length = 348

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 131/262 (50%), Gaps = 37/262 (14%)

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL---RHSVNKHVLMSHHKHHD 117
           +  LD + EL   R  P +A +G+T+FSDL+  EF+ R L   R S+   V    H+   
Sbjct: 48  DAQLDGLRELRAARLDP-TATHGVTKFSDLTPGEFRDRLLGLRRPSLEGLVGGEPHE--- 103

Query: 118 HHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
                           +PT G+P   DWRE G +G V++Q +CG+CW+FST    E  H 
Sbjct: 104 -------------APILPTDGLPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHF 150

Query: 177 LKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL 228
           L  G L +LS Q+++DC        +   + GC+GG       ++ +    L+ E +YP 
Sbjct: 151 LATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYL-MKSGGLQSEKDYPY 209

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV 288
             ++  CK    S    ++K+++  ++  +E  I  ++  HGP+  A+NA   Q Y+GGV
Sbjct: 210 AGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIAANLVKHGPLAIAINAAYMQTYIGGV 266

Query: 289 -IQYNCDGSLANINHAVQIVGY 309
              + C     +++H V +VGY
Sbjct: 267 SCPFICG---RHLDHGVLLVGY 285


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 135/288 (46%), Gaps = 28/288 (9%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQ-SPESARYGITE 86
           P L+   +L+ S+   ++K Y + E   R   +EK+L +IE  N +      S + G+ +
Sbjct: 128 PELDGHWQLWKSW---HRKDYHEREEGWRRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQ 184

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI--PVKKDW 144
           F D++ EEF     R  +N +V           H   +++   +    P  +  P   DW
Sbjct: 185 FGDMTTEEF-----RQLMNGYV-----------HKKSERKYRGSQFLEPNFLEAPRSVDW 228

Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSG 203
           RE G +  V++Q  CG+CWAFST    E  H  K G L  LS Q ++DC+   GN GC+G
Sbjct: 229 REKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNG 288

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G       ++  N  + + E  YP   KD    R     N      +  D     E +++
Sbjct: 289 GLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEYNAANDTGFV-DIPQGHERALM 346

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A  GPV  A++A   ++Q+Y  G I Y  D S  +++H V +VGY
Sbjct: 347 KAVAAVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDLDHGVLVVGY 393


>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
          Length = 360

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 128/278 (46%), Gaps = 27/278 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL ++   N+   S    R GI  F+D+S EEF
Sbjct: 59  FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+                       +P  KDWRE GI+  V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           NQ  CG+CW FST    E+ +    G    LS Q++IDC     N GC+GG      +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLIDCGFAFNNFGCNGGLPSQAFEYI 219

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  CK K  +  G K+   + +  + +E  +   +    PV 
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GFKVLD-SVNITLGAEDELKDAVGLVRPVS 276

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +T ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314


>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
          Length = 333

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 149/309 (48%), Gaps = 40/309 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           + L A C   + +  + P  +Q L+  +  ++  +++ YS +E   R   +EK++ +IE 
Sbjct: 5   LVLTAFC---LGIASAAPKFDQNLDTQWYQWKATHRRLYSTNEEGWRRAVWEKNMKMIEL 61

Query: 70  LNKNRQSPESARYGIT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            N         ++G T     F D++ EEF+            +M   ++  H +  V +
Sbjct: 62  HNGEY---SRGKHGFTMAMNAFGDMTNEEFRQ-----------VMVCFRNQKHKNGKVFR 107

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
             +   + +P  +    DWR+ G +  V+NQ+ CG+CWAFS     E     K G L  L
Sbjct: 108 GPLL--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSL 161

Query: 186 SVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           S Q ++DC+   GN GC+GG       ++  N   L+ E+ YP   KD  CK K  +   
Sbjct: 162 SEQNLVDCSRPQGNQGCNGGFMNYAFRYVKENG-GLDSEASYPYEAKDGICKYKPEN--- 217

Query: 245 VKIKSYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANI 300
             + + T   +IP+ E  ++  +AT GP+  AV+A   ++Q+Y  G+  +  C  S  N+
Sbjct: 218 -SVANDTGFVVIPTHEKELMKAVATVGPISVAVDASHSSFQFYKSGIYFEKKC--SSKNL 274

Query: 301 NHAVQIVGY 309
           +H V +VGY
Sbjct: 275 DHGVLVVGY 283


>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
          Length = 365

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/277 (31%), Positives = 129/277 (46%), Gaps = 26/277 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL   EE+    +   S R GI  FSD+S EEF
Sbjct: 64  FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLSYRLGINRFSDMSWEEF 120

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  +      ++         NH+ + +          +P  KDWRE GI+  V++
Sbjct: 121 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKD 164

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GCSGG      +++ 
Sbjct: 165 QSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIK 224

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N  + + E  YP    +  C  KA   N V     + +  + +E  +   +    PV  
Sbjct: 225 YNGGI-DTEESYPYKGVNGVCHYKA--ENAVVQVLDSVNITLNAEDELKNAVGLVRPVSV 281

Query: 275 AVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
           A   +  ++ Y  GV   + C  +  ++NHAV  VGY
Sbjct: 282 AFEVINGFRQYKSGVYSSDHCGTTPDDVNHAVLAVGY 318


>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
 gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
 gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
 gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
 gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
 gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
 gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
 gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
 gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
          Length = 379

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 81/284 (28%), Positives = 137/284 (48%), Gaps = 28/284 (9%)

Query: 36  LFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           LF  ++  + + Y   E +  R + F+ +L+ I ++N NR+SP S R G+ +F+D++ +E
Sbjct: 43  LFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKSPHSHRLGLNKFADITPQE 102

Query: 95  FKTRHLR--HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           F  ++L+    V++ + M++ K        +KK   +         P   DWR+ G+I +
Sbjct: 103 FSKKYLQAPKDVSQQIKMANKK--------MKKEQYSCDHP-----PASWDWRKKGVITQ 149

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+ Q  CG+ WAFS     E+ HA+  G L  LS QE++DC      GC  G      +W
Sbjct: 150 VKYQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEESE-GCYNGWHYQSFEW 208

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT---- 268
           + +    +  + +YP   K+  CK      + V I  Y  +TLI S+ S  ++       
Sbjct: 209 V-LEHGGIATDDDYPYRAKEGRCKANKIQ-DKVTIDGY--ETLIMSDESTESETEQAFLS 264

Query: 269 ---HGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
                P+  +++A  +  Y GG+       S   INH V +VGY
Sbjct: 265 AILEQPISVSIDAKDFHLYTGGIYDGENCTSPYGINHFVLLVGY 308


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 83/289 (28%), Positives = 137/289 (47%), Gaps = 40/289 (13%)

Query: 32  QKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           + + ++  +   + K+Y+   E + RF+ F+ +L  ++E N       S R G+  F+DL
Sbjct: 42  EAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNA---VAGSYRVGLNRFADL 98

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT-----GITIPTGIPVKKDWR 145
           + EE+++  L  ++                  +K+RS +T            +P   DWR
Sbjct: 99  TNEEYRSMFLGGNM-----------------EMKERSASTKSDRYAFRAGDKLPGSVDWR 141

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
           E G +  V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + NMGC+GG 
Sbjct: 142 EKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGG- 200

Query: 206 FCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSI 262
              L+D+     +N   ++ E +YP    D  C +   +   V I  Y  D     E+S+
Sbjct: 201 ---LMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYE-DVPEDDENSL 256

Query: 263 LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
              +A   PV  A+ A    +Q Y  GV   +C     N++H V  VGY
Sbjct: 257 KKAVANQ-PVSVAIEAGGRAFQLYESGVFTGHCG---TNLDHGVVAVGY 301


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+  +   + ++Y+   E D RF+ F  +L  ++  N+ R +    R G+ +F+DL+ +E
Sbjct: 51  LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 109

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+  +L   +             + H    +            +P   DWRE G +  V+
Sbjct: 110 FRAAYLGARIPASRRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 158

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CWAFS V + ES++ +  G +  LS QE+++C+ + GN GC+GG   A  D++
Sbjct: 159 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 218

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    ++ E +YP    D  C     +   V I  +  D     E S+   +A H PV 
Sbjct: 219 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 275

Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A+ A    +Q Y  GV    C     N++H V  VGY
Sbjct: 276 VAIEAGGREFQLYKAGVFTGTC---TTNLDHGVVAVGY 310


>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
 gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
          Length = 356

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 132/279 (47%), Gaps = 31/279 (11%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y  +E   +RF  F +SL++I+  NK   S    + G+ +F+D + EEF
Sbjct: 57  FARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNKQGLS---YKLGVNQFADWTWEEF 113

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N        HK  D                  T +P  KDWR+ GI+  V+
Sbjct: 114 RKHRLGAAQNCSATTKGSHKLTD------------------TALPESKDWRKDGIVSPVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +A  +G    LS Q+++DC  G  N GC+GG      +++
Sbjct: 156 DQGHCGSCWTFSTTGALEAAYAQAHGKGISLSEQQLVDCGRGFNNFGCNGGLPSQAFEYI 215

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    D +CK     P  V ++   + +  + +E  +   +A   PV
Sbjct: 216 KYNG-GLDTEEAYPYTGVDGSCK---FVPENVGVQVIDSVNITLGAEDELKHAVAFVRPV 271

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   ++ ++ Y  GV   N C  +  ++NHAV  VGY
Sbjct: 272 SVAFEVVSGFRLYSKGVYTSNSCGSTPMDVNHAVLAVGY 310


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 144/307 (46%), Gaps = 35/307 (11%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELN 71
             +L  L++ +  S  + E+ + ++  +  ++ K Y+   E D RF+ F+ +L  I+E N
Sbjct: 11  FFSLITLSLAMDTSMRSNEEVMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEHN 70

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
               +    + G+ +F+D + EE++  +L               +D   N V K  ITTG
Sbjct: 71  AQNYT---YKVGLNKFADTTNEEYRNMYL------------GTKNDAKRN-VMKIKITTG 114

Query: 132 --ITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                 +G  +PV  DWR  G +  +++Q +CG+CWAFST+ T E+++ +  G L  LS 
Sbjct: 115 HRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSE 174

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           QE++DC    N GC+GG    L+D+     V    ++ E +YP    +  C     +   
Sbjct: 175 QELVDCDRAFNEGCNGG----LMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKV 230

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
           V I  Y  + +     + L     H PV  A+ A     Q Y  GV    C     N++H
Sbjct: 231 VSIDGY--EDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCG---TNLDH 285

Query: 303 AVQIVGY 309
            V +VGY
Sbjct: 286 GVVVVGY 292


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
           V   + + E+   L++ ++  + KSY+   E + R+  F  +L  I+E N    +   S 
Sbjct: 27  VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 86

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
           R G+  F+DL+ EE++  +L             ++       V  R +         +P 
Sbjct: 87  RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 132

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR  G + ++++Q  CG+CWAFS +   E ++ +  G L  LS QE++DC  + N G
Sbjct: 133 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 192

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      D++ +N   ++ E +YP   KD  C     +   V I SY  D    SE+
Sbjct: 193 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 250

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           S+   +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 251 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 297


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 139/282 (49%), Gaps = 33/282 (11%)

Query: 44  YKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARY--GITEFSDLSEEEFKTRHL 100
           Y ++Y   +E + RFK F+++++ IE +N    S  + RY   I EF+D + EEFK    
Sbjct: 43  YGRTYKDIAEKERRFKIFKENVEYIESVN----SAGNRRYKLSINEFADQTNEEFKAS-- 96

Query: 101 RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCG 160
           R+  N        +     + +V              +P   DWR+ G +  +++Q  CG
Sbjct: 97  RNGYNMSSRPRSSEITSFRYENV------------AAVPSSMDWRKKGAVTPIKDQGQCG 144

Query: 161 ACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV 219
            CWAFS V   E +  LK G L  LS QE++DC  +G + GC GG   +  +++ +    
Sbjct: 145 CCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFI-IGNGG 203

Query: 220 LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA- 278
           L  E+ YP    DA C +K  + +  KIK+Y  D    SE+++L  +A H PV  A++A 
Sbjct: 204 LTTEANYPYKGVDATCNKKKAASSAAKIKNYE-DVPANSEAALLKAVAQH-PVSVAIDAG 261

Query: 279 -LTWQYYLGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
              +Q+Y  GV    C   L   +H V  VGY   D+ ++ W
Sbjct: 262 GSDFQFYSSGVFTGQCGTEL---DHGVTAVGYGKTDDGTKYW 300


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+  +   + ++Y+   E D RF+ F  +L  ++  N+ R +    R G+ +F+DL+ +E
Sbjct: 48  LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 106

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+  +L   +             + H    +            +P   DWRE G +  V+
Sbjct: 107 FRAAYLGARIPAARRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 155

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CWAFS V + ES++ +  G +  LS QE+++C+ + GN GC+GG   A  D++
Sbjct: 156 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 215

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    ++ E +YP    D  C     +   V I  +  D     E S+   +A H PV 
Sbjct: 216 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 272

Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A+ A    +Q Y  GV    C     N++H V  VGY
Sbjct: 273 VAIEAGGREFQLYKAGVFSGTC---TTNLDHGVVAVGY 307


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 84/283 (29%), Positives = 144/283 (50%), Gaps = 20/283 (7%)

Query: 36  LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSEE 93
           ++++F+ ++ KSY +K E  +RF+ F  +  +IE+ N   ++ + S    + +F+D++  
Sbjct: 42  VWTNFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNA 101

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EF+ R     +N   L +  K        +K+  +   +     IP   DWR+ G + KV
Sbjct: 102 EFRQR-----MNGFKLPAKRKLAKSQP--LKEDGMIFEMPDNVTIPDSVDWRKEGYVTKV 154

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
           ++Q +CG+CWAFS   + E  H  + G L  LS Q ++DC  NG + GC+GG       +
Sbjct: 155 KDQGSCGSCWAFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQY 214

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD--IATHG 270
           ++ NK + + E+ YP   +D  C+ K+           T    IP  +  L +  IAT G
Sbjct: 215 VETNKGI-DTEASYPYKGRDGRCRFKSEDVGATD----TGFVDIPEGNETLLEAAIATVG 269

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
           PV  A++A    +Q+Y  GV  Y+   S   ++H V  VGY++
Sbjct: 270 PVSVAIDAASFKFQFYSHGVY-YDRSCSPEYLDHGVLAVGYNS 311


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
           V   + + E+   L++ ++  + KSY+   E + R+  F  +L  I+E N    +   S 
Sbjct: 26  VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
           R G+  F+DL+ EE++  +L             ++       V  R +         +P 
Sbjct: 86  RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR  G + ++++Q  CG+CWAFS +   E ++ +  G L  LS QE++DC  + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      D++ +N   ++ E +YP   KD  C     +   V I SY  D    SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           S+   +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296


>gi|118365718|ref|XP_001016079.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297846|gb|EAR95834.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 336

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 156/312 (50%), Gaps = 29/312 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LCF A  + V      +KL  ++ +  ++++ Y  +EH+  F+   F +++
Sbjct: 6   LLSIIMLMPLCF-AQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENM 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
             I+E N +  +  S    + +FSD+++EEF  + L +  +  H +   ++   H  ++ 
Sbjct: 58  QKIQEHNSDPNNTYSTH--LNQFSDMTKEEFVEKILMKQDLVDHFMKGINQETTHSDSNN 115

Query: 124 KKRSITT-GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           K+  + +  +T+   I    DWR  G +  V+NQ  CG+CW FS     ES + +KN  L
Sbjct: 116 KETQLNSKSLTLADSI----DWRTKGAVTSVKNQGDCGSCWTFSAAGLMESFNFIKNNVL 171

Query: 183 SLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
              S Q+++DC     G  + GCSGG     L++   +K+ +    +YP +     C+  
Sbjct: 172 VDFSEQQLLDCVYFTRGYNSYGCSGGWPDQCLNY--ASKIGITTLDKYPYVGVMTNCRGS 229

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
            T+ NG K KS+     IP+ S+ L       PV   V+A T   Y  G+    CD S  
Sbjct: 230 GTN-NGFKPKSW---IQIPNTSNDLKSALNFSPVSVLVDASTLGIYKSGIFN-GCDQSNI 284

Query: 299 NINHAVQIVGYD 310
           ++NHAV  VGYD
Sbjct: 285 SLNHAVLAVGYD 296


>gi|218202220|gb|EEC84647.1| hypothetical protein OsI_31538 [Oryza sativa Indica Group]
          Length = 363

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ K Y   +E   RF+ F +SL+++   N+ R  P   R GI  F+D+S EEF
Sbjct: 63  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 119

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+  D                    +P  KDWRE GI+  V+
Sbjct: 120 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 162

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q+++DCA    N GCSGG      +++
Sbjct: 163 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 222

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P  V +K   + +  + +E  +   +    PV
Sbjct: 223 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 278

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   +  ++ Y  GV   + C  S  ++NHAV  VGY
Sbjct: 279 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 317


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANGTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|62945374|ref|NP_001017509.1| uncharacterized protein LOC498688 precursor [Rattus norvegicus]
 gi|60552853|gb|AAH91563.1| Similar to cathepsin R [Rattus norvegicus]
 gi|149039732|gb|EDL93848.1| similar to cathepsin R [Rattus norvegicus]
          Length = 334

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 147/306 (48%), Gaps = 28/306 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           +   + +  L + V    P L+  L+  +  ++++Y KSYS  E ++R   +E++L +I+
Sbjct: 1   MTPAVFIAILCLGVASGAPILDPSLDAEWQEWKKKYDKSYSLEEEELRRAVWEENLKMIK 60

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N +N          I EF D + EEF+   +   V  H               + KR+
Sbjct: 61  LHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEFPVQTH----------REGKSIMKRA 110

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
              G   P  +    DWR+ G +  VR Q  C ACWAFS     E+    ++G L  LSV
Sbjct: 111 --AGSIFPKFV----DWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSV 164

Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC GGD      ++ ++   L+ E+ YP   KD  C+    + +  +
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLQSEATYPYEGKDGPCRYNPKN-SSAE 222

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
           I  +   +L  SE  ++  +AT GP+ A ++A   ++++Y  G+  + NC  S  ++ H 
Sbjct: 223 ITGFV--SLPESEDILMVAVATIGPISAGIDASHESFKFYKKGIYHEPNC--SSNSVTHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
          Length = 473

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 144/298 (48%), Gaps = 34/298 (11%)

Query: 20  AIPVKVSKPNLE--QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQS 76
           A+P+  SKP  E  + L +F +F   Y ++YS + E + R + F++++   + L    Q 
Sbjct: 156 AVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQG 215

Query: 77  PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
             SA YGIT+FSDL+E+EF+  +L   +++  L               K+ +   I    
Sbjct: 216 --SAEYGITKFSDLTEDEFRMMYLNPMLSQWSL---------------KKEMKPAIPASA 258

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
             P   DWR+ G +  V+NQ  CG+CWAFS     E     K G L  LS QE++DC   
Sbjct: 259 PAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDC-DK 317

Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT-- 254
            +  C GG      + ++ N   LE E++Y       +C          K+ +Y   +  
Sbjct: 318 LDQACGGGLPSNAYEAIE-NLGGLETETDYSYTGHKQSCDFSTG-----KVAAYINSSVE 371

Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           L   E  I   +A +GPV AA+NA   Q+Y  GV   ++  C+  +  I+HAV +VG+
Sbjct: 372 LPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGF 427


>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
          Length = 326

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 145/305 (47%), Gaps = 34/305 (11%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           LFI+A++ +  L               +L+  +++ Y K Y+ ++ + R   +E+++  I
Sbjct: 3   LFILAVLTVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51

Query: 68  EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +E N ++     +   G+ +F+D++ EEFK ++L        ++SH   ++ ++  V   
Sbjct: 52  QEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNRAV--- 108

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                       P K DWRE+G + ++++Q  CG+CWAFST  T E  +     T    S
Sbjct: 109 ------------PDKIDWRESGYVTELKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+G  GNMGCSGG      +++   +  LE ES YP    +  C+         
Sbjct: 157 EQQLVDCSGPWGNMGCSGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
               YT  +   SE  +   +   GP   AV+  + +  Y GG+ Q     SL  +NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-RVNHAV 271

Query: 305 QIVGY 309
             VGY
Sbjct: 272 LAVGY 276


>gi|115479391|ref|NP_001063289.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|115510968|sp|P25778.2|ORYC_ORYSJ RecName: Full=Oryzain gamma chain; Flags: Precursor
 gi|51535997|dbj|BAD38077.1| putative oryzain gamma chain precursor [Oryza sativa Japonica
           Group]
 gi|113631522|dbj|BAF25203.1| Os09g0442300 [Oryza sativa Japonica Group]
 gi|215694919|dbj|BAG90110.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 362

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ K Y   +E   RF+ F +SL+++   N+ R  P   R GI  F+D+S EEF
Sbjct: 62  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+  D                    +P  KDWRE GI+  V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q+++DCA    N GCSGG      +++
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P  V +K   + +  + +E  +   +    PV
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 277

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   +  ++ Y  GV   + C  S  ++NHAV  VGY
Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 129/263 (49%), Gaps = 31/263 (11%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
           E D RF+ F+ +L  I++ NK   S    R G+T F+DL+ +E+++++L   + K     
Sbjct: 61  EKDRRFEIFKDNLRFIDDHNKKNLS---YRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 117

Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
             + ++             G  +P  I    DWR+ G + +V++Q +CG+CWAFST+   
Sbjct: 118 TSQRYEAR----------VGDELPESI----DWRKKGAVAEVKDQGSCGSCWAFSTIGAV 163

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
           E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +    ++ + +YP 
Sbjct: 164 EGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LMDYAFEFIIKNGGIDTDKDYPY 219

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
              D  C +   +   V I SY  D    SE S+   +A H PV  A+ A    +Q Y  
Sbjct: 220 KGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA-HQPVSVAIEAGGRAFQLYDS 277

Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
           G+    C   L   +H V  VGY
Sbjct: 278 GIFDGTCGTQL---DHGVVAVGY 297


>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
 gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
          Length = 473

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 92/298 (30%), Positives = 144/298 (48%), Gaps = 34/298 (11%)

Query: 20  AIPVKVSKPNLE--QKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQS 76
           A+P+  SKP  E  + L +F +F   Y ++YS + E + R + F++++   + L    Q 
Sbjct: 156 AVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQG 215

Query: 77  PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
             SA YGIT+FSDL+E+EF+  +L   +++  L               K+ +   I    
Sbjct: 216 --SAEYGITKFSDLTEDEFRMMYLNPMLSQWSL---------------KKEMKPAIPASA 258

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
             P   DWR+ G +  V+NQ  CG+CWAFS     E     K G L  LS QE++DC   
Sbjct: 259 PAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDC-DK 317

Query: 197 GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT-- 254
            +  C GG      + ++ N   LE E++Y       +C          K+ +Y   +  
Sbjct: 318 LDQACGGGLPSNAYEAIE-NLGGLETETDYSYTGHKQSCDFSTG-----KVAAYINSSVE 371

Query: 255 LIPSESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           L   E  I   +A +GPV AA+NA   Q+Y  GV   ++  C+  +  I+HAV +VG+
Sbjct: 372 LPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWM--IDHAVLLVGF 427


>gi|27960477|gb|AAO27843.1|AF456459_1 cathepsin R [Rattus norvegicus]
          Length = 334

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 145/306 (47%), Gaps = 28/306 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           +   + +  L + V    P L+  L+  +  ++++Y KSYS  E ++R   +E++L +I+
Sbjct: 1   MTPAVFIAILCLGVASGAPILDPSLDAEWQEWKKKYDKSYSLEEEELRRAVWEENLKMIK 60

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
             N +N          I EF D + EEF+   +   V  H               + KR+
Sbjct: 61  LHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEFPVQTH----------REGKSIMKRA 110

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
              G   P  +    DWR+ G +  VR Q  C ACWAFS     E+    + G L  LSV
Sbjct: 111 --AGSIFPKFV----DWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQTGKLIPLSV 164

Query: 188 QEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC  GD     +++ +N   LE E+ YP   K+  C+    +P   K
Sbjct: 165 QNLVDCSKSQGNEGCQWGDPHIAYEYV-LNNGGLEAEATYPYKGKEGVCRY---NPKHSK 220

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
            +     +L  SE  ++  +AT GP+  AV+A   ++ +Y  G+  + NC  S   +NH+
Sbjct: 221 AEITGFVSLPESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNC--SNNTVNHS 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 84/276 (30%), Positives = 135/276 (48%), Gaps = 26/276 (9%)

Query: 40  FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEEEFKT- 97
           +++ + KSY   E   R + F KS+  I   N ++     + R G+ +F+D++ EEF+  
Sbjct: 22  YKKVHGKSYGHDEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRNF 81

Query: 98  RHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ 157
           + L+    K              N  + +    G  +PT +    DWRE G +  V+NQ 
Sbjct: 82  KGLKFDATKT-----------KRNGTRFQKELLGEALPTQV----DWREKGYVTPVKNQG 126

Query: 158 TCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVN 216
            CG+CWAFST  + E  H    G L  LS Q ++DC+   GN GC+GG       ++  N
Sbjct: 127 QCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQN 186

Query: 217 KVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV 276
             + + E  YP   KD  C     S  G ++K +  D     E+++   +A+ GPV  A+
Sbjct: 187 GGI-DTEESYPYTGKDGDCAFNENSV-GARVKGFV-DVPQRDEAALQAAVASVGPVSVAI 243

Query: 277 NAL--TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
           +A   ++QYY  GV  + +C  S + ++H V +VGY
Sbjct: 244 DASNDSFQYYKEGVYDEPSC--SFSQLDHGVLVVGY 277


>gi|47227478|emb|CAG04626.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 175

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 80/144 (55%), Gaps = 21/144 (14%)

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSIT 129
           LN     P+SA+YGI +FSDLSE EFK  +LR S ++  + +  K               
Sbjct: 11  LNSFSTEPQSAKYGINQFSDLSEREFKDLYLRASADRAPVFTGQKIK------------- 57

Query: 130 TGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQE 189
                  G+P + DWR+  ++G V+NQQ CG+CWAFS V   +S+HA+ +  L  LSVQ+
Sbjct: 58  -------GLPARFDWRDNAVVGPVQNQQACGSCWAFSVVGAVQSVHAIGSSPLVELSVQQ 110

Query: 190 VIDCAGNGNMGCSGGDFCALLDWM 213
           V+DC+   N GC GG     L W+
Sbjct: 111 VLDCSFQNN-GCDGGTPINALKWL 133


>gi|328870624|gb|EGG18997.1| cysteine proteinase [Dictyostelium fasciculatum]
          Length = 521

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 89/306 (29%), Positives = 145/306 (47%), Gaps = 26/306 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           F+   +A+ F+A+    +    +Q  + F+++  +  +SY  +E   RF  F+K++D + 
Sbjct: 4   FLFVCLAV-FMALQAANAAFTEKQYRDAFTNWMIKNDRSYQSAEFGNRFNVFKKNMDYVN 62

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
           E N   +  E+    +T F+D+S EE++  +L   ++    +        ++N       
Sbjct: 63  EWNS--KGSETV-LDLTIFADISNEEYQRIYLGTKIDATQKLIDAARITMNNNFAAAPVF 119

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              +          DWR+ G +  ++NQ  CG+CW+FST  + E  H L  G L  LS Q
Sbjct: 120 NATV----------DWRQKGAVTPIKNQGQCGSCWSFSTTGSTEGAHFLSTGNLVSLSEQ 169

Query: 189 EVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN--GV 245
            ++DC+G  GN GC+GG       ++  NK + + ES YP       C   A +P   G 
Sbjct: 170 NLVDCSGPEGNDGCNGGLMDQAFTYIIKNKGI-DTESSYPYKAVQGKC---AFNPKNIGA 225

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
            +  YT D    SES  L   A  GPV  A++A   ++Q Y  GV  Y    S   ++H 
Sbjct: 226 TLTGYT-DVKSGSESD-LEAKANTGPVSVAIDASHNSFQLYGSGVY-YEPKCSATQLDHG 282

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 283 VLVVGY 288


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDYAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEE 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
 gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
          Length = 355

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ KSY S+ E   R++ F ++L  I   NKNR  P +    +  F+D + EEF
Sbjct: 55  FARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRL-PYT--LSVNHFADWTWEEF 111

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           K   L  + N    L  +HK             +T  +  PT     KDWR+ GI+  V+
Sbjct: 112 KRHRLGAAQNCSATLNGNHK-------------LTDAVLPPT-----KDWRKEGIVSDVK 153

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q +CG+CW FST    E+  A   G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 154 DQGSCGSCWTFSTTGALEAACAQAFGKSISLSEQQLVDCAGRFNNFGCNGGLPSQAFEYI 213

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   LE E  YP   KD  CK  A +     I S   +  + +E+ +   +A   PV 
Sbjct: 214 KYNG-GLETEEAYPYTGKDGVCKFSAENVAVQVIDS--VNITLGAENELKHAVAFVRPVS 270

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  + +Y  GV   + C  +  ++NHAV  VGY
Sbjct: 271 VAFQVVNGFHFYENGVYTSDICGSTSQDVNHAVLAVGY 308


>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
 gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
          Length = 463

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 146/299 (48%), Gaps = 41/299 (13%)

Query: 21  IPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR----FKNFEKSLDIIEELNKNRQS 76
           +P    +  + + L LF  F   Y K YS  E   R    F    K   +I+E+++    
Sbjct: 150 VPSSELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQG--- 206

Query: 77  PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
             +A YG+T++SDL+E+EF++ +L   ++   L            +  K++I   ++ P 
Sbjct: 207 --TAEYGVTKYSDLTEDEFRSLYLNPLLSSKPL------------YQMKKAIVPNMSAPD 252

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
               + DWR+ G + +V+NQ  CG+CWAFS +   E    LK G+L  LS QE++DC G 
Sbjct: 253 ----QWDWRDHGAVTEVKNQGMCGSCWAFSVIGNIEGQWFLKKGSLVSLSEQELVDCDGV 308

Query: 197 GNMGCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
            +  C+GG      +   + K+  +E E EY        C    +     K+ +Y   ++
Sbjct: 309 -DHACAGGLPSNAYE--AIEKLGGIETEQEYSYEGHKNTCSFSTS-----KVSAYINSSV 360

Query: 256 -IP-SESSILTDIATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            IP  E+ I   +A +GP+  A+NA   Q+Y  G+    +  C+  +  I+HAV +VGY
Sbjct: 361 EIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGISHPFRILCNPWM--IDHAVLLVGY 417


>gi|149392541|gb|ABR26073.1| oryzain gamma chain precursor [Oryza sativa Indica Group]
          Length = 367

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ K Y   +E   RF+ F +SL+++   N+ R  P   R GI  F+D+S EEF
Sbjct: 67  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 123

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+  D                    +P  KDWRE GI+  V+
Sbjct: 124 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 166

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q+++DCA    N GCSGG      +++
Sbjct: 167 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 226

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P  V +K   + +  + +E  +   +    PV
Sbjct: 227 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 282

Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
             A   +  ++ Y  GV   + C  S  ++NHAV  VGY
Sbjct: 283 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 321


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|387915132|gb|AFK11175.1| cathspsin H [Callorhinchus milii]
          Length = 330

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 144/317 (45%), Gaps = 32/317 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           ++    L+A+  L     VS+    Q++  F ++  ++ K YS  E+  R + F ++   
Sbjct: 1   MVLSATLLAIALLGGVCCVSEFTF-QEIVSFKTWMTQHNKHYSSEEYSYRLRTFIQNKRK 59

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +EE N  R S    R G+ +FSD++  EFK  +L        L           NHV   
Sbjct: 60  VEEHNSGRHS---YRMGLNQFSDMTFSEFKKLYL--------LREPQNCSATRGNHV--- 105

Query: 127 SITTGITIPTGIPVKKDWREAG-IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            ++ G       P   DWR  G  +  V+NQ  CG+CW FST    ES  A+K G L  L
Sbjct: 106 -LSMGP-----YPDFVDWRTKGNYVTPVKNQGGCGSCWTFSTTGCLESAIAIKTGKLLSL 159

Query: 186 SVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN- 243
           + Q+++DCAG   N GC+GG      +++  N   LE E +YP   +D  C+ +   PN 
Sbjct: 160 AEQQLVDCAGAYKNHGCNGGLPSQAFEYIKYNG-GLEAEKDYPYTAQDQHCQYQ---PNK 215

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSLANIN 301
            V       +     E+ I+  +A   PV  A       +QY  G     NCD +   +N
Sbjct: 216 AVAFVKEVVNITQYDENGIVDAVARLNPVSIAFEVTDDFFQYEGGVYSNSNCDSTPDKVN 275

Query: 302 HAVQIVGY--DNYSRTW 316
           HAV  VGY   N ++ W
Sbjct: 276 HAVLAVGYGVQNGTKYW 292


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 144/303 (47%), Gaps = 32/303 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L A+C+    +  + P  +Q L+  +  ++  +K+ Y  +E   R   +EK++ +IE  N
Sbjct: 7   LAAVCW---GIASAIPKFDQNLDTQWYQWKATHKRLYGLNEEGWRRAVWEKNMRMIELHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
               Q       G+  + D++ EEF+            +M+  ++  H    + +  +  
Sbjct: 64  GEYSQGKHGFTMGMNAYGDMTNEEFRQ-----------VMNGFQNQKHKKGKMFRDPLL- 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
            +  P  +    DWRE G +  V+NQ  CG+CWAFS     E     K G L  LS Q +
Sbjct: 112 -LQYPKSV----DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+   GN GC+GG       ++  N   L+ E  YP    D  CK K        + +
Sbjct: 167 VDCSHPQGNQGCNGGLMDYAFQYVKDNS-GLDSEESYPYEGMDGTCKYKPE----CSVAN 221

Query: 250 YTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
            T    IP  E ++L  +AT GP+ AA++A  +++Q+Y  G I Y+ D S  +++H + +
Sbjct: 222 DTGFVDIPGHEKALLRAVATVGPISAAIDAGHMSFQFYKSG-IYYDPDCSSKDLDHGILV 280

Query: 307 VGY 309
           VGY
Sbjct: 281 VGY 283


>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 144/309 (46%), Gaps = 30/309 (9%)

Query: 8   LFIVALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +  +A++ALC   A+      P L+   EL+ S+   + K Y + E   R   +EK+L  
Sbjct: 1   MLPLAVVALCLSAALSAPSLDPQLDDHWELWKSW---HSKKYHEKEEGWRRMVWEKNLKK 57

Query: 67  IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           IE  N ++     S R G+  F D++ EEF+            LM+ +K         + 
Sbjct: 58  IELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQ-----------LMNGYK------RKAET 100

Query: 126 RSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           ++  +    P  +  P   DWR+ G +  V++Q  CG+CWAFST    E  H  K G L 
Sbjct: 101 KARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLV 160

Query: 184 LLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q ++DC+   GN GC+GG       ++  N+  L+ E  YP L  D        + 
Sbjct: 161 SLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ-GLDSEDSYPYLGTDDQPCHYDPTY 219

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           N V    +  D     E +++  +A  GPV  A++A   ++Q+Y  G I Y  + S   +
Sbjct: 220 NSVNDTGFV-DIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG-IYYEKECSSEEL 277

Query: 301 NHAVQIVGY 309
           +H V +VGY
Sbjct: 278 DHGVLVVGY 286


>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
          Length = 352

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 147/302 (48%), Gaps = 38/302 (12%)

Query: 22  PVKVSKPNLEQKLEL---------FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
           P+++     EQ L++         F+ F  +Y K Y S  E   RF+ F ++L++I+  N
Sbjct: 29  PIRLVSDLEEQVLQVIGQTRHAASFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTN 88

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITT 130
           K R S    + G+  F+DLS +EF+T+ L  + N    L+ +HK  D             
Sbjct: 89  KKRLS---YKLGLNHFADLSWDEFRTQKLGAAQNCSATLIGNHKLTD------------- 132

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
                  +  +KDWR+  I+ +V++Q  CG+CW FST    E+ +A  +G    LS Q++
Sbjct: 133 -----AVLSAEKDWRKESIVSEVKDQAHCGSCWTFSTTGALEAAYAQAHGKNISLSEQQL 187

Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DCAG   N GC+GG      +++  N  +   E EYP   KD A K  A +   V++  
Sbjct: 188 VDCAGAFNNFGCNGGLPSQAFEYIKYNGGI-ALEKEYPYTAKDEASKFTAENV-AVRVLD 245

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIV 307
            + +  + +E  +   +A   PV  A   +  ++ Y  GV   + C  +  ++NHAV  V
Sbjct: 246 -SVNITLGAEDELKHAVAFARPVSVAFQVVDGFRLYKEGVYTSDTCGNTPMDVNHAVLAV 304

Query: 308 GY 309
           GY
Sbjct: 305 GY 306


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 85/279 (30%), Positives = 126/279 (45%), Gaps = 30/279 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL   EE+    Q   S R GI  +SD+S EEF
Sbjct: 62  FARFAVRYGKSYESAAEVQRRFRIFSESL---EEVRSTNQKGLSYRLGINRYSDMSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  +      L  +H+  D +                  +P  KDWRE GI+  V+
Sbjct: 119 QASRLGAAQTCSATLRGNHRMQDAN-----------------ALPETKDWREDGIVSPVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 162 DQSHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI 221

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
             N   L+ E  YP    +  C  K   P    ++   + +  + +E  +   +    PV
Sbjct: 222 KYNG-GLDTEESYPYKGVNGVCHYK---PENAAVQVLDSVNITLNAEDELQNAVGLVRPV 277

Query: 273 IAAVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A   +    QY  G     +C  +  ++NHAV  VGY
Sbjct: 278 SVAFEVINGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 316


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 148/312 (47%), Gaps = 27/312 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
           F   ++LF   L+ L        +++   ++   ++ S+  +Y KSY S  E + RF+ F
Sbjct: 7   FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +++L  I+E N +     S + G+ +F+DL++EEF++ +L  +   +     +++     
Sbjct: 67  KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
                     G  +P+ +    DWR AG +  +++Q  CG CWAFS + T E ++ +  G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169

Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS QE+IDC    N  GC+G        ++ +N   +  E  YP   +D  C    
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGSYITDGFPFI-INNGGINTEENYPYTAQDGECNVDL 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
            +   V I +Y  + +  +    L    T+ PV  A++A    ++ Y  G+    C  + 
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285

Query: 298 ANINHAVQIVGY 309
             I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295


>gi|66815893|ref|XP_641963.1| cysteine protease 4 [Dictyostelium discoideum AX4]
 gi|166201984|sp|P54639.2|CYSP4_DICDI RecName: Full=Cysteine proteinase 4; Flags: Precursor
 gi|60469981|gb|EAL67962.1| cysteine protease 4 [Dictyostelium discoideum AX4]
          Length = 442

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L +    +K      Q    F+++ Q ++++YS  E + R++ F+ ++D + + 
Sbjct: 4   LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ +E++T +L    +   L+   +         +K   T 
Sbjct: 64  NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
             T+        DWR  G +  ++NQ  CG CW+FST  + E  H + +GT   L  LS 
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
           Q +IDC+ + GN GC GG      +++ +N   ++ ES YP   +D   CK K TS  G 
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLAFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
           +I SY  +    SE+S L   + + PV  A++A   ++Q Y  G I Y    S   ++H 
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACSPTQLDHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 323

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 89/281 (31%), Positives = 133/281 (47%), Gaps = 30/281 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE-SARYGITEFSDLSE 92
           E +  F+  + K+Y S  E   RF  F+K+L  I+E NK  +  E S    +T+F+D++ 
Sbjct: 21  EEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTH 80

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF        V    L S   + +            T I     +    DWR+ G +  
Sbjct: 81  EEFLDLLKLQGV--PALPSDAVYFEE-----------TDIEEKDAV----DWRKEGAVTP 123

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
           V+NQ  CG+CWAFS V   E     KNGTL  LS QE++DCA    GN GC+GG      
Sbjct: 124 VKNQGHCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAF 183

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           D+++   +  + E  YP   K + C+         K+K+Y    L+ +E  I   ++  G
Sbjct: 184 DFVEDEGI--QTEESYPYKAKRSICQMNGEYV--TKVKTY---HLLLNEQEIARAVSAKG 236

Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGS--LANINHAVQIVGY 309
           PV  A++A    +Y  G++   C  S    ++NH V +VGY
Sbjct: 237 PVAVAIDASQLSFYDQGIVDEKCKCSKKREDLNHGVLVVGY 277


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 153/311 (49%), Gaps = 35/311 (11%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQK-----LELFSSFQQRYKKSYSK--SEHDIRFKNFEK 62
           I+AL+   F+A+        + Q+     + L+  ++ ++ K ++   +E + RF  F+ 
Sbjct: 9   IMALLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKD 68

Query: 63  SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +L  I+E+N         R G+  F+DL+ EE+++R+L          S  + +   + +
Sbjct: 69  NLKFIDEINAQNLP---YRLGLNVFADLTNEEYRSRYLGGK-----FASGSRRNRTSNRY 120

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           + +     G  +P  I    DWR  G +  V++Q +CG+CWAFSTV + E+++ +  G L
Sbjct: 121 LPR----LGDDLPDSI----DWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDL 172

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKA 239
             LS QE++DC  + N GC+GG    L+D+     +    L+ E +YP    D++C +  
Sbjct: 173 IALSEQELVDCDRSYNEGCNGG----LMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYK 228

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA-AVNALTWQYYLGGVIQYNCDGSLA 298
            +   V I SY  D  + +E ++   ++     +A      ++Q Y  G+    C     
Sbjct: 229 KNAKVVAIDSYE-DVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCG---T 284

Query: 299 NINHAVQIVGY 309
           +++H V +VGY
Sbjct: 285 DLDHGVNVVGY 295


>gi|118373813|ref|XP_001020099.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89301866|gb|EAR99854.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 332

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 37/316 (11%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHD-----IRFKN 59
           K +L  +AL+   +LA  V +      +KL  ++ +  +  +++   E       + F+N
Sbjct: 4   KFILLSIALLMPIYLAQNVSI------EKLLAYNKWSTQNLRAFLSDEEKLFRQLVFFEN 57

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
            +K  D       N Q   +    + +FSD++EEEF  + L  S   HV+  H K    +
Sbjct: 58  LQKVKD------HNSQDHHTYSLDLNQFSDMTEEEFVEKVLMKS---HVVDLHIKQATSN 108

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
           ++     S +T         V  DWR  G +  V+NQ  CG+CW FS     ES + +KN
Sbjct: 109 NSTSSASSNSTSNNAT----VTVDWRTKGAVTSVKNQGQCGSCWTFSAAGLMESFNFIKN 164

Query: 180 GTLSLLSVQEVIDCA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
             L+  S Q+++DC     G G+ GC+GG   + LD+    K  +     YP +     C
Sbjct: 165 KNLTNFSEQQLVDCVNSANGYGSNGCNGGWPASCLDYSS--KFGITTLQNYPYVGVQKKC 222

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
               T+ NG K KS+     IP+ S  L +     PV   V+A TW +Y  GV  YN C+
Sbjct: 223 NITGTN-NGFKPKSW---KQIPNTSKDLQNALNFSPVSVVVDASTWSHYRSGV--YNGCN 276

Query: 295 GSLANINHAVQIVGYD 310
            +   +NHAV  VGYD
Sbjct: 277 QTKIQLNHAVLAVGYD 292


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 79/281 (28%), Positives = 142/281 (50%), Gaps = 31/281 (11%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+ S+  ++ K+Y+   E D RF+ F+ +L  I+E N    +    + G+ +F+DL+ EE
Sbjct: 51  LYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHT---YKLGLNKFADLTNEE 107

Query: 95  FKTRHLR-HSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           ++  +    +++    +S  K   + +         +G ++P  +    DWRE G +  V
Sbjct: 108 YRMTYTGIKTIDDKKKLSKMKSDRYAYR--------SGDSLPEYV----DWREQGAVTDV 155

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW- 212
           ++Q +CG+CWAFST  + E ++ +  G L  +S QE+++C  + N GC+GG    L+D+ 
Sbjct: 156 KDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGG----LMDYA 211

Query: 213 --MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
               +    ++ E +YP   KD  C +   +   V I SY  D  +  ESS+   ++   
Sbjct: 212 FEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYE-DVPVNDESSLKKAVSNQ- 269

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A+ A    +Q+Y  G+   +C  +L   +H V   GY
Sbjct: 270 PVAVAIEAGGRDFQFYTSGIFTGSCGTAL---DHGVLAAGY 307


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 94/319 (29%), Positives = 153/319 (47%), Gaps = 42/319 (13%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLD 65
            L  V+LI LCF  I   + KP  E    ++   +  + K+YS +SE ++R+  ++ +++
Sbjct: 3   ALIFVSLITLCFGYI---IEKPIRESSWYVW---KMAHNKAYSHESEENVRYAIWKDNMN 56

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            I E N   ++       +  F D++  EF+ +     +N  +L   HKH +        
Sbjct: 57  RITEYNSKSKN---VILRMNHFGDMTNTEFRAK-----MNGLLL---HKHQN-------- 97

Query: 126 RSITTGITIP--TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
               +   +P  T  P   DWR  G +  V+NQ  CG+CWAFS+    E  H  K G L 
Sbjct: 98  ---GSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLV 154

Query: 184 LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q ++DC+ + GN GC+GG       ++  N  + + E+ YP   +D  C R + S 
Sbjct: 155 SLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGI-DTETGYPYEGQDGTC-RYSKSS 212

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLAN 299
            G     +  D     E ++   +AT GPV  A++A  +++Q+Y  GV  +  C  S + 
Sbjct: 213 IGADDTGFV-DIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQC--SPSA 269

Query: 300 INHAVQIVGY--DNYSRTW 316
           ++H V +VGY  DN    W
Sbjct: 270 LDHGVLVVGYGTDNGKDYW 288


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 159/325 (48%), Gaps = 47/325 (14%)

Query: 1   MFDVKNVLFIVAL-IALCFLAIPVKVSKPNL----EQKLELFSSFQQRYKKSYSK-SEHD 54
           +F +  +LF+ +   A+    I  K  K +     E+  E++  +  ++ K YS   E++
Sbjct: 4   LFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEEVKEIYELWLAKHDKVYSGLVEYE 63

Query: 55  IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLR------HSVNKHV 108
            RF+ F+ +L  I+E N    +    + G+T ++DL+ EEF+  +L       H + + +
Sbjct: 64  KRFEIFKDNLKFIDEHNSENHT---YKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLKRTI 120

Query: 109 LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTV 168
            +S    ++   N                +P + DWR+ G +  V+NQ  CG+CWAFSTV
Sbjct: 121 NISERYAYEAGDN----------------LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTV 164

Query: 169 ETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL 228
            T ES++ ++ G L  LS Q+++DC    N GC GG F     ++ ++   ++ E+ YP 
Sbjct: 165 STVESINQIRTGNLISLSEQQLVDC-NKKNHGCKGGAFVYAYQYI-IDNGGIDTEANYPY 222

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNALT--WQYY 284
                 C+    +   V+I  Y     +P  +E+++   +A+  P + A++A +  +Q+Y
Sbjct: 223 KAVQGPCR---AAKKVVRIDGYKG---VPHCNENALKKAVASQ-PSVVAIDASSKQFQHY 275

Query: 285 LGGVIQYNCDGSLANINHAVQIVGY 309
             G+    C   L   NH V IVGY
Sbjct: 276 KSGIFSGPCGTKL---NHGVVIVGY 297


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 151/329 (45%), Gaps = 39/329 (11%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK--------SEH 53
           F    +   VA+ +  + +I   +S+P L+ +L +    Q+R+ +  +K         E 
Sbjct: 3   FKHMQIFLFVAIFSSFYFSI--SLSRP-LDNELIM----QKRHIEWMTKHGRVYADVKEK 55

Query: 54  DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRH-SVNKHVLMSH 112
             R+  F+ +++ IE LN N  +  + +  + +F+DL+ +EF++ +     V+     S 
Sbjct: 56  SNRYVVFKSNVERIEHLN-NIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQ 114

Query: 113 HKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAE 172
            K     + +V   ++          P+  DWR  G +  ++NQ +CG CWAFS V   E
Sbjct: 115 TKTTSFRYQNVSSGAL----------PISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIE 164

Query: 173 SMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKD 232
               +K G L  LS Q+++DC  N + GC GG      + + +    L  ES YP   +D
Sbjct: 165 GATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHI-MATGGLTTESNYPYKGED 222

Query: 233 AACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQ 290
           A C  K T+P    I  Y  D  +  E +++  +A H PV   +      +Q+Y  GV  
Sbjct: 223 ATCNSKKTNPKATSITGYE-DVPVNDEQALMKAVA-HQPVSVGIEGGGFDFQFYSSGVFT 280

Query: 291 YNCDGSLANINHAVQIVGYD---NYSRTW 316
             C   L   +HAV  +GY    N S+ W
Sbjct: 281 GECTTYL---DHAVTAIGYGQSTNGSKYW 306


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 147/314 (46%), Gaps = 37/314 (11%)

Query: 8   LFIVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKS 63
           + I  L+ L F    A  + +   +  + ++++  +  +++K Y+   E + RF+ F+ +
Sbjct: 4   MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHN 121
           L  I++ N    +      G+ +F+D++ +E++  +L  R    + V+ + +  H + +N
Sbjct: 64  LGFIQDHNAQNNT---YTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
              +            +PV  DWR  G +G +++Q  CG+CWAFSTV   E ++ +  G 
Sbjct: 121 SGDQ------------LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGE 168

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRK 238
              LS QE++DC    + GC+GG    L+D+     +    ++ E +YP    D  C   
Sbjct: 169 FVSLSEQELVDCDREYDEGCNGG----LMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDET 224

Query: 239 ATSPNGVKIKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDG 295
                 V+I  Y     +PS + + L    +H PV  A+ A     Q Y  GV    C  
Sbjct: 225 KKKTKVVQIDGYED---VPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGT 281

Query: 296 SLANINHAVQIVGY 309
           +L   +H V +VGY
Sbjct: 282 AL---DHGVVVVGY 292


>gi|343475823|emb|CCD12886.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 88/316 (27%), Positives = 150/316 (47%), Gaps = 28/316 (8%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P+  DWR+ G +  V++Q  C + WAFS +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGRPPMTVDWRKKGAVTPVKDQGKCDSSWAFSAIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKRK 238
             L+ LS Q ++ C  N ++GC  G       W +  NK  +  E  YP           
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCELGLKDPAFQWILWSNKGNVFTEQSYPYASGGGNVPTC 226

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
             S   V  K      L   E +I   +A  GPV  AV+A ++Q Y GGV+  +C     
Sbjct: 227 DMSGKVVGAKISNMRYLPLDEDTIAEWLARKGPVAIAVDATSFQRYTGGVLT-SCISR-- 283

Query: 299 NINHAVQIVGYDNYSR 314
            +N+   +VGYD+ S+
Sbjct: 284 RLNYGALLVGYDDTSK 299


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 79/276 (28%), Positives = 129/276 (46%), Gaps = 19/276 (6%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           L+  ++ R+  +    +   RF  F+ ++ +I E N+ R  P   R  +  F D++ +EF
Sbjct: 155 LYERWRGRHALARDLGDKARRFNVFKANVRLIHEFNR-RDEPYKLR--LNRFGDMTADEF 211

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +    RH     V  +HH+            + +        +P   DWR+ G +  V++
Sbjct: 212 R----RHYAGSRV--AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 265

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q  CG+CWAFST+   E ++A+K   L+ LS Q+++DC    N GC+GG       ++  
Sbjct: 266 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 325

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAA 275
           +  V   E  YP   + A+CK K+ +P  V I  Y  + +  ++ S L     H PV  A
Sbjct: 326 HGGVAA-EDAYPYRARQASCK-KSPAPV-VTIDGY--EDVPANDESALKKAVAHQPVSVA 380

Query: 276 VNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           + A    +Q+Y  GV    C   L   +H V  VGY
Sbjct: 381 IEASGSHFQFYSEGVFSGRCGTEL---DHGVAAVGY 413


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 145/308 (47%), Gaps = 32/308 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLD 65
           V+F+   + + + +     +    +  ++ F  +   Y + Y  ++  +R F+ F+ +++
Sbjct: 7   VVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            IE  N   ++  S   GI +F+D++  EF              ++ +        ++++
Sbjct: 67  HIETFNSRNEN--SYTLGINQFTDMTNNEF--------------IAQYTGGISRPLNIER 110

Query: 126 RSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
             + +   +  + +P   DWR+ G +  V+NQ  CGACWAF+ + T ES++ +K G L  
Sbjct: 111 EPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEP 170

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS Q+V+DCA     GC GG      +++  NK V    + YP       CK     PN 
Sbjct: 171 LSEQQVLDCA--KGYGCKGGWEFRAFEFIISNKGVAS-GAIYPYKAAKGTCKTNGV-PNS 226

Query: 245 VKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANIN 301
             I  Y     +P  +ESS++  ++   P+  AV+A   +QYY  GV    C  SL   N
Sbjct: 227 AYITGY---ARVPRNNESSMMYAVSKQ-PITVAVDANANFQYYKSGVFNGPCGTSL---N 279

Query: 302 HAVQIVGY 309
           HAV  +GY
Sbjct: 280 HAVTAIGY 287


>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
          Length = 366

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 138/284 (48%), Gaps = 36/284 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F++R+ K Y S  EHD R   F+ ++       ++++   +A +G+T+FSDL+  EF
Sbjct: 49  FTVFKRRFGKVYASDEEHDYRLSEFKANM---RRAKQHQELDPAAVHGVTQFSDLTPTEF 105

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           + + L   +N+ +                     T   +PT  +P   DWR+ G +  V+
Sbjct: 106 RRKFL--GLNRRLKFPADAK--------------TAPILPTDELPSDFDWRDHGAVTPVK 149

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ TCG+C +FST    E  + L  G L  LS Q+++DC        AG+ + GC+GG  
Sbjct: 150 NQGTCGSCCSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLM 209

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            +  ++  +    L  E ++P    D    R   +    K+ +++  +L   E  I  ++
Sbjct: 210 NSAFEYT-LKAGGLMREEDHPYTGNDLQVCRFDKTKIAAKVANFSVVSL--DEDQIAANL 266

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             +GP+  A+NA+  Q Y+GGV   Y C   L   +H V +VGY
Sbjct: 267 VKNGPLAVAINAVFMQTYIGGVSCPYICSKRL---DHGVLLVGY 307


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 140/303 (46%), Gaps = 33/303 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELN 71
           L A+  L + V  S        +LF ++ ++Y K+YS  E    R K FE++   + +  
Sbjct: 5   LWAVSILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQ-- 62

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
            N  +  S    +  F+DL+  EFK   L  S  +   +               RS+ T 
Sbjct: 63  HNSMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSI---------------RSVGTP 107

Query: 132 ITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVI 191
           +     +P   DWR++G +  V++Q  CG CW+FST    E ++ +  G+L  LS QE++
Sbjct: 108 VQ-ELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELV 166

Query: 192 DCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           DC  + N GC GG    L+D+     +    ++ E++YP +  D  C ++    + V I 
Sbjct: 167 DCDRSYNSGCEGG----LMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTID 222

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQI 306
            YT   + P++   L  +    PV   +  +  T+Q Y  GV    C  +L   +HAV I
Sbjct: 223 GYT--DIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTL---DHAVLI 277

Query: 307 VGY 309
           VGY
Sbjct: 278 VGY 280


>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
          Length = 343

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 146/313 (46%), Gaps = 34/313 (10%)

Query: 7   VLFIVALIA--LCFL-AIPVKVSKPNLEQKLELF--SSFQQRYKKSY-SKSEHDIRFKNF 60
           V F VA  A  L F  + P+++     EQ L++   S F  RY K Y +  E   RFK F
Sbjct: 9   VFFCVATAAAGLSFHDSNPIRMVSDMEEQLLQVIGESRFANRYGKRYDTVDEMKRRFKIF 68

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVN-KHVLMSHHKHHDHH 119
            ++L +I+  NK R        G+  F+D + EEF++  L  + N    L  +H+  D  
Sbjct: 69  SENLQLIKSTNKKRLG---YTLGVNHFADWTWEEFRSHRLGAAQNCSATLKGNHRITD-- 123

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                             +P +KDWR+ GI+ +V++Q  CG+CW FST    ES +A   
Sbjct: 124 ----------------VVLPAEKDWRKEGIVSEVKDQGHCGSCWTFSTTGALESAYAQAF 167

Query: 180 GTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK 238
           G    LS Q+++DCAG   N GC+GG      +++  N   LE E  YP   ++  C  K
Sbjct: 168 GKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNG-GLETEEVYPYTGQNGLC--K 224

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ-YNCDGS 296
            TS N       + +  + +E  +   +A   PV  A   +  ++ Y  GV     C  +
Sbjct: 225 FTSENVAVQVLGSVNITLGAEDELKHAVAFARPVSVAFQVVDDFRLYKKGVYTGTTCGST 284

Query: 297 LANINHAVQIVGY 309
             ++NHAV  VGY
Sbjct: 285 PMDVNHAVLAVGY 297


>gi|354504701|ref|XP_003514412.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
 gi|344245862|gb|EGW01966.1| Cathepsin R [Cricetulus griseus]
          Length = 333

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 147/311 (47%), Gaps = 29/311 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
           ++  + L FL + V  + P  +  L+  +  +++ Y K+YS+ E   +   +E ++ +I+
Sbjct: 1   MILAVLLGFLYLGVASAAPTPDYSLDAEWEEWKKSYDKTYSQEEERQKRAVWEDNVKMIK 60

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
            L+ +N     +    + EF DL+ EE K    +   +  VL   +  H      VK   
Sbjct: 61  LLSMENGLGMNNFTVEMNEFGDLTGEEMK----KMMTDSSVLTLRNGKHMQRLGDVK--- 113

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                     IP   DWR  G +G VR Q  CGACWAF+   + ES    K G ++ LSV
Sbjct: 114 ----------IPKTLDWRTQGYVGPVRKQNGCGACWAFAVAASIESQLFKKTGKMTQLSV 163

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q +IDCA +    GC GG       ++  NK  LE E+ YP   K+  C+ +A   + VK
Sbjct: 164 QNLIDCARSYSTYGCKGGLVYGAFLYVKNNK-GLEAEATYPYEAKEGRCRYRAER-SVVK 221

Query: 247 IKSYTCDTLIP-SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
           I  +    ++P +E +++  + THGP+   ++A   ++  Y GG I +       N  H 
Sbjct: 222 ITRF---LVVPRNEEALMNALVTHGPIAVGIDAGHESFTNYAGG-IYHEPKCKTDNPTHG 277

Query: 304 VQIVGYDNYSR 314
           + +VG+    R
Sbjct: 278 LLLVGFGYEGR 288


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/294 (28%), Positives = 138/294 (46%), Gaps = 39/294 (13%)

Query: 31  EQKLE-LFSSFQQRYKKSYSKS---------EHDIRFKNFEKSLDIIEELNKNRQSPESA 80
           E++L+ LF S+  ++ KSY+ +         E   R+  F+ +L  I   N+  Q     
Sbjct: 50  EERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQG---Y 106

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
             G+  F+DL+ EEF+ +  RH            H +  +  V+ + +          P 
Sbjct: 107 FLGLNAFADLTNEEFRAQ--RHGGRFDRSRERTSHEEFRYGSVQLKDL----------PD 154

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWRE G +  V++Q +CG+CWAFS V   E ++ L  G L  LS QE++DC    + G
Sbjct: 155 SIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEG 214

Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           C+GG    L+D+     +    L+ E++YP       C R   +   V I  Y  D  + 
Sbjct: 215 CNGG----LMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYE-DVPVN 269

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E+++L  +A H PV  A++A   + Q+Y  G+    C     +++H V  VGY
Sbjct: 270 DETALLKAVA-HQPVSVAIDAGGSSMQFYRSGIFTGRCG---TDLDHGVTNVGY 319


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 136/283 (48%), Gaps = 30/283 (10%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           + ++  +  ++ K+Y+   E   RF+ F+ +L  I+E N    +    + G+T+F+DL+ 
Sbjct: 1   MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHT---YKVGLTKFADLTN 57

Query: 93  EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
           EE++   L   S  K  LM      + +       +   G  +P  +    DWR  G + 
Sbjct: 58  EEYRAMFLGTRSDAKRRLMKSKSPSERY-------AFKAGDKLPESV----DWRAKGAVN 106

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
            +++Q +CG+CWAFSTV   E ++ +  G L  LS QE++DC    N GC+GG    L+D
Sbjct: 107 PIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGG----LMD 162

Query: 212 W---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
           +     +N   L+ E +YP +  D  C +       V I  +  + ++P +   L     
Sbjct: 163 YAFQFIINNGGLDTEKDYPYVGDDDKCDKDKMKTKAVSIDGF--EDVLPYDEKALQKAVA 220

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           H PV  A+ A  +  Q+Y  GV    C  +L   +H V +VGY
Sbjct: 221 HQPVSVAIEASGMALQFYQSGVFTGECGTAL---DHGVVVVGY 260


>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
 gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
 gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
          Length = 324

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 32/301 (10%)

Query: 14  IALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
           I LC L   V   +  +L +    F  F  ++ K+YS +SE   RFK F+ +L+  E +N
Sbjct: 4   IMLCLLVCGVVHAATYDLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLE--EIIN 61

Query: 72  KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS-HHKHHDHHHNHVKKRSITT 130
           KN Q+  +A+Y I +FSDLS+EE        +++K+  +S  H+  +     +  R    
Sbjct: 62  KN-QNDSTAQYEINKFSDLSKEE--------AISKYTGLSLPHQTQNFCEVVILDRPPDR 112

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G       P++ DWR+   +  V+NQ  CGACWAF+T+ + ES  A+K   L  LS Q+ 
Sbjct: 113 G-------PLEFDWRQFNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQF 165

Query: 191 IDCAGNGNMGCSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           IDC    N GC GG    A    M++  V +  ES+YP    +  C+    +PN   +  
Sbjct: 166 IDC-DRVNAGCDGGLLHTAFESAMEMGGVQM--ESDYPYETANGQCR---INPNRFVVGV 219

Query: 250 YTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
            +C   I      L D+    GP+  A++A     Y  G+++   +  L   NHAV +VG
Sbjct: 220 RSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMRQCANHGL---NHAVLLVG 276

Query: 309 Y 309
           Y
Sbjct: 277 Y 277


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 133/281 (47%), Gaps = 29/281 (10%)

Query: 37  FSSFQQRYKK--SYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++S+  ++ K  + S S  D RF+ F+++   IEE   NR    S R G+ +FSDL+ EE
Sbjct: 13  YASWCAKFGKECASSNSLGDRRFETFKENFRYIEE--HNRAGKHSYRLGLNQFSDLTSEE 70

Query: 95  FKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           F+ R L  R  +    ++   +  D          I  G      +P   DWR+ G +  
Sbjct: 71  FRQRFLGLRPDLIDSPVLKMPRDSD----------IEEGFQ-NVDLPASVDWRKHGAVTA 119

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
            ++Q +CG CWAF+T    E ++ +  G L  LS QE+IDC    + GC GG       +
Sbjct: 120 PKDQGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQF 179

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
           + V    L+ E++YP    ++ C  K  +   V I  Y     IP   E ++L  +A   
Sbjct: 180 I-VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGY---EAIPDGDEQALLRAVAKQ- 234

Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A+   +  +Q+Y  GV   +C      INH V IVGY
Sbjct: 235 PVSVAIEGASKDFQHYASGVFTGHCG---EEINHGVLIVGY 272


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/283 (27%), Positives = 131/283 (46%), Gaps = 24/283 (8%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFS 88
           E+   L++ ++  + KSY+   E + R+  F  +L  I+E N    +   S R G+  F+
Sbjct: 34  EEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DL+ EE++  +L             ++       V  R +         +P   DWR  G
Sbjct: 94  DLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPESVDWRTKG 139

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            + ++++Q  CG+CWAFS +   E ++ +  G L  LS QE++DC  + N GC+GG    
Sbjct: 140 AVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 199

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
             D++ +N   ++ E +YP   KD  C     +   V I SY  + + P+  + L     
Sbjct: 200 AFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSY--EDVTPNSETSLQKAVR 256

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           + PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 257 NQPVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 128/278 (46%), Gaps = 24/278 (8%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           L+  ++ R+  +    +   RF  F+ ++ +I E N+ R  P   R  +  F D++ +EF
Sbjct: 48  LYERWRGRHALARDLGDKARRFNVFKANVRLIHEFNR-RDEPYKLR--LNRFGDMTADEF 104

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG--IPVKKDWREAGIIGKV 153
           +         +H   S   HH       +  S +          +P   DWR+ G +  V
Sbjct: 105 R---------RHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDV 155

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM 213
           ++Q  CG+CWAFST+   E ++A+K   L+ LS Q+++DC    N GC+GG       ++
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             +  V   E  YP   + A+CK K+ +P  V I  Y  + +  ++ S L     H PV 
Sbjct: 216 AKHGGVA-AEDAYPYRARQASCK-KSPAPV-VTIDGY--EDVPANDESALKKAVAHQPVS 270

Query: 274 AAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A+ A    +Q+Y  GV    C   L   +H V  VGY
Sbjct: 271 VAIEASGSHFQFYSEGVFSGRCGTEL---DHGVTAVGY 305


>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 330

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 145/308 (47%), Gaps = 40/308 (12%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIE 68
           I++ + L FL +   + +  +E     F  FQ ++ K+Y   E ++ R   F+ +L +IE
Sbjct: 4   IISFVLLSFLPLVKCLDEGTVELA---FMGFQHKFGKNYESKEEEVKRNAIFQANLHLIE 60

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEF---KTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           ++N    S    + G+ E++DL+ EEF   K   L+    +H  +S     D        
Sbjct: 61  QVNAKNLS---YKLGVNEYADLTHEEFAALKLGTLKMRPAEHASLSLFVSAD-------- 109

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                     T +P   DWR   ++  V++Q +CG+CWAFS     E+ +A+  G L  L
Sbjct: 110 ---------TTQLPTSVDWRNKSVLSPVKDQGSCGSCWAFSAAGALEAQYAIATGKLRPL 160

Query: 186 SVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           S Q+++DC+   G  GC GG       +  +    L+ ES YP    +  C+ +    +G
Sbjct: 161 SEQQLVDCSHKYGTNGCFGGFMADAYKY--IKSAGLDQESTYPYKGVNEPCRPREKKADG 218

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANIN 301
           + ++ +  DT   +E S++  +A   PV  A+ A    +  YL GV     C+G    I+
Sbjct: 219 IPVR-FVLDT--KTEQSLMKALA-DAPVSVAMYASDFLFHLYLSGVYSSTTCNG---EID 271

Query: 302 HAVQIVGY 309
           HAV  VGY
Sbjct: 272 HAVVAVGY 279


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/263 (29%), Positives = 126/263 (47%), Gaps = 32/263 (12%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS 111
           E D RF+ F+ +L  I+E N    S    + G+T F+DL+ EE+++ +L     K VL +
Sbjct: 69  EKDQRFEIFKDNLRFIDEHNNKNLS---YKLGLTRFADLTNEEYRSIYLGAKSKKRVLKT 125

Query: 112 HHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
             ++                  +   IP   DWR+ G +  V++Q +CG+CWAFST+   
Sbjct: 126 SDRYQPR---------------VGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAV 170

Query: 172 ESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPL 228
           E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +    ++ E +YP 
Sbjct: 171 EGINKIVTGDLISLSEQELVDCDTSYNQGCNGG----LMDYAFEFIIKNGGIDTEEDYPY 226

Query: 229 LLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLG 286
              D  C +   +   V I +Y  D    +E+++   +A   P+  A+ A    +Q Y  
Sbjct: 227 KAADGRCDQTRKNAKVVTIDAYE-DVPENNEAALKKTLANQ-PISVAIEAGGRAFQLYSS 284

Query: 287 GVIQYNCDGSLANINHAVQIVGY 309
           GV    C   L   +H V  VGY
Sbjct: 285 GVFDGICGTEL---DHGVVAVGY 304


>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
          Length = 363

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 128/284 (45%), Gaps = 38/284 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F+ +Y K Y S+ EHD R K F+ +L       +++    +A +GIT+FSDL+  EF
Sbjct: 47  FSLFKSKYGKIYASQEEHDHRLKVFKANL---RRARRHQLLDPTAEHGITQFSDLTPSEF 103

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTG-IPVKKDWREAGIIGKVR 154
           +  +L                   H    K +      +PT  +P   DWRE G +  V+
Sbjct: 104 RRTYLGL-----------------HKPRPKLNAQKAPILPTSDLPEDFDWREKGAVTGVK 146

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
           NQ +CG+CW+FST    E  H L  G L  LS Q+++DC            + GC+GG  
Sbjct: 147 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCNGGLM 206

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               ++  +    L+ E +YP   +D  C     S     + +++   L   E  I  ++
Sbjct: 207 TTAFEYT-LKAGGLQREKDYPYTGRDGKCHFD-KSKIAASVANFSVIGL--DEDQIAANL 262

Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
             HGP+   +NA   Q Y+ GV     C       +H V +VGY
Sbjct: 263 VKHGPLAVGINAAWMQTYMRGVSCPLIC---FKRQDHGVLLVGY 303


>gi|218478060|dbj|BAH03396.1| cathepsin L-like cysteine peptidase [Taenia saginata]
          Length = 338

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 147/307 (47%), Gaps = 26/307 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           F++ LI +  LA  V+ S    E++L   +  ++ ++ + YS+ E   R   F ++L  I
Sbjct: 6   FLLLLI-IHPLAAVVETSALLTERELSRQWIGWKLQHGRVYSEKEEAYRRGIFARNLLYI 64

Query: 68  EELNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +  N+   +  ES   G+ +F+DL   EF  R L                       K+ 
Sbjct: 65  KGQNRRFNAGLESYSTGLNQFADLESSEFSERFLGTRPGSRAAG-------------KRG 111

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I   +     +P   DWR+  ++ +V+NQ  CG+CWAFS+    E   A K G L  LS
Sbjct: 112 RIWKALASAADLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLS 171

Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+  NGN GC+GG       +++ + +  EPES YP    D  C+   +   GV
Sbjct: 172 EQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHSI--EPESAYPYRATDGPCRYNESL--GV 227

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINH 302
              +   D    +E++++  +AT GP+  A++A  L + +Y  G+ + + C      +NH
Sbjct: 228 GTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNH 285

Query: 303 AVQIVGY 309
            V  +GY
Sbjct: 286 GVLAIGY 292


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 134/294 (45%), Gaps = 39/294 (13%)

Query: 31  EQKLE-LFSSFQQRYKKSYSKS---------EHDIRFKNFEKSLDIIEELNKNRQSPESA 80
           E++L+ LF S+  ++ KSY+++         E   R+  F+ +L  I   N+  Q     
Sbjct: 50  EERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQG---Y 106

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
             G+  F+DL+ EEF+ +             H    D             G      +P 
Sbjct: 107 FLGLNAFADLTNEEFRAQR------------HGGRFDRSRERTSYEEFRYGSVQLKDLPD 154

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWRE G +  V++Q +CG+CWAFS V   E ++ L  G L  LS QE++DC    + G
Sbjct: 155 SIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEG 214

Query: 201 CSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           C+GG    L+D+     +    L+ E++YP       C R   +   V I  Y  D  + 
Sbjct: 215 CNGG----LMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYE-DVPVN 269

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E+++L  +A H PV  A++A   + Q+Y  G+    C     +++H V  VGY
Sbjct: 270 DETALLKAVA-HQPVSVAIDAGGSSMQFYRSGIFTGRCG---TDLDHGVTNVGY 319


>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
          Length = 364

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 137/286 (47%), Gaps = 55/286 (19%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
           F+SF++R+ ++Y                       + R+   +A +G+T+FSDL+  EF+
Sbjct: 58  FASFERRFGRTYPGP-------------------RRARRLDPTATHGVTKFSDLTPGEFR 98

Query: 97  TRHL---RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGK 152
            R L   R S+   V    H+                   +PT G+P   DWRE G +G 
Sbjct: 99  DRFLGLRRPSLEGLVGGEPHE----------------APILPTDGLPDDFDWREHGAVGP 142

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
           V++Q +CG+CW+FST    E  H L  G L +LS Q+++DC        +   + GC+GG
Sbjct: 143 VKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGG 202

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
                  ++ +    L+ E +YP   ++  CK    S    ++K+++  ++  +E  I  
Sbjct: 203 LMTTAFSYL-MKSGGLQSEKDYPYAGRENTCKFD-KSKIVAQVKNFSVISV--NEDQIAA 258

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  HGP+  A+NA   Q Y+GGV   + C     +++H V +VGY
Sbjct: 259 NLVKHGPLAIAINAAYMQTYIGGVSCPFICG---RHLDHGVLLVGY 301


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 148/330 (44%), Gaps = 47/330 (14%)

Query: 4   VKNVLFIVAL-IALCFLAIPVKVSKPNLEQKL--ELFSSFQQRYKKSYSK-SEHDIRFKN 59
            KN  + ++  + LC      +VS   L+     E    +  RY K Y    E + RF  
Sbjct: 3   TKNQFYQISFALVLCLGLWAFQVSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNI 62

Query: 60  FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
           F++++  IE  N     P   + G+ +F+DL+ +EF             + + +K   H 
Sbjct: 63  FQENVKYIEASNNAGNKP--YKLGVNQFTDLTNKEF-------------IATRNKFKGHM 107

Query: 120 HNHVKKRSI--TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
            + + + +      +T P+ +    DWR+ G +  V+NQ TCG CWAFS V   E +H L
Sbjct: 108 SSSITRTTTFKYENVTAPSTV----DWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKL 163

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLK 231
             G L  LS QE++DC  +G + GC GG    L+D  D  K +     L  E++YP    
Sbjct: 164 STGNLVSLSEQELVDCDTSGADQGCQGG----LMD--DAFKFIIQNGGLNTEAQYPYQGV 217

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           D  C       +   I  Y  D    +E ++   +A   P+  A++A    +Q Y  GV 
Sbjct: 218 DGTCNTNEEVTHVATITGYE-DVPSNNEQALQQAVANQ-PISVAIDASGSDFQNYQSGVF 275

Query: 290 QYNCDGSLANINHAVQIVGY---DNYSRTW 316
             +C   L   +H V +VGY   D+ ++ W
Sbjct: 276 TGSCGTQL---DHGVAVVGYGVSDDGTKYW 302


>gi|218185|dbj|BAA14404.1| oryzain gamma precursor [Oryza sativa Japonica Group]
          Length = 362

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/278 (31%), Positives = 129/278 (46%), Gaps = 28/278 (10%)

Query: 37  FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  R+ K Y   +E   RF+ F +SL+++   N+ R  P   R GI  F+D+S EEF
Sbjct: 62  FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L  +H+  D                    +P  KDWRE GI+  V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------APALPETKDWREDGIVSPVK 161

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q++ DCA    N GCSGG      +++
Sbjct: 162 DQGHCGSCWPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYI 221

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP    +  C  K  +  GVK+      TL+ +E  +   +    PV 
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYKPENA-GVKVLDSVNITLV-AEDELKNAVGLVRPVS 278

Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  ++ Y  GV   + C  S  ++NHAV  VGY
Sbjct: 279 VAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 130/278 (46%), Gaps = 22/278 (7%)

Query: 36  LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           L+  +   + ++Y+   E D RF+ F  +L  ++  N+ R +    R G+ +F+DL+ +E
Sbjct: 108 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNE-RAAEHGFRLGMNQFADLTNDE 166

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+  +L   +             + H    +            +P   DWRE G +  V+
Sbjct: 167 FRAAYLGARIPASRRRGTAVGERYRHGGGAEE-----------LPESVDWREKGAVAPVK 215

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CWAFS V + ES++ +  G +  LS QE+++C+ + GN GC+GG   A  D++
Sbjct: 216 NQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFI 275

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
            +    ++ E +YP    D  C     +   V I  +  D     E S+   +A H PV 
Sbjct: 276 -IKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFE-DVPENDEKSLQKAVA-HQPVS 332

Query: 274 AAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            A+ A    +Q Y  GV    C     N++H V  VGY
Sbjct: 333 VAIEAGGREFQLYKAGVFTGTC---TTNLDHGVVAVGY 367


>gi|357446993|ref|XP_003593772.1| Cysteine proteinase [Medicago truncatula]
 gi|355482820|gb|AES64023.1| Cysteine proteinase [Medicago truncatula]
          Length = 339

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 133/285 (46%), Gaps = 19/285 (6%)

Query: 28  PNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITE 86
           P  ++ +E+F  + + + + Y    E   +F  F  +L  I E N  R+S      G+T 
Sbjct: 9   PTQDKTIEIFQLWMKEHGRVYKDLDEMAKKFDIFISNLKYITETNAKRKSSNGFLLGLTN 68

Query: 87  FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE 146
           F+D S EEF+ R+L H+++    +   K +D H          +  + P+ +    DWR 
Sbjct: 69  FTDWSSEEFQERYL-HNIDMPTDIDTMKVNDVH---------LSSCSAPSSL----DWRS 114

Query: 147 AGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF 206
            G++  +++Q+ CG+CWAFS V   E ++A+  G L  LS QE++DC      GC+ G  
Sbjct: 115 KGVVSDIKDQKNCGSCWAFSAVGAIEGINAITTGKLINLSEQELLDCDPISG-GCNSGWV 173

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
               DW+  NK V   +++YP   +   CK     PN       T   +  S+  +L  +
Sbjct: 174 NKAFDWVIRNKGV-ALDNDYPYTAEKGVCKASQI-PNSAISSINTYHHVEQSDQGLLCAV 231

Query: 267 ATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGYD 310
           A     +       + +Y  G+    NC  +  + NH V IVGYD
Sbjct: 232 AKQPVSVCLYAPQDFHHYSSGIYDGPNCPVNSKDTNHCVLIVGYD 276


>gi|391333957|ref|XP_003741376.1| PREDICTED: cathepsin S-like [Metaseiulus occidentalis]
          Length = 333

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 148/311 (47%), Gaps = 36/311 (11%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           V+    L++ C   +  K+ K  L  K   ++S++  +KKSYS +E  +R  N+  +  +
Sbjct: 5   VVCAALLVSACQAEVSPKLMKAALRAK---WTSYKAAHKKSYSAAEESLRMANYLDNTRV 61

Query: 67  IEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSV--NKHVLMSHHKHHDHHHNHV 123
           IEE N +  Q  ES   G  E SDL+ EE K+  +   +  N   + ++   H       
Sbjct: 62  IEEHNARFHQGLESYELGHNELSDLTLEEIKSTRMGLVLPPNAAEIAANASRH------- 114

Query: 124 KKRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
                      P+ I  P   DWR    +  V+NQ +CG+C++FS +   E+ +  K+G 
Sbjct: 115 ---------FAPSDIVAPGSVDWRSKRCVQYVKNQGSCGSCYSFSALGALETSYCNKHGQ 165

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
           L  L+ Q ++DCAG    GCSGG    + +++  N   ++ +  YP   K   CK+    
Sbjct: 166 LPDLAEQHLVDCAGR---GCSGGWMHDMFNYLQSNGGAID-QRRYPYTGKVEQCKQDRM- 220

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQ--YYLGGVIQY-NCDGSLA 298
           P    + +Y       +E+ ++  IAT G V  A NA T Q  YY GG++   NC  +  
Sbjct: 221 PKAAGVATYK-QISRGNENELMQAIATVGTVSIAYNAGTQQHSYYRGGILDVPNCGNT-- 277

Query: 299 NINHAVQIVGY 309
              HAV +VGY
Sbjct: 278 -PTHAVLLVGY 287


>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
          Length = 367

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 140/284 (49%), Gaps = 34/284 (11%)

Query: 35  ELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E+F+ FQ +Y +SYS  +EH  R   F ++L   + L +      +A +G+T FSDL+EE
Sbjct: 40  EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG--TAEFGVTPFSDLTEE 97

Query: 94  EFKTRHLRH-SVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE-AGIIG 151
           EF   H  H    K   M            +K  S  +G T+P       DWR+  G+I 
Sbjct: 98  EFGQLHGHHWGAGKAPSMG-----------IKVGSEESGETVPQSC----DWRKKPGVIS 142

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGG-DFCALL 210
            +++Q+ C  CWA + V+  E+  A+K      LSVQ+V+DC   GN GC+GG  + A L
Sbjct: 143 AIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN-GCNGGFVWDAFL 201

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
             ++ + +  E +  Y   +K   C  K       +  ++  D L+    E SI   +AT
Sbjct: 202 TVLNTSGLASEQDYPYKGTVKTHRCLAKQH-----RKVAWIQDFLMLQFCEQSIARYLAT 256

Query: 269 HGPVIAAVNALTWQYYLGGVIQ---YNCDGSLANINHAVQIVGY 309
            GP+   +NA   Q Y  GVI+     CD  L  +NH+V +VG+
Sbjct: 257 EGPITVTINAGLLQQYKRGVIRATPATCDPHL--VNHSVLLVGF 298


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/312 (28%), Positives = 148/312 (47%), Gaps = 30/312 (9%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNL---EQKLELFSSFQQRYKKSYSKSEHDIRFKNFE 61
           K  LF V L  +   A+ +++++ +L   E   +L+  ++  +  S   SE   RF  F+
Sbjct: 5   KAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVSRDLSEKRKRFNVFK 64

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
            ++  I ++N   Q  +  +  +  F+D++  EF  R    S  KH  M H    +    
Sbjct: 65  ANVHHIHKVN---QKDKPYKLKLNSFADMTNHEF--REFYSSKVKHYRMLHGSRANTGFM 119

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           H K  S+          P   DWR+ G +  V+NQ  CG+CWAFSTV   E ++ +K G 
Sbjct: 120 HGKTESL----------PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQ 169

Query: 182 LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
           L  LS QE++DC  + N GC+GG      +++  +  +   E  YP   +D +C     +
Sbjct: 170 LVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKSGGIT-TERLYPYKARDGSCDSSKMN 227

Query: 242 PNGVKIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSL 297
              V I  +    ++P+  E++++  +A   PV  A++A     Q+Y  GV  Y  D   
Sbjct: 228 APAVTIDGH---EMVPANDENALMKAVANQ-PVSVAIDASGSDMQFYSEGV--YAGDSCG 281

Query: 298 ANINHAVQIVGY 309
             ++H V +VGY
Sbjct: 282 NELDHGVAVVGY 293


>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
          Length = 326

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 145/305 (47%), Gaps = 34/305 (11%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           LFI+A++ +  L               +L+  +++ Y K Y+ ++ + R   +E+++  I
Sbjct: 3   LFILAVLTVGVLG-----------SNDDLWHQWKRMYNKEYNGADDEHRRNIWEENVKHI 51

Query: 68  EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +E N ++     +   G+ +F+D++ EEFK ++L        ++SH   ++ ++      
Sbjct: 52  QEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHGIPYEANNR----- 106

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
                      +P K DWRE+G + +V++Q  CG+CWAFST  T E  +     T    S
Sbjct: 107 ----------AVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+G  GNMGC GG      +++   +  LE ES YP    +  C+         
Sbjct: 157 EQQLVDCSGPWGNMGCMGGLMENAYEYL--KQFGLETESSYPYTAVEGQCRYNRQLGVAK 214

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAV 304
               YT  +   SE  +   +   GP   AV+  + +  Y GG+ Q     SL ++NHAV
Sbjct: 215 VTDYYTVHS--GSEVELKNLVGAEGPAAVAVDVESDFMMYSGGIYQSRTCSSL-HVNHAV 271

Query: 305 QIVGY 309
             VGY
Sbjct: 272 LAVGY 276


>gi|218478062|dbj|BAH03397.1| cathepsin L-like cysteine peptidase [Taenia asiatica]
          Length = 338

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 147/307 (47%), Gaps = 26/307 (8%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           F++ LI +  LA  V+ S    E++L   +  ++ ++ + YS+ E   R   F ++L  I
Sbjct: 6   FLLLLI-IHPLAAVVETSALLTERELSRQWIGWKLQHGRVYSEKEEAYRRGIFARNLLYI 64

Query: 68  EELNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           +  N+   +  ES   G+ +F+DL   EF  R L                       K+ 
Sbjct: 65  KGQNRRFNAGLESYSTGLNQFADLESSEFSERFL-------------GTRPESRAAGKRG 111

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
            I   +     +P   DWR+  ++ +V+NQ  CG+CWAFS+    E   A K G L  LS
Sbjct: 112 RIWKALASAADLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLS 171

Query: 187 VQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
            Q+++DC+  NGN GC+GG       +++ + +  EPES YP    D  C+   +   GV
Sbjct: 172 EQQLVDCSLKNGNDGCNGGYMSYAFKYLEEHSI--EPESAYPYRATDGPCRYNESL--GV 227

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINH 302
              +   D    +E++++  +AT GP+  A++A  L + +Y  G+ + + C      +NH
Sbjct: 228 GTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKF--LNH 285

Query: 303 AVQIVGY 309
            V  +GY
Sbjct: 286 GVLAIGY 292


>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
          Length = 427

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 94/287 (32%), Positives = 144/287 (50%), Gaps = 46/287 (16%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSL---DIIEELNKNRQSPESARYGITEFSDLSE 92
           LF  FQ++++KSYS S+   R+  F+ +L    +I+ L K      +A YGIT+FSDLS 
Sbjct: 126 LFEEFQRKFRKSYS-SDTAKRYALFKYNLLKMQLIQRLEKG-----TANYGITKFSDLSA 179

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EEF     RHS      +++ K      + ++     T I     +P   DWR  G + +
Sbjct: 180 EEF-----RHS------LANMKRRKSKGSQMETAIFPTTIQ---SLPPSFDWRANGAVTE 225

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CWAF+T    E     K   L  LS Q+++DC    +  C+GG    L +W
Sbjct: 226 VKDQGMCGSCWAFATTGNIEGQWFRKTNKLISLSEQQLLDC-DTKDEACNGG----LPEW 280

Query: 213 MDVNKVV----LEPESEYPL-LLKDAACKRKATSPNGVKIKSYT--CDTLIPSESSILTD 265
              +++V    L  E +YP   +K+ +C  +   PN   I +Y     TL   E+ +   
Sbjct: 281 A-YDEIVKMGGLMSEKDYPYEAMKEQSCHLR--RPN---ISAYINGSATLPSDEAKLAAW 334

Query: 266 IATHGPVIAAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
           +  +GP+   VNA   Q+YLGG+       C  S A ++HAV +VGY
Sbjct: 335 LVQNGPISVGVNANFLQFYLGGISHPPHMLC--SEAGLDHAVLLVGY 379


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/261 (32%), Positives = 128/261 (49%), Gaps = 36/261 (13%)

Query: 60  FEKSLDIIEELNKNRQSPESARY-GITEFSDLSEEEFKTRH-LRHSVNKHVLMSHHKHHD 117
           F+++L  IEE NK     + + Y GI +F+D+  EEF+  + LR   N       +    
Sbjct: 66  FKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFRMYNGLRRDYN-------YSREV 118

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
              NH+    +          P + DWR+ G +  V+NQ  CG+CW+FST  + E  H  
Sbjct: 119 QCSNHLTPEYLVA--------PDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSLEGQHFH 170

Query: 178 KNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
           K+G L  LS Q+++DC+G  GN GC+GG      +++  N  + E E EYP   +   C 
Sbjct: 171 KSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGI-ETEEEYPYDARQERCH 229

Query: 237 RK-----ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
            K     AT+   V +KS         E+ +   +A  GPV  A++A   ++Q Y GGV 
Sbjct: 230 FKKSEVAATASGCVDVKS-------GDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVY 282

Query: 290 -QYNCDGSLANINHAVQIVGY 309
            +  C  S   ++H V +VGY
Sbjct: 283 DEPKC--SSTELDHGVLVVGY 301


>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
          Length = 379

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 143/309 (46%), Gaps = 30/309 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYS-KSEHDIRFKNFEKSL 64
           V  I  L + C  A+     +    Q+ E LF  F  ++ + YS + E+  R+  F  ++
Sbjct: 49  VFLIFVLFSSC--ALREMGKRKTATQRYEVLFDEFLYKFNRLYSSQEEYKYRYHIFVHNV 106

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
              EE  + R+ P    + I EF+D SEEE +         K ++   +   + +    +
Sbjct: 107 REFEE--EERKHP-GLDFDINEFTDWSEEELR---------KMIVDKKNVKEEKNAVRFE 154

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
              +++GI  P  I    DWR+ G +  ++NQ  CG+CWAF+TV   E+ HA+K G L  
Sbjct: 155 GSVLSSGIKRPASI----DWRDQGKLTPIKNQGQCGSCWAFATVAAIEAQHAIKKGILVS 210

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPL-LLKDAACKRKATSPN 243
           LS QE++DC G  N GCSGG     + ++  N   LE E  YP   LK   C       N
Sbjct: 211 LSEQEMVDCDGRNN-GCSGGYRPYAMRFVKENG--LETEKSYPYSALKHDQC---MLHQN 264

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY-YLGGVIQYNCD--GSLANI 300
             K+       L  SE +I   + T GPV   +N +   Y Y  G+   + +     +  
Sbjct: 265 DTKVYIDDYRMLSTSEENIADWVGTKGPVTFGMNVVKAMYSYRSGIFNPSAEDCAEKSMG 324

Query: 301 NHAVQIVGY 309
            HA+ IVGY
Sbjct: 325 AHALTIVGY 333


>gi|319976406|gb|ADV90878.1| cysteine proteinase B [Leishmania donovani]
          Length = 332

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/236 (32%), Positives = 115/236 (48%), Gaps = 23/236 (9%)

Query: 80  ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
           AR+GIT+F DLSE EF  R+L    N     +  K H   H    +  ++        +P
Sbjct: 3   ARFGITKFFDLSEAEFAARYL----NGAAYFAAAKQHAGQHYRKARADLSA-------VP 51

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNM 199
              DWRE G +  V+NQ  CG+CWAFS V   ES  A     L  LS Q+++ C    N 
Sbjct: 52  DAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDN- 110

Query: 200 GCSGGDFCALLDWMDVNKV-VLEPESEYPLLLKD---AACKRKATSPNGVKIKSYTCDTL 255
           GC+GG      +W+  +   ++  E  YP    +   A C   +    G +I  Y    +
Sbjct: 111 GCNGGLMLQAFEWLLRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGY---VM 167

Query: 256 IPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
           IPS  +++   +A +GP+  AV+A ++  Y  GV+  +C G    +NH V +VGY+
Sbjct: 168 IPSNETVMAAWLAENGPIAIAVDASSFMSYQSGVLT-SCAGDA--LNHGVLLVGYN 220


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 137/285 (48%), Gaps = 28/285 (9%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           ++ + ++ ++  ++ K+Y+   E + RF  F+ +L  I+E N    +    R G+  F+D
Sbjct: 43  DEVMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLT---YRLGLNRFAD 99

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ EE+++ +L   V         K        V ++S      +   +P   DWR+ G 
Sbjct: 100 LTNEEYRSMYL--GVKPGATRVTRK--------VSRKSDRFAARVGDALPDFIDWRKEGA 149

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L
Sbjct: 150 VVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGG----L 205

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     +N   ++ E +YP    D  C +   + N V I  Y  + +  ++ + L   
Sbjct: 206 MDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSIDGY--EDVPENDEAALKKA 263

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               PV  A+ A    +Q Y  GV    C  SL   +H V  VGY
Sbjct: 264 VAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSL---DHGVAAVGY 305


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/270 (29%), Positives = 129/270 (47%), Gaps = 32/270 (11%)

Query: 45  KKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSV 104
           K   S +E D RF+ F+ +L  I+E N    S    R G+T+F+DL+ +E+++ +L   +
Sbjct: 51  KAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLS---YRLGLTKFADLTNDEYRSMYLGSRL 107

Query: 105 NKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWA 164
            +                  K S+     +   IP   DWR+ G + +V++Q +CG+CWA
Sbjct: 108 KRKA---------------TKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWA 152

Query: 165 FSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLE 221
           FST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+D+     +    ++
Sbjct: 153 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGG----LMDYAFEFIIKNGGID 208

Query: 222 PESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAV--NAL 279
            E +YP    D  C +   +   V I SY  D    SE S L    +H P+  A+     
Sbjct: 209 TEEDYPYKGVDGRCDQTRKNAKVVTIDSYE-DVPANSEES-LKKALSHQPISVAIEGGGR 266

Query: 280 TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +Q Y  G+    C     +++H V  VGY
Sbjct: 267 AFQLYDSGIFDGICG---TDLDHGVVAVGY 293


>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/292 (29%), Positives = 138/292 (47%), Gaps = 31/292 (10%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYY-LGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y LG   + NC  S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSLGIYYEPNC--SSKNLDHGVLLVGY 283


>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
           Full=Papaya proteinase III; Short=PPIII; AltName:
           Full=Papaya proteinase omega; Flags: Precursor
 gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
          Length = 348

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 79/280 (28%), Positives = 134/280 (47%), Gaps = 28/280 (10%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E+ ++LF+S+   + K Y   +  + RF+ F+ +L+ I+E NK   S      G+ EF+D
Sbjct: 42  ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YWLGLNEFAD 98

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           LS +EF  +++   ++  +  S+ +   +                   +P   DWR+ G 
Sbjct: 99  LSNDEFNEKYVGSLIDATIEQSYDEEFINEDT--------------VNLPENVDWRKKGA 144

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  VR+Q +CG+CWAFS V T E ++ ++ G L  LS QE++DC    + GC GG     
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
           L+++  N + L   S+YP   K   C+ K     G  +K+     + P+    L +    
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259

Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
            PV   V +    +Q Y GG+ +  C      ++HAV  V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296


>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
 gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
          Length = 354

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
            + +  L  +C+ +  +  + P ++  +    + SF++R+ K++   +E   RF  F+++
Sbjct: 10  AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+DL+ +EF   +L    N      H K+H      
Sbjct: 70  MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKNH------ 116

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K  +    + P+G+ +  DWR+ G +  V+NQ  CG+CWAFS +   E   A    +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
             LS Q ++ C  N + GC+GG     ++W M  +   +  E+ YP          C  +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
                G KI  +   +L   E  I   +   GPV  AV+A TWQ Y GGV+      SL 
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287

Query: 299 NINHAVQIVGYDNYSR 314
             NH V IVG++  ++
Sbjct: 288 --NHGVLIVGFNKNAK 301


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 147/317 (46%), Gaps = 32/317 (10%)

Query: 2   FDVKNVLFIVALIALCF-LAIPVKVSKPNLEQK---LELFSSFQQRYKKSYSKSEHDIRF 57
            +VK V F+    AL   +A   + ++ +LE +    +L+  ++  +  S S  E   RF
Sbjct: 1   MEVKKVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHHTVSRSLDEKHNRF 60

Query: 58  KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
             F+ ++  +   NK     +  +  +  F+D++  EF++ +    VN            
Sbjct: 61  NVFKGNVMHVHSSNK---MDKPYKLKLNRFADMTNHEFRSIYAGSKVN------------ 105

Query: 118 HHHNHVKKRSITTGITIPTGI---PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESM 174
            HH   +      G  +   +   P   DWR+ G +  V++Q  CG+CWAFST+   E +
Sbjct: 106 -HHRMFRGTPRGNGTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGI 164

Query: 175 HALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAA 234
           + +K   L  LS QE++DC    N GC+GG   +  ++  + +  +   S YP   KD  
Sbjct: 165 NQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEF--IKQYGITTASNYPYEAKDGT 222

Query: 235 CKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN 292
           C     +   V I  +  +  + +E+++L  +A H PV  A+ A  + +Q+Y  GV   N
Sbjct: 223 CDASKVNEPAVSIDGHE-NVPVNNEAALLKAVA-HQPVSVAIEAGGIDFQFYSEGVFTGN 280

Query: 293 CDGSLANINHAVQIVGY 309
           C  +L   +H V IVGY
Sbjct: 281 CGTAL---DHGVAIVGY 294


>gi|330793420|ref|XP_003284782.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
 gi|325085276|gb|EGC38686.1| hypothetical protein DICPUDRAFT_28222 [Dictyostelium purpureum]
          Length = 347

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 79/281 (28%), Positives = 135/281 (48%), Gaps = 27/281 (9%)

Query: 32  QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
           Q    F+++  + ++ Y+  E   R+  F+ ++D ++E N   +  E+   G+  F+D++
Sbjct: 25  QYRNAFTNWMIQNQRHYASEEFAARYNIFKANMDYVQEWNS--KGSETV-LGLNTFADIT 81

Query: 92  EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
            +EF++ +L    +   +++                      I        DWR  G + 
Sbjct: 82  NQEFRSIYLGTPFDGSSIINTETEK-----------------IFAAPAASIDWRTKGAVT 124

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALL 210
            ++NQQ CG CW+FST  + E   A+  G L  LS Q +IDC+G+ GN GC+GG      
Sbjct: 125 PIKNQQQCGGCWSFSTTGSTEGATAIAKGNLPSLSEQNLIDCSGSYGNNGCNGGLMTLAF 184

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++ +N   ++ ES YP   KD    +   +  G  + SY+ +    SE S L   A  G
Sbjct: 185 EYI-INNKGIDTESSYPYTAKDGKTCKYNPANIGATLSSYS-NVTSGSEPS-LESAANIG 241

Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A   ++Q Y  G I Y    S  +++H V +VGY
Sbjct: 242 PVSVAIDASHNSFQLYSSG-IYYEPACSTTSLDHGVLVVGY 281


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/281 (30%), Positives = 133/281 (47%), Gaps = 29/281 (10%)

Query: 37  FSSFQQRYKK--SYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
           ++S+  ++ K  + S S  D RF+ F+++   IEE   NR    S R G+ +FSDL+ EE
Sbjct: 13  YASWCAKFGKECASSNSLGDHRFETFKENFRYIEE--HNRAGKHSYRLGLNQFSDLTSEE 70

Query: 95  FKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           F+ R L  R  +    ++   +  D          I  G      +P   DWR+ G +  
Sbjct: 71  FRQRFLGLRPDLIDSPVLKMPRDSD----------IEEGFQ-NVDLPASVDWRQHGAVTA 119

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
            ++Q +CG CWAF+T    E ++ +  G L  LS QE+IDC    + GC GG       +
Sbjct: 120 PKDQGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQF 179

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
           + V    L+ E++YP    ++ C  K  +   V I  Y     IP   E ++L  +A   
Sbjct: 180 I-VENGGLDTETDYPYHASESHCNMKKLNSRVVAIDGY---KAIPEGDEQALLLAVAKQ- 234

Query: 271 PVIAAVNALT--WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A+   +  +Q+Y  GV   +C      INH V IVGY
Sbjct: 235 PVSVAIEGASKDFQHYASGVFTGHCG---EEINHGVLIVGY 272


>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
          Length = 359

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y  +E   +RF  F+++LD+I   NK   S    + G+ +F+D++ +EF
Sbjct: 60  FARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFTDMTWQEF 116

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK               TG      +P  KDWRE GI+  V+
Sbjct: 117 QRTKLGAAQNCSATLKGTHK--------------LTG----EALPETKDWREDGIVSPVK 158

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   +D  CK  A +  GV++   + +  + +E  +   +    PV 
Sbjct: 219 KSNG-GLDTEEAYPYTGEDGTCKYSAENV-GVQVLD-SVNITLGAEDELKHAVGLLRPVS 275

Query: 274 AAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 276 IAFEVIHSFRLYKSGVYSDSHCGQTPMDVNHAVLAVGY 313


>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 142/310 (45%), Gaps = 32/310 (10%)

Query: 7   VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
           +   + + A+C   +     + P L+    L+ ++   +KKSY+  E   R   +EK+L 
Sbjct: 1   MALYLGIAAICLTTVFAAPTTDPALDNHWNLWKNW---HKKSYAPKEEGWRRVLWEKNLR 57

Query: 66  IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           +IE  N ++     S   G+ +F D++ EEF+            LM+ +K      N  K
Sbjct: 58  MIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQ-----------LMNGYK------NQKK 100

Query: 125 KRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
            R  T     P     P   DWR+ G +  V++Q  CG+CWAFST    E  H    G +
Sbjct: 101 IRGST--FLAPNNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKM 158

Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS Q ++DC+   GN GC+GG       ++  N  + + E  YP   KD        +
Sbjct: 159 ISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYDPN 217

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            N      +  D    SE  ++  +A+ GPV  AV+A   ++Q+Y  G I Y  + S  +
Sbjct: 218 YNSANDTGFV-DVTSESEKDLMNAVASVGPVSVAVDAGHQSFQFYKSG-IYYEPECSSED 275

Query: 300 INHAVQIVGY 309
           ++H V +VGY
Sbjct: 276 LDHGVLVVGY 285


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 131/282 (46%), Gaps = 29/282 (10%)

Query: 40  FQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
           F+ ++ +SY+  E +   K  F +++ +I E N    +      G+ +F+DL+ EEF   
Sbjct: 22  FKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHT---YTLGVNQFADLTVEEFSKT 78

Query: 99  HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
           ++             K+ D  +     R +  G  +PT +    DW   G +  V+NQ  
Sbjct: 79  YMGFK------KPAQKYGDAAY---LGRHVYNGEALPTSV----DWSSQGAVTPVKNQGQ 125

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNK 217
           CG+CW+FST  + E  + +  G L  LS Q+ +DCAG  GN GC+GG   +   + + N 
Sbjct: 126 CGSCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN- 184

Query: 218 VVLEPESEYPLLLKDAACKRKATSPNGVK--IKSYTCDTLIPSESSILTDIATHGPVIAA 275
             L  E  YP    D +C+  + S    K  +  Y  D    SE  +++ +A   PV  A
Sbjct: 185 -ALCTEQSYPYKGTDGSCQASSCSTGLAKGSVSGYK-DVSSDSEQDMMSAVAQQ-PVSIA 241

Query: 276 VNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
           + A    +Q Y GGV+   C  SL   +H V  VGY   S T
Sbjct: 242 IEADKSVFQLYSGGVLTGACGASL---DHGVLAVGYGTLSGT 280


>gi|218478069|dbj|BAH03395.1| cathepsin L-like cysteine peptidase [Taenia solium]
          Length = 346

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 142/304 (46%), Gaps = 32/304 (10%)

Query: 19  LAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
           LA+ V+ S    E++L   ++ ++ ++ + YS  E   R   F ++L  I+  N+   + 
Sbjct: 16  LAVVVETSALLTERELSRQWAGWKLQHGRVYSGKEEAYRRGIFARNLLYIKGQNRRFNAG 75

Query: 78  -ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
            ES   G+ +F+DL   EF  R L       V               ++  I   +    
Sbjct: 76  LESYSTGLNQFADLESSEFSERFLGTRPESRVAG-------------RRGRIWKALASAA 122

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-G 195
           G+P   DWR+  ++ +V+NQ  CG+CWAFS+    E   A K G L  LS Q+++DC+  
Sbjct: 123 GLPDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKLISLSEQQLVDCSLK 182

Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
           NGN GC+GG       +++ +   +EPES YP    D  C+   +   GV   +   D  
Sbjct: 183 NGNDGCNGGYMSYAFKYLEEH--FIEPESAYPYRATDGPCRYNESL--GVGTVTDIGDIP 238

Query: 256 IPSESSILTDIATHGPVIAAVNA--LTWQYYL--------GGVIQYNCDGSLANINHAVQ 305
             +E++++  +AT GP+  A++A  L + +Y         G    + C      +NH V 
Sbjct: 239 EGNETALMEAVATVGPISIAIDASSLGFMFYRQVATNPHHGIYKSHWCSSKF--LNHGVL 296

Query: 306 IVGY 309
            +GY
Sbjct: 297 AIGY 300


>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
          Length = 364

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 134/276 (48%), Gaps = 24/276 (8%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+SF +R+ K Y ++SE   RF  F+++L+II    +N +   +A YGI +F+DLS EEF
Sbjct: 64  FTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKG--TAIYGINQFADLSPEEF 121

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           K  HL H+          K  DH +  V   +   G+     +P   DWRE G + KV+ 
Sbjct: 122 KKTHLPHT---------WKQPDHPNRIVDLAA--EGVDPKEPLPESFDWREHGAVTKVKT 170

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           +  C ACWAFS     E    L    L  LS Q+++DC    + GC+GG    L  + ++
Sbjct: 171 EGHCAACWAFSVTGNIEGQWFLAKKKLVSLSAQQLLDC-DVVDEGCNGG--FPLDAYKEI 227

Query: 216 NKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            ++  LEPE +YP   K   C+     P+ + +       L   E  +   +   GP+  
Sbjct: 228 VRMGGLEPEDKYPYEAKAEQCR---LVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISI 284

Query: 275 AVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
            +     Q+Y GGV +   C   L+++ H   +VGY
Sbjct: 285 GITVDDIQFYKGGVSRPTTC--RLSSMIHGALLVGY 318


>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
          Length = 471

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 138/309 (44%), Gaps = 24/309 (7%)

Query: 5   KNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKSEHDIRFKNFEKS 63
           + VL    L+ +  L +P   +  + E+ L   F+ F+Q++ + Y  +   +    F ++
Sbjct: 6   RFVLLAAVLVVMACL-VPAATASLHAEETLTSQFAEFKQKHGRVYESAARRLPLSVFREN 64

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           L +      +  +   A +G+T FSDL+ EEF++R+             H    H     
Sbjct: 65  LFLAR---LHAAANPHATFGVTPFSDLTREEFRSRY-------------HNGAAHFAAAQ 108

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           ++  +   + +  G P   DWR  G +  V++Q  CG+CWAFS +   E    L    L+
Sbjct: 109 ERARVPVKVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLT 167

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSP 242
            LS Q ++ C    + GCSGG      +W+   N   +  E  YP    +       TS 
Sbjct: 168 NLSEQMLVSC-DKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSG 226

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINH 302
           + V         L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C      ++H
Sbjct: 227 HTVGATITGHVELPQDEAQIAACVAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDH 283

Query: 303 AVQIVGYDN 311
            V +VGY++
Sbjct: 284 GVLLVGYND 292


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 134/308 (43%), Gaps = 21/308 (6%)

Query: 15  ALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-----EHDIRFKNFEKSLDIIEE 69
           A+ FL      S+P  E    +   F+ R+K  +S++     E   R + + +++  IE 
Sbjct: 17  AVFFLHGSSATSRPATEDADPMAQRFR-RWKAEHSRTYATPEEERHRLRVYARNMRYIEA 75

Query: 70  LNKNRQSPESARYGITEFSDLSEEEFKTRH------LRHSVNKHVLMSHHKHHDHHHNHV 123
            N +  +  +   G T ++DL+ +EF   +      L    +   +              
Sbjct: 76  TNGDAGAGLTYELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAG 135

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
               +   +    G P   DWRE G +  V+NQ  CG+CWAFSTV   E +H +K G L+
Sbjct: 136 GGGWLQVYVNESAGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLA 195

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            LS QE++DC    + GC+GG     L W+  N  +   + +YP   KD  C  K  S +
Sbjct: 196 SLSEQELVDC-DKLDHGCNGGVSYRALQWITSNGGITS-QDDYPYTAKDDTCDTKKLSHH 253

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
              I  +       SE S+   +A   PV  ++ A    +Q+Y  GV    C      +N
Sbjct: 254 AASISGFQ-RVATRSELSLTNAVAMQ-PVAVSIEAGGANFQHYRNGVYNGPCG---TRLN 308

Query: 302 HAVQIVGY 309
           H V +VGY
Sbjct: 309 HGVTVVGY 316


>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
          Length = 334

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/289 (32%), Positives = 142/289 (49%), Gaps = 42/289 (14%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
           F +++ ++ +SY S SE D R + + ++ +I+   N    Q   + R G+T ++DL  EE
Sbjct: 26  FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT--GIPVKKDWREAGIIGK 152
           FK      +V    L S         N  K R  ++ + +     +P   DWR+ G +  
Sbjct: 86  FK-----QTVFGVCLGSF--------NASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTP 132

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLD 211
           V+NQ +CG+CW+FS+    E  +  K G L  LS QE++DC+GN GN GC+GG       
Sbjct: 133 VKNQGSCGSCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGG------- 185

Query: 212 WMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
           WMD      VNK  +  E  YP   +   C R      G     Y  D    +E ++   
Sbjct: 186 WMDNAFRYIVNKGGIHTEDSYPYEGQVGQC-RANYGEIGATCTGYY-DIPSGNEHALKEA 243

Query: 266 IATHGPVIAAVNA--LTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
           +AT GPV  A++A   ++Q Y  GV  YN   C G+   ++HAV IVGY
Sbjct: 244 VATFGPVSVAIHASDQSFQLYHSGV--YNNPYCSGTA--LDHAVLIVGY 288


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 143/291 (49%), Gaps = 20/291 (6%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
           VK  +  +++  +L+  +++ + KSY+K E +   + F K++  I+E N+ +R   ++  
Sbjct: 33  VKSLRQKIDEAFKLWDDYKEAFGKSYNKDEENDYMEAFVKNVIHIDEHNQEHRLGRKTFE 92

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
            G+   +DL   +++             ++ ++H  +  + ++             IP  
Sbjct: 93  MGLNSIADLPFSQYRK------------LNGYRHRRNFGDSMQSNGTKWLAPFNVEIPDS 140

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
            DWR+ G++  V+NQ  CG+CWAFS     E  HA  +G +  LS Q ++DC+   GN G
Sbjct: 141 VDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHG 200

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      +++  N  + + E  YP + ++  C  K     G + K +  D     E 
Sbjct: 201 CNGGLMDLAFEYIKDNHGI-DTEESYPYVGRETKCHFKKKDI-GAEDKGFV-DLPEGDEE 257

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           ++   +AT GP+  A++A   T+Q Y  GV  Y+ + S   ++H V +VGY
Sbjct: 258 ALKVAVATQGPISIAIDAGHRTFQLYKKGVY-YDEECSSEELDHGVLLVGY 307


>gi|281207567|gb|EFA81750.1| cysteine protease 4 [Polysphondylium pallidum PN500]
          Length = 432

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/312 (28%), Positives = 142/312 (45%), Gaps = 24/312 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
           M+ +   L    +  L  L+     ++    Q  + F S+ Q     Y   E + R+  F
Sbjct: 1   MYRLSAYLLACTVFMLAVLSANAAFTE---RQYQDSFVSWMQTNNVKYDGKEFNHRYGVF 57

Query: 61  EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
           +K++D +++ N       S   G+  F+DL+  E++  +L   ++   L+          
Sbjct: 58  KKNMDYVQQWNAK---GSSTVLGMNIFADLTNAEYQRIYLGTKIDASGLL---------- 104

Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
           N    R+      I    P   DWR  G +  ++NQ  CG+CW+FST  + E  H +  G
Sbjct: 105 NVAAARAFDRNFNIKALNPTV-DWRAKGAVTPIKNQAQCGSCWSFSTTGSVEGAHEISTG 163

Query: 181 TLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
            L  LS Q +IDC+   GN GC+GG   A ++++  N  + + ES YP         R  
Sbjct: 164 NLVALSEQNLIDCSVPEGNQGCNGGLMWAAMEYIIKNGGI-DTESSYPYTATGPNKCRYN 222

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
           ++ +G KI SY  +    SE+S L   A   PV  A++A   ++Q Y  G I Y    S 
Sbjct: 223 SANSGAKISSYV-NVTSGSETS-LASAANVNPVSVAIDASHNSFQLYSSG-IYYEPACST 279

Query: 298 ANINHAVQIVGY 309
             ++H V +VGY
Sbjct: 280 TQLDHGVLVVGY 291


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/262 (29%), Positives = 127/262 (48%), Gaps = 23/262 (8%)

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            EHD RF+ F  +L  ++  N+ R      R G+ +F+DL+ +EF+  +L   +      
Sbjct: 72  GEHDSRFRVFWDNLRFVDAHNE-RAGEHGFRLGMNQFADLTNDEFRAAYLGARI-PAARS 129

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
            +     + H+  ++            +P   DWRE G +  V+NQ  CG+CWAFS V +
Sbjct: 130 GNAVGEMYRHDGAEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVSS 177

Query: 171 AESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            ES++ +  G +  LS QE+++C+ + GN GC+GG   A  +++ +    ++ E +YP  
Sbjct: 178 VESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFI-IKNGGIDTEDDYPYK 236

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
             D  C     +   V I ++  D     E S+   +A H PV  A+ A    +Q Y  G
Sbjct: 237 AVDGKCDINRRNAKVVSIDAFE-DVPENDEKSLQKAVA-HQPVSVAIEAGGRQFQLYKSG 294

Query: 288 VIQYNCDGSLANINHAVQIVGY 309
           V   +C     N++H V  VGY
Sbjct: 295 VFSGSC---TTNLDHGVVAVGY 313


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/301 (30%), Positives = 138/301 (45%), Gaps = 39/301 (12%)

Query: 21  IPVKVSKPNLEQKLELFSSFQQ---RYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQS 76
             ++V+   L+    ++   +Q    Y K Y    E + R K F+++++ IE  N N  +
Sbjct: 22  FAIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASN-NAGN 80

Query: 77  PESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT 136
            +  + GI +F+DL+ EEF             + S +K   H  + + K S  T      
Sbjct: 81  NKLYKLGINQFADLTNEEF-------------IASRNKFKGHMCSSITKTS--TFKYENA 125

Query: 137 GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN 196
            +P   DWR+ G +  V+NQ  CG CWAFS V   E +H L  G L  LS QE++DC   
Sbjct: 126 SVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTK 185

Query: 197 G-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
           G + GC GG    L+D  D  K +     L  E++YP    D  C     S + V I  Y
Sbjct: 186 GVDQGCEGG----LMD--DAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGY 239

Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
             D    +E ++   +A   P+  A++A    +Q+Y  GV   +C   L   +H V  VG
Sbjct: 240 E-DVPANNEQALQKAVANQ-PISVAIDASGSDFQFYKSGVFTGSCGTEL---DHGVTAVG 294

Query: 309 Y 309
           Y
Sbjct: 295 Y 295


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 144/311 (46%), Gaps = 33/311 (10%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDII 67
           FI+ L    +     ++ +P++  + E    + + + K Y+ + E + RF+ F+ +++ I
Sbjct: 13  FILILGMWAYEVASRELQEPSMSARHE---QWMETFGKVYADAAEKERRFEIFKDNVEYI 69

Query: 68  EELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E  N     P   +  + +F+DL+ EE K    R+   + +     K     + +V    
Sbjct: 70  ESFNTAGNKP--YKLSVNKFADLTNEELKV--ARNGYRRPLQTRPMKVTSFKYENV---- 121

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                   T +P   DWR+ G +  +++Q  CG+CWAFSTV   E ++ L  G L  LS 
Sbjct: 122 --------TAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSE 173

Query: 188 QEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           QE++DC   G + GC GG      +++  N  +   E+ YP    D  C  K  +    K
Sbjct: 174 QELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGIT-TEANYPYQAADGTCNSKKEASRIAK 232

Query: 247 IKSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINH 302
           I  Y     +P  SE+++L  +A+  P+  +++A    +Q+Y  GV    C   L   +H
Sbjct: 233 ITGYES---VPANSEAALLKAVASQ-PISVSIDAGGSDFQFYSSGVFTGQCGTEL---DH 285

Query: 303 AVQIVGYDNYS 313
            V  VGY   S
Sbjct: 286 GVTAVGYGETS 296


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 143/291 (49%), Gaps = 20/291 (6%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNK-NRQSPESAR 81
           VK  +  +++  +L+  +++ + KSY+K E +   + F K++  I+E N+ +R   ++  
Sbjct: 33  VKSLRQKIDEAFKLWDDYKESFGKSYNKDEENDYMEAFVKNVIHIDEHNQEHRLGRKTFE 92

Query: 82  YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
            G+   +DL   +++             ++ ++H  +  + ++             IP  
Sbjct: 93  MGLNSIADLPFSQYRK------------LNGYRHRRNFGDSMQSNGTKWLAPFNVEIPDS 140

Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
            DWR+ G++  V+NQ  CG+CWAFS     E  HA  +G +  LS Q ++DC+   GN G
Sbjct: 141 VDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHG 200

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      +++  N  + + E  YP + ++  C  K     G + K +  D     E 
Sbjct: 201 CNGGLMDLAFEYIKDNHGI-DTEESYPYVGRETKCHFKKKDI-GAEDKGFV-DLPEGDEE 257

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           ++   +AT GP+  A++A   T+Q Y  GV  Y+ + S   ++H V +VGY
Sbjct: 258 ALKVAVATQGPISIAIDAGHRTFQLYKKGVY-YDEECSSEELDHGVLLVGY 307


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 143/308 (46%), Gaps = 40/308 (12%)

Query: 10  IVALIALCFL---AIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +   +A+C     AIP+K      +   E + SF    KK +++ E D R   F +++  
Sbjct: 4   LSVFLAICLAVVSAIPLK------DPSWEAWKSFHG--KKYHNQGEDDFRHYVFLQNIKT 55

Query: 67  IEELNKNRQSPESARYGITEFSDLSEEEFKTRH--LRHSVNKHVLMSHHKHHDHHHNHVK 124
           I   N    +  + +  I EFSDL+ +EF   +   R S+ K                  
Sbjct: 56  IAAHN----AKSTFKMAINEFSDLTRKEFVKTYNGYRLSMKKST---------------- 95

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            +  T    + T +P + DWR+ G +  ++NQ  CG+CWAFST  + E  H  K G L  
Sbjct: 96  NKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVS 155

Query: 185 LSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LS Q +IDC A  GN GC GG      +++ +N  + + E+ YP   +D  C+ K T  N
Sbjct: 156 LSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGI-DTEASYPYEGRDDICRYKKT--N 212

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANIN 301
              I +   D    SE  +   +AT GP+  A++A   ++  Y  GV  +  + S   ++
Sbjct: 213 KGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTGVY-HEPECSQTVLD 271

Query: 302 HAVQIVGY 309
           H V +VGY
Sbjct: 272 HGVLVVGY 279


>gi|118373823|ref|XP_001020104.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89301871|gb|EAR99859.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 337

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 146/316 (46%), Gaps = 31/316 (9%)

Query: 6   NVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
           N  FI+  IAL    +P+ +++    +KL  ++ +  +  +++  +E  +      + L 
Sbjct: 2   NSKFILLSIALL---MPLYLAQNVFIEKLIAYNKWSSKNLRTFLNNEEKLF-----RQLV 53

Query: 66  IIEELNK----NRQSPESARYGITEFSDLSEEEFKTRHLRHS--VNKHVLMSHHKHHDHH 119
             E L K    N Q   +    + +FSD++EEEF  + L  S  V+ H+     +   H+
Sbjct: 54  FFENLQKVNYHNAQDHHTYSLALNQFSDMTEEEFAEKILMQSDLVDLHI----QQTASHN 109

Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
                    T+  +      V  DWR  G +  V+ Q  C +CWAFS     ES + +KN
Sbjct: 110 STSSTTGGSTSSNSTSNNATVTVDWRSKGAVTPVKQQGYCSSCWAFSAAGLMESFNFIKN 169

Query: 180 GTLSLLSVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
             L+  S Q+++DC  + N     GCSGG   + +D+    K  +     YP +     C
Sbjct: 170 KNLTDFSEQQLVDCVNSANGYSSKGCSGGWPASAIDYSS--KFGITTLQNYPYIGVQKKC 227

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CD 294
               T+ NG K KS+     IP+ S  L +   + PV  AV+A TW +Y  GV  YN C+
Sbjct: 228 NITGTN-NGFKPKSW---KQIPNTSKDLQNALNYSPVSIAVDASTWSHYKSGV--YNGCN 281

Query: 295 GSLANINHAVQIVGYD 310
            +   INH V  +GYD
Sbjct: 282 QTDIKINHGVLAIGYD 297


>gi|343474209|emb|CCD14094.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 307

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 153/318 (48%), Gaps = 32/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAF+ +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPKTVDWRKKGAVTPVKDQGKCDSSWAFAAIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
             L+ LS Q ++ C  N ++GC  G       W+   N   +  E  YP           
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFLDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTC 226

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            K+    G  I  +    ++ +E++I   +A  GPV  AV+A ++Q Y GGV+  +C   
Sbjct: 227 NKSGKVVGANIDDHV--HILDNENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCISK 283

Query: 297 LANINHAVQIVGYDNYSR 314
              +N A  +VGYD+ S+
Sbjct: 284 --EVNSAALLVGYDDTSK 299


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 145/309 (46%), Gaps = 37/309 (11%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
           V+L A   ++I V   + + E+   +++ +   +  +Y+   E + RF+ F  +L  I++
Sbjct: 18  VSLAAAADMSI-VSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ 76

Query: 70  LNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHHHNHVKK 125
            N    +   S R G+  F+DL+ EE+++ +L        +  L + ++  D+       
Sbjct: 77  HNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDE----- 131

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                       +P   DWR+ G +G V++Q  CG+CWAFS +   E ++ +  G +  L
Sbjct: 132 ------------LPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPL 179

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
           S QE++DC  + N GC+GG    L+D+     +N   ++ E +YP   +D  C     + 
Sbjct: 180 SEQELVDCDTSYNQGCNGG----LMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNA 235

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
             V I  Y  D  + SE S+   +A   P+  A+ A    +Q Y  G+    C  +L   
Sbjct: 236 KVVTIDGYE-DVPVNSEKSLQKAVANQ-PISVAIEAGGRAFQLYKSGIFTGTCGTAL--- 290

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 291 DHGVAAVGY 299


>gi|121531598|gb|ABM55484.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/311 (29%), Positives = 145/311 (46%), Gaps = 38/311 (12%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
            +A+ A   +A+    ++       + + +F+Q + K+Y    E   RF  F+++L  I+
Sbjct: 3   FLAIFATVLIAVTASTNE-------DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIK 55

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E N +  +  E+   G+T F+DL+ EEFK           +L    K+        K R 
Sbjct: 56  EHNARYDKGEETYLLGVTRFADLTHEEFK----------DILKGQIKN--------KPRL 97

Query: 128 ITTGITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
             T    P    +P   DW E G + +V++Q  CG+CWAFS     +  +A+ N     L
Sbjct: 98  NATPTVFPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALKGQNAILNNVKISL 157

Query: 186 SVQEVIDC-AGNGNMGC-SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           S Q+++DC A  GN  C  GGD  A  D+  V    ++ E  YP + K   C+  A S  
Sbjct: 158 SEQQLLDCSAAYGNGNCKEGGDMSAAFDY--VRDYGIQSEKSYPYIRKQTECQYDA-SKT 214

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
            +KIK Y    +  SE  +   + T GP+  A+N+   Q Y  G I  +  G   +++H 
Sbjct: 215 ILKIKGYK--NVTTSEEGLRKAVGTIGPISIAMNSDPLQLYYSGTI--SGKGCSHDLDHG 270

Query: 304 VQIVGYDNYSR 314
           V +VGY   S+
Sbjct: 271 VLVVGYGKASQ 281


>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
 gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 142/310 (45%), Gaps = 32/310 (10%)

Query: 7   VLFIVALIALCFLAI-PVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLD 65
           +   + + A+C   +     + P L+    L+ ++   +KKSY+  E   R   +EK+L 
Sbjct: 1   MALYLGIAAICLTTVFAAPTTDPALDNHWNLWKNW---HKKSYAPKEEGWRRVLWEKNLR 57

Query: 66  IIEELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           +IE  N ++     S   G+ +F D++ EEF+            LM+ +K      N  K
Sbjct: 58  MIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQ-----------LMNGYK------NQKK 100

Query: 125 KRSITTGITIPTGI--PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
            R  T     P     P   DWR+ G +  V++Q  CG+CWAFST    E  H    G +
Sbjct: 101 IRGST--FLAPNNFESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKM 158

Query: 183 SLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATS 241
             LS Q ++DC+   GN GC+GG       ++  N  + + E  YP   KD        +
Sbjct: 159 ISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGI-DSEDSYPYTAKDDQECHYDPN 217

Query: 242 PNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
            N      +  D    SE  ++  +A+ GPV  AV+A   ++Q+Y  G I Y  + S  +
Sbjct: 218 YNSANDTGFV-DVTSGSEKDLMNAVASVGPVSVAVDAGHQSFQFYKSG-IYYEPECSSED 275

Query: 300 INHAVQIVGY 309
           ++H V +VGY
Sbjct: 276 LDHGVLVVGY 285


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 135/291 (46%), Gaps = 24/291 (8%)

Query: 23  VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
           V   + + E+   L++ ++  + K+Y+   E + R+  F  +L  I+E N    +   S 
Sbjct: 26  VSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85

Query: 81  RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
           R G+  F+DL+ EE++  +L             ++       V  R +         +P 
Sbjct: 86  RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131

Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
             DWR  G + ++++Q  CG+CWAFS +   E ++ +  G L  LS QE++DC  + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191

Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
           C+GG      D++ +N   ++ E +YP   KD  C     +   V I SY  D    SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249

Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           S+   +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296


>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
          Length = 350

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y    E   RFK F ++L +IE  NK R        G+  F+D + EEF
Sbjct: 51  FARFANRYGKRYDTVDEMKRRFKIFSENLQLIESTNKKRLG---YTLGVNHFADWTWEEF 107

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           ++  L  + N    L  +H+  D                    +P +KDWR+ GI+ +V+
Sbjct: 108 RSHRLGAAQNCSATLKGNHRITD------------------VVLPAEKDWRKEGIVSEVK 149

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    ES +A   G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 150 DQGHCGSCWTFSTTGALESAYAQAFGKNISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYI 209

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   LE E  YP   ++  C  K TS +       + +  + +E  +   +A   PV 
Sbjct: 210 KYNG-GLETEEAYPYTGQNGPC--KFTSEDVAVQVLGSVNITLGAEDELKHAVAFARPVS 266

Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
            A   +  ++ Y  GV     C  +  ++NHAV  VGY
Sbjct: 267 VAFEVVDDFRLYKKGVYTSTTCGNTPMDVNHAVLAVGY 304


>gi|118365742|ref|XP_001016091.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297858|gb|EAR95846.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 335

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 152/313 (48%), Gaps = 32/313 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDI 66
           +L I+ L+ LC LA  + V      +KL  ++ +Q ++++ Y  +EH+  ++     +  
Sbjct: 6   LLSIIMLMPLC-LAQNITV------EKLLAYNQWQSQHQRIY-LNEHEKLYR----QMVF 53

Query: 67  IEELNK--NRQSPESARYGI--TEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHN 121
            E+L K     +  +  Y I   +FSD+++EEF  + L +  +  H++ +  +   H+  
Sbjct: 54  FEKLQKINEHNNNSNNTYSIHLNQFSDMTKEEFTQKILMKQDLADHLMKAGSQEATHNDV 113

Query: 122 HVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT 181
           +++ +  +   T+ T I    DWR  G +  V+NQ  CG+CW+FS     ES + ++N  
Sbjct: 114 NIEAKLNSKNSTLATSI----DWRTKGAVTSVKNQGNCGSCWSFSATGLMESFNFIQNKA 169

Query: 182 LSLLSVQEVIDCAGNGN----MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
           L   S Q+++DC    N     GC GG     +D+   +KV L    +YP +     C  
Sbjct: 170 LVEFSEQQLLDCVTPANGYRIHGCDGGWPAYCVDY--ASKVGLTTLKKYPYVGVQNNCNV 227

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             T+ NG K K +     +P+ S+ L       PV   V+A  W  Y  G+    CD SL
Sbjct: 228 TGTN-NGFKPKKW---NQVPNTSNDLKTALNFSPVSVLVDANNWDGYQSGIFN-GCDQSL 282

Query: 298 ANINHAVQIVGYD 310
             +NHAV  VGYD
Sbjct: 283 IILNHAVLAVGYD 295


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 146/313 (46%), Gaps = 28/313 (8%)

Query: 2   FDVKNVLFIVALIALCFLAIPVKVSKPNLEQ--KLELFSSFQQRYKKSYSKS-EHDIRFK 58
           F +KN+  ++ L ++  L  P  V+  NL++   LE   ++   + + Y    E + RFK
Sbjct: 5   FFLKNITVVLLLFSILSL-YPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFK 63

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F+++++ IE  NKN    +  +  + +++DL+ EEF T  +    +   L+S  +    
Sbjct: 64  TFKENVEFIESFNKN--GTQRYKLAVNKYADLTTEEFTTSFMGLDTS---LLSQQES-TA 117

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                K  S+T        +P   DWR+ G +  V++Q  CG CWAFS     E  + + 
Sbjct: 118 TTTSFKYDSVTE-------VPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIA 170

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAACKR 237
           N  L  LS Q+++DC+   N GC GG      D+ +  N   +  E+ YP       CK 
Sbjct: 171 NNELISLSEQQLLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKT 229

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGS 296
           +   P  V I  Y    ++PS+ S L     + P+   + A   +  Y  G+   +C+  
Sbjct: 230 E--QPAAVTINGY---EVVPSDESSLLKAVVNQPISVGIAANDEFHMYGSGIYDGSCNSR 284

Query: 297 LANINHAVQIVGY 309
           L   NHAV ++GY
Sbjct: 285 L---NHAVTVIGY 294


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 151/331 (45%), Gaps = 50/331 (15%)

Query: 3   DVKNVLFIVALIALCFLAI-----PVKVSKPNLEQKLELF-SSFQQRYK------KSYSK 50
           +V   L I+  + + + A      P++    ++E++ E +     +RYK      + +  
Sbjct: 9   NVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGI 68

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            + ++RF N+  + +    L  N            +F+D++ EE+K  ++        L 
Sbjct: 69  YQSNVRFINYINAQNFSFTLTDN------------QFADMTNEEYKALYMG-------LG 109

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           +      +  +  ++RS          +P+  DWR+ G +  VRNQ  CG+CWAFSTV  
Sbjct: 110 TSETSRKNQSSFKRERSKV--------LPISVDWRKMGAVTPVRNQGECGSCWAFSTVAA 161

Query: 171 AESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            E ++ ++ G L  LS QE++DC   +GN GC+GG       ++  N  +    + YP +
Sbjct: 162 VEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARN-YPYI 220

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
            +   C +   + + VKI  Y  +T+ P+   IL       PV  A++A    +Q Y  G
Sbjct: 221 GEQGICNKDKAANHVVKISGY--ETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKG 278

Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           +    C   L   NHAV ++GY  DN  + W
Sbjct: 279 IFNGFCGKQL---NHAVTVIGYGEDNGKKYW 306


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 143/287 (49%), Gaps = 34/287 (11%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E  + ++ ++  ++ KSY+   E + RF+ F+ +L  I+E N   ++    + G+  F+D
Sbjct: 45  EDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRT---YKVGLNRFAD 101

Query: 90  LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           L+ EE+++ +L  R +  +    S +K  D +   V       G ++P  +    DWR+ 
Sbjct: 102 LTNEEYRSMYLGTRTAAKRR---SSNKISDRYAFRV-------GDSLPESV----DWRKK 147

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + +V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG   
Sbjct: 148 GAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG--- 204

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     +N   ++ E +YP    D  C +   +   V I  Y  D     E S+  
Sbjct: 205 -LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYE-DVPENDEKSLEK 262

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 263 AVANQ-PVSVAIEAGGREFQLYQSGIFTGRCGTAL---DHGVTAVGY 305


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 145/303 (47%), Gaps = 30/303 (9%)

Query: 14  IALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELN 71
           +A+CFLA    +S   L     E + +F+ ++ KSY  S E   R   ++++   I+E N
Sbjct: 3   VAICFLAF-FAISHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHN 61

Query: 72  KNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           K  ++ E S +  +  F DL + EFK             ++  K      N  +    T 
Sbjct: 62  KRYENGEVSYKLKMNHFGDLMQHEFKA------------LNKLKRSAKQQNSGEVFRATG 109

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
           G      +P K DWR+ G +  V++   CG+CWAFS+  +      LKN  L  LS Q++
Sbjct: 110 GK-----LPAKVDWRQKGAVTPVKDPGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQL 164

Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
           +DC+GN GN GC GG       ++  N  + + E  YP   +D  C+ K  S  G   K 
Sbjct: 165 VDCSGNYGNDGCDGGIMVQAFQYIKGNGGI-DTEGSYPYEAEDDKCRYKTKSVAGTD-KG 222

Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQI 306
           Y  D     E+++   +A  GP+  A++A  L++Q+Y  G+  +  C  S   ++H V +
Sbjct: 223 YV-DIAQGDENALKEAVAEIGPISVAIDAGNLSFQFYSEGIYDEPFC--SNTELDHGVLV 279

Query: 307 VGY 309
           VGY
Sbjct: 280 VGY 282


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/307 (27%), Positives = 139/307 (45%), Gaps = 31/307 (10%)

Query: 8   LFIVALIALCFLAIPVKVSKPNLEQK--LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSL 64
           +F+  L+ L   A  +   +P  EQ+  L+    +  ++ + Y    E + R+  F++++
Sbjct: 10  IFLPFLLILAAWATKI-ACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENI 68

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
           + IE  N    S    + G+ +F+DL+ EEF+  +  +      LMS    +++  +   
Sbjct: 69  ERIEAFNNG--SDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMSSSFRYENLSD--- 123

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
                        IP   DWR  G +  V++Q TCG CWAFSTV   E +  L+ G L  
Sbjct: 124 -------------IPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLIS 170

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS Q+++DC   GN GC GG       ++ +    L  E  YP    D  C  +  +   
Sbjct: 171 LSEQQLVDCTA-GNKGCQGGLMDTAFQYI-IRNGGLTSEDNYPYQGVDGTCSSEKAASTE 228

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINH 302
            +I  Y  D    +E+++L  +A   PV   V+     +Q+Y  GV   +C       NH
Sbjct: 229 AQITGYE-DVPQNNENALLQAVAKQ-PVSVGVDGGGNDFQFYKSGVFNGDCG---TQQNH 283

Query: 303 AVQIVGY 309
           AV  +GY
Sbjct: 284 AVTAIGY 290


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 77/247 (31%), Positives = 123/247 (49%), Gaps = 20/247 (8%)

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
           Q  +S R G+T+F+D+  EE+K+           L+S       + +  ++ S    +  
Sbjct: 67  QGIKSYRLGMTQFADMDNEEYKS-----------LISLGCLRAFNTSAPRRGSAFFRLAE 115

Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
            T +P   DWR+ G +  V++Q+ CG+CWAFS   + E  +  K G L  LS Q+++DC+
Sbjct: 116 GTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCS 175

Query: 195 GN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           G+ GNMGC+GG       ++  N  + + E  YP   +D  C+ K  +  G K   Y  D
Sbjct: 176 GDYGNMGCNGGLMDYAFKYIQENGGI-DTEKSYPYEAEDGQCRFKPENV-GAKCTGYV-D 232

Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY-- 309
             +  E ++   +AT GPV   ++A   ++Q Y  GV     D S  +++H V  VGY  
Sbjct: 233 VTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYDEQ-DCSSQDLDHGVLAVGYGT 291

Query: 310 DNYSRTW 316
           DN    W
Sbjct: 292 DNGQDYW 298


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/325 (28%), Positives = 146/325 (44%), Gaps = 44/325 (13%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQ--------QRYKKSYSKSEHDI-----RF 57
           +  ++ CF  + V V+   L  +L    S Q        + +  SY +   DI     R+
Sbjct: 1   MGFVSQCFCLV-VMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRY 59

Query: 58  KNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHD 117
           K FE+++ +IE  NK+   P   +  + +F+DL+ EEFK    R+    H+  +  K   
Sbjct: 60  KIFEENVALIESSNKDANKP--YKLSVNQFADLTNEEFKAS--RNRFKGHICST--KSTS 113

Query: 118 HHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
             + +V            + +P   DWR  G +  V++Q  CG CWAFS V   E +  L
Sbjct: 114 FKYGNV------------SAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKL 161

Query: 178 KNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
             G L  LS QE++DC  +G + GC GG       ++  N   L  E+ YP    D  C 
Sbjct: 162 TTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNH-GLASEANYPYKGVDGTCN 220

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
               + +  +I  +  D    SE ++L  +A H PV  A++A    +Q+Y  GV    C 
Sbjct: 221 TNKQAIHAAEINGFE-DVPANSEEALLNAVA-HQPVSVAIDAGGSGFQFYSKGVFIGACG 278

Query: 295 GSLANINHAVQIVGY---DNYSRTW 316
             L   +H V  VGY   D+ ++ W
Sbjct: 279 TQL---DHGVTAVGYGTSDDGTKYW 300


>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/275 (30%), Positives = 131/275 (47%), Gaps = 28/275 (10%)

Query: 40  FQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RYGITEFSDLSEEEFKTR 98
           ++  +++ Y  +E + R   +EK++ +I+  N    + +      +  F D++ EEF   
Sbjct: 6   WKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF--- 62

Query: 99  HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
             R  VN +           H  H K R     + +   IP   DWRE G +  V+NQ  
Sbjct: 63  --RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSVDWREKGCVTPVKNQGQ 108

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNK 217
           CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC+GG       ++  N 
Sbjct: 109 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 168

Query: 218 VVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGPVIAAV 276
             L+ E  YP   KD +CK +A       + + T    IP  E +++  +AT GP+  A+
Sbjct: 169 -GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEKALMKAVATVGPISVAM 223

Query: 277 NAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 224 DASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 257


>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F ++L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P   DWR  G +  V++Q  CG+CWAFS +   E    L 
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C    + GC GG      +W+   N   +  E  YP    +     
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPP 222

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             TS + V         L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C    
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280

Query: 298 ANINHAVQIVGYDN 311
             ++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 132/288 (45%), Gaps = 24/288 (8%)

Query: 26  SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQ-SPESARYGI 84
           + P L+   +L+ S+   + K Y + E   R   +EK+L +IE  N +      S + G+
Sbjct: 2   ADPELDGHWQLWKSW---HNKDYHEREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGM 58

Query: 85  TEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDW 144
            +F D++ EEF+            LM+ + H      +   + +          P   DW
Sbjct: 59  NQFGDMTTEEFRQ-----------LMNGYAHKKSERKYRGSQFLEPSFLE---APRSVDW 104

Query: 145 REAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSG 203
           RE G +  V++Q  CG+CWAFST    E  H  K G L  LS Q ++DC+   GN GC+G
Sbjct: 105 REKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNG 164

Query: 204 GDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSIL 263
           G       ++  N  + + E  YP   KD    R     N      +  D     E +++
Sbjct: 165 GLMDQAFQYVQDNGGI-DSEESYPYTAKDDEDCRYKAEYNAANDTGFV-DIPQGHERALM 222

Query: 264 TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             +A  GPV  A++A   ++Q+Y  G I Y  D S  +++H V +VGY
Sbjct: 223 KAVAAVGPVSVAIDAGHSSFQFYQSG-IYYEPDCSSEDLDHGVLVVGY 269


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 136/279 (48%), Gaps = 23/279 (8%)

Query: 34  LELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           +  + S+  ++ KSY+   E + RF+ F+ +   I+E N  +    S + G+  F+DL+ 
Sbjct: 41  MAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD--RSFKLGLNRFADLTN 98

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           EE+++        K+  +             ++ +   G ++P  +    DWRE G +  
Sbjct: 99  EEYRS--------KYTGIRTKDSRKKVSGKSQRYASLAGESLPESV----DWREHGAVAS 146

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG       +
Sbjct: 147 VKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQF 206

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + +N   ++ +++YP   +D  C +   +   V I SY  + +   +   L   A + P+
Sbjct: 207 I-INNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSY--EDVPEYDEKALQKAAANQPI 263

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A    +Q+Y  G+    C     +++H V +VGY
Sbjct: 264 SVAIEASGRDFQFYDSGIFTGKCG---TDLDHGVVVVGY 299


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 30/308 (9%)

Query: 7   VLFIVALIALCFL-AIPVKVSKPN-LEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKS 63
           V  +   + LC + A P   S+    +  ++ F  +   Y + Y  ++  +R F+ F+ +
Sbjct: 5   VQLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNN 64

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
           ++ IE  N NR    S   GI +F+D++  EF T++   S+  +         D  +   
Sbjct: 65  VNHIETFN-NRNG-NSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVSFDDVNISA 122

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
             +SI              DWR+ G + +V++Q  CG+CWAFS + T E ++ +  G L 
Sbjct: 123 VGQSI--------------DWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLV 168

Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
            LS QEV+DCA +   GC GG      D++  N  V   E++YP    +  C   +  PN
Sbjct: 169 SLSEQEVLDCAVSN--GCDGGFVDNAYDFIISNNGVAS-EADYPYQAYEGDCTANSW-PN 224

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANIN 301
              I  Y+   +  ++ S +     + P+ AA++A    +QYY GGV    C  SL   N
Sbjct: 225 SAYITGYS--YVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSL---N 279

Query: 302 HAVQIVGY 309
           HA+ I+GY
Sbjct: 280 HAITIIGY 287


>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
 gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N       + ++G
Sbjct: 16  LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72

Query: 84  IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
            T     F D++ EEF     R  VN +           H  H K R     + +   IP
Sbjct: 73  FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
              DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
            GC+GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP 
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E +++  +AT GP+  A++A   + Q+Y  G I Y  + S  +++H V +VGY
Sbjct: 231 QEKALMKPVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 140/304 (46%), Gaps = 38/304 (12%)

Query: 19  LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
           +A  +K+    L +  + F  F + Y +SYS  E  +R      + +++         P 
Sbjct: 36  IARKLKLGDNELLRTEKKFKVFMENYGRSYSTEEEYLRRLGI-FAQNMVRAAEHQALDP- 93

Query: 79  SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP--- 135
           +A +G+T+FSDL+E+EF+   L   VN     S++                 GI  P   
Sbjct: 94  TAVHGVTQFSDLTEDEFE--KLYTGVNGGFPSSNNA--------------AGGIAPPLEV 137

Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
            G+P   DWRE G + +V+ Q  CG+CWAFST  + E  + L  G L  LS Q+++DC  
Sbjct: 138 DGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDN 197

Query: 196 NGNM--------GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
             ++        GC+GG      +++ +    LE ES YP   +   CK     P  + +
Sbjct: 198 KCDITEKTSCDNGCNGGLMTNAYNYL-LESGGLEEESSYPYTGERGECK---FDPEKIAV 253

Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCD--GSLANINHAVQ 305
           K      +   E+ I   +  +GP+   VNA+  Q Y+GGV   +C    S   +NH V 
Sbjct: 254 KITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQTYIGGV---SCPLICSKKRLNHGVL 310

Query: 306 IVGY 309
           +VGY
Sbjct: 311 LVGY 314


>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
          Length = 331

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/278 (32%), Positives = 137/278 (49%), Gaps = 28/278 (10%)

Query: 35  ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           + F +F   Y K Y+  SE + RF  F+++L   EE+N   +  +SA Y I +F+DLS+ 
Sbjct: 29  DYFETFLANYNKMYNDTSEKERRFSIFQQTL---EEINYKNRLNDSAVYQINKFADLSKN 85

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI-PVKKDWREAGIIGK 152
           E  +++    +N  V  +         N  K    T  I  P G  P+  DWR+   +  
Sbjct: 86  EIISKYT--GLNMPVQTT---------NFCK----TIVIDQPPGKGPLNFDWRQQNKVTS 130

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           ++NQ+ CGACWAF+T+ + ES +A+KN     LS Q++IDC    +MGC GG      + 
Sbjct: 131 IKNQKACGACWAFATLASIESQYAIKNNVHIDLSEQQMIDC-DYVDMGCDGGLLHTAFEQ 189

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH-GP 271
           M +    L  E EYP    +  C+ +      VK+K   C   +      L D+    GP
Sbjct: 190 M-IQMGELVQEHEYPYAGVNKPCELRGDETGVVKVKG--CYRYVVFREEKLKDLLRAVGP 246

Query: 272 VIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +  A++A     Y  G+I Y C+     +NHAV +VGY
Sbjct: 247 IPMAIDASGIVNYHHGIIHY-CENY--GLNHAVLLVGY 281


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/279 (28%), Positives = 137/279 (49%), Gaps = 23/279 (8%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARY-GITEFSDLSE 92
           +LF  + Q++ K+Y S+ E ++R K F  + + +++ N   ++ E   + G+   +DL++
Sbjct: 66  DLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADLTK 125

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
           +EFK            ++ ++         V   +       P   P + DW  +G +  
Sbjct: 126 DEFKK-----------MLGYNAALRASRAPVDASTWEYADVTP---PEEIDWVASGAVTP 171

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
           V+NQ+ CG+CWAFST    E ++A+K G L  LS +E+I C+ NGNMGC+GG      +W
Sbjct: 172 VKNQKQCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEW 231

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           + VN   ++ E  +  + K+  C         V I  +  D     E S++  ++   PV
Sbjct: 232 I-VNNRGIDTEDGWEYVAKEEKCGFFRRHHRAVAIDGFK-DVPSNDEDSLMKAVSQQ-PV 288

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+ A   ++Q Y GGV  Y+       ++H V +VGY
Sbjct: 289 SVAIEADHQSFQLYAGGV--YSAKDCGTELDHGVLLVGY 325


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 143/287 (49%), Gaps = 34/287 (11%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           E  + ++ ++  ++ KSY+   E + RF+ F+ +L  I+E N   ++    + G+  F+D
Sbjct: 47  EDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRT---YKVGLNRFAD 103

Query: 90  LSEEEFKTRHL--RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           L+ EE+++ +L  R +  +    S +K  D +   V       G ++P  +    DWR+ 
Sbjct: 104 LTNEEYRSMYLGTRTAAKRR---SSNKISDRYAFRV-------GDSLPESV----DWRKK 149

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + +V++Q +CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG   
Sbjct: 150 GAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG--- 206

Query: 208 ALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
            L+D+     +N   ++ E +YP    D  C +   +   V I  Y  D     E S+  
Sbjct: 207 -LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXVVTIDGYE-DVPENDEKSLEK 264

Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            +A   PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 265 AVANQ-PVSVAIEAGGREFQLYQSGIFTGRCGTAL---DHGVTAVGY 307


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 77/241 (31%), Positives = 121/241 (50%), Gaps = 24/241 (9%)

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
           Q  +S R G+T F+D+  EE+K       +++  L        H  N    R  +T   +
Sbjct: 66  QGLKSYRLGMTYFADMENEEYK-----RVISQGCL--------HSFNASLPRRGSTFFRL 112

Query: 135 PTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
           P G  +P   DWR+ G +  V++Q+ CG+CWAFS   + E  H  K GTL  LS Q+++D
Sbjct: 113 PEGTDLPDAVDWRDKGYVTDVKDQKQCGSCWAFSATGSLEGQHFRKTGTLVSLSEQQLVD 172

Query: 193 CAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
           C+G+ GNMGC GG       ++  N  + + E  YP   ++  C+    +  G     YT
Sbjct: 173 CSGDYGNMGCMGGLMDYAFQYIQANGGI-DTEESYPYEAENGKCRYNPDNI-GATSTGYT 230

Query: 252 CDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYN-CDGSLANINHAVQIVG 308
            +     E ++   +AT GP+   ++A  +++Q+Y  GV  YN  D S   ++H V  VG
Sbjct: 231 -EVSQGDEDALKEAVATIGPISVGIDASQMSFQFYESGV--YNEPDCSSLELDHGVLAVG 287

Query: 309 Y 309
           Y
Sbjct: 288 Y 288


>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
           cysteine proteinase A-1; Flags: Precursor
 gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
 gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 354

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 32/316 (10%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
            + +  L  +C+ +  +  + P ++  +    + SF++R+ K++   +E   RF  F+++
Sbjct: 10  AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+DL+ +EF   +L    N      H K H      
Sbjct: 70  MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKDH------ 116

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
             K  +    + P+G+ +  DWR+ G +  V+NQ  CG+CWAFS +   E   A    +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
             LS Q ++ C  N + GC+GG     ++W M  +   +  E+ YP          C  +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232

Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
                G KI  +   +L   E  I   +   GPV  AV+A TWQ Y GGV+      SL 
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287

Query: 299 NINHAVQIVGYDNYSR 314
             NH V IVG++  ++
Sbjct: 288 --NHGVLIVGFNKNAK 301


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 145/309 (46%), Gaps = 37/309 (11%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEE 69
           V+L A   ++I V   + + E+   +++ +   +  +Y+   E + RF+ F  +L  I++
Sbjct: 18  VSLAAAADMSI-VSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ 76

Query: 70  LNKNRQSP-ESARYGITEFSDLSEEEFKTRHLRHSV---NKHVLMSHHKHHDHHHNHVKK 125
            N    +   S R G+  F+DL+ EE+++ +L        +  L + ++  D+       
Sbjct: 77  HNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDE----- 131

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
                       +P   DWR+ G +G V++Q  CG+CWAFS +   E ++ +  G +  L
Sbjct: 132 ------------LPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPL 179

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDW---MDVNKVVLEPESEYPLLLKDAACKRKATSP 242
           S QE++DC  + N GC+GG    L+D+     +N   ++ E +YP   +D  C     + 
Sbjct: 180 SEQELVDCDTSYNQGCNGG----LMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNA 235

Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
             V I  Y  D  + SE S+   +A   P+  A+ A    +Q Y  G+    C  +L   
Sbjct: 236 KVVTIDGYE-DVPVNSEKSLQKAVANQ-PISVAIEAGGRAFQLYKSGIFTGTCGTAL--- 290

Query: 301 NHAVQIVGY 309
           +H V  VGY
Sbjct: 291 DHGVAAVGY 299


>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
          Length = 295

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/257 (32%), Positives = 127/257 (49%), Gaps = 22/257 (8%)

Query: 57  FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
           F+N  K + +   L++  +SP +   GI +FSD+ E+EF T      +N   + +  K  
Sbjct: 11  FRNNIKKIQMHNYLHEQGKSPFTM--GINQFSDMDEKEFST-----IMNGFRMNNRTKVR 63

Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
           DH H+H           IP  +P + DWR+ G +  V+NQ  CG+CWAFS +   E  H 
Sbjct: 64  DHLHSHY------ISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHF 117

Query: 177 LKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAAC 235
            K G L  LS Q ++DC+ + GN GC+GG       ++  N    + E+ YP    D  C
Sbjct: 118 RKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGD-DTEACYPYEAVDGMC 176

Query: 236 KRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYN 292
           + K     G   + YT D    +E  +   +A  GPV  A++A   ++  Y GGV ++  
Sbjct: 177 RFKRECV-GATCRGYT-DLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKE 234

Query: 293 CDGSLANINHAVQIVGY 309
           C  S   ++H V +VGY
Sbjct: 235 C--SPYQLDHGVLVVGY 249


>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
          Length = 567

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 138/308 (44%), Gaps = 42/308 (13%)

Query: 18  FLAIPVKVSKPNLEQKLEL---FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKN 73
            LA P   S P +   +EL   F  F   Y KSY+  +E   R   F ++L++  +L + 
Sbjct: 248 LLAEPHSSSLPRMGDSVELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQEL 307

Query: 74  RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT 133
            Q   SA+YG+T+FSDL+EEEF+  +L      + L+S           +  R++     
Sbjct: 308 DQG--SAQYGVTKFSDLTEEEFRMFYL------NPLLSS----------LPGRALRPAPR 349

Query: 134 IPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC 193
                P   DWR+ G +   +NQ  CG+CWAFS     E    L+ G L  LS QE++DC
Sbjct: 350 ARGPAPASWDWRDHGALTAAKNQGMCGSCWAFSVTGNVEGQWFLRRGALLTLSEQELVDC 409

Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVV-------LEPESEYPLLLKDAACKRKATSPNGVK 246
               +  C GG        +  N          LE E +Y    +   C   + SP+  +
Sbjct: 410 -DTLDQACGGG--------LPSNAYTAIETLGGLETEKDYSYEGRKERC---SFSPDKAR 457

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQ 305
               +   L   E  I   +A +GPV  A+NA   Q+Y  GV   +    S   I+HAV 
Sbjct: 458 AYINSSVDLSRDEQEIAAWLAENGPVSIALNAFAMQFYRRGVSHPFRPLCSPWFIDHAVL 517

Query: 306 IVGYDNYS 313
           +VGY + S
Sbjct: 518 LVGYGDRS 525


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E   +L+  ++  +  S S  E   RF  F+ ++  +   NK     +  +  + +F+D+
Sbjct: 34  ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +  EF++ +    VN H +    +H      + K  S+          P   DWR+ G +
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V++Q  CG+CWAFST+   E ++ +K   L  LS QE++DC    N GC+GG   +  
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++   K  +  ES YP   ++  C     +   V I  +  +  +  E+++L  +A   
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A    +Q+Y  GV   +C+    ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 151/331 (45%), Gaps = 50/331 (15%)

Query: 3   DVKNVLFIVALIALCFLAI-----PVKVSKPNLEQKLELF-SSFQQRYK------KSYSK 50
           +V   L I+  + + + A      P++    ++E++ E +     +RYK      + +  
Sbjct: 5   NVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGI 64

Query: 51  SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
            + ++RF N+  + +    L  N            +F+D++ EE+K  ++        L 
Sbjct: 65  YQSNVRFINYINAQNFSFTLTDN------------QFADMTNEEYKALYMG-------LG 105

Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
           +      +  +  ++RS          +P+  DWR+ G +  VRNQ  CG+CWAFSTV  
Sbjct: 106 TSETSRKNQSSFKRERSKV--------LPISVDWRKMGAVTPVRNQGECGSCWAFSTVAA 157

Query: 171 AESMHALKNGTLSLLSVQEVIDC-AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
            E ++ ++ G L  LS QE++DC   +GN GC+GG       ++  N  +    + YP +
Sbjct: 158 VEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARN-YPYI 216

Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
            +   C +   + + VKI  Y  +T+ P+   IL       PV  A++A    +Q Y  G
Sbjct: 217 GEQGICNKDKAANHVVKISGY--ETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKG 274

Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
           +    C   L   NHAV ++GY  DN  + W
Sbjct: 275 IFNGFCGKQL---NHAVTVIGYGEDNGKKYW 302


>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 359

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 134/278 (48%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSYSKSEH-DIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y  +E   +RF  F+++LD+I   NK   S    + G+ +F+D++ +EF
Sbjct: 60  FARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADMTWQEF 116

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N    L   HK               TG      +P  KDWRE GI+  V+
Sbjct: 117 QRTKLGAAQNCSATLKGTHK--------------LTG----EALPETKDWREDGIVSPVK 158

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++
Sbjct: 159 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 218

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   +D  CK  A +  GV++   + +  + +E  +   +    PV 
Sbjct: 219 KSNG-GLDTEEAYPYTGEDGTCKYSAENV-GVEVLD-SVNITLGAEDELKHAVGLVRPVS 275

Query: 274 AAVNAL-TWQYYLGGVI-QYNCDGSLANINHAVQIVGY 309
            A   + +++ Y  GV    +C  +  ++NHAV  VGY
Sbjct: 276 IAFEVIHSFRLYKSGVYSDSHCGQTPMDVNHAVLAVGY 313


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 152/307 (49%), Gaps = 30/307 (9%)

Query: 9   FIVALIALCFLAIPVKVSKP--NLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
            ++ L+ +    +   + +P  N E   E    +  R+ ++Y   +E + RF+ F+ +LD
Sbjct: 10  LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
            IE  NK     ++ + G+ +FSDLSEEEF T +  + +   +  +         N   K
Sbjct: 70  YIENFNKAFN--KTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTA---------NTTVK 118

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
            +  +       +P   DWRE G++  V+NQ  CG CWAFS V   E +     G  + L
Sbjct: 119 PTFFSNYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIA----GNGASL 174

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGV 245
           S Q+++DC G+ N GC GG      +++  N+ ++  +++YP       C  ++ S    
Sbjct: 175 SAQQLLDCVGD-NSGCGGGTMIKAFEYIVQNQGIVS-DTDYPYEQTQEMC--RSGSNVAA 230

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT---WQYYLGGVIQYNCDGSLANINH 302
           +I  Y  +++I SE ++   +A   P+  A++A +   ++ Y+ GV  ++ +    ++ H
Sbjct: 231 RITGY--ESVIQSEEALKRAVAKQ-PISVAIDASSGPNFKSYISGV--FSAEDCGTHLTH 285

Query: 303 AVQIVGY 309
           AV +VGY
Sbjct: 286 AVTLVGY 292


>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYGSAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F ++L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRENL-FLARLHA--AANPHATFGVTAFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P   DWR  G +  V++Q  CG+CWAFS +   E    L 
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C    + GC GG      +W+   N   +  E  YP    +     
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             TS + V         L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C    
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280

Query: 298 ANINHAVQIVGYDN 311
             ++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/279 (30%), Positives = 131/279 (46%), Gaps = 29/279 (10%)

Query: 37  FSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  FQ+ + K Y+  E  + R+  F+ +L  I   N N Q   S    + +F DL+ EEF
Sbjct: 89  FYQFQRDHNKFYATEEERLKRYAIFKNNLTYIH--NHNMQG-YSYVLKMNKFGDLTLEEF 145

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           + R+L +   K  L +  +  D     V+   I          P   DWR+ G +  V++
Sbjct: 146 RQRYLGY--KKPDLRTPPREVDTTLESVEDNDI----------PTHVDWRQRGCVTSVKD 193

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CWAFS     E ++  K G L  LS Q+++DC+   GN GC GG      +++ 
Sbjct: 194 QGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVV 253

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPV 272
            N  +   E+ YP + KD  CK    S     + + T    +P  SE S+ T +A   PV
Sbjct: 254 ENGGICSGEN-YPYMRKDGVCK----SSQCTSVATITGYRSVPRRSEKSMKTALALRSPV 308

Query: 273 IAAV--NALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             A+  N   +Q+Y  G+    C     N++H V +VGY
Sbjct: 309 SVAIQANQAAFQFYYDGIFDAPCG---TNLDHGVLLVGY 344


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 149/309 (48%), Gaps = 30/309 (9%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           ++ L+A   +A    +S    E + E   +F+ ++ K YS+ E   R   F+ +L  IE 
Sbjct: 3   LLVLLACVAMATAASLS---FESQWE---AFKIKHDKVYSEKEEYARRLIFQDNLKTIES 56

Query: 70  LNKNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N+   + + + + G+ +F+D++  E+    L   +   ++ S         N  K  S 
Sbjct: 57  HNQEADTGKHSYWLGVNQFADMTHAEY----LNQVIGGCLITS---------NLTKTGSR 103

Query: 129 TTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
            T   +P   +    DWR+ G++  +++Q  CG+CWAFST  + E  HA   GTL  LS 
Sbjct: 104 ATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSE 163

Query: 188 QEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
           Q ++DC+   GN GC GGD      ++  NK + + E  YP   K+  CK    S  G  
Sbjct: 164 QNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGI-DTEQCYPYKAKNHRCKFD-NSCIGAT 221

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHA 303
           + S+T D     E ++    A  GP+   ++A   ++Q+Y  GV  ++ C  S   ++H 
Sbjct: 222 MSSFT-DVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFEC--SSTKLDHG 278

Query: 304 VQIVGYDNY 312
           V +VGY  Y
Sbjct: 279 VLVVGYGTY 287


>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYESAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F ++L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P   DWR  G +  V++Q  CG+CWAFS +   E    L 
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C    + GC GG      +W+   N   +  E  YP    +     
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             TS + V         L   E+ I   +A +GPV  AV+A +W  Y GGV+  +C    
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE- 280

Query: 298 ANINHAVQIVGYDN 311
             ++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293


>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
 gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
          Length = 401

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 131/296 (44%), Gaps = 23/296 (7%)

Query: 21  IPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPES 79
           +P     P   +  + F  F+++Y K YS   E + RF+ ++++++ I+  N    S   
Sbjct: 70  VPGDYVDPATREYRKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFS--- 126

Query: 80  ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
               + EF DLS+EEF  R        ++  S         + V           P  I 
Sbjct: 127 YVLEMNEFGDLSKEEFMAR-----FTGYIKDSKDDERVFKSSRVSASESEEEFVPPNSI- 180

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMH-ALKNGTLSLLSVQEVIDCAG-NG 197
              +W EAG +  +RNQ+ CG+CWAFS V   E    A  N  L  LS Q+ +DC+  NG
Sbjct: 181 ---NWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNG 237

Query: 198 NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
           N GC GG       +   NK  L    +YP   ++  C   +   N ++I       + P
Sbjct: 238 NFGCDGGTMGLAFQYAIKNK-YLCTNDDYPYFAEEKTC-MDSFCENYIEIPVKAYKYVFP 295

Query: 258 SESSIL-TDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
              + L T +A +GP+  A+ A    +Q+Y  GV    C      +NH V +VGYD
Sbjct: 296 RNINALKTALAKYGPISVAIQADQTPFQFYKSGVFDAPCG---TKVNHGVVLVGYD 348


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 127/273 (46%), Gaps = 23/273 (8%)

Query: 39  SFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTR 98
           + Q R  +S    EH  RF+ F++++  I+ +NK + SP   + G+ +F+DLS EEFK  
Sbjct: 50  ALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNK-KDSP--YKLGLNKFADLSNEEFKAI 106

Query: 99  HLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQT 158
           ++             K        V+  S     + P  +P   DWR+ G +  V+NQ  
Sbjct: 107 YM-----------GTKMDLRGDREVQSGSFMYQNSEP--LPASIDWRQKGAVAAVKNQGH 153

Query: 159 CGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKV 218
           CG+CWAFSTV + E ++ +  G L  LS Q+++DC+   N GC+GG       ++ +N  
Sbjct: 154 CGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYI-INNG 211

Query: 219 VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA 278
            +  E  YP   +   C     +    ++     + +  +    L +   H PV  A+ A
Sbjct: 212 GIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEA 271

Query: 279 --LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               +Q+Y  GV    C  +L   +H V  VGY
Sbjct: 272 SGQDFQFYSTGVFTGKCGTAL---DHGVVAVGY 301


>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 146/311 (46%), Gaps = 38/311 (12%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIE 68
           ++A+ A   +A+    ++       + + +F+Q + K+Y    E   RF  F+++L  I+
Sbjct: 3   LLAIFATVLIAVTASTNE-------DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIK 55

Query: 69  ELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
           E N +  +  E+   G+T F+DL+ EEFK           +L    K+        K R 
Sbjct: 56  EHNARYDKGEETYLLGVTRFADLTHEEFK----------DILKGQIKN--------KPRL 97

Query: 128 ITTGITIPTG--IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
             T    P    +P   DW E G + +V++Q  CG+CWAFS     E  +A+ N     L
Sbjct: 98  NATPTVFPEDLEVPDSIDWTEKGAVLEVKDQNPCGSCWAFSATGALEGQNAILNNVKISL 157

Query: 186 SVQEVIDC-AGNGNMGC-SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           S Q+++DC A  GN  C  GGD  A  ++  V    ++ E  YP + K   C+  A S  
Sbjct: 158 SEQQLLDCSAAYGNGNCKEGGDMSAAFEY--VRDYGIQSEKSYPYIRKQTECQYDA-SKT 214

Query: 244 GVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHA 303
            +KIK Y    +  SE  +   +   GP+  A+N+   Q Y  G+I  +  G   +++H 
Sbjct: 215 ILKIKGYK--NVTTSEEGLRKAVGAIGPISIAMNSDPLQLYYSGII--SGKGCSHDLDHG 270

Query: 304 VQIVGYDNYSR 314
           V +VGY   S+
Sbjct: 271 VLVVGYGKASQ 281


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/277 (29%), Positives = 132/277 (47%), Gaps = 25/277 (9%)

Query: 36  LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           LF+ F  RY K Y   E +I+ + FE  LD ++ +  + +   S + G+ EF+D++ +EF
Sbjct: 60  LFARFAHRYGKRYETVE-EIK-QRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDITWDEF 117

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  + N                ++K  ++         +P  KDWREAGI+  V+N
Sbjct: 118 RRDRLGAAQNCSATT---------KGNLKLTNVV--------LPETKDWREAGIVSPVKN 160

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++ 
Sbjct: 161 QGKCGSCWTFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 220

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N   L+ E  YP   K+  CK  + +  GVK+   + +  + +E  +   +A   PV  
Sbjct: 221 SNG-GLDTEEAYPYTGKNGLCKFSSENV-GVKVID-SVNITLGAEDELKYAVALVRPVSI 277

Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   +    QY  G      C  +  ++NHAV  VGY
Sbjct: 278 AFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGY 314


>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
          Length = 334

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 88/319 (27%), Positives = 148/319 (46%), Gaps = 39/319 (12%)

Query: 11  VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
           + L A C L I   V  P  +Q L+  +  ++  +++ Y  +E   R   +EK++ +IE 
Sbjct: 5   LVLAAFC-LGIASAV--PKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61

Query: 70  LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
            N    Q        +  F D++ EEF+            +M   ++       V +  +
Sbjct: 62  HNGEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------MMGCFRNQKFRKGKVFREPL 110

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
              + +P  +    DWR+ G +  V+NQ+ CG+CWAFS     E     K G L  LS Q
Sbjct: 111 F--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164

Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
            ++DC+   GN GC+GG       ++  N   L+ E  YP +  D  CK +  +     +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMGKAFQYVKENG-GLDSEESYPYVAMDEICKYRPEN----SV 219

Query: 248 KSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
            + T  T++P   E +++  +AT GP+  A++A   ++Q+Y  G I +  D S  N++H 
Sbjct: 220 ANDTGFTVVPPGKEKALMKAVATVGPISVAMDAGHSSFQFYNQG-IYFEPDCSSENLDHG 278

Query: 304 VQIVGY------DNYSRTW 316
           V +VGY       N S+ W
Sbjct: 279 VLVVGYGFEGANSNNSKYW 297


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)

Query: 34  LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           + ++ ++  ++ K+ S++   E D RF+ F+ +L  ++E N+   S    R G+T F+DL
Sbjct: 47  MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + +E+++++L   + K                 ++ S+     +   +P   DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEK--------------KGERRTSLRYEARVGDELPESIDWRKKGAV 149

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
            +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205

Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     +    ++ + +YP    D  C +   +   V I SY  D    SE S+   +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            H P+  A+ A    +Q Y  G+   +C      ++H V  VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCG---TQLDHGVVAVGY 304


>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N    + +     
Sbjct: 16  LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75

Query: 83  GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
            +  F D++ EEF     R  VN +           H  H K R     + +   IP   
Sbjct: 76  EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118

Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
           DWRE G +  V+N+  CG+CWAFS     E    LK G L  LS Q ++DC+   GN GC
Sbjct: 119 DWREKGCVTPVKNKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178

Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
           +GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP  E 
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233

Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           +++  +AT GP+  A++A   + Q+Y  G I Y  + S  N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 128/285 (44%), Gaps = 31/285 (10%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
           E    +   Y K Y   +E D RF+ F+ +++ IE  N +   P   + G+   +DL+ E
Sbjct: 36  ERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKP--YKLGVNHLADLTVE 93

Query: 94  EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
           EFK                 + H+      K  ++T        IP   DWR  G +  +
Sbjct: 94  EFKASR----------NGFKRPHEFSTTTFKYENVTA-------IPAAIDWRTKGAVTPI 136

Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDW 212
           ++Q  CG+CWAFST+   E +H +  G L  LS QE++DC   G + GC GG      ++
Sbjct: 137 KDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEF 196

Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
           +  N  +   E+ YP    D  C  KATSP   +IK Y  + + P+  + L     + PV
Sbjct: 197 IIKNGGITS-ETNYPYKAVDGKC-NKATSPV-AQIKGY--EKVPPNSETALQKAVANQPV 251

Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYSRT 315
             +++A    + +Y  G+    C   L   +H V  VGY   + T
Sbjct: 252 SVSIDADGAGFMFYSSGIYNGECGTEL---DHGVTAVGYGTANGT 293


>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 479

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 146/311 (46%), Gaps = 28/311 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYSKSE-HDIRFKNFEKS 63
            +    L ALC+ +  +  +   ++ ++    F  F++++ KS+ +      RF  F+++
Sbjct: 10  AMVATVLFALCYCSTVIARTLHGIDDEVASAHFMHFKKQHGKSFGEEAVEGHRFNAFKEN 69

Query: 64  LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
           +     LN   Q+P  A Y ++ +F+ L+ +EF  ++L        L +H K   H +  
Sbjct: 70  MQTAVYLNA--QNPH-AHYDVSGKFAALTPQEFAKQYLNPDYYTRQLKAH-KERAHVYEG 125

Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
           V+            G     DWRE G + +V++Q  CG+CWAFS +   E   AL   TL
Sbjct: 126 VR------------GGLSAVDWREKGAVTEVKDQGLCGSCWAFSAIGNIEGQWALSGNTL 173

Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN-KVVLEPESEYPLLLKDAACKR-KAT 240
             LS Q ++ C    +MGC+GG       W+  N    +  E  YP    D +     +T
Sbjct: 174 VSLSEQMLVSC-DTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTSGDGSTASCLST 232

Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
              G +I      +L   E +I   +  +GP+  AV+A TWQ Y GGV+  NC     N+
Sbjct: 233 GKVGARISGQV--SLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVS-NCFAY--NL 287

Query: 301 NHAVQIVGYDN 311
           NH V +VGY+N
Sbjct: 288 NHGVLLVGYNN 298


>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 140/314 (44%), Gaps = 24/314 (7%)

Query: 1   MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFK 58
           M      L + A++ +    +P   +  + E+ L   F+ F+Q++ + Y S +E   R  
Sbjct: 1   MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFAEFKQKHGRVYESAAEEAFRLS 60

Query: 59  NFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
            F ++L  +  L+    +   A +G+T FSDL+ EEF++R+             H    H
Sbjct: 61  VFRENL-FLARLHA--AANPHATFGVTPFSDLTREEFRSRY-------------HNGAAH 104

Query: 119 HHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALK 178
                ++  +   + +  G P   DWR  G +  V++Q  CG+CWAFS +   E    L 
Sbjct: 105 FAAAQERARVPVNVEV-VGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLA 163

Query: 179 NGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKR 237
              L+ LS Q ++ C    + GCSGG      +W+   N   +  E  YP    +     
Sbjct: 164 GHPLTNLSEQMLVSC-DKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPP 222

Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
             TS + V         L   E+ I   +A +GPV   V+A +W  Y GGV+  +C    
Sbjct: 223 CTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVGVDASSWMTYTGGVMT-SCVSE- 280

Query: 298 ANINHAVQIVGYDN 311
             ++H V +VGY++
Sbjct: 281 -QLDHGVLLVGYND 293


>gi|326492229|dbj|BAK01898.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/322 (27%), Positives = 148/322 (45%), Gaps = 30/322 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQ---RYKKSYSKSEHDIRFKNFEKS 63
           ++ +   +A+C LA      K   E  +  ++ F+    +  K Y   E  IRF N++ +
Sbjct: 6   LITLTVFLAICSLAASSNTFKNPQEDVVLFYNIFKDWITQSNKQYGIEEMAIRFFNWKNN 65

Query: 64  LDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
            D ++E   N Q+  + R  + +++D++ EEF   H+   +N  +L +  K+        
Sbjct: 66  FDFVQE--HNAQAGLTFRLEMNDYADMTAEEFSALHM--GLNTELLAASKKNKTAPAAAK 121

Query: 124 KKRSITTGI------------TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETA 171
           K  + T                  TG+P   D R+ G +  V+NQ TCG C+AF+     
Sbjct: 122 KANNTTNSTNATNAGFNKNSSAADTGLPKSVDCRKTGAVSGVKNQGTCGGCYAFAAAGAL 181

Query: 172 ESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLL 230
           E ++A+KN  L+ +SVQ++IDC+G  GN GC GG       +  +  V  E ES Y    
Sbjct: 182 EGLYAIKNKKLTDISVQQMIDCSGFFGNKGCDGGLMTTTFGFTQMFGV--EAESTYGYAA 239

Query: 231 KDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV 288
               C++   + + +  ++   + +  +++  L       PV   + A  L  Q +  GV
Sbjct: 240 ALGECRQ---NTDNIVFRNSGYEEVPQNDTLALKKAVARQPVSVGIEASSLAVQLFKSGV 296

Query: 289 IQYNCDGSLANINHAVQIVGYD 310
           +   C  +L   NHAV IVGYD
Sbjct: 297 LTGGCGTAL---NHAVLIVGYD 315


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)

Query: 25  VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
           ++ P  +Q     +  ++  +++ Y  +E + R   +EK++ +I+  N       + ++G
Sbjct: 16  LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72

Query: 84  IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
            T     F D++ EEF     R  VN +           H  H K R     + +   IP
Sbjct: 73  FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115

Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
              DWRE G +  V+NQ  CG+CWAFS     E    LK G L  LS Q ++DC+   GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175

Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
            GC+GG       ++  N   L+ E  YP   KD +CK +A       + + T    IP 
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230

Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            E +++  +AT GP+  A++A   + Q+Y  G I Y  + S  +++H V +VGY
Sbjct: 231 QEKALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 133/285 (46%), Gaps = 27/285 (9%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           +Q L L+ S+  ++ K+Y+   E + RF  F+ ++  ++  N  R   +S + G+ +F+D
Sbjct: 54  DQLLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRN--QSYKLGLNKFAD 111

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
           L+ +E+++ +L   + K              N    RS          +P   DWR+ G 
Sbjct: 112 LTNDEYRSLYLSGKMMKR----------ERKNEDGFRSDRFVFEDGDHLPESVDWRDRGA 161

Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
           +  V++Q  CG+CWAFSTV   E ++ +  G L  LS QE++DC    N GC+GG    L
Sbjct: 162 VAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGG----L 217

Query: 210 LDW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
           +D+     V    ++ E +YP    D  C +   +   V I  Y  D     E S+   +
Sbjct: 218 MDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAKVVTINGYE-DVPHNDEKSLKKAV 276

Query: 267 ATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A H PV  A+ A    +Q Y  GV    C   L   +H V  VGY
Sbjct: 277 A-HQPVSVAIEAGGRAFQLYESGVFTGQCGTEL---DHGVVAVGY 317


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/283 (28%), Positives = 132/283 (46%), Gaps = 24/283 (8%)

Query: 31  EQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESARYGITEFS 88
           E+   L++ ++  + KSY+   E + R+  F  +L  I+E N    +   S R G+  F+
Sbjct: 34  EEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFA 93

Query: 89  DLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAG 148
           DL+ EE++  +L             ++       V  R +         +P   DWR  G
Sbjct: 94  DLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPESVDWRTKG 139

Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCA 208
            + ++++Q+  G+CWAFS +   E ++ +  G L  LS QE++DC  + N GC+GG    
Sbjct: 140 AVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDY 199

Query: 209 LLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
             D++ +N   ++ E +YP   KD  C     +   V I SY  D    SE+S+   +A 
Sbjct: 200 AFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSETSLQKAVAN 257

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             PV  A+ A    +Q Y  G+    C  +L   +H V  VGY
Sbjct: 258 Q-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296


>gi|330796919|ref|XP_003286511.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
 gi|325083492|gb|EGC36943.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
          Length = 325

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 151/313 (48%), Gaps = 38/313 (12%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           + +  I L FL   V  +K   E   +  F S+     K Y   +   R++ F+ ++D I
Sbjct: 4   YFIGFILLIFLNQNVFCNKLFTEIIYQNKFISWANENNKFYETLDFKKRYEIFKYNMDFI 63

Query: 68  EELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
              NK N Q+      G+ +++DLS EE+K+  L  ++                N+++  
Sbjct: 64  YSWNKGNSQTI----LGLNKYADLSNEEYKSLFLGSNI-------------KTQNYIRIN 106

Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
           S    I      P   DWR  G +  V+NQ  C + +AFS + + ES + ++NG L  LS
Sbjct: 107 SSRYDI------PTTFDWRLKGAVTPVKNQGFCNSGYAFSAIGSLESSNKIENGQLIRLS 160

Query: 187 VQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEP-ESEYPLLLKDAACKRKATSPNG 244
            Q +IDC+G+ GN GC GG      +++  ++    P ES YP   + + C+ K     G
Sbjct: 161 EQNLIDCSGSEGNRGCDGGTVVNSFNYLFKHQNGKIPKESSYPYEAQKSKCRFKDQFI-G 219

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI-- 300
             + ++    LI  ES+I   +AT GPV  A++A  + +Q Y GGV     D   +NI  
Sbjct: 220 ATLNNFA--NLISDESTIQNAVATKGPVSVAIDASSIFFQLYFGGVYD---DLFCSNIYT 274

Query: 301 NHAVQIVGY-DNY 312
           NH V IVGY +NY
Sbjct: 275 NHFVLIVGYTENY 287


>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 88/279 (31%), Positives = 140/279 (50%), Gaps = 25/279 (8%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFK 96
           F +F   + K YS+ E   RF+ F ++L  I+  N   Q   SA+YG+TEF+DLS+ EF+
Sbjct: 50  FENFLLEHPKMYSEQESHSRFQTFWENLKRIKFHNHIEQG--SAKYGVTEFADLSDFEFR 107

Query: 97  TRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQ 156
             +L   +   + + + K ++      K R+ +  +     +    DW E G + +V+NQ
Sbjct: 108 RHYL--GLKPELKIPNRKKYER-----KSRNSSKKLKFAKTVDETFDWVEKGAVTEVKNQ 160

Query: 157 QTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD--WMD 214
             CG+CWAFST    E       G L  LS QE++DC    + GC+GG    L+D  + +
Sbjct: 161 GMCGSCWAFSTTGNIEGAWFKATGDLVSLSEQELVDCD-QKDSGCNGG----LMDQAFEE 215

Query: 215 VNKV-VLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
           V ++  LE E +YP       C  +  S + V+I  +    +   E  I   +  HGP+ 
Sbjct: 216 VIRIGGLETEQQYPYDGVQETCNFEK-SLSKVQIDDFM--DIGEDEEEIAEALEEHGPLS 272

Query: 274 AAVNALTWQYYLGGV---IQYNCDGSLANINHAVQIVGY 309
            A+NA   Q+Y GG+   + + C  S   ++H V +VGY
Sbjct: 273 IAINAFGMQFYRGGISHPLSFLC--SQDGLDHGVLMVGY 309


>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
 gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
          Length = 496

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 132/280 (47%), Gaps = 34/280 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F  F + +KK Y S+ E   R+  F+ ++  +E L KN Q   +A YG+T F+DL+ EEF
Sbjct: 196 FKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQG--TAVYGVTFFADLTPEEF 253

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +  +L     +  L              +K SI  G      I  + DWRE   + +V+N
Sbjct: 254 RKFYLSPQWKRDQLPQ------------RKASIPKG-----KIEDRWDWREHNAVTEVKN 296

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDV 215
           Q  CG+CWAF+T+   E + A+K G L  LS QE++DC    + GCSGG        + +
Sbjct: 297 QGMCGSCWAFATIANVEGVWAVKKGELVSLSEQELVDC-DTLDQGCSGGYPSNAYKEI-I 354

Query: 216 NKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVI 273
               L  E+ Y        C+ K  +      K Y  D  +L   E+ I   I  +GPV 
Sbjct: 355 RLGGLTTETNYSYDGNQGTCRFKTQNA-----KVYINDSVSLPEDETEIAAYIRENGPVA 409

Query: 274 AAVNALTWQYYLGGVI---QYNCDGSLANINHAVQIVGYD 310
             +NA    +Y  G+    ++ C  S   ++H V IVGYD
Sbjct: 410 VGINAFAMMFYRHGIAHPWRFLC--SPDALDHGVAIVGYD 447


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/283 (30%), Positives = 136/283 (48%), Gaps = 37/283 (13%)

Query: 34  LELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
           L++F  + + + + Y S SE   RF+ F+++   I   NK ++S      G+ +FSDL+ 
Sbjct: 46  LDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKS---YWLGLNKFSDLTH 102

Query: 93  EEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
           +EF+ ++L    VN+    ++  + D                       K DWR  G + 
Sbjct: 103 QEFRAQYLGTKPVNRQRKEANFMYED------------------VEAEPKVDWRLKGAVT 144

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
            V++Q  CG+CWAFS V + E ++A+K G L  LS QE++DC    N GC+GG    L+D
Sbjct: 145 DVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGG----LMD 200

Query: 212 W---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIAT 268
           +     +    ++ E +YP   +D  C     +   V I  Y  D    SES+++  + T
Sbjct: 201 YAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQ-DVPTQSESALMKAL-T 258

Query: 269 HGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
             PV  A+ A    +Q+Y GGV    C   L   +H V  VGY
Sbjct: 259 KNPVSVAIEAGGRDFQHYQGGVFTGPCGSEL---DHGVLAVGY 298


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 130/303 (42%), Gaps = 31/303 (10%)

Query: 21  IPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDI---------RFKNFEKSLDIIEEL 70
           IP   S  + E+ L  L+  ++ RY  S   +   +         RF  F ++   I E 
Sbjct: 25  IPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEA 84

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH--HHNHVKKRSI 128
           N+    P   R  + +F+D++ +EF+         +    S  +HH         +  S 
Sbjct: 85  NRRGGRP--FRLALNKFADMTTDEFR---------RTYAGSRARHHRSLRGGRGGEGGSF 133

Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
             G      +P   DWRE G +  +++Q  CG+CWAFS V   E ++ +K G L  LS Q
Sbjct: 134 RYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQ 193

Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIK 248
           E++DC    N GC GG       ++  N  +   ES YP   +   C +   S + V I 
Sbjct: 194 ELVDCDTGDNQGCDGGLMDYAFQFIKRNGGIT-TESNYPYRAEQGRCNKAKASSHDVTID 252

Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQI 306
            Y  D     ES++   +A   PV  AV A    +Q+Y  GV    C     +++H V  
Sbjct: 253 GYE-DVPANDESALQKAVANQ-PVAVAVEASGQDFQFYSEGVFTGECG---TDLDHGVAA 307

Query: 307 VGY 309
           VGY
Sbjct: 308 VGY 310


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 136/307 (44%), Gaps = 30/307 (9%)

Query: 14  IALCFLAIPVKVSKPNLEQKLELFSS--------FQQRYKKSYSK-SEHDIRFKNFEKSL 64
           I    LAI +      +  +  LF +        +  R+ + YS  SE   RF+ F  +L
Sbjct: 4   IVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNL 63

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
             +E +N N  + ++    + EFSDL++EEFK R+    V +   M+     D H   V 
Sbjct: 64  KFVESINMN--TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEG--MTRISTTDSHET-VS 118

Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
            R    G T  +      DW + G +  V++QQ CG CWAFS V   E M  + NG L  
Sbjct: 119 FRYENVGETGES-----MDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVS 173

Query: 185 LSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
           LS Q+++DC+   N GC GG      D++  N+ +   E  YP       C+    +   
Sbjct: 174 LSEQQLLDCSTENN-GCGGGIMWKAFDYIKENQGIT-TEDNYPYQGAQQTCESNHLAA-- 229

Query: 245 VKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQY--YLGGVIQYNCDGSLANINH 302
             I  Y  +T+  ++   L    +  PV  A+    +++  Y GG+    C   L    H
Sbjct: 230 ATISGY--ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLT---H 284

Query: 303 AVQIVGY 309
           AV IVGY
Sbjct: 285 AVTIVGY 291


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 78/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)

Query: 34  LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           + ++ ++  ++ K+ S++   E D RF+ F+ +L  ++E N+   S    R G+T F+DL
Sbjct: 47  MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + +E+++++L   + K                 ++ S+     +   +P   DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
            +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205

Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     +    ++ + +YP    D  C +   +   V I SY  D    SE S+   +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            H P+  A+ A    +Q Y  G+   +C      ++H V  VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCG---TQLDHGVVAVGY 304


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)

Query: 34  LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           + ++ ++  ++ K+ S++   E D RF+ F+ +L  ++E N+   S    R G+T F+DL
Sbjct: 47  MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           + +E+++++L   + K                 ++ S+     +   +P   DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
            +V++Q  CG+CWAFST+   E ++ +  G L  LS QE++DC  + N GC+GG    L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205

Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
           D+     +    ++ + +YP    D  C +   +   V I SY  D    SE S+   +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264

Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
            H P+  A+ A    +Q Y  G+   +C   L   +H V  VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCGTQL---DHGVVAVGY 304


>gi|440797325|gb|ELR18416.1| cathepsin Llike cysteine protease [Acanthamoeba castellanii str.
           Neff]
          Length = 345

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 134/291 (46%), Gaps = 36/291 (12%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           + SF+ +Y KSY +  E   R   F +++  I   +    +       I EF+DL+ +EF
Sbjct: 27  WESFKAKYGKSYPTPHEEAHRRAVFHRNVAFIAAHHDPLYT-----VAINEFADLTFDEF 81

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
            TR +          S       H      R           +P + DWRE G++ +V+N
Sbjct: 82  STRKMGLLPPPLPSSSSSSPGAAHLLEAATR-----------LPTQVDWREKGVVTRVKN 130

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWMD 214
           Q  CG+CWAFS     E   AL+ G L  LS + +IDC+   G+MGC GG       ++ 
Sbjct: 131 QLDCGSCWAFSAAGAIEGQQALRTGRLVDLSEENLIDCSWAQGDMGCGGGLPSQAFQYVI 190

Query: 215 VNKVVLEPESEYPL---LLKDAACKR-------KATSPNGVKIKSYTCDTLIP--SESSI 262
            NK + + E+ YPL    + D            ++    G  + SYT    +P  SE+++
Sbjct: 191 DNKGI-DTEARYPLASVWISDCTAPELCPCTYNRSAGAVGAVVASYTS---LPAGSEAAL 246

Query: 263 LTDIATHGPVIAAVNA-LTWQYYLGGVI-QYNCDGSLANINHAVQIVGYDN 311
              +AT GP+   ++A    Q+Y GGV    +C  +  ++NHAV  VGY +
Sbjct: 247 AHALATVGPISVCIDAEQGLQFYSGGVFSSRSCGSARTDLNHAVLAVGYGS 297


>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 363

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 131/305 (42%), Gaps = 39/305 (12%)

Query: 25  VSKPNLEQKLEL---FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNR------ 74
             KP  +   EL   +S +Q +Y K Y S  E + RF  F  + + I   +  +      
Sbjct: 28  AGKPAADDDSELRQRWSKWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSAV 87

Query: 75  -------QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
                  Q+  + R G+  F DL   E   +    +    VL +       HH+      
Sbjct: 88  VGSFGAPQTVTTVRVGMNRFGDLQPREVLDQFTGFNNTAAVLKTPPPTRLPHHSRK---- 143

Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
                      P   DWR +G +  V+ Q +C +CWAF+ V   E M+ ++ GTL  LS 
Sbjct: 144 -----------PCCVDWRSSGAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSE 192

Query: 188 QEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK-RKATSPNGVK 246
           Q+++DC  NG+ GC+GG     LD +     +   E  Y     +  CK  K    +G  
Sbjct: 193 QQLVDCD-NGSSGCAGGRTDTALDLVARRGGITSGE-RYAYGGFNGRCKVDKLLFDHGAA 250

Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLGGVIQYNCDGSLANINHAV 304
           +  +    + P++   L       PV A V+A TW  Q+Y GG+ +  C G  A +NHAV
Sbjct: 251 VGGF--KAVPPNDEHQLAMAVARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAV 308

Query: 305 QIVGY 309
            IVGY
Sbjct: 309 TIVGY 313


>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 153/318 (48%), Gaps = 32/318 (10%)

Query: 4   VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSY-SKSEHDIRFKNFE 61
            + + F V L+A+    +PV +   + EQ L+  F++F+Q+Y +SY   +E   RF+ F+
Sbjct: 7   TRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFK 66

Query: 62  KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
           +++   E   +   +   A +G+T FSD+S EEF+              ++H   +++  
Sbjct: 67  QNM---ERAKEEAAANPYATFGVTRFSDMSPEEFRA-------------TYHNGAEYYAA 110

Query: 122 HVKK-RSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
            +K+ R +   + + TG  P   DWR+ G +  V++Q  C + WAF+ +   E    +  
Sbjct: 111 ALKRPRKV---VNVSTGKAPEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAG 167

Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACK-- 236
             L+ LS Q ++ C  N ++GC  G       W+   N   +  E  YP           
Sbjct: 168 HELTSLSEQMLVSCDTN-DLGCRAGFMDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTC 226

Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
            K+    G  I  +    ++ +E++I   +A  GPV  AV+A ++Q Y GGV+  +C   
Sbjct: 227 NKSGKVVGANIDDHV--HILDNENAIAEWLAKKGPVAIAVDATSFQSYTGGVLT-SCISK 283

Query: 297 LANINHAVQIVGYDNYSR 314
              +N A  +VGYD+ S+
Sbjct: 284 --EVNSAALLVGYDDTSK 299


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 141/322 (43%), Gaps = 45/322 (13%)

Query: 10  IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIE 68
           +  L+   FLA  V           E    +  R+ K Y    E + RF+ F ++++ +E
Sbjct: 108 LAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVE 167

Query: 69  ELNKNRQSPESARYGITEFSDLSEEEF---KTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
             N     P   + GI +F DL+ +EF   + R   H  +  +  +  K+ +        
Sbjct: 168 AFNNAANKP--YKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIRTTTFKYEN-------- 217

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
             +TT       +P   DWR+ G +  V++Q  CG CWAFS V   E +HAL  G L  L
Sbjct: 218 --VTT-------VPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISL 268

Query: 186 SVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVV-----LEPESEYPLLLKDAACKRKA 239
           S QE++DC   G + GC GG    L+D  D  K +     L  E+ YP    D  C    
Sbjct: 269 SEQELVDCDTKGVDQGCEGG----LMD--DAYKFIIQNHGLNTEANYPYKGVDGKCNANE 322

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT--WQYYLGGVIQYNCDGSL 297
            + +   I  Y  D    +E ++   +A   PV  A++A +  +Q+Y  G    +C   L
Sbjct: 323 AANHAATITGYE-DVPANNEKALQKAVANQ-PVSVAIDASSSDFQFYKSGAFTGSCGTEL 380

Query: 298 ANINHAVQIVGY---DNYSRTW 316
              +H V  VGY   D+ ++ W
Sbjct: 381 ---DHGVTAVGYGVSDHGTKYW 399


>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 90/290 (31%), Positives = 138/290 (47%), Gaps = 41/290 (14%)

Query: 32  QKLELFSSFQQRYKKSYSKS--EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
           Q  +LF +F   YK  Y     E   RF+ F++++  I ELN + +   +  Y +T F+D
Sbjct: 226 QAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERG--TGVYAVTRFTD 283

Query: 90  LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT--GIPVKKDWREA 147
           L+ EEFK+++L  + N               N +  R       IP    +P   DWR  
Sbjct: 284 LTYEEFKSKYLGLNPNLK-----------KPNQIPMRQAE----IPKVHQLPASFDWRPL 328

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
           G + +V++Q  CG+CWAFS     E    LK G L  LS QE++DC    + GC GG   
Sbjct: 329 GAVTEVKDQGACGSCWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMDD-GCDGG--- 384

Query: 208 ALLDWMD-VNKVV-----LEPESEYPLLLKDAACK-RKATSPNGVKIKSYTCDTLIPSES 260
               +MD   + +     LE E EYP   +D  C   K+ S    K++      +  +E+
Sbjct: 385 ----YMDNAYRAIEQLGGLETEEEYPYEAEDDKCSFNKSLS----KVQISGAVNISSNET 436

Query: 261 SILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
           ++   +  +GP+   +NA   Q+Y+GGV   +    +  NI+H V IVGY
Sbjct: 437 NMAKWLVHNGPISIGINANAMQFYVGGVSHPWKALCNPKNIDHGVLIVGY 486


>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
          Length = 325

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 151/310 (48%), Gaps = 41/310 (13%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLD 65
           +LF +  I+LC        +K  L  +L+ F++F++++ K+Y  + E   R   F  +L 
Sbjct: 2   ILFALIFISLC-------TAKDTLSVELQ-FAAFEKKFGKTYVGEEERRFRMSVFSNNLK 53

Query: 66  IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
           I++  N ++QS  S   GIT F DLS +EF+ R   ++       +  K      +   +
Sbjct: 54  IVDYYN-SKQS--SFVLGITPFIDLSNDEFRERFASNT-------AFEKKAKSVESSSSQ 103

Query: 126 RSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLL 185
           ++     ++P  I    DWR    +  V++Q+ CGACWAF+ V + E ++A K G +   
Sbjct: 104 QTSQDYSSLPRSI----DWRAKNTVSSVKDQKNCGACWAFAAVASIEGVYAQKTGKILDF 159

Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRK--ATSPN 243
           S Q+++DC    ++GCSGG      +++  N + L  ES+YP      +CK+    TS  
Sbjct: 160 SPQQLVDC-DYSSLGCSGGLMTYAYEYVMNNGISL--ESDYPYKASQGSCKKVDFVTSIM 216

Query: 244 GVKIKSYTCDTLIPSESSI-LTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANI 300
           G           +P  S+  L    T  PV  A+ A  + +Q Y  G++     G+   +
Sbjct: 217 GYY--------EVPVGSTYELLKATTKNPVSVAIGADSIFFQLYTSGILAEELCGT--TL 266

Query: 301 NHAVQIVGYD 310
           NH V +VGY+
Sbjct: 267 NHGVLLVGYE 276


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 137/285 (48%), Gaps = 28/285 (9%)

Query: 27  KPNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEKSLDIIEELNKNRQSPESARYGIT 85
           +PN +  ++ F  +   Y + Y  ++  +R F+ F+ ++  IE  N   ++  S   GI 
Sbjct: 1   EPN-DPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNS--RNGNSYTLGIN 57

Query: 86  EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
           +F+D+++ EF  ++   S+  ++        D              + I + +P   DWR
Sbjct: 58  QFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD-------------VNI-SAVPQSIDWR 103

Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
           + G + +V+NQ  CG+CWAF+ + T E ++ +K G L  LS QEV+DCA   + GC GG 
Sbjct: 104 DYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCA--VSYGCKGGW 161

Query: 206 FCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD 265
                D++  N  V   E  YP       C   +  PN   I  Y+       E S++  
Sbjct: 162 VNKAYDFIISNNGVTT-EENYPYQAYQGTCNANSF-PNSAYITGYSY-VRRNDERSMMYA 218

Query: 266 IATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           ++   P+ A ++A   +QYY GGV    C  SL   NHA+ I+GY
Sbjct: 219 VSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---NHAITIIGY 259


>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
 gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
          Length = 362

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 128/277 (46%), Gaps = 26/277 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY KSY S +E   RF+ F +SL   EE+    +     R GI  FSD+S EEF
Sbjct: 61  FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLPYRLGINRFSDMSWEEF 117

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  +      ++         NH+ + +          +P  KDWRE GI+  V+N
Sbjct: 118 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKN 161

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ +    G    LS Q+++DCAG   N GC+GG      +++ 
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N  + + E  YP    +  C  KA +   V++   + +  + +E  +   +    PV  
Sbjct: 222 YNGGI-DTEESYPYKGVNGVCHYKAENA-AVQVLD-SVNITLNAEDELKNAVGLVRPVSV 278

Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   +    QY  G     +C  +  ++NHAV  VGY
Sbjct: 279 AFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 315


>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 387

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/286 (30%), Positives = 139/286 (48%), Gaps = 38/286 (13%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           FS F++R+ KSY ++ EHD RFK F+ ++   E   +++    SA +G+T+FSDL+  EF
Sbjct: 59  FSLFKRRFGKSYATEEEHDRRFKIFKANMRRAE---RHQSFDPSAIHGVTQFSDLTPFEF 115

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGIT--IPT-GIPVKKDWREAGIIGK 152
           +   L   +  H L               +  + T     +PT  +P+  DWR+ G + +
Sbjct: 116 RKAFL--GLRGHRL---------------RLPVDTNAAPILPTENLPIDFDWRQHGGVTR 158

Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGG 204
           V+NQ +CG+CW+FST    E  + L  G L  LS Q+++DC            + GC+GG
Sbjct: 159 VKNQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGG 218

Query: 205 DFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
              +  ++  +    L  E +YP    D        S     I +++    I  E  I  
Sbjct: 219 LMNSAFEYT-LKAGGLMKEQDYPYAGIDRNTCNFDKSKIAASIANFSVVNSI-DEDQIAA 276

Query: 265 DIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
           ++  +GP+  A+NA+  Q Y+GGV   + C   L   +H V +VGY
Sbjct: 277 NLVKNGPLAIAINAVFMQTYIGGVSCPFICSKRL---DHGVLLVGY 319


>gi|1222695|gb|AAA92019.1| CP4 [Dictyostelium discoideum]
          Length = 442

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)

Query: 13  LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
           L  LC L +    +K      Q    F+++ Q ++++YS  E + R++ F+ ++D + + 
Sbjct: 4   LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63

Query: 71  NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           N   +  E+   G+  F+D++ +E++T +L    +   L+   +         +K   T 
Sbjct: 64  NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
             T+        DWR  G +  ++NQ  CG CW+FST  + E  H + +GT   L  LS 
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163

Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
           Q +IDC+ + GN GC GG      +++ +N   ++ ES YP   +D   CK K TS  G 
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLGFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221

Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
           +I SY  +    SE+S L   + + PV  A++A   ++Q Y  G I Y    +   ++H 
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACTPTQLDHG 278

Query: 304 VQIVGY 309
           V +VGY
Sbjct: 279 VLVVGY 284


>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 141/304 (46%), Gaps = 33/304 (10%)

Query: 13  LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
           L ALC   + V  + P L+Q L++ ++ ++  YKK Y+ +E D R   +EK++ +IE  N
Sbjct: 7   LAALC---LGVASAAPKLDQSLDVQWNQWRSTYKKPYAVNEEDWRRAVWEKNVKMIERHN 63

Query: 72  KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
           +   Q        +  F D++ EEF+            +M+  ++  H    +    +  
Sbjct: 64  QEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------VMNGFQNQKHKKGKLFYEPVFG 112

Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
            I      P   DW + G +  V+NQ  CG+CWAFS     E     K G L  LS Q +
Sbjct: 113 HI------PTSVDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166

Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIK 248
           +DC+   GN GC+GG       ++  N   L+ E  YP L  D   C  K          
Sbjct: 167 VDCSRREGNEGCNGGLMDNAFQYVQDNG-GLDSEESYPYLATDTHTCNYKPE----CSAA 221

Query: 249 SYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
           + T    IP  E +++  +AT GP+  A++A   ++Q+Y  G I Y    S  +++H V 
Sbjct: 222 NDTGFVDIPQREKALMKAVATVGPISVAIDAGHESFQFYKSG-IYYEPGCSSKDLDHGVL 280

Query: 306 IVGY 309
           +VGY
Sbjct: 281 LVGY 284


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)

Query: 31  EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
           E   +L+  ++  +  S S  E   RF  F+ ++  +   NK     +  +  + +F+D+
Sbjct: 34  ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90

Query: 91  SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
           +  EF++ +    VN H +    +H      + K  S+          P   DWR+ G +
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140

Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
             V++Q  CG+CWAFST+   E ++ +K   L  LS QE++DC    N GC+GG   +  
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200

Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
           +++   K  +  ES YP   ++  C     +   V I  +  +  +  E+++L  +A   
Sbjct: 201 EFIK-QKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257

Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
           PV  A++A    +Q+Y  GV   +C+    ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295


>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
 gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
          Length = 356

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 137/284 (48%), Gaps = 25/284 (8%)

Query: 29  NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
           NL++  + F SF + Y K+Y+   E + R+  F+ +L  I   N N     +A Y I +F
Sbjct: 48  NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107

Query: 88  SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
           SDLS+ E   +    S+ + V            N  K   +      P   P+  DWRE 
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-----------SNFCKTIILNQP---PDKGPLHFDWREQ 153

Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF- 206
             +  ++NQ  CGACWAF+T+ + ES  A+++  L  LS Q++IDC  + +MGC+GG   
Sbjct: 154 NKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC-DSVDMGCNGGLLH 212

Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
            A  + M +  V  + E +YP + ++  C      P  V +    C   +      L D+
Sbjct: 213 TAFEEIMRMGGV--QTELDYPFVGRNRRCGLDRHRPYVVSLVG--CYRYVMVNEEKLKDL 268

Query: 267 ATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
               GP+  A++A     Y  GVI  +C+ +   +NHAV +VGY
Sbjct: 269 LRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GLNHAVLLVGY 309


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 84/275 (30%), Positives = 135/275 (49%), Gaps = 36/275 (13%)

Query: 52  EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNK---HV 108
           E ++R+K F++++  IE  N      +S + G+ +F+DL+EEEFK      ++NK   ++
Sbjct: 55  EKELRYKIFQQNVKGIEGFN--NAGNKSHKLGVNQFADLTEEEFK------AINKLKGYM 106

Query: 109 LMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQ-TCGACWAFST 167
                +     + HV K            +P   DWR+ G +  +++Q   CG+CWAF+ 
Sbjct: 107 WSKISRTSTFKYEHVTK------------VPATLDWRQKGAVTPIKSQGLKCGSCWAFAA 154

Query: 168 VETAESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEY 226
           V   E +  L  G L  LS QE+IDC  NG N GC  G       ++  NK  L  E+ Y
Sbjct: 155 VAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNK-GLATEASY 213

Query: 227 PLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYY 284
           P    D  C  K  S +   IK Y  D    +E+++L  +A   PV   V++    +++Y
Sbjct: 214 PYQAVDGTCNAKVESKHVASIKGYE-DVPANNETALLNAVANQ-PVSVLVDSSDYDFRFY 271

Query: 285 LGGVIQYNCDGSLANINHAVQIVGY---DNYSRTW 316
             GV+  +C  +    +HAV +VGY   D+ ++ W
Sbjct: 272 SSGVLSGSCGTTF---DHAVTVVGYGVSDDGTKYW 303


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 133/278 (47%), Gaps = 29/278 (10%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F +RY K Y S  E  +RF  F K+LD+I   N    S    R G+ +F+D S EEF
Sbjct: 62  FARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGLS---YRLGLNKFADWSWEEF 118

Query: 96  KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           +   L  + N       +HK             +T  +     +P  KDWRE+GI+  V+
Sbjct: 119 QRHRLGAAQNCSATTKGNHK-------------LTADV-----LPETKDWRESGIVSPVK 160

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
           +Q  CG+CW FST  + E+ +    G    LS Q+++DCA    N GC+GG      +++
Sbjct: 161 DQGHCGSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI 220

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N   L+ E  YP   KD  CK  + +  GV++   + +  + +E  +   +    PV 
Sbjct: 221 KYNG-GLDTEEAYPYTGKDGVCKFSSENV-GVQVLD-SVNITLGAEDELQHAVGLVRPVS 277

Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
            A   +  +++Y  GV     C  +  ++NHAV  VGY
Sbjct: 278 VAFEVVDGFRFYKSGVYSSTKCGNTPMDVNHAVVAVGY 315


>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 294

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/248 (31%), Positives = 123/248 (49%), Gaps = 22/248 (8%)

Query: 75  QSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITI 134
           Q  +S R G+T+F+D+  EE+K            L+S       + +  +K S    +  
Sbjct: 26  QGIKSYRLGMTQFADMDNEEYKR-----------LISLGCLGAFNASAPRKGSAFFRLAE 74

Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
            T +P   DWR+ G +  V++Q+ CG+CWAFS   + E  +  K G L  LS Q+++DC+
Sbjct: 75  GTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLVSLSEQQLVDCS 134

Query: 195 GN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
           G+ GNMGC GG   +   ++  N  + + E  YP   +D  C+ K  +  G K   Y  D
Sbjct: 135 GDYGNMGCGGGLMDSAFKYIQENGGI-DTEESYPYEAEDGKCRFKPQNI-GAKCTGYV-D 191

Query: 254 TLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI-QYNCDGSLANINHAVQIVGY- 309
                E ++   +AT GPV  A++A   ++Q Y  GV  +  C  S  +++H V  VGY 
Sbjct: 192 VTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELEC--SSEDLDHGVLAVGYG 249

Query: 310 -DNYSRTW 316
            DN    W
Sbjct: 250 TDNGQDYW 257


>gi|118365710|ref|XP_001016075.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297842|gb|EAR95830.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 335

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 93/312 (29%), Positives = 154/312 (49%), Gaps = 30/312 (9%)

Query: 7   VLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKN--FEKSL 64
           +L I+ L+ LC LA  + V      +KL  ++ +  ++++ Y  +EH+  F+   F ++L
Sbjct: 6   LLSIIMLMPLC-LAQDISV------EKLLAYNKWSSQHQRVY-LNEHEKLFRQMVFFENL 57

Query: 65  DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHL-RHSVNKHVLMSHHKHHDHHHNHV 123
             ++E N N  +  S   G+  FSD++++EF  + L +  +  H + S  +   H+  ++
Sbjct: 58  QKVKEHNSNPNNTYSI--GLNLFSDMTKQEFAEKILMKQDLVDHYMKSISQKETHNDVNI 115

Query: 124 KKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
           + +  +  +T+ T I    DWR  G +  V+ Q  CG+CW+F+     ES + ++N  L 
Sbjct: 116 ETQLNSKNLTLATSI----DWRTQGAVTSVKYQGNCGSCWSFAGAALMESFNFIQNKVLV 171

Query: 184 LLSVQEVIDC--AGNG--NMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
             S Q+++DC  + NG  + GC+GG     LD+   +KV +     YP +     C    
Sbjct: 172 DFSEQQLVDCVISANGYQSEGCNGGFSFETLDY--ASKVGITTLDNYPYVEVQKKCNMTG 229

Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYN-CDGSLA 298
           T+ NG K K +     +PS S+ L       PV   VNA  W  Y  G+  YN  D S  
Sbjct: 230 TN-NGFKPKQW---IQVPSTSNDLKHALNFSPVSVYVNAYNWVSYQSGI--YNGSDQSNI 283

Query: 299 NINHAVQIVGYD 310
             NH V  VGYD
Sbjct: 284 VFNHEVLAVGYD 295


>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 154/310 (49%), Gaps = 34/310 (10%)

Query: 9   FIVALIALCFLAIPVKVSKPNLEQKLE-LFSSFQQRYKKSYSKSEHDIRFKNFEKSLDII 67
           F + L +LC   + +  + P  ++ L+  +  ++ ++ KSY+ +E   R   +EK+L +I
Sbjct: 3   FYLCLASLC---LGLAAAIPPFDRALDSQWHQWKAQHGKSYAANEDSWRRATWEKNLKMI 59

Query: 68  EELNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
           E  N+   + + S +  + +F D+S EEFK            +M+ +K      N  +KR
Sbjct: 60  ERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQ-----------VMNGYKS-----NGSQKR 103

Query: 127 SITTGI--TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL 184
           +  +    ++   +P   DWRE G +  V+ Q+ C +CWAFS     E     K G L  
Sbjct: 104 TKGSLYRESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVS 163

Query: 185 LSVQEVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPN 243
           LSVQ ++DC+   GN GC GG       ++  N  + + E  YP + +D  CK +    +
Sbjct: 164 LSVQNLVDCSIPEGNNGCDGGLMGNAFQYVQDNGGI-DTEECYPYVAQDNECKYQPEC-S 221

Query: 244 GVKIKSYTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLAN 299
           G  +  +     IPS  E +++  +A  GP+  A++A   ++++Y  GV  Y+   S + 
Sbjct: 222 GANVTGF---VKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQSGVY-YDPQCSSSQ 277

Query: 300 INHAVQIVGY 309
           +NH V +VGY
Sbjct: 278 LNHGVLVVGY 287


>gi|405958752|gb|EKC24846.1| Cathepsin L1 [Crassostrea gigas]
          Length = 290

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/261 (32%), Positives = 130/261 (49%), Gaps = 27/261 (10%)

Query: 55  IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHH 113
           IR   +E +LD I + N   Q    S   G+ EF+DLS EEF           H+     
Sbjct: 4   IRRGIWEANLDYINQHNDEFQRGAHSYTLGLNEFADLSHEEFL----------HLYGGGI 53

Query: 114 KHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
           +  D     V     T  +   +G+P++ DWR+ G +G + NQ  CG+CWAF+     E 
Sbjct: 54  RPRDS----VSSDPDTDIVVDTSGLPLEVDWRKEGWVGPIGNQFACGSCWAFTATGALEG 109

Query: 174 MHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLK 231
               K G L +LSVQ+++DC+   GN GC GG   A   ++ DV  +  E  + YP    
Sbjct: 110 QVRNKTGKLIVLSVQQMMDCSEKWGNHGCEGGLMDAAFKYIHDVGGI--ESNASYPYKPA 167

Query: 232 DAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVI 289
           +  CK   ++    K+K Y    L  SE S++  +AT GP+ AA++A   ++Q Y  GV 
Sbjct: 168 EEKCKFNKSAVV-AKVKGYK--DLPKSEESLMVAVATVGPISAALDASHSSFQLYKSGVY 224

Query: 290 -QYNCDGSLANINHAVQIVGY 309
            + NC  S   ++H++ +VGY
Sbjct: 225 DEPNC--SSGQVDHSLVVVGY 243


>gi|374414520|pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From
           Tenebrio Molitor Larval Midgut
 gi|374414521|pdb|3QJ3|B Chain B, Structure Of Digestive Procathepsin L2 Proteinase From
           Tenebrio Molitor Larval Midgut
          Length = 331

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 132/284 (46%), Gaps = 23/284 (8%)

Query: 35  ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
           E + +F+  Y +SY +  E   R + F+K L+  EE N K RQ   S   G+  F+D++ 
Sbjct: 20  EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79

Query: 93  EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK-KRSITTGITIPTGIPVKKDWREAGIIG 151
           EE K          H L+      D H N +  K     G+      P   DWR+ G++ 
Sbjct: 80  EEMKAY-------THGLI---MPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVS 129

Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTL--SLLSVQEVIDCAGNGNMGCSGGDFCAL 209
            V+NQ +CG+ WAFS+    ES   + NG    S +S Q+++DC  N  +GCSGG     
Sbjct: 130 PVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA-LGCSGGWMNDA 188

Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTD-IAT 268
             ++  N  + + E  YP  + D  C      PN V  +      L   + ++L D +AT
Sbjct: 189 FTYVAQNGGI-DSEGAYPYEMADGNCHY---DPNQVAARLSGYVYLSGPDENMLADMVAT 244

Query: 269 HGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
            GPV  A +A   +  Y GGV  YN         HAV IVGY N
Sbjct: 245 KGPVAVAFDADDPFGSYSGGVY-YNPTCETNKFTHAVLIVGYGN 287


>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
          Length = 324

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 80/280 (28%), Positives = 132/280 (47%), Gaps = 22/280 (7%)

Query: 37  FSSFQQRYKKSYSKSEHDIRFKN-FEKSLDIIEELNK-NRQSPESARYGITEFSDLSEEE 94
           F+ ++ ++ KSY   E +   K  +  +   I+  N+   Q   S R G+ +FSD+  EE
Sbjct: 22  FNEWKAKFGKSYPSLEKEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEE 81

Query: 95  FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
           F+         + VL       ++       R++  G+          DWR +G +  ++
Sbjct: 82  FR---------QTVLTKMDPPKNNRGASEPFRALNVGLAASV------DWRTSGCVSPIK 126

Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
           NQ  CG+CW+FS     ES   L+ G L  LS Q+++DC+G+ GN GC+GG       ++
Sbjct: 127 NQGQCGSCWSFSATGALESQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYI 186

Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
             N  + + ES YP   +   C    ++ +      Y   T + SES++   +A  GP+ 
Sbjct: 187 QANGGI-DSESYYPYQARVGTCHYN-SAYSAATCSGYQDVTPVGSESALQYYVANVGPLS 244

Query: 274 AAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDNYS 313
            A++A  WQ Y  GV  +N        +HAV +VGY  Y+
Sbjct: 245 IAIDASGWQSYQSGV--FNDPSCSQTADHAVLLVGYGTYN 282


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 86/277 (31%), Positives = 130/277 (46%), Gaps = 27/277 (9%)

Query: 37  FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
           F+ F  RY K Y S  E   RF+ F  +L +I   NK   S    + G+ EF+DL+ +EF
Sbjct: 61  FARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLS---YKLGVNEFTDLTWDEF 117

Query: 96  KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
           +   L  + N                ++K  ++         +P  KDWREAGI+  V+N
Sbjct: 118 RRDRLGAAQNCSATT---------KGNLKVTNVV--------LPETKDWREAGIVSPVKN 160

Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
           Q  CG+CW FST    E+ ++   G    LS Q+++DCAG   N GC+GG      +++ 
Sbjct: 161 QGKCGSCWTFSTTGALEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIK 220

Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
            N   L+ E  YP   K+  CK  + +  GVK+   + +  + +E  +   +A   PV  
Sbjct: 221 SNG-GLDTEEAYPYTGKNGLCKFSSENV-GVKVID-SVNITLGAEDELKYAVALVRPVSI 277

Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
           A   +    QY  G      C  +  ++NHAV  VGY
Sbjct: 278 AFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAVGY 314


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.133    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,913,491,364
Number of Sequences: 23463169
Number of extensions: 196878336
Number of successful extensions: 791882
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4656
Number of HSP's successfully gapped in prelim test: 1818
Number of HSP's that attempted gapping in prelim test: 777070
Number of HSP's gapped (non-prelim): 8164
length of query: 317
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 175
effective length of database: 9,027,425,369
effective search space: 1579799439575
effective search space used: 1579799439575
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)