BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017419
         (372 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  563 bits (1452), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 263/359 (73%), Positives = 310/359 (86%), Gaps = 2/359 (0%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           +++STL+FLFF + SSA DMSI+S+++ H H SSWR+D+EV+++Y  WLAKH KT N +G
Sbjct: 5   ISLSTLLFLFF-TLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLG 63

Query: 68  HNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
             EKRF+IFK+NLRFIDEHN S NRTYKVGL +FADLTNEEYRA +LGT+SD KRRLMKS
Sbjct: 64  EREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSDPKRRLMKS 123

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           K  SQRYA KAGD LPES+DWR+ GAV+ +KDQGSCGSCWAFST+AAVEG+NKIVTGELI
Sbjct: 124 KNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELI 183

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLSEQELVDCDR  NAGCNGGLMD AFQFII NGG+D+++DYPY   + KCD ++   K 
Sbjct: 184 SLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKA 243

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V+IDG+EDV  FDEM+L+KAVA QPVSVAIEA G A Q Y+SGVFTGECGSALDHGVV V
Sbjct: 244 VTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIV 303

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           GYGTE+G+DYWLVRNSWG DWGENGY+K+QRN++DT TGKCGIAME+SYP+KN+QN  K
Sbjct: 304 GYGTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPIKNTQNPVK 362


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  559 bits (1441), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 262/342 (76%), Positives = 298/342 (87%), Gaps = 2/342 (0%)

Query: 27  MSIISYDNNH--DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
           MSI ++D+NH     SSWR+DDEVM+IY+ WL KHGK  N +G   KRF+IFK+NLRFID
Sbjct: 1   MSIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFID 60

Query: 85  EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
           EHNS NRTYKVGL KFADLTN+EYRAM+LGTRSD KRRLMKSK  S+RYA KAGD+LPES
Sbjct: 61  EHNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPES 120

Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
           VDWR KGAVNP+KDQGSCGSCWAFSTVAAVEGIN+IVTGELISLSEQELVDCDR  NAGC
Sbjct: 121 VDWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGC 180

Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLK 264
           NGGLMDYAFQFII NGG+D+E+DYPYLG ++ CD  +   K VSIDG+EDV PFDE +L+
Sbjct: 181 NGGLMDYAFQFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQ 240

Query: 265 KAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
           KAVA QPVSVAIEA G A Q Y+SGVFTGECG+ALDHGVV VGYGTE G+DYWLVRNSWG
Sbjct: 241 KAVAHQPVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGTEKGLDYWLVRNSWG 300

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           ++WGE+GY+K+QRN+ DT TG+CGIAME+SYPVKN QN+AKP
Sbjct: 301 TEWGEHGYIKMQRNVRDTYTGRCGIAMESSYPVKNGQNTAKP 342


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 251/344 (72%), Positives = 295/344 (85%), Gaps = 1/344 (0%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
           AA MSII Y+ N +H SS RTD+EVM IY  WLAKHGK  NG+G  E+RF+IFKDNL+F+
Sbjct: 19  AAHMSIIDYNTNPNHKSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFV 78

Query: 84  DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
           DEHNS NR+YKVGLN+FADLTNEEYR+M+LGT++D+KRR MKSK AS+RYA +  D LPE
Sbjct: 79  DEHNSENRSYKVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPE 138

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG 203
           SVDWRE GAV P+KDQGSCGSCWAFSTVAAVEG+N+I TGE+I LSEQELVDCDR  +AG
Sbjct: 139 SVDWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG 198

Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSL 263
           CNGGLMDYAF+FII NGG+D+E+DYPY G +  CDP R+N KVVSI+ YEDV P+DEM+L
Sbjct: 199 CNGGLMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMAL 258

Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
           KKAVA QPVSVAIEA GRAFQ Y SGVFTGECG ALDHGVV VGYGT+NG D+W+VRNSW
Sbjct: 259 KKAVAHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGTDNGADHWIVRNSW 318

Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA-KP 366
           G+ WGENGY++++RN++D   GKCGIAM+ASYP+KN +N A KP
Sbjct: 319 GTSWGENGYIRMERNVVDNFGGKCGIAMQASYPIKNGENPANKP 362


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 256/347 (73%), Positives = 303/347 (87%), Gaps = 4/347 (1%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
           SSA  +S ++ + NH  SSSWR+DDEVM +Y++W+ +HGK  NG+G  EKRF+IFKDNLR
Sbjct: 15  SSATYISTLTLNQNHPSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLR 74

Query: 82  FIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
           FIDEHNS N  TYK+GLNKFADLTN+EYRA +LGTR+D +RRLMKSK+ S RYA +AGD 
Sbjct: 75  FIDEHNSNNNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDN 134

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP+SVDWR+ GAV+PVKDQGSCGSCWAFST+A VEGINKIV+GEL+SLSEQELVDCDR  
Sbjct: 135 LPDSVDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSY 194

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           +AGCNGGLMDYAFQFI+ NGG+D+E+DYPYLG  N+CDP+++NAKVVSIDGYEDV P +E
Sbjct: 195 DAGCNGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDV-PNNE 253

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLV 319
            +LKKAVA QPVS+AIEAGGRAFQ YESGVF GECG ALDHGVVAVGYGT +NG DYW+V
Sbjct: 254 NALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIV 313

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           RNSWGS+WGENGY++++RN ++ NTGKCGIAMEASYPVKN  N  +P
Sbjct: 314 RNSWGSNWGENGYIRMERN-INANTGKCGIAMEASYPVKNGANIIQP 359


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  543 bits (1398), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 258/361 (71%), Positives = 305/361 (84%), Gaps = 12/361 (3%)

Query: 5   SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           +M L+ISTL+FLFF++SS+A              SSSWR+++EVM +YQ W+AKHGK  N
Sbjct: 11  AMALSISTLLFLFFVASSAAD------------LSSSWRSEEEVMGMYQWWMAKHGKAYN 58

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           G+G  EKRF+IFKDNL+FIDEHN+ NRTYKVGLN+FADLTNEEYRA+YLGTRSD KRR  
Sbjct: 59  GLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADLTNEEYRAIYLGTRSDPKRRFA 118

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           K K AS RYA   G+ LPESVDWRE GAVNPVKDQ SCGSCWAFSTVAAVEGIN+IVTGE
Sbjct: 119 KLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGE 178

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           LISLSEQELVDCD + + GCNGGLMDYAF FII+NGG+D+E+DYPY G + +C+ S +++
Sbjct: 179 LISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSS 238

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           KVVSIDGYEDV PFDE +L+KAVA QPVSVA+EAGGRA Q Y SG+FTGECG+ALDHG+V
Sbjct: 239 KVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIV 298

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           AVGYGTENG DYW+VRNSWGS WGENGY++++RN+ D  +GKCGIAMEASYP+KN +N +
Sbjct: 299 AVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKNGENPS 358

Query: 365 K 365
           K
Sbjct: 359 K 359


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 254/354 (71%), Positives = 294/354 (83%), Gaps = 5/354 (1%)

Query: 11  STLVFLFFI---SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           S  VFLF +   +S+SA DMSII YD  H   SSWRTD++VM +Y+ WLAKHGK+ N +G
Sbjct: 9   SMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALG 68

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             E+RFQIFKDNLRFIDEHN+ NRTYKVGLN+FADLTNEEYR+MYLGTR+ AKRR   S 
Sbjct: 69  EKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRR--SSN 126

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S RYA + GD LPESVDWR+KGAV  VKDQGSCGSCWAFST+AAVEGINKIVTG LIS
Sbjct: 127 KISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLIS 186

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY  ++ +CD  R+NA VV
Sbjct: 187 LSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXVV 246

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +IDGYEDV   DE SL+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVG
Sbjct: 247 TIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVG 306

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           YGTENGVDYW+V+NSWG+ WGE GY++++R+L  + TGKCGIAMEASYP+K  Q
Sbjct: 307 YGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQ 360


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 254/352 (72%), Positives = 293/352 (83%), Gaps = 3/352 (0%)

Query: 11  STLVFLFFISS-SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           S  VFLF +   +SA DMSII YD  H   SSWRTD++VM +Y+ WLAKHGK+ N +G  
Sbjct: 9   SMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEK 68

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           E+RFQIFKDNLRFIDEHN+ NRTYKVGLN+FADLTNEEYR+MYLGTR+ AKRR   S   
Sbjct: 69  ERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRR--SSNKI 126

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S RYA + GD LPESVDWR+KGAV  VKDQGSCGSCWAFST+AAVEGINKIVTG LISLS
Sbjct: 127 SDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLS 186

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQELVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY  ++ +CD  R+NAKVV+I
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTI 246

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           DGYEDV   DE SL+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVGYG
Sbjct: 247 DGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYG 306

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           TENGVDYW+V+NSWG+ WGE GY++++R+L  + TGKCGIAMEASYP+K  Q
Sbjct: 307 TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKKGQ 358


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 254/348 (72%), Positives = 301/348 (86%), Gaps = 5/348 (1%)

Query: 22  SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
           SSA  +S ++ + NH   SSSWR+DDEVM +Y++W+ +HGK  NG+G  EKRF+IFKDNL
Sbjct: 15  SSATYISTLTLNQNHPSSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNL 74

Query: 81  RFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD 139
           RFIDEHNS N T YK+GLNKFADLTN+EYRA +LGTR+D +RRLMKSK+ S RYA +AGD
Sbjct: 75  RFIDEHNSNNNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGD 134

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
            LP+SV+WR+ GAV+ VKDQGSCGSCWAFS +AAVEGINKIV+GELISLSEQELVDCDR 
Sbjct: 135 NLPDSVNWRDHGAVSRVKDQGSCGSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRS 194

Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
            +AGCNGGLMDYAFQFII NGG+D+E+DYPYLG  N+CDP+++NAKVVSIDGYEDV P +
Sbjct: 195 YDAGCNGGLMDYAFQFIIDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDV-PNN 253

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWL 318
           E +LKKAVA QPVS+AIEAGGRAFQ YESGVF GECG ALDHGVVAVGYG+ +NG DYW+
Sbjct: 254 ENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWI 313

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           VRNSWG +WGENGY++++RN ++ NTGKCGIAMEASYPVKN  N  +P
Sbjct: 314 VRNSWGGNWGENGYIRMERN-INANTGKCGIAMEASYPVKNGANIIQP 360


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  525 bits (1351), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 246/318 (77%), Positives = 283/318 (88%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M++Y+ WLAKHGK  NG+G   +RF+IFK+NLRFIDEHNS N TYKVGL KFADLTNEEY
Sbjct: 1   MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEY 60

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           RAM+LGTRSDAKRRLMKSK  S+RYA KAGD+LPESVDWR KGAVNP+KDQGSCGSCWAF
Sbjct: 61  RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAF 120

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           STVAAVEGIN+IVTGELISLSEQELVDCDR  NAGCNGGLMDYAFQFII NGG+D+E+DY
Sbjct: 121 STVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDY 180

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY+G ++KCD  +   K VSIDG+EDV P+DE +L+KAVA QPVSVAIEA G A Q Y+S
Sbjct: 181 PYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQS 240

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVFTGECG+ALDHGVV VGY +ENG+DYWLVRNSWG++WGE+GY+K+QRN+ DT TG+CG
Sbjct: 241 GVFTGECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRCG 300

Query: 349 IAMEASYPVKNSQNSAKP 366
           IAME+SYPVKN +N+AKP
Sbjct: 301 IAMESSYPVKNGENTAKP 318


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  523 bits (1346), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 245/337 (72%), Positives = 288/337 (85%), Gaps = 5/337 (1%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISY +  +     RTD EVM +Y+ WL KHGK+ N +G  E+RF+IFKDNLRFI+E
Sbjct: 32  DMSIISYGDRLEK----RTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEE 87

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
           HN++NRTYKVGLN+FADLTNEEYR+ YLG R + +R L  S+V S RY+ +AG++LPESV
Sbjct: 88  HNAVNRTYKVGLNRFADLTNEEYRSRYLGRRDETRRGLRASRV-SDRYSFRAGEDLPESV 146

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWREKGAV PVKDQG+CGSCWAFST+AAVEGIN+I TG+LISLSEQELVDCD+  N GCN
Sbjct: 147 DWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCN 206

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMDYAF+FII NGG+DSE+DYPY  A+  CDP+R+NA+VVSIDGYEDV   DE SLKK
Sbjct: 207 GGLMDYAFEFIINNGGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKK 266

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           AVA+QPVSVAIEAGGRAFQ Y+SGVFTG+CG+ LDHGVVAVGYGTEN VDYW+VRNSWG 
Sbjct: 267 AVANQPVSVAIEAGGRAFQLYQSGVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGP 326

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           +WGE+GY+KL+RNL  T TGKCGIA+E SYP+KN QN
Sbjct: 327 NWGESGYIKLERNLAGTETGKCGIAIEPSYPIKNGQN 363


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 248/360 (68%), Positives = 288/360 (80%), Gaps = 13/360 (3%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M L ISTL+FL F  S +    +I +Y           TD+EVMT+Y+ WL KH K  NG
Sbjct: 5   MTLMISTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNG 53

Query: 66  MGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           +G  +KRFQ+FKDNL FI EHN+  N TYK+GLNKFAD+TNEEYR MY GT+SDAKRRLM
Sbjct: 54  LGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLM 113

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           K+K    RYA  AGD+LP  VDWR KGAV P+KDQGSCGSCWAFSTVA VE INKIVTG+
Sbjct: 114 KTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGK 173

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
            +SLSEQELVDCDR  N GCNGGLMDYAF+FIIQNGG+D+++DYPY G +  CDP+++NA
Sbjct: 174 FVSLSEQELVDCDRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNA 233

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           K V+IDGYEDV P+DE +LKKAVA QPVS+AIEA GRA Q Y+SGVFTGECG++LDHGVV
Sbjct: 234 KAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVV 293

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
            VGYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T TGKCGI MEASYPVKN  NSA
Sbjct: 294 VVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTPTGKCGITMEASYPVKNGLNSA 352


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  520 bits (1340), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 254/365 (69%), Positives = 289/365 (79%), Gaps = 17/365 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MATA+  LA   L+  FF+S S++A               S R+D EV  IY  WLAKHG
Sbjct: 1   MATATTSLA---LLSFFFLSISASA--------------LSRRSDGEVREIYDLWLAKHG 43

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K  NG+   EKRFQIFK+NL+FID+HNS NRTYKVGLN FADLTNEEYRA+YLGTRS   
Sbjct: 44  KAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPA 103

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           RR+MK+K AS+RYA    D LPES+DWR +GAV PVK+QGSCGSCWAFST+AAVEGIN+I
Sbjct: 104 RRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQI 163

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTGELISLSEQELV CD+K N+GCNGGLMDYAFQFII NGG+D+E+DYPY   + +CDP+
Sbjct: 164 VTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPT 223

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R+NAKVVSID YEDV   DE SLKKAVA QPVSVAIEA G A Q Y+SGVFTG+CGSALD
Sbjct: 224 RKNAKVVSIDAYEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALD 283

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGVVAVGYG ENGVDYWLVRNSWG+ WGE+GY KL+RN+     GKCGIAM+ASYPVKN 
Sbjct: 284 HGVVAVGYGKENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKND 343

Query: 361 QNSAK 365
            N  K
Sbjct: 344 NNPTK 348


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  520 bits (1339), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 242/349 (69%), Positives = 291/349 (83%), Gaps = 3/349 (0%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           L+F  F + SSA DMSIISYDN H   ++WRTD+EV ++Y+ WL KHGK  N +G  +KR
Sbjct: 2   LLFALF-ALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKR 60

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
           FQIFKDNLRFID+ N+ NRTYK+GLN+FADLTNEEYRA YLGT+ D  RRL   +  S R
Sbjct: 61  FQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRL--GRTPSNR 118

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           YA + G+ LP+SVDWR++GAV PVKDQ SCGSCWAFS + AVEGINKIVTG+LISLSEQE
Sbjct: 119 YAPRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQE 178

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCD   N GCNGGLMDYAF+FII+NGG+DSE+DYPY G + +CD  R+NAKVVSIDGY
Sbjct: 179 LVDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGY 238

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV+ +DE++LKKAVA+QPVSVA+E GGR FQ Y SGVFTG CG+ALDHGVVAVGYGT+N
Sbjct: 239 EDVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGTDN 298

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           G D+W+VRNSWG+DWGE GY++L+RNL ++ +GKCGIA+E SYP+K  Q
Sbjct: 299 GHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPIKTGQ 347


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 249/363 (68%), Positives = 291/363 (80%), Gaps = 7/363 (1%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M   S  +AI+ L  LF   +SSA DMSII+YD  H   SSWRTDDEVM +Y++WL KHG
Sbjct: 1   MKLLSPSMAIALLFALFV--ASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHG 58

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  EKRFQIFKDNLRFIDEHN+  N +YKVGLN+FADLTNEEYR+ YLG +S  
Sbjct: 59  KSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKP 118

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K     SKV S RYA + GD LPESVDWR KGAV P+KDQGSCGSCWAFSTV AVEGIN+
Sbjct: 119 KL----SKVKSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQ 174

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTGELI+LSEQELVDCD+  N GC+GGLMDY F+FII NGG+D+++DYPYLG + +CD 
Sbjct: 175 IVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQ 234

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            R+NAKVV+ID YEDV   +E +LKKAVA QPVSV IE GGRAFQ Y+SG+FTG+CG+AL
Sbjct: 235 YRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTAL 294

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGV  VGYGTE G DYW+VRNSWGS WGE GY++++RNL  T+ GKCGIAME SYP+KN
Sbjct: 295 DHGVNVVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKN 354

Query: 360 SQN 362
            QN
Sbjct: 355 GQN 357


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 249/367 (67%), Positives = 293/367 (79%), Gaps = 16/367 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ +M   I TL+FL F  S +    +II+Y           TD+EVM +Y+ WL +H 
Sbjct: 1   MASMTM---IYTLLFLSFTLSYAIKTSTIINY-----------TDNEVMAMYEEWLVRHQ 46

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K  N +G  +KRFQ+FKDNL FI EHN+ LN TYK+GLNKFAD+TNEEYRAMYLGT+S+A
Sbjct: 47  KGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNA 106

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           KRRLMK+K    RYA  A D LP  VDWR KGAV P+KDQGSCGSCWAFSTVA VE INK
Sbjct: 107 KRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINK 166

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG+ +SLSEQELVDCDR  N GCNGGLMDYAF+FIIQNGG+D+++DYPY G +  CDP
Sbjct: 167 IVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDP 226

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           +++NAKVV+IDGYEDV P+DE +LKKAVA QPVSVAIEA GRA Q Y+SGVFTG+CG++L
Sbjct: 227 TKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSL 286

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGVV VGYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T+TGKCGI MEASYPVKN
Sbjct: 287 DHGVVVVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTSTGKCGITMEASYPVKN 345

Query: 360 SQNSAKP 366
             NSA P
Sbjct: 346 GLNSAVP 352


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  516 bits (1330), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 249/345 (72%), Positives = 286/345 (82%), Gaps = 8/345 (2%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT---SNGMGHNEKRFQIFKD 78
           SSA DMSI+SYD  H   SSWRTDDEVM IY+ WL K+GK    +N +G  E+RFQ+FKD
Sbjct: 21  SSALDMSIVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKD 80

Query: 79  NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR-RLMKSKVASQRYACKA 137
           NLRFIDEHNS NR+YKVGLN+FADLTNEEYR+MYLG RS AKR RL +S   S RY  + 
Sbjct: 81  NLRFIDEHNSENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRS---SNRYLPRV 137

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
           GD LP+SVDWR++GAV  VKDQGSCGSCWAFST+AAVEGINKIVTG+LISLSEQELVDCD
Sbjct: 138 GDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCD 197

Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
           R  N GCNGGLMDYAFQFII NGG+DSE+DYPYL  +  CD  R+NAKVV+ID YEDV  
Sbjct: 198 RSYNEGCNGGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPV 257

Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
            DE +L+KAVA+QPVSVAIEAGGR FQ Y+SG+FTG CG+ALDHGV AVGYGTENG DYW
Sbjct: 258 NDEKALQKAVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGTENGKDYW 317

Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           +VRNSWG  WGE+GY++++RN+  T TGKCGIA+E SYP+K  QN
Sbjct: 318 IVRNSWGKSWGESGYIRMERNIA-TATGKCGIAIEPSYPIKKGQN 361


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  516 bits (1328), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 246/352 (69%), Positives = 292/352 (82%), Gaps = 6/352 (1%)

Query: 8   LAISTLVFLFFI-SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
           +A++T++ LF + + SSA DMSIISYDN H  +S  R+D+E+M++Y+ WL KHGK  N +
Sbjct: 36  MAMATILLLFTVFAVSSALDMSIISYDNAHAATS--RSDEELMSMYEQWLVKHGKVYNAL 93

Query: 67  GHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           G  EKRFQIFKDNLRFID+HNS  +RTYK+GLN+FADLTNEEYRA YLGT+ D  RRL  
Sbjct: 94  GEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRL-- 151

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
            K  S RYA + GD+LPESVDWR++GAV PVKDQG CGSCWAFS + AVEGINKIVTGEL
Sbjct: 152 GKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGEL 211

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           ISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY G + +CD  R+NAK
Sbjct: 212 ISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAK 271

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VVSID YEDV  +DE++LKKAVA+QPVSVAIE GGR FQ Y SGVFTG CG+ALDHGVVA
Sbjct: 272 VVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVA 331

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           VGYGT NG DYW+VRNSWG  WGE+GY++L+RNL ++ +GKCGIA+E SYP+
Sbjct: 332 VGYGTANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 248/370 (67%), Positives = 297/370 (80%), Gaps = 10/370 (2%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRT---DDEVMTIYQTWLAKHGKTSNGM 66
           I+TL+F  F S S A DMSII Y NNH ++  W     +D+V   Y+ WLA+HG+  N +
Sbjct: 6   ITTLLFALFSSLSYAIDMSIIDYKNNH-YARKWTLQSDEDQVKNRYEMWLAEHGRAYNAL 64

Query: 67  GHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           G  EKRF+IFKDNLRFI+ HN S NRTYKVGLN+FADLTNEEYR MYLGT+SDA+RR +K
Sbjct: 65  GEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRFVK 124

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           SK  SQRYA +  + +P SVDWR++GAV P+K+QGSCGSCWAFSTVAAVEGIN+IVTGE+
Sbjct: 125 SKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVEGINQIVTGEM 184

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           I+LSEQELVDCDR  N+GCNGGLMDYAF+FII NGGMD+E+ YPY G E +CDP R+N K
Sbjct: 185 ITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYK 244

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VVSIDGYEDV P +E +L+KAVA QPV VAIEA GRAFQ Y SGVFTGECG  +DHGVV 
Sbjct: 245 VVSIDGYEDV-PRNERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGVVV 303

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK----NSQ 361
           VGYG+E+GVDYW+VRNSWG+ WGENGYVK++RN+  ++ GKCGI  EASYP K    N +
Sbjct: 304 VGYGSEDGVDYWIVRNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTKDSAINKR 363

Query: 362 NSAKPKPHSS 371
           N++K +  SS
Sbjct: 364 NTSKEEKISS 373


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 246/350 (70%), Positives = 286/350 (81%), Gaps = 5/350 (1%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           L+FL F + SSA DMSIISY   H   SSWRTDDEVM +Y+ WL KHGK  N +G  EKR
Sbjct: 4   LLFLVF-ALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKR 62

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
           F+IFKDNL FID+HNS NRTY VGLN+FADLTNEE+R+MYLGTR+  K+RL K+   S R
Sbjct: 63  FEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT---SDR 119

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           YA + GD LP+SVDWR++GAV  VKDQG CGSCWAFST+AAVEGINKIVTG+LI+LSEQE
Sbjct: 120 YAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQE 179

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCD   N GCNGGLMDYAF+FII NGG+D+E DYPYLG + +CD  R+NAKVVSID Y
Sbjct: 180 LVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSY 239

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV   DE +LKKAVA+QPVSVAIE GGR FQ Y SGVFTGECG++LDHGV AVGYGTE 
Sbjct: 240 EDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEK 299

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G DYW+VRNSWG  WGE+GY++++RN+  + TGKCGIA+E SYP+K  QN
Sbjct: 300 GKDYWIVRNSWGKSWGESGYIRMERNIA-SPTGKCGIAIEPSYPIKKGQN 348


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  514 bits (1323), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 242/358 (67%), Positives = 287/358 (80%), Gaps = 14/358 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           + I++L+F   I+ S A D S+             R+++EVMT+Y+ WL KH K  NG+G
Sbjct: 4   ITITSLLFFSLITLSLAMDTSM-------------RSNEEVMTMYEEWLVKHHKVYNGLG 50

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             ++RF+IFKDNL FIDEHN+ N TYKVGLNKFAD TNEEYR MYLGT++DAKR +MK K
Sbjct: 51  EKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIK 110

Query: 128 VAS-QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           + +  RYA  +GD LP  VDWR KGAV  +KDQGSCGSCWAFST+A VE INKIVTG+L+
Sbjct: 111 ITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLV 170

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLSEQELVDCDR  N GCNGGLMDYAF+FI++NGG+D+EQDYPY G E +CDP+R+NAKV
Sbjct: 171 SLSEQELVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKV 230

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           VSIDGYEDV  ++E +LKKAV  QPVSVAIEAGGRA Q Y+SGVFTG CG+ LDHGVV V
Sbjct: 231 VSIDGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVV 290

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           GYG ENGVDYWLVRNSWG++WGE+GY KL+RN+   NTGKCGIAM+ASYPVK  QNSA
Sbjct: 291 GYGFENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPVKYGQNSA 348


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  514 bits (1323), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 239/347 (68%), Positives = 282/347 (81%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
           +L  L   ++SSA DMSI+SYD  H   SSWRTDDEVM +Y+ WL KHGK  N +G  EK
Sbjct: 9   SLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEK 68

Query: 72  RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           RF IFKDNLRFIDEHNS N TY++GLN+FADLTNEEYR+MYLG +  A R   K    S 
Sbjct: 69  RFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRKSD 128

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           R+A + GD LP+ +DWR++GAV  VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQ
Sbjct: 129 RFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQ 188

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           ELVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY  A+ KCD  R+NA VVSIDG
Sbjct: 189 ELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSIDG 248

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           YEDV   DE +LKKAVA QPVSVAIEAGGRAFQ Y+SGVFTG+CG++LDHGV AVGYGTE
Sbjct: 249 YEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGTE 308

Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NG DYW+V NSWG +WGE+GY++++RNL  +++GKCGIA+  SYP+K
Sbjct: 309 NGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIK 355


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  513 bits (1322), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 245/358 (68%), Positives = 294/358 (82%), Gaps = 5/358 (1%)

Query: 2   ATASMFLAISTLVFLFFISSSSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHG 60
           + A+M +A   L+F  F + SSA DMSIISYD+ H D +++ RT++E+M++Y+ WL KHG
Sbjct: 9   SPATMTMAAIVLLFTVF-AVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHG 67

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K  N +G  EKRFQIFKDNLRFID+HNS  +RTYK+GLN+FADLTNEEYRA YLGT+ D 
Sbjct: 68  KVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKIDP 127

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            RRL   K  S RYA + GD+LP+SVDWR++GAV PVKDQG CGSCWAFS + AVEGINK
Sbjct: 128 NRRL--GKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINK 185

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTGELISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+DS++DYPY G + +CD 
Sbjct: 186 IVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDT 245

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            R+NAKVVSID YEDV  +DE++LKKAVA+QPVSVAIE GGR FQ Y SGVFTG CG+AL
Sbjct: 246 YRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTAL 305

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGVVAVGYGT  G DYW+VRNSWGS WGE+GY++L+RNL ++ +GKCGIA+E SYP+
Sbjct: 306 DHGVVAVGYGTAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 249/362 (68%), Positives = 292/362 (80%), Gaps = 10/362 (2%)

Query: 2   ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
           ++A+MF+    L+FL F + SSA+DMSIISYD  H   SSWRTDDEVM IY+ WL K GK
Sbjct: 7   SSAAMFV----LLFLSF-TLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGK 61

Query: 62  TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
             N +G  EKRFQ+FKDNLRFIDEHNS NRTYK+GLN FADLTNEEYR+ YLG R   KR
Sbjct: 62  VYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKR 121

Query: 122 -RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            RL K+   S RYA + G+ LP+SVDWR++GAV  VKDQGSCGSCWAFST+AAVEGINKI
Sbjct: 122 NRLRKT---SDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKI 178

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+D+E+DYPYL  + +CD  
Sbjct: 179 VTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTY 238

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R+NAKVV+ID YEDV    E +L+KAVA+QPVSVAIEAGGR FQ Y SG+F+G CG+ LD
Sbjct: 239 RKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLD 298

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGV AVGYGTENG DYW+VRNSWG  WGENGY+++ R+ +++ TG CGIAMEASYP+K  
Sbjct: 299 HGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARS-INSPTGICGIAMEASYPIKKG 357

Query: 361 QN 362
           QN
Sbjct: 358 QN 359


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 242/341 (70%), Positives = 280/341 (82%), Gaps = 4/341 (1%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
           SSA DMSIISY   H   SSWRTDDEVM +Y+ WL KHGK  N +G  EKRF+IFKDNL 
Sbjct: 21  SSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLM 80

Query: 82  FIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           FID+HNS NRTY VGLN+FADLTNEE+R+MYLGTR+  K+RL K+   S RYA + GD L
Sbjct: 81  FIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKT---SDRYAPRVGDSL 137

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P+SVDWR++GAV  VKDQG CGSCWAFST+AAVEGINKIVTG+LI+LSEQELVDCD   N
Sbjct: 138 PDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYN 197

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+D+E DYPYLG + +CD  R+NAKVVSID YEDV   DE 
Sbjct: 198 EGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDET 257

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +LKKAVA+QPVSVAIE GGR FQ Y SGVFTGECG++LDHGV AVGYGTE G DYW+VRN
Sbjct: 258 ALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRN 317

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GY++++RN+  + TGKCGIA+E SYP+K  QN
Sbjct: 318 SWGKSWGESGYIRMERNIA-SPTGKCGIAIEPSYPIKKGQN 357


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 245/351 (69%), Positives = 285/351 (81%), Gaps = 4/351 (1%)

Query: 13  LVFLFFISS-SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
            + LFF S+ SSA+D+SIISYD +H   SSWRTDDEVM IY+ WL KHGK  N +G  E+
Sbjct: 2   FMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKER 61

Query: 72  RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           RF++FKDNLRFIDEHNS NRTY+VGLN+FADLTNEEYR+MYLG  S  +R   K +  S 
Sbjct: 62  RFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSMYLGALSGIRRN--KLRKISD 119

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY  + GD LP+SVDWR++GAV  VKDQGSCGSCWAFS VAAVEGINKIVTG+LISLSEQ
Sbjct: 120 RYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQ 179

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           ELVDCD   N GCNGGLMDY F+FII NGG+DSE+DYPYL  + +CD  R+NA+VVSID 
Sbjct: 180 ELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDS 239

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           YEDV   +E +L+KAVA+QPVSVAIEAGGR FQ Y SGVF+G CG+ALDHGVVAVGYGTE
Sbjct: 240 YEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGTE 299

Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           NG DYW+VRNSWG  WGE+GY+++ RN+    TG CGIAMEASYP+K  QN
Sbjct: 300 NGQDYWIVRNSWGKSWGESGYLRMARNIRKP-TGICGIAMEASYPIKKGQN 349


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 243/357 (68%), Positives = 291/357 (81%), Gaps = 10/357 (2%)

Query: 10  ISTLVFL-FFISSSSAA-------DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
           +++L FL FFI  S          DMSI+ Y+  H      RTD +V  +Y+ WL +HGK
Sbjct: 1   MASLKFLAFFILFSGLLSSFSSALDMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGK 60

Query: 62  TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
             N +G  EKRF+IFKDNLRFIDEHNS++R+YKVGLN+FADLTNEEY+AM+LGT+ + K 
Sbjct: 61  AYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAMFLGTKMERKN 120

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
           R + ++  SQRY  K GD+LPE+VDWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IV
Sbjct: 121 RFLGTR--SQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGAVEGINQIV 178

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           TGELISLSEQELVDCD+  N GCNGGLMDYAF+FII NGG+D+E+DYPY  ++N CDP+R
Sbjct: 179 TGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKASDNICDPNR 238

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
           +NAKVV+IDGYEDV   DE SLKKAVA QPVSVAIEAGGRAFQ Y+SGVFTG CG+ LDH
Sbjct: 239 KNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSGVFTGRCGTELDH 298

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GVVAVGYGTENGV+YW+VRNSWGS WGE+GY++++RN+ +T TGKCGIA++ SYP K
Sbjct: 299 GVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANTKTGKCGIAIQPSYPTK 355


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  509 bits (1312), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 240/356 (67%), Positives = 288/356 (80%), Gaps = 3/356 (0%)

Query: 4   ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
           AS++ + + L   +F+S   A DMSII Y+  H      RT+ E + +Y+ WL K+GK  
Sbjct: 2   ASLYRSFAFLATFYFLSVCLAIDMSIIDYNLKHGQVPE-RTEAETLRLYEMWLVKYGKAY 60

Query: 64  NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
           N +G  E+RF+IFKDNL+F+D+HNS+ N +YK+GLNKFADL+NEEYRA YLGTR D KRR
Sbjct: 61  NALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRR 120

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
           L+     S RY  K GD+LPESVDWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IVT
Sbjct: 121 LLGGP-KSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVT 179

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQELVDCD+  N GCNGGLMDYAF+FI++NGG+D+E+DYPY   ++ CDP+R+
Sbjct: 180 GNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRK 239

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
           NA+VV+IDGYEDV   DE SL+KAVA+QPVSVAIEAGGRAFQ Y+SGVFTG CG+ LDHG
Sbjct: 240 NARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHG 299

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           VVAVGYGTENGVDYW+VRNSWG  WGENGY++++RN+  T TGKCGIAMEASYP K
Sbjct: 300 VVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTK 355


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  506 bits (1302), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 238/355 (67%), Positives = 286/355 (80%), Gaps = 5/355 (1%)

Query: 13  LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           L+ +  ISS   S A DMSIISYD  H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G 
Sbjct: 12  LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
            +KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D  RR+ K   
Sbjct: 72  KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S RYA + GD+LPESVDWR++GAV  VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE DYPY   + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +ID YEDV  +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+ALDHGV AVG
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVG 311

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YGTENG DYW+VRNSWG  WGE GY++L+RNL  +  GKCGIA+E SYP+KN QN
Sbjct: 312 YGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQN 366


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  505 bits (1301), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 238/355 (67%), Positives = 286/355 (80%), Gaps = 5/355 (1%)

Query: 13  LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           L+ +  ISS   S A DMSIISYD  H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G 
Sbjct: 12  LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
            +KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D  RR+ K   
Sbjct: 72  KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S RYA + GD+LPESVDWR++GAV  VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE DYPY   + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +ID YEDV  +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+ALDHGV AVG
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVG 311

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YGTENG DYW+VRNSWG  WGE GY++L+RNL  +  GKCGIA+E SYP+KN QN
Sbjct: 312 YGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKNGQN 366


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 241/358 (67%), Positives = 283/358 (79%), Gaps = 13/358 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           L  STL+FL F  S +    +I +Y           TD+EVMT+Y+ WL KH K  NG+ 
Sbjct: 7   LVTSTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNGLR 55

Query: 68  HNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
             +KRFQ+FKDNL FI EHN+  N TYK+GLN+FAD+TNEEYR MY GT+SDAKRRLMK+
Sbjct: 56  EKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKT 115

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           K    RYA  AGD LP  VDWR KGAV P+KDQGSCGSCWAFSTVA VE INKIVTG+ +
Sbjct: 116 KSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFV 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLSEQELVDCDR  N GCNGGLMDYAF+FIIQNGG+D+++DYPY G +  CDP+++NAKV
Sbjct: 176 SLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKV 235

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V+IDG+EDV P+DE +LKKAVA QPVS+AIEA GR  Q Y+SGVFTG+CG++LDHGVV V
Sbjct: 236 VNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVV 295

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           GYG+ENGVDYWLVRNSWG+ WGE+GY K+QRN + T TGKCGI MEASYPVKN   SA
Sbjct: 296 GYGSENGVDYWLVRNSWGTGWGEDGYFKMQRN-VRTPTGKCGITMEASYPVKNGLISA 352


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  503 bits (1295), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 239/359 (66%), Positives = 282/359 (78%), Gaps = 12/359 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           + I TL+ L F  S + A MSII+Y  N           EVM +Y+ WL KH K  NG+ 
Sbjct: 4   MLIPTLLLLSFTFSHATA-MSIINYSEN-----------EVMDMYEEWLVKHRKVYNGLD 51

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             EKRFQ+FKDNL FI +HN+ N TY +GLNKFAD+TNEEYRAMYLGTR+DAKRR+MK++
Sbjct: 52  EKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQ 111

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
               RYA  +GD+LP  VDWR KGAV P+KDQG+CGSCWAFSTVAAVEGIN IVTGE +S
Sbjct: 112 NTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVS 171

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCDR+ + GCNGGLMDYAFQFIIQNGG+D+E+DYPY G +  CD +++  KVV
Sbjct: 172 LSEQELVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
            IDGYEDV   +E +LKKAV+ QPVSVAIEA GRA Q Y+SGVFTG+CG+ALDHGVV VG
Sbjct: 232 QIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVG 291

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           YGTENGVDYWLVRNSWG+ WGE+GY K++RN+  T+ GKCGIAM+ SYPVK   NSA P
Sbjct: 292 YGTENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVP 350


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  503 bits (1295), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 241/356 (67%), Positives = 288/356 (80%), Gaps = 10/356 (2%)

Query: 24  AADMSIISYDNNHDHSSSWRT---DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
           A DMSII Y NNH ++  W     +D+V   Y+ WLA+HG+  N +G  EKRF+IFKDNL
Sbjct: 20  AIDMSIIDYKNNH-YARKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNL 78

Query: 81  RFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD 139
           RFI+EHN S NRTYKVGLN+FADLTNEEYR MYLGT+SDA+RR +KSK  SQRYA +  +
Sbjct: 79  RFIEEHNNSGNRTYKVGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNE 138

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
            +P SVDWR++GAV P+K+QGSCGSCWAFSTVAAV GIN+IVTGE+I+LSEQELVDCDR 
Sbjct: 139 LMPHSVDWRKRGAVAPIKNQGSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRV 198

Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
            N+GCNGGLMDYAF+FII NGGMD+E+ YPY G E +CDP R+N KVVSIDGYEDV P +
Sbjct: 199 QNSGCNGGLMDYAFEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDV-PRN 257

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E +L+KAVA QPV VAIEA GRAFQ Y SGVFTGECG  +DHGVV VGYG+E+GVDYW+V
Sbjct: 258 ERALQKAVAHQPVCVAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIV 317

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK----NSQNSAKPKPHSS 371
           RNSWG+ WGENGYVK++RN+  ++ GKCGI  EASYP K    N +N++K +  SS
Sbjct: 318 RNSWGTKWGENGYVKMERNVKKSHLGKCGIMTEASYPTKDSAINKRNTSKEEKISS 373


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 236/352 (67%), Positives = 283/352 (80%), Gaps = 12/352 (3%)

Query: 15  FLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           FLFF  I+ S A D+ +           + R++DEVMT+Y+ WL KH K  NG+   ++R
Sbjct: 10  FLFFSLITFSLALDIQL----------PTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQR 59

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
           FQIFKDNL FIDEHN+ N TY VGLNKFAD+TNEEYR MYLGTRSD KRR+MK+K+   R
Sbjct: 60  FQIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHR 119

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           YA  +GD LP  VDWR KGA+  +KDQGSCGSCWAFST+A VE INKIVTG+L+SLSEQE
Sbjct: 120 YAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQE 179

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCDR  N GCNGGLMDYAF+FII NGG+D++Q YPY G E +CDP+R+ AK+VSIDGY
Sbjct: 180 LVDCDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGY 239

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV   +E +LKKAVA QPVSVAIEA GRA Q Y+SGVFTG+CG++LDH VV VGYG+EN
Sbjct: 240 EDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSEN 299

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           G+DYWLVRNSWG++WGE+GY K++RN+  T+TGKCGIA+EASYPVK  +NSA
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVKYGKNSA 351


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 236/357 (66%), Positives = 290/357 (81%), Gaps = 4/357 (1%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           + +  +VF  F +++ A DMSIISYD  H   SS R+D EV  IY+ W  KHGK +N + 
Sbjct: 10  MLVILIVFTLF-TATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNID 68

Query: 68  HNEK--RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM- 124
            +EK  RF+IFKDNL+FIDEHN+ NRTYKVGLN+FADL+NEEYR+ YLGT+ D    +M 
Sbjct: 69  GSEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMA 128

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           ++K  S RYA   GD+LP+SVDWR +GAV  VKDQGSCGSCWAFST+AAVEGINKIVTGE
Sbjct: 129 RTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGE 188

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           L+SLSEQELVDCDR +NAGC+GGLM+YAF+FII NGG+DS++DYPY G + KCD  ++NA
Sbjct: 189 LVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKNA 248

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           +VVSID YE V  +DE++LKKAVA+QP+SVAIEAGGR FQ Y SG+FTG+CG+ALDHGV 
Sbjct: 249 RVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGVT 308

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           AVGYGTENGVDYW+VRNSWG  WGE+GYV+++RNL  +  GKCGI M++SYP+K  Q
Sbjct: 309 AVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPIKKGQ 365


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 238/359 (66%), Positives = 282/359 (78%), Gaps = 12/359 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           + I TL+ L F  S + A MSII+Y  N           EVM +Y+ WL KH K  NG+ 
Sbjct: 4   MLIPTLLLLSFTFSHATA-MSIINYSEN-----------EVMDMYEEWLVKHRKVYNGLD 51

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             EKRFQ+FKDNL FI +HN+ N TY +GLNKFAD+TN+EYRAMYLGTR+DAKRR+MK++
Sbjct: 52  EKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQ 111

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
               RYA  +GD+LP  VDWR KGAV P+KDQG+CGSCWAFSTVAAVEGIN IVTGE +S
Sbjct: 112 NTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVS 171

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCDR+ + GCNGGLMDYAFQFIIQNGG+D+E+DYPY G +  CD +++  KVV
Sbjct: 172 LSEQELVDCDREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
            IDGYEDV   +E +LKKAV+ QPVSVAIEA GRA Q Y+SGVFTG+CG+ALDHGVV VG
Sbjct: 232 QIDGYEDVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVG 291

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           YGTENGVDYWLVRNSWG+ WGE+GY K++RN+  T+ GKCGIAM+ SYPVK   NSA P
Sbjct: 292 YGTENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVP 350


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 239/361 (66%), Positives = 291/361 (80%), Gaps = 8/361 (2%)

Query: 7   FLAISTLVFLFFISSSSAADMSIISYDNNH----DHSSSWRTDDEVMTIYQTWLAKHGKT 62
            + ++TL F   IS  SA DMSII+YD  H      S+  RTDDEV  +Y++WL KHGKT
Sbjct: 3   LIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKT 62

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS-DAKR 121
            N +G  ++RFQIFKDNLRFIDEHNS + TYK+GLNKFADLTNEEYR  Y G ++ D K+
Sbjct: 63  YNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKK 122

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
           +L  SK+ S RYA ++GD LPE VDWRE+GAV  VKDQGSCGSCWAFST  +VEG+NKIV
Sbjct: 123 KL--SKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIV 180

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           TG+LIS+SEQELV+CD   N GCNGGLMDYAF+FII+NGG+D+E+DYPY G + KCD ++
Sbjct: 181 TGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNK 240

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
           +NAKVV+ID YEDV   DE SLKKAV++QPV+VAIEAGGR FQ Y SG+FTG CG+ALDH
Sbjct: 241 KNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDH 300

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           GV+A GYGTE+G DYWLV+NSWG++WGE GY+K++RN+ D  +GKCGIAMEASYP+KN  
Sbjct: 301 GVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIAD-KSGKCGIAMEASYPIKNGD 359

Query: 362 N 362
           N
Sbjct: 360 N 360


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 245/366 (66%), Positives = 288/366 (78%), Gaps = 7/366 (1%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDH-SSSWRTDDEVMTIYQTWLAKH 59
           MA     L I+ + FLF + S S A MSII YD   D   S+ RT+  +M +Y+ WL KH
Sbjct: 1   MAPPPFRLCIA-ISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKH 59

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           GK  N +G  E+RF+IFKDNLRF+DE NS+  RTYK+GL KFADLTNEEYRAMYLG + +
Sbjct: 60  GKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKME 119

Query: 119 AKRRLMKSKVASQRYACKAG--DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
            K +L   +  SQRY  KAG  D+LP  VDWREKGAV  VKDQG CGSCWAFSTV +VEG
Sbjct: 120 KKEKLRTER--SQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEG 177

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           IN+IVTG+LISLSEQELVDCD+  N GCNGGLMDYAF+FII+NGG+DSE DYPY  ++N 
Sbjct: 178 INQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNM 237

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           CD +R+NA VV+IDGYEDV   DE SLKKAVA+QPVSVAIEAGGR FQ Y+SGVFTG CG
Sbjct: 238 CDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCG 297

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + LDHGVVAVGYGTENG+DYW+VRNSWG  WGE+GY++++RN+  T+TGKCGIAMEASYP
Sbjct: 298 TNLDHGVVAVGYGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYP 357

Query: 357 VKNSQN 362
            K  QN
Sbjct: 358 TKKGQN 363


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  500 bits (1287), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 242/350 (69%), Positives = 282/350 (80%), Gaps = 7/350 (2%)

Query: 9   AISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           +++ L+FL F + SSA DMSIISYD  H      RTD E M IY+ WL  HGK  N +G 
Sbjct: 8   SVACLLFLCF-AFSSALDMSIISYDQTH---PPQRTDAEAMAIYEKWLTTHGKAYNAIGE 63

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            E+RF+IFKDNLRF+DEHN++  +Y+VGLN+FADLTNEEYR+M+LG   + K R   +K 
Sbjct: 64  KERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTK- 122

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            S RYA +AGD+LP SVDWREKGAV+PVKDQG CGSCWAFST++AVEGIN+IVTGELISL
Sbjct: 123 -SDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISL 181

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQELVDCD+  N GCNGGLMDY FQFII NGG+D+E+DYPY   +  CD  R+NA+VVS
Sbjct: 182 SEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVS 241

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I+GYEDV   DE SLKKAVA+QPVSVAIEAGGRAFQ YESGVFTG CG+ LDHGVVAVGY
Sbjct: 242 INGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGY 301

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GTENGVDYW VRNSWG  WGENGY+KL+RN ++  +GKCGIA  ASYP K
Sbjct: 302 GTENGVDYWTVRNSWGPKWGENGYIKLERN-INATSGKCGIASMASYPTK 350


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  493 bits (1270), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 236/367 (64%), Positives = 288/367 (78%), Gaps = 6/367 (1%)

Query: 2   ATASMFLAISTLVFLF----FISSSSAADMSIISYDNNHD-HSSSWRTDDEVMTIYQTWL 56
           ++A+M      LV  F    F+  SSA+DMSII+YD  H  +S   RT D+++++Y++WL
Sbjct: 5   SSATMSPRPQCLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWL 64

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGT 115
            KH K  N +G  E RF IFKDN+ F+D HNS+ N++YK+GLNKFADLTN+EYR++YL  
Sbjct: 65  VKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSG 124

Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
           +   + R  +    S R+  + GD LPESVDWR++GAV PVKDQG CGSCWAFSTV AVE
Sbjct: 125 KMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVE 184

Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           GINKIVTGELISLSEQELVDCD   N GCNGGLMDYAF+FI++NGG+D+E DYPY G + 
Sbjct: 185 GINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDG 244

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
            CD +R+NAKVV+I+GYEDV   DE SLKKAVA QPVSVAIEAGGRAFQ YESGVFTG+C
Sbjct: 245 LCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQC 304

Query: 296 GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
           G+ LDHGVVAVGYG+ENG DYW+VRNSWG DWGE+GY++L+RN+  T+TGKCGIAM+ASY
Sbjct: 305 GTELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASY 364

Query: 356 PVKNSQN 362
           P K   N
Sbjct: 365 PTKTGDN 371


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 238/357 (66%), Positives = 292/357 (81%), Gaps = 4/357 (1%)

Query: 7   FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
            L +S  V     S+S++ADMSII+YD  H      R+D+EVM +Y++WL +HGK+ NG+
Sbjct: 4   LLILSLFVLAAVSSASASADMSIITYDEEHPAKGLSRSDEEVMALYESWLVEHGKSYNGL 63

Query: 67  G-HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           G   +KRF+IFKDNLR+IDE NS  +R+YK+GLN+FADLTNEEYR+ YLG ++DA+RR+ 
Sbjct: 64  GGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARRRIA 123

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           K+K + +RYA KAG  LP+S+DWREKGAV  VKDQGSCGSCWAFST+AAVEGIN+IVTGE
Sbjct: 124 KTK-SDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGE 182

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E DYPY G   +CD +R+NA
Sbjct: 183 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTRKNA 242

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           KVVSIDGYEDV+P+DE +LK+AVA QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGV 
Sbjct: 243 KVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCGTDLDHGVT 302

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           AVGYGTENGVDYW+V+NSW + WGE GY+++QRN+ D N G CGIA+E SYP K  +
Sbjct: 303 AVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKDKN-GLCGIAIEPSYPTKTGE 358


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/359 (65%), Positives = 288/359 (80%), Gaps = 6/359 (1%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S  L IS L+ L F + SSA+DMSIISYD  H H    RTDDEV  +Y++WL +HG
Sbjct: 1   MAAHSSTLTISILLMLIFSTLSSASDMSIISYDETHIHR---RTDDEVSALYESWLIEHG 57

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  +KRFQIFKDNLR+IDE NS+ N++YK+GL KFADLTNEEYR++YLGT+S  
Sbjct: 58  KSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSG 117

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            R+ + SK  S RY  K GD LPES+DWREKG +  VKDQGSCGSCWAFS VAA+E IN 
Sbjct: 118 DRKKL-SKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINA 176

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG LISLSEQELVDCDR  N GC+GGLMDYAF+F+I+NGG+D+E+DYPY      CD 
Sbjct: 177 IVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQ 236

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            R+NAKVV ID YEDV   +E +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+
Sbjct: 237 YRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAV 296

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           DHGVV  GYGTENG+DYW+VRNSWG++WGENGY+++QRN+  +++G CG+A+E SYPVK
Sbjct: 297 DHGVVIAGYGTENGMDYWIVRNSWGANWGENGYLRVQRNVA-SSSGLCGLAIEPSYPVK 354


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 222/366 (60%), Positives = 284/366 (77%), Gaps = 3/366 (0%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M T    +A   +V    ++ SSA DMSIISYD +H   S W++D+EVM+IY+ WL KHG
Sbjct: 1   MGTNRSLMATILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHG 60

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K  N +   EKRFQIFKDNL FI+EHN++NRTYKVGLN+F+DL+NEEYR+ YLGT+ D  
Sbjct: 61  KVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEEYRSKYLGTKIDPS 120

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R + +    S+RY+ +  D LPESVDWR++GAV  VK+Q  C  CWAFS +AAVEGINKI
Sbjct: 121 RMMAR---PSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKI 177

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L +LSEQEL+DCDR +NAGC+GGL+DYAF+FII NGG+D+E+DYP+ GA+  CD  
Sbjct: 178 VTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQY 237

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + NA+ V+IDGYE V  +DE++LKKAVA+QPVSVAIEA G+ FQ YESG+FTG CG+++D
Sbjct: 238 KINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSID 297

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGV AVGYGTENG+DYW+V+NSWG +WGE GYV ++RN+ +   GKCGIA+   YP+K  
Sbjct: 298 HGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIKIG 357

Query: 361 QNSAKP 366
           QN + P
Sbjct: 358 QNPSNP 363


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  489 bits (1259), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 231/335 (68%), Positives = 273/335 (81%), Gaps = 3/335 (0%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSII Y+  H      RT+ E   IY+ WL KHG+  N +G  E+RF+IFKDNL+FIDEH
Sbjct: 1   MSIIDYNIKHGQVPE-RTEAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEH 59

Query: 87  NSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
           NS+ N +YK+GLNKFADL+N+EYR++YLGTR D K RL+     S+RY  K GD+LPE+V
Sbjct: 60  NSVGNPSYKLGLNKFADLSNDEYRSVYLGTRMDGKGRLLGGP-KSERYLFKEGDDLPETV 118

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWREKGAV PVKDQG CGSCWAFSTV AVEGIN+IVTG L SLSEQELVDCD+  N GCN
Sbjct: 119 DWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCN 178

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMDYAF FII+NGG+D+E+DYPY   ++ CDP+R+NA+VV+IDGYEDV   DE SLKK
Sbjct: 179 GGLMDYAFDFIIENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKK 238

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           AVA+QPVSVAIEAGGR FQ Y+SGVFTG CG+ LDHGVV VGYGTE+GVDYW+VRNSWG 
Sbjct: 239 AVANQPVSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGP 298

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            WGENGY++++R++  T TGKCGIAMEASYP K S
Sbjct: 299 AWGENGYIRMERDVASTETGKCGIAMEASYPTKKS 333


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 236/345 (68%), Positives = 274/345 (79%), Gaps = 21/345 (6%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
           A DMSII YD +H              +Y+ WL KHGK  N +G  E+RF+IFKDNLRFI
Sbjct: 31  AMDMSIIDYDESHTRH-----------VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFI 79

Query: 84  DEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA-----SQRYACKA 137
           +EHN   +++YK+GLNKFADLTNEEYRAM+LGTR+    R  K+K A     + RYA +A
Sbjct: 80  EEHNGAGDKSYKLGLNKFADLTNEEYRAMFLGTRT----RGPKNKAAVVAKKTDRYAYRA 135

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
           G+ELP  VDWREKGAV P+KDQG CGSCWAFSTV AVEGIN+IVTG L SLSEQELVDCD
Sbjct: 136 GEELPAMVDWREKGAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD 195

Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
           R  N GCNGGLMDYAF+FI+QNGG+D+E+DYPY   +N CDP+R+NA+VV+IDGYEDV  
Sbjct: 196 RGYNMGCNGGLMDYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPT 255

Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
            DE SL KAVA+QPVSVAIEAGG  FQ Y+SGVFTG CG+ LDHGVVAVGYGTENG DYW
Sbjct: 256 NDEKSLMKAVANQPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYW 315

Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           LVRNSWGS WGENGY+KL+RN+ +T TGKCGIA+EASYP+KN  N
Sbjct: 316 LVRNSWGSAWGENGYIKLERNVQNTETGKCGIAIEASYPIKNGAN 360


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 242/361 (67%), Positives = 289/361 (80%), Gaps = 9/361 (2%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M ++  F A++ L+     + SSA DMSII         SS RTDDEVM +Y++WL KHG
Sbjct: 1   MDSSRSFTAMALLLLFSLFALSSALDMSIIG------ELSSSRTDDEVMAMYESWLVKHG 54

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K+ N +G  EKRFQIFKDNLRFIDEHN+ +RTYKVGLN+FADLTN+EYR+MYLG R+ ++
Sbjct: 55  KSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSMYLGARTGSR 114

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           RRL   K  S RY   AG+ LP+SVDWREKGAV  VKDQGSCGSCWAFST+AAVEGIN+I
Sbjct: 115 RRLSTQK-RSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGINQI 173

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E+DYPY   + +CD  
Sbjct: 174 VTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYNARDGRCDQY 233

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R+NAKVV+ID YEDV   +E +L+KAVA+QPVSVAIEA G AFQ YESGVFTG CG+ALD
Sbjct: 234 RKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGVFTGNCGTALD 293

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGV AVGYGTEN VDYW+V+NSWGS WGE+GY++++RN     TGKCGIA+E SYP+K S
Sbjct: 294 HGVTAVGYGTENSVDYWIVKNSWGSSWGESGYIRMERNT--GATGKCGIAVEPSYPIKTS 351

Query: 361 Q 361
           Q
Sbjct: 352 Q 352


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 228/351 (64%), Positives = 283/351 (80%), Gaps = 7/351 (1%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           + ++FL  I  SSA DMSIISYD NH H+ S R+D EV  +Y+ W+ KHGK  N +   +
Sbjct: 2   TVILFLAMIVVSSAMDMSIISYDKNH-HTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKD 60

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           +RF+IFKDNLRFIDEHN  N +Y++GL KFADLTN+EYR+MYLG+R   KR+  K+   S
Sbjct: 61  RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKT---S 115

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
            RY  + GD +PESVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKIVTG+LISLSE
Sbjct: 116 LRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSE 175

Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           QELVDCD   N GCNGGLMDYAF+FII+NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 235

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
            YEDV    E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 236 SYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 295

Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           ENG DYW+V+NSWG+ WGE+GY++++RN+  ++ GKCGIA+E SYP+KN Q
Sbjct: 296 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 345


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 229/351 (65%), Positives = 282/351 (80%), Gaps = 7/351 (1%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           + ++FL  I  SSA DMSIISYD NH H+ S R+D EV  +Y+ WL KHGK  N +   +
Sbjct: 2   TVILFLTMIVVSSAMDMSIISYDKNH-HTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 60

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           +RF+IFKDNLRFIDEHN  N +Y++GL KFADLTN+EYR+MYLG+R   KR+  KS   S
Sbjct: 61  RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKS---S 115

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
            RY  + GD +PESVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKIVTG+LI+LSE
Sbjct: 116 LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSE 175

Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           QELVDCD   N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 235

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
            YEDV    E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 236 LYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 295

Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           ENG DYW+V+NSWG+ WGE+GY++++RN+  ++ GKCGIA+E SYP+KN Q
Sbjct: 296 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 345


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 229/351 (65%), Positives = 282/351 (80%), Gaps = 7/351 (1%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           + ++FL  I  SSA DMSIISYD NH H+ S R+D EV  +Y+ WL KHGK  N +   +
Sbjct: 8   TVILFLTMIVVSSAMDMSIISYDKNH-HTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 66

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           +RF+IFKDNLRFIDEHN  N +Y++GL KFADLTN+EYR+MYLG+R   KR+  KS   S
Sbjct: 67  RRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSR--LKRKATKS---S 121

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
            RY  + GD +PESVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKIVTG+LI+LSE
Sbjct: 122 LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSE 181

Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           QELVDCD   N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID
Sbjct: 182 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTID 241

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
            YEDV    E SLKKA++ QP+SVAIE GGRAFQ Y+SG+F G CG+ LDHGVVAVGYGT
Sbjct: 242 LYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT 301

Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           ENG DYW+V+NSWG+ WGE+GY++++RN+  ++ GKCGIA+E SYP+KN Q
Sbjct: 302 ENGKDYWIVKNSWGTSWGESGYIRMERNIA-SSAGKCGIAVEPSYPIKNGQ 351


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 232/363 (63%), Positives = 286/363 (78%), Gaps = 6/363 (1%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S  L IS L+ L F + SSA+DMSIISYD  H H    R+DDEV  +Y++WL +HG
Sbjct: 1   MAAHSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHH---RSDDEVSALYESWLIEHG 57

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  +KRFQIFKDNL++IDE NS+ N++YK+GL KFADLTNEEYR++YLGT+S  
Sbjct: 58  KSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSG 117

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            RR + SK  S RY  K GD LPESVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN 
Sbjct: 118 DRRKL-SKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINA 176

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG LISLSEQELVDCD+  N GC+GGLMDYAF+F+I NGG+D+E+DYPY    + CD 
Sbjct: 177 IVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQ 236

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            R+NAKVV ID YEDV   +E +L+KAVA QPVS+AIEAGGR  QHY+SG+FTG+CG+A+
Sbjct: 237 YRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAV 296

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGVVA GYG+ENG+DYW+VRNSWG+ WGE GY+++QRN+  +++G CG+A E SYPVK 
Sbjct: 297 DHGVVAAGYGSENGMDYWIVRNSWGAKWGEKGYLRVQRNVA-SSSGLCGLATEPSYPVKT 355

Query: 360 SQN 362
             N
Sbjct: 356 GAN 358


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 229/340 (67%), Positives = 278/340 (81%), Gaps = 7/340 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISYD  H      R+++E+  +Y+ WLAKHG+  N +G  E+RF+IFKDN+RFID 
Sbjct: 24  DMSIISYDEAHGVQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDA 83

Query: 86  HN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN    S +R++++GLN+FAD+TNEEYR +YLGTR  + RR  ++++ S RY   AG+EL
Sbjct: 84  HNAAADSGHRSFRLGLNRFADMTNEEYRTVYLGTRPASHRR--RARLGSDRYRYNAGEEL 141

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR+KGAV  VKDQGSCGSCWAFST+AAVEGINKIVTG+LISLSEQELVDCD   N
Sbjct: 142 PESVDWRDKGAVTTVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQN 201

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+D+E+DYPY   + KCD  R+NAKVVSIDGYEDV   DE 
Sbjct: 202 QGCNGGLMDYAFEFIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEK 261

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA+QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGVVAVGYGTENG DYW+VRN
Sbjct: 262 ALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRN 321

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           SWG DWGE+GY++++RN ++ +TGKCGIAME+SYP K  Q
Sbjct: 322 SWGGDWGESGYIRMERN-VNASTGKCGIAMESSYPTKKGQ 360


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  483 bits (1244), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 224/357 (62%), Positives = 274/357 (76%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M   ++ L      + S A DM IISYD  H   S+ RT+D+V+T+Y+ WL KHGK  N 
Sbjct: 1   MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNA 60

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +G  EKRF+IFKDNL FIDEHNS N ++++GLN+FADLTNEEYR  +LGTR +  RR  K
Sbjct: 61  LGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRK 120

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
               + RYA + GD+LPESVDWR++GAV  VKDQGSCGSCWAFS +AAVEG+NK+ TG+L
Sbjct: 121 VNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDL 180

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           ISLSEQELVDCD   N GCNGGLMDYAF+FII    +  E+DYPY   + +CD +R+NAK
Sbjct: 181 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAK 240

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VVSID YEDV  +DE +LKKAVA+Q ++VA+E GGR FQ Y+SGVFTG CG+ALDHGV A
Sbjct: 241 VVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAA 300

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           VGYGTENG DYW+VRNSWG  WGE GY++L+RNL  + +GKCGIA+E SYP+KN  N
Sbjct: 301 VGYGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIKNGLN 357


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  483 bits (1243), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 226/320 (70%), Positives = 264/320 (82%), Gaps = 2/320 (0%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           R D+EV  +Y++WL  HGK  N +G  E+RF+IFKDNLRFIDEHN  +RTYKVGL +FAD
Sbjct: 53  RPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFAD 112

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LTNEEYRA +LG R   K RL  +K  S RYA   GD+LP+ VDWR+KGAV  VKDQG C
Sbjct: 113 LTNEEYRARFLGGRFSRKPRLSAAK--SGRYAAALGDDLPDDVDWRKKGAVATVKDQGQC 170

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFS+VAAVEGIN+IVTGELI LSEQELVDCD+  N GCNGGLMDYAFQFII NGG+
Sbjct: 171 GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 230

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           D+E+DYPY G +  CDP+R+NAKVV+IDGYEDV   DE SLKKAVA+QPVSVAIEAGGRA
Sbjct: 231 DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 290

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           FQ Y+SGVFTG CG+ LDHGVVAVGYGT+NG DYW+VRNSWG DWGE+GY++L+RN+ + 
Sbjct: 291 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 350

Query: 343 NTGKCGIAMEASYPVKNSQN 362
            TGKCGIA++ SYP K+  N
Sbjct: 351 TTGKCGIAVQPSYPTKSGAN 370


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 229/341 (67%), Positives = 276/341 (80%), Gaps = 7/341 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISYD  H      R+++E+  +Y+ WLAKHG+  N +G  E+RF+IFKDN+ FID 
Sbjct: 24  DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83

Query: 86  HNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+     +R++++GLN+FAD+TNEEYRA+YLGTR    RR  +++V S RY   AG++L
Sbjct: 84  HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRR--RARVGSDRYRYNAGEDL 141

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  VKDQGSCGSCWAFSTVAAVEGINKIVTG+LISLSEQELVDCD   N
Sbjct: 142 PESVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYN 201

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDY F+FII NGG+D+E+DYPY   + KCD  R+NAKVVSIDGYEDV   DE 
Sbjct: 202 QGCNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEK 261

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA+QPVSVAIEAGGR FQ Y SG+FTG CG+ LDHGVVAVGYGTENG DYW+VRN
Sbjct: 262 ALQKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRN 321

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG DWGE+GY++++RN ++T+TGKCGIA+E SYP K  QN
Sbjct: 322 SWGGDWGESGYIRMERN-VNTSTGKCGIAIEPSYPTKKGQN 361


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  480 bits (1236), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 235/361 (65%), Positives = 285/361 (78%), Gaps = 12/361 (3%)

Query: 7   FLAISTLVFLF-FISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGK--- 61
           FL +S ++ L   I  S A DMSIISYD NH  ++ + R+D EV  IY+ W+ +HGK   
Sbjct: 3   FLKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKM 62

Query: 62  TSNGMG-HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             NG+G   ++RF+IFKDNLRFIDEHN+ N +YK+GL +FADLTNEEYR+MYLG +    
Sbjct: 63  NQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAK--PT 120

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           +R++K+   S RY  + GD LP+SVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKI
Sbjct: 121 KRVLKT---SDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E DYPY  A+ +CD +
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R+NAKVV+ID YEDV    E SLKKA+A QP+SVAIEAGGRAFQ Y SGVF G CG+ LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELD 297

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGVVAVGYGTENG DYW+VRNSWG+ WGE+GY+K+ RN ++  TGKCGIAMEASYP+K  
Sbjct: 298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARN-IEAPTGKCGIAMEASYPIKKG 356

Query: 361 Q 361
           Q
Sbjct: 357 Q 357


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  479 bits (1233), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 226/353 (64%), Positives = 281/353 (79%), Gaps = 8/353 (2%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGKTSN--GMGH 68
            ++FL  ++ +SA DMSIISYD  H  S++  R+D EVM+IY+ WL KHGK  N   +  
Sbjct: 2   VILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLVE 61

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            ++RF+IFKDNLRFID+HN  N +Y++GL +FADLTN+EYR+ YLG + + K      + 
Sbjct: 62  KDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERR 117

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            SQRY  + GDELPES+DWR+KGAV  VKDQGSCGSCWAFST+ AVEGIN+IVTG+LI+L
Sbjct: 118 TSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITL 177

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+++DYPY G +  CD  R+NAKVV+
Sbjct: 178 SEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVT 237

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           ID YEDV  + E SLKKAVA QPVSVAIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGY
Sbjct: 238 IDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVGY 297

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           GTENG DYW+VRNSWG  WGE+GY+K+ RN+  +++GKCGIA+E SYP+KN +
Sbjct: 298 GTENGKDYWIVRNSWGKSWGESGYLKMARNIA-SSSGKCGIAIEPSYPIKNGE 349


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  479 bits (1233), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 233/359 (64%), Positives = 286/359 (79%), Gaps = 11/359 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGKT--SN 64
           + ++ L+    I  S AADMSIISYD  H  ++ + R+D EV  IY+ W+ KHGK   SN
Sbjct: 4   VKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSN 63

Query: 65  GMGHNEK--RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
           G+   EK  RF+IFKDNLRFIDEHN+ N +YK+GL +FADLTNEEYR++YLG +S  K+R
Sbjct: 64  GLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKS--KKR 121

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
           ++K+   S RY  + GD +P+SVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKIVT
Sbjct: 122 VLKT---SDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVT 178

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E+DYPY  A+ +CD +R+
Sbjct: 179 GDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRK 238

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
           NAKVV+ID YEDV   +E +LKK +A+QP+SVAIEAGGRAFQ Y SGVF G CG+ LDHG
Sbjct: 239 NAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHG 298

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           VVAVGYGTENG DYW+VRNSWG  WGE+GY+K+ RN+ +  TGKCGIAMEASYP+K  Q
Sbjct: 299 VVAVGYGTENGKDYWIVRNSWGGSWGESGYIKMARNIAEP-TGKCGIAMEASYPIKKGQ 356


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 235/361 (65%), Positives = 285/361 (78%), Gaps = 12/361 (3%)

Query: 7   FLAISTLVFLF-FISSSSAADMSIISYDNNHDHSS-SWRTDDEVMTIYQTWLAKHGK--- 61
           FL +S ++ L   I  S A DMSIISYD NH  S+ S R+D EV  IY+ W+ +HGK   
Sbjct: 3   FLKLSPMILLLAMIGVSYAIDMSIISYDENHHISTVSSRSDAEVERIYEAWMVEHGKKKM 62

Query: 62  TSNGMG-HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             NG+G   ++RF+IFKDNLR+IDEHN+ N +YK+GL +FADLTN+EYR+MYLG +    
Sbjct: 63  NQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADLTNDEYRSMYLGAK--PV 120

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           +R++K+   S RY  + GD LP+SVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKI
Sbjct: 121 KRVLKT---SDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E DYPY  A+ +CD +
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R+NAKVV+ID YEDV    E SLKKA+A QP+SVAIEAGGRAFQ Y SGVF G CG+ LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTELD 297

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           HGVVAVGYGTENG DYW+VRNSWG+ WGE+GY+K+ RN+ +  TGKCGIAMEASYP+K  
Sbjct: 298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIAEP-TGKCGIAMEASYPIKKG 356

Query: 361 Q 361
           Q
Sbjct: 357 Q 357


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 221/311 (71%), Positives = 257/311 (82%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M++Y+ WL KHGK  N +G  +KRF IFKDNLRFID+HN+ NRTYK+GLN+FADLTNEEY
Sbjct: 1   MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEY 60

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           RA YLGTR D  RR +K+K  S RYA + GD LPESVDWR + AV PVKDQG+CGSCWAF
Sbjct: 61  RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST+ AVEGINKIVTG+LISLSEQELVDCD   N GCNGGLMDYA++FII NGG+DSE+DY
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEEDY 180

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   +  CD  R+NAKVV+ID YEDV   DE++LKKAVA+QPVSVAIE GGR FQ Y S
Sbjct: 181 PYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYVS 240

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVFTG CG+ALDHGVVAVGYG+  G DYW+VRNSWG+ WGE GYV+L+RNL  + +GKCG
Sbjct: 241 GVFTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGKCG 300

Query: 349 IAMEASYPVKN 359
           IA+E SYP+KN
Sbjct: 301 IAIEPSYPIKN 311


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  476 bits (1224), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 231/344 (67%), Positives = 279/344 (81%), Gaps = 6/344 (1%)

Query: 24  AADMSIISYDNNHDH-SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
           A DMSIISYD+NH+   SS R+DDEVM IY++WL +H K  N +G  EKRF IFKDNL F
Sbjct: 24  AVDMSIISYDHNHNLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEF 83

Query: 83  IDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLG----TRSDAKRRLMKSKVASQRYACKA 137
           ID+HNS + +T+KVGLNKFADLTNEE+R++YLG    + S       KSKV S RY  K 
Sbjct: 84  IDQHNSDDSQTFKVGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKE 143

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
           GDELPE+VDWR+ GAV  VKDQG CGSCWAFST+AAVEGIN+IVTGEL+SLSEQELVDCD
Sbjct: 144 GDELPEAVDWRKNGAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCD 203

Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
              N+GC+GGLMDYA++FII NGG+D++ DYPY   + KCD  R+NAKVV+ID +EDV  
Sbjct: 204 TSYNSGCDGGLMDYAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPE 263

Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
            DE +L+KAVA QPVSVAIEAGG  FQ Y+SGVFTG+CG+ LDHGVVAVGYG+++G DYW
Sbjct: 264 NDEKALQKAVAHQPVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGSDDGKDYW 323

Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           +VRNSWG+DWGE+GY++++RNL    TGKCGIA+E SYP+KNSQ
Sbjct: 324 IVRNSWGADWGESGYIRMERNLETVKTGKCGIAIEPSYPIKNSQ 367


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
           ++FL  ++ SSA DMSIISYD  H  S++  R++ EVM+IY+ WL KHGK  + N +   
Sbjct: 10  ILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           ++RF+IFKDNLRF+DEHN  N +Y++GL +FADLTN+EYR+ YLG + + K      +  
Sbjct: 70  DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S RY  + GDELPES+DWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQELVDCD   N GCNGGLMDYAF+FII+NGG+D+++DYPY G +  CD  R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           D YEDV  + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           TENG DYW+VRNSWG  WGE+GY+++ RN+  +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
           ++FL  ++ SSA DMSIISYD  H  S++  R++ EVM+IY+ WL KHGK  + N +   
Sbjct: 10  ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           ++RF+IFKDNLRF+DEHN  N +Y++GL +FADLTN+EYR+ YLG + + K      +  
Sbjct: 70  DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S RY  + GDELPES+DWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQELVDCD   N GCNGGLMDYAF+FII+NGG+D+++DYPY G +  CD  R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           D YEDV  + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           TENG DYW+VRNSWG  WGE+GY+++ RN+  +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
           ++FL  ++ SSA DMSIISYD  H  S++  R++ EVM+IY+ WL KHGK  + N +   
Sbjct: 10  ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           ++RF+IFKDNLRF+DEHN  N +Y++GL +FADLTN+EYR+ YLG + + K      +  
Sbjct: 70  DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S RY  + GDELPES+DWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQELVDCD   N GCNGGLMDYAF+FII+NGG+D+++DYPY G +  CD  R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           D YEDV  + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           TENG DYW+VRNSWG  WGE+GY+++ RN+  +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 222/342 (64%), Positives = 276/342 (80%), Gaps = 6/342 (1%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
           ++A DMSII+YD  H  +  ++TDDE  T++++WL  HGK+ N +G  EKRFQIFK+NLR
Sbjct: 17  AAATDMSIITYDETH--AVGFKTDDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLR 74

Query: 82  FIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
           +IDE N + +R +K+GLNKFADLTNEEYR+ Y G +S   R+ + +K  S RYA  +G+ 
Sbjct: 75  YIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAK--SGRYATLSGES 132

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPESVDWRE GAV  VKDQGSCGSCWAFST++AVEGIN+I TG+LI+LSEQELVDCDR  
Sbjct: 133 LPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSY 192

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GCNGGLMDYAF+FII NGG+D++ DYPY G + KCD  R+NAKVV+ID YEDV  +DE
Sbjct: 193 NEGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDE 252

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
           ++LKKA A+QP+SVAIEA GR FQ Y+SG+FTG+CG ALDHGVV VGYGTENG DYW+VR
Sbjct: 253 LALKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVR 312

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           NSWG+DWGENGY++++R  + + TG CGIA+E SYPVK   N
Sbjct: 313 NSWGADWGENGYLRMERG-ISSKTGICGIAIEPSYPVKTGVN 353


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 234/358 (65%), Positives = 282/358 (78%), Gaps = 10/358 (2%)

Query: 8   LAISTLVFLFFISSSSAA-----DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           +A  +++F F  +  SAA     DMSII+YD  H      R++DEV  ++++WL KHGK+
Sbjct: 1   MARPSILFTFLFAVVSAAAAAAEDMSIITYDQQHPAKGLVRSEDEVKEMFESWLVKHGKS 60

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
            N +   +KRF+IF+DNL++IDE NSL NR+YK+GLN+FAD+TNEEYR  YLG + DA R
Sbjct: 61  YNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTGYLGAKRDASR 120

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
            ++KSK  S RYA  AGD LP+S+DWREKGAV  VKDQGSCGSCWAFST+AAVEG+N++ 
Sbjct: 121 NMVKSK--SDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFSTIAAVEGVNQLA 178

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           TG LISLSEQELVDCDRKIN GCNGG M YAFQFII+NGG+DSE+DYPY G + KCD  R
Sbjct: 179 TGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDSEEDYPYTGKDGKCDSYR 238

Query: 242 R-NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + NAKV SIDGYE+V   +E SL+KAVA+QPVSVAIEAGG  FQ Y SG+FTG CG+ LD
Sbjct: 239 QNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLYSSGIFTGSCGTDLD 298

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYGTENGVDYW+V+NSWG  WGE GYV++QRN +   TG CGIAMEASYP K
Sbjct: 299 HGVAAVGYGTENGVDYWIVKNSWGDYWGEKGYVRMQRN-VKAKTGLCGIAMEASYPTK 355


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 227/356 (63%), Positives = 277/356 (77%), Gaps = 7/356 (1%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
           ++ D SII+          WRTD+EV +IY  W A+HGKT+N         +KRF IFKD
Sbjct: 20  ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79

Query: 79  NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY-AC 135
           NLRFID HN  N+  TYK+GL KF DLTN+EYR +YLG R++  RR+ K+K  +Q+Y A 
Sbjct: 80  NLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
           CD+  N GCNGGLMDYAFQFI++NGG+++E+DYPY G   KC+   +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE +LKKA++ QPVSVAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
           YW+VRNSWG  WGE GY++++RNL  + +GKCGIA+EASYPVK S N  +    SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISS 375


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 227/356 (63%), Positives = 277/356 (77%), Gaps = 7/356 (1%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
           ++ D SII+          WRTD+EV +IY  W A+HGKT+N         +KRF IFKD
Sbjct: 20  ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79

Query: 79  NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY-AC 135
           NLRFID HN  N+  TYK+GL KF DLTN+EYR +YLG R++  RR+ K+K  +Q+Y A 
Sbjct: 80  NLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
           CD+  N GCNGGLMDYAFQFI++NGG+++E+DYPY G   KC+   +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE +LKKA++ QPVSVAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
           YW+VRNSWG  WGE GY++++RNL  + +GKCGIA+EASYPVK S N  +    SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISS 375


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 219/323 (67%), Positives = 260/323 (80%), Gaps = 3/323 (0%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGL 97
           +S+ RTD+EV   Y+ WLA+HGKT N +G  E RF+IF DNL+FIDEHN S NR+YKVGL
Sbjct: 23  TSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGL 82

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNP 155
           N+FADLTNEEYR+MYLGT+ D  RR+  M+    S+RYA +  +  P  VDWRE+GAV+P
Sbjct: 83  NQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSP 142

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           VK+QG CGSCWAFSTVA+VEGINKIVTG+LISLSEQELVDCD K N+GCNGG MDYAFQF
Sbjct: 143 VKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQF 202

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           I+ NGG+DSE DYPY G    CDP R  AK+VSIDGYEDV P +E +L KAVA QPVSV 
Sbjct: 203 IVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVG 262

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           IEA GRAFQ Y SGV TG CG+ LDHGVV VGYG+ENG DYW+VRNSWG +WGE+GY+++
Sbjct: 263 IEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYIRM 322

Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
           +RN++DT  G CGI + ASYP+K
Sbjct: 323 ERNMVDTPVGMCGITLMASYPIK 345


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 226/350 (64%), Positives = 273/350 (78%), Gaps = 7/350 (2%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
           ++ D SII+        S WRTD+EV +IY  W A HGKT+N         +KRF IFKD
Sbjct: 20  ASGDESIINDHLQLPSDSWWRTDEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKD 79

Query: 79  NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           NLRFID HN  N+  TYK+GL KF DLTNEEYR++YLG R++  RR+ K+K  +Q+Y+  
Sbjct: 80  NLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAA 139

Query: 137 A-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             G E+PE+VDWR KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVD 199

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
           CD   N GCNGGLMDYAFQFI++NGG+ +E+DYPY G   KC+   +NAKVVSIDGYEDV
Sbjct: 200 CDNSYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDV 259

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE +LK+A++ QPVSVAIEAGGR FQHY++G+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGSENGVD 319

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           YW+VRNSWG  WGE GY++++RNL  + +GKCGIA+EASYPVK S N  +
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLASSKSGKCGIAVEASYPVKYSPNPVR 369


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  470 bits (1210), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 225/357 (63%), Positives = 276/357 (77%), Gaps = 7/357 (1%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
           ++ D SII+          WRTD+EV +IY  W A+HGKT+N         +KRF IFKD
Sbjct: 20  ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79

Query: 79  NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           NLRFID HN  N+  TYK+GL KF DLTN+EYR +YLG R++  RR+ K+K  +Q+Y+  
Sbjct: 80  NLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139

Query: 137 A-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
           CD+  N GCNGGLMDYAFQFI++NGG+++E+DYPY G   KC+   +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE +LKKA++ QPV VAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
           YW+VRNSWG  WGE GY++++RNL  + +GKCGIA+EASYPVK S N  +    SS 
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISSV 376


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 226/360 (62%), Positives = 279/360 (77%), Gaps = 11/360 (3%)

Query: 6   MFLAISTLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           M  + ST+  LF   FI SSSA D+SII    N       R DDE+ ++Y+TWL KHGK 
Sbjct: 1   MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFN-------RPDDEIASLYETWLVKHGKN 53

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
            NG+G  + RF IFKDNLRF+DE NS N ++K+GLN+FADLTNEEYR++YLGTR  +   
Sbjct: 54  YNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAV 113

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               +  S RYA +AGD LPESVDWR+KGAV  +KDQGSCGSCWAFS +AAVEG+N+IVT
Sbjct: 114 ARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVT 173

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G+LISLSEQELV+CD   N GC+GGLMDYAF+FII+N G+DS++DYPY G + +CD +R+
Sbjct: 174 GDLISLSEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRK 233

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
           NAKVV+ID YED   +DE SL+KAVA+QPVSVAIE GGR FQ Y+SGVFTG+CG+ALDHG
Sbjct: 234 NAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHG 293

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           V  VGYGTE+G+DYW+VRNSWG  WGE GY+++QRN     +G CGIA+E SYP+K+  N
Sbjct: 294 VAVVGYGTEDGLDYWIVRNSWGDTWGEGGYIRMQRN-TKLPSGICGIAIEPSYPIKSGLN 352


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  467 bits (1201), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 223/337 (66%), Positives = 272/337 (80%), Gaps = 4/337 (1%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
           A+DMSII+YD  H +S   RTDDEVMT+Y +WL KHGK+ N +G  E RFQIFKDNLR+I
Sbjct: 22  ASDMSIINYDQTHTNSLI-RTDDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYI 80

Query: 84  DEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           D HN+  +R+Y++GLN+FADLTNEEYRA YLGT+S   R  + SK  S RYA   G+ELP
Sbjct: 81  DNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKL-SKGPSDRYAPVEGEELP 139

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
           +S+DWREKGAV  VKDQGSCGSCWAFS + AVEGIN+I TGELI+LSEQELVDCDR  N 
Sbjct: 140 DSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNE 199

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GC GGLMDYAF FII+NGG+DS+ DYPY G +  C+ ++ NAKVV+ID YEDV  +DE +
Sbjct: 200 GCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKA 259

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KA A+QP+SVAIEAGG  FQ Y SG+FTG+CG+A+DHGVV VGYG+E G+DYW+VRNS
Sbjct: 260 LQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNS 319

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           WG+ WGE GY+K+QRN +  ++G CGI +E SYPVKN
Sbjct: 320 WGAAWGEAGYLKMQRN-VGKSSGLCGITIEPSYPVKN 355


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 218/354 (61%), Positives = 275/354 (77%), Gaps = 21/354 (5%)

Query: 19  ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
           +S ++AADMSI+SY          R+++EV  +Y  W+A+HG T N +G  E+RF+ F+D
Sbjct: 18  VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69

Query: 79  NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
           NLR+ID+HN+       ++++GLN+FADLTNEEYR+ YLG R+  D +R+L      S R
Sbjct: 70  NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y     DELPESVDWR+KGAV  VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV    E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTEN
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTEN 303

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           G DYWLVRNSWGS WGE+GY++++RN +  ++GKCGIA+E SYP K ++    P
Sbjct: 304 GKDYWLVRNSWGSVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTARTPLTP 356


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score =  459 bits (1182), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 217/321 (67%), Positives = 255/321 (79%), Gaps = 2/321 (0%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           MT+Y+ WL KH K  NG+G  + RFQIFKDNLRFIDEHN+ N +YKVGLNKFAD+ NEEY
Sbjct: 1   MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEY 60

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           R MYLGT+SDAKRR+MK+K+   R    +   +   VDWR KGAV  +KDQGSCGSCWAF
Sbjct: 61  RDMYLGTKSDAKRRVMKTKITGHRITYNS-VIVTVKVDWRLKGAVTHIKDQGSCGSCWAF 119

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST+A VE INKIVTG+ +SLSEQELVDCDR  N GCNGGLMDYAF+FII+NGG+D++QDY
Sbjct: 120 STIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDY 179

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY G E KCDP+++NAKVVSIDGYEDV  +   +LKKAVA QPVSVAI   GRA Q Y+S
Sbjct: 180 PYNGFERKCDPTKKNAKVVSIDGYEDVPSYMN-ALKKAVAHQPVSVAIAGLGRALQLYQS 238

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVFTG+CG+ LDHGVV VGYG+ENGVDYWLVRNSWG++WGE+GY K+    + +   KCG
Sbjct: 239 GVFTGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298

Query: 349 IAMEASYPVKNSQNSAKPKPH 369
           IAMEASYPVK  QN+    P 
Sbjct: 299 IAMEASYPVKYGQNTNSAAPQ 319


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 223/354 (62%), Positives = 266/354 (75%), Gaps = 8/354 (2%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           + L  + L       S+S AD SIISYD     S     DD +M +Y+ WLA+H K  NG
Sbjct: 3   ILLLFAVLALSAMAGSASRADFSIISYD-----SQDLIGDDAIMELYELWLAQHKKAYNG 57

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           +   +K+F +FKDN  +I +HN+  N +YK+GLN+FADL++EE++A YLGT+ DAK+RL 
Sbjct: 58  LDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLS 117

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           +S   S RY    G++LPES+DWREKGAV  VK+QGSCGSCWAFSTVAAVEGIN+IVTG 
Sbjct: 118 RS--PSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGN 175

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           L SLSEQELVDCD   N GCNGGLMDYAFQFII NGG+DSE DYPY      CD  R+NA
Sbjct: 176 LTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNA 235

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            VV+ID YEDV   DE SLKKA A+QP+SVAIEA GRAFQ YESGVFT  CG+ LDHGV 
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVT 295

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            VGYG+E+G+DYWLV+NSWG+ WGE G++KLQRNL   +TG CGIAMEASYPVK
Sbjct: 296 LVGYGSESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVK 349


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 218/341 (63%), Positives = 267/341 (78%), Gaps = 7/341 (2%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
           +AADMSII+YD  H   S   TDD +M  Y++WL KHGK+ N +G  E+RFQIFKDN  +
Sbjct: 18  TAADMSIITYDQTHAVGS---TDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLY 74

Query: 83  IDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           IDE N+  +R++K+GLN+FADLTNEEYR+ Y G R+   R+ +  K  SQRYA  AG+ L
Sbjct: 75  IDEQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGK--SQRYASLAGESL 132

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWRE GAV  VKDQG CGSCWAFST++AVEGIN+I TG+LI+LSEQELVDCDR  N
Sbjct: 133 PESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYN 192

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMD AFQFII NGG+DS+ DYPY G + +CD  R+NAKVV+ID YEDV  +DE 
Sbjct: 193 EGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEK 252

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KA A+QP+SVAIEA GR FQ Y+SG+FTG+CG+ LDHGVV VGYGTENG DYW+VRN
Sbjct: 253 ALQKAAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRN 312

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG+DWGE GY++++R  + +  G CGI  E SYPVK+  N
Sbjct: 313 SWGADWGEKGYLRMERG-ISSKAGICGITSEPSYPVKSGVN 352


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  456 bits (1174), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 211/324 (65%), Positives = 263/324 (81%), Gaps = 5/324 (1%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           R+DDEV  +YQ W A+H ++ N +  +E+R +IF+DNLRFID+HN+       ++++GL 
Sbjct: 38  RSDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLT 97

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEEYR+ YLG R+   RR   S V S RY  ++ D+LP+S+DWR+KGAV  VKD
Sbjct: 98  RFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKD 157

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCGSCWAFST+AAVEGIN IVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII 
Sbjct: 158 QGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIIS 217

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+++DYPY G +  CD  R+NA VV+ID YEDV   DE SL+KAVA+QPVSVAIEA
Sbjct: 218 NGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEA 277

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
           GGRAFQ YESG+FTG CG+ LDHGV A+GYG+ENG  YW+V+NSWGSDWGE+GY++++RN
Sbjct: 278 GGRAFQLYESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERN 337

Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
            +++ TGKCGIAMEASYP+KN QN
Sbjct: 338 -INSATGKCGIAMEASYPIKNGQN 360


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  456 bits (1173), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 218/350 (62%), Positives = 274/350 (78%), Gaps = 21/350 (6%)

Query: 19  ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
           +S ++AADMSI+SY          R+++EV  +Y  W+A+HG T N +G  E+RF+ F+D
Sbjct: 18  VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69

Query: 79  NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
           NLR+ID+HN+       ++++GLN+FADLTNEEYR+ YLG R+  D +R+L      S R
Sbjct: 70  NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y     DELPESVDWR+KGAV  VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV    E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTEN
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTEN 303

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G DYWLVRNSWGS WGE+GY++++RN +  ++GKCGIA+E SYP K  +N
Sbjct: 304 GKDYWLVRNSWGSVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 352


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 219/354 (61%), Positives = 262/354 (74%), Gaps = 8/354 (2%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           + L  + L       S+S AD SII YD     S   R DD +M +Y+ WLA+H K  NG
Sbjct: 3   ILLLFAVLALSAMAGSASRADFSIIGYD-----SKDLREDDAIMELYELWLAQHKKAYNG 57

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           +G  + RF +FKDN  +I +HN+  N +YK+GLN+FADL++EE++A YLG + D K+RL 
Sbjct: 58  LGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLS 117

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
            S   S RY    G++LPES+DWREKGAV  VKDQGSCGSCWAFSTVAAVEGIN+IVTG 
Sbjct: 118 NS--PSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGN 175

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           L SLSEQELVDCD   N GCNGGLMDYAFQFII NGG+DSE DYPY   +  CD  R+NA
Sbjct: 176 LTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNA 235

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            VV+ID YEDV   DE SLKKA A+QP+SVAIEA GRAFQ YESGVFT  CG+ LDHGV 
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVT 295

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            VGYG+E+G DYW+V+NSWG  WGE G+++LQRN+   +TG CGIAMEASYP+K
Sbjct: 296 LVGYGSESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLK 349


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 216/338 (63%), Positives = 260/338 (76%), Gaps = 9/338 (2%)

Query: 21  SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
           S+S AD SIIS       S   R DD +M +Y+ WLA+H +  NG+   +KRF +FKDN 
Sbjct: 18  SASRADFSIIS-------SKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNF 70

Query: 81  RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
            +I EHN  NR+YK+GLN+FADL++EE++A YLG + D K+RL  S+  S+RY    G++
Sbjct: 71  LYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRL--SRPPSRRYQYSDGED 128

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPES+DWREKGAV  VKDQGSCGSCWAFSTVAAVEGIN+IVTG+LISLSEQELVDCD   
Sbjct: 129 LPESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSY 188

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GCNGGLMDYAF+FII NGG+DSE+DYPY   +  CD  R+NA VV+ID YEDV   DE
Sbjct: 189 NQGCNGGLMDYAFEFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDE 248

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
            SLKKA A+QP+SVAIEA GR FQ Y+SGVFT  CG+ LDHGV  VGYG+E+G DYW V+
Sbjct: 249 KSLKKAAANQPISVAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVK 308

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG  WGE G+++LQRN+   +TG CGIAMEASYPVK
Sbjct: 309 NSWGKSWGEEGFIRLQRNIEVASTGMCGIAMEASYPVK 346


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 213/308 (69%), Positives = 253/308 (82%), Gaps = 5/308 (1%)

Query: 56  LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLG 114
           L KH K  N +G  EKRF+IFKDNLRFIDEHN  +N+++K+GLNKFADL+NEEY++M+LG
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
            R    R+  +S     R+    GDELP+SVDWREKGAV PVKDQG CGSCWAFSTVAAV
Sbjct: 71  GRMVRDRKGFES----DRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAV 126

Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
           EGIN+I TG+LISLSEQELVDCD+  N GCNGG MDYAF+FI++NGG+D+E DYPY G +
Sbjct: 127 EGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVD 186

Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
            +CD +R+NAKVV+I+G+EDV   DE SLKKAVA QPVSVAIEAGGRAFQ YESG+F G 
Sbjct: 187 GQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGL 246

Query: 295 CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
           CG+ LDHGVVAVGYGTE+G DYW+VRNSWG +WGENGY++L+RN+  TNTGKCGIAM+ S
Sbjct: 247 CGTDLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPS 306

Query: 355 YPVKNSQN 362
           YP K   N
Sbjct: 307 YPTKTGVN 314


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 212/314 (67%), Positives = 248/314 (78%), Gaps = 34/314 (10%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M +Y+ WL KHGK+ N +G  E+RF+IFKDNLRFI+EHN++NRTYKVG            
Sbjct: 1   MAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVG------------ 48

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
                                  RY+ +AG++LPESVDWREKGAV PVKDQG+CGSCWAF
Sbjct: 49  ----------------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAF 86

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST+AAVEGIN+I TG+LISLSEQELVDCD+  N GCNGGLMDYAF+FII NGG+DSE+DY
Sbjct: 87  STIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDY 146

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  A+  CDP+R+NA+VVSIDGYEDV   DE SLKKAVA+QPVSVAIEAGGRAFQ Y+S
Sbjct: 147 PYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQS 206

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVFTG+CG+ LDHGVVAVGYGTEN VDYW+VRNSWG +WGE+GY+KL+RNL  T TGKCG
Sbjct: 207 GVFTGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266

Query: 349 IAMEASYPVKNSQN 362
           IA+E SYP+KN QN
Sbjct: 267 IAIEPSYPIKNGQN 280


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 215/347 (61%), Positives = 270/347 (77%), Gaps = 21/347 (6%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLR 81
           ++AADMSI+ Y          R+++EV  +Y  W+A+H  T N +G  E+RF+ F++NLR
Sbjct: 20  AAAADMSIVFYGE--------RSEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLR 71

Query: 82  FIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQRYAC 135
           +ID+HN+       ++++GLN+FADLTNEEYR+ YLG R+  D +R+L      S RY  
Sbjct: 72  YIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SARYQA 125

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
              DELPESVDWR+KGAV  VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQELVD
Sbjct: 126 ADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVD 185

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
           CD   N GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGYEDV
Sbjct: 186 CDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDV 245

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
               E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTENG D
Sbjct: 246 PVNSEKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKD 305

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YWLVRNSWGS WGENGY++++RN +  ++GKCGIA+E SYP K  +N
Sbjct: 306 YWLVRNSWGSVWGENGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 351


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  446 bits (1148), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 210/341 (61%), Positives = 270/341 (79%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++EV  +Y  W+A++G+T N +G  E+RF++F+DNLR++D+
Sbjct: 24  DMSIVSYGE--------RSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQ 75

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+    + ++ +  S RY     +EL
Sbjct: 76  HNAAADAGLHSFRLGLNRFADLTNEEYRDTYLGVRT----KPVRERRLSGRYQAADNEEL 131

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWREKGAV  VKDQG CGSCWAFS +AAVEGIN+IVTG++I+LSEQELVDCD   N
Sbjct: 132 PESVDWREKGAVAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYN 191

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGYEDV    E+
Sbjct: 192 QGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEL 251

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SLKKAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYG+ENG DYW+V+N
Sbjct: 252 SLKKAVANQPISVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKN 311

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG+ WGE+GYV+L+RN+  T +GKCGIA+E SYP+K   N
Sbjct: 312 SWGTVWGEDGYVRLERNIKAT-SGKCGIAIEPSYPLKKGAN 351


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/337 (62%), Positives = 262/337 (77%), Gaps = 17/337 (5%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+D+E   +Y  W+A HG+T N +G  E+R+Q+F+DNLR+ID 
Sbjct: 28  DMSIVSYGE--------RSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 79

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTN+EYRA YLG R+    R  + +    RY     ++L
Sbjct: 80  HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 135

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 136 PESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 195

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE 
Sbjct: 196 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 255

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 256 SLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 315

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K
Sbjct: 316 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 351


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 209/343 (60%), Positives = 269/343 (78%), Gaps = 21/343 (6%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++EV  +Y  W+++H +T N +G  E+RF++F+DNLR+ID+
Sbjct: 23  DMSIVSYGE--------RSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQ 74

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQRYACKAGD 139
           HN+       ++++GLN+FADLTNEEYR+ YLG R+  D +R+L      S RY     +
Sbjct: 75  HNAAADAGLHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SARYQADDNE 128

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
           ELPE+VDWR+KGAV  +KDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQELVDCD  
Sbjct: 129 ELPETVDWRKKGAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS 188

Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
            N GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGYEDV    
Sbjct: 189 YNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNS 248

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E SL+KAVA+QP+SVAIEAGGRAFQ Y+SG+FTG CG+ALDHGV AVGYGTENG DYWLV
Sbjct: 249 EKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLV 308

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           RNSWG+ WGE+GY++++RN +  ++GKCGIA+E SYP K  +N
Sbjct: 309 RNSWGTVWGEDGYIRMERN-IKASSGKCGIAVEPSYPTKTGEN 350


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 213/362 (58%), Positives = 278/362 (76%), Gaps = 16/362 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MAT+   + ++ L+F   + S S   ++          + + R + E   +Y+ WL ++ 
Sbjct: 1   MATSIKSITLALLIFSVLLISLSLGSVT---------ATETTRNEAEARRMYERWLVENR 51

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K  NG+G  E+RF+IFKDNL+F++EH+S+ NRTY+VGL +FADLTN+E+RA+YL  RS  
Sbjct: 52  KNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYL--RSKM 109

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           +R   +  V  ++Y  K GD LP+++DWR KGAVNPVKDQGSCGSCWAFS + AVEGIN+
Sbjct: 110 ER--TRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQ 167

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCD 238
           I TGELISLSEQELVDCD   N GC GGLMDYAF+FII+NGG+D+E+DYPY+  + N C+
Sbjct: 168 IKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCN 227

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             ++N +VV+IDGYEDV   DE SLKKA+A+QP+SVAIEAGGRAFQ Y SGVFTG CG++
Sbjct: 228 SDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTS 287

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           LDHGVVAVGYG+E G DYW+VRNSWGS+WGE+GY KL+RN+ ++ +GKCG+AM ASYP K
Sbjct: 288 LDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKES-SGKCGVAMMASYPTK 346

Query: 359 NS 360
           +S
Sbjct: 347 SS 348


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 218/350 (62%), Positives = 265/350 (75%), Gaps = 15/350 (4%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG-H 68
           I  L+F  FI+ S+A+  SII            RTDDEVM +Y  W AKHGK  N +G  
Sbjct: 9   IMALLFFLFIALSAASPSSIIPQ----------RTDDEVMALYDQWRAKHGKLHNNLGAE 58

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            E RF IFKDNL+FIDE N+ N  Y++GLN FADLTNEEYR+ YLG +  +  R  ++  
Sbjct: 59  PENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRT-- 116

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            S RY  + GD+LP+S+DWR KGAV PVKDQGSCGSCWAFSTVA+VE IN+IVTG+LI+L
Sbjct: 117 -SNRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIAL 175

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQELVDCDR  N GCNGGLMDYAF+FII+NGG+D+E+DYPY G ++ C   ++NAKVV+
Sbjct: 176 SEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVA 235

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           ID YEDV   +E +L+KAV+ Q VSVAIE GGR+FQ Y+SG+FTG CG+ LDHGV  VGY
Sbjct: 236 IDSYEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGY 295

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G+E GVDYW+VRNSWG  WGE+GYVK+QRN+  + TG CGIAME SYP K
Sbjct: 296 GSEGGVDYWIVRNSWGGSWGESGYVKMQRNIA-SPTGLCGIAMEPSYPTK 344


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 209/340 (61%), Positives = 263/340 (77%), Gaps = 17/340 (5%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSI+SY          R+++E   +Y  W+A HG+T N +G  E+RF++F+DNLR++D H
Sbjct: 29  MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80

Query: 87  NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           N+       ++++GLN+FADLTN+EYRA YLG RS    R  + +    RY     ++LP
Sbjct: 81  NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
           ESVDWR KGAV  VKDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD   N 
Sbjct: 137 ESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV    E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+NS
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNS 316

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K   N
Sbjct: 317 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 355


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/341 (61%), Positives = 261/341 (76%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK+ N +G  E+R+  F+DNLR+IDE
Sbjct: 22  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 74  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P  E 
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 208/340 (61%), Positives = 263/340 (77%), Gaps = 17/340 (5%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSI+SY          R+++E   +Y  W+A HG+T N +G  E+RF++F+DNLR++D H
Sbjct: 29  MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80

Query: 87  NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           N+       ++++GLN+FADLTN+EYRA YLG RS    R  + +    RY     ++LP
Sbjct: 81  NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
           ESVDWR KGAV  +KDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD   N 
Sbjct: 137 ESVDWRAKGAVAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV    E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+NS
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNS 316

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K   N
Sbjct: 317 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 355


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/341 (61%), Positives = 261/341 (76%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK+ N +G  E+R+  F+DNLR+IDE
Sbjct: 23  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 74

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 75  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 130

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 131 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 190

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P  E 
Sbjct: 191 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 250

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 251 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 310

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 311 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 350


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  442 bits (1138), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/341 (61%), Positives = 260/341 (76%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK  N +G  E+R+  F+DNLR+IDE
Sbjct: 22  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE 73

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 74  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P  E 
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 206/324 (63%), Positives = 265/324 (81%), Gaps = 8/324 (2%)

Query: 49  MTIYQTWLAKHGKT---SNGM-GHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFAD 102
           M+IY  W  +HGK+   SNG+    ++RF IFKDNLRFID HN  N+  TYK+GL  FA+
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-ELPESVDWREKGAVNPVKDQGS 161
           LTN+EYR++YLG R++  RR+ K+K  + +Y+    D E+P +VDWR+KGAVN +KDQG+
Sbjct: 61  LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGT 120

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST AAVEGINKIVTGEL+SLSEQELVDCD+  N GCNGGLMDYAFQFI++NGG
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +++E+DYPY G   KC+   +N++VV+IDGYEDV   DE +LK+AV+ QPVSVAI+AGGR
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           AFQHY+SG+FTG+CG+ +DH VVAVGYG+ENGVDYW+VRNSWG+ WGE+GY++++RN+  
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA- 299

Query: 342 TNTGKCGIAMEASYPVKNSQNSAK 365
           + +GKCGIA+EASYPVK S N  +
Sbjct: 300 SKSGKCGIAIEASYPVKYSPNPVR 323


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 206/324 (63%), Positives = 265/324 (81%), Gaps = 8/324 (2%)

Query: 49  MTIYQTWLAKHGKT---SNGM-GHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFAD 102
           M+IY  W  +HGK+   SNG+    ++RF IFKDNLRFID HN  N+  TYK+GL  FA+
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGS 161
           LTN+EYR++YLG R++  RR+ K+K  + +Y+     DE+P +VDWR+KGAVN +KDQG+
Sbjct: 61  LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST AAVEGINKIVTGEL+SLSEQELVDCD+  N GCNGGLMDYAFQFI++NGG
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +++E+DYPY G   KC+   +N++VV+IDGYEDV   DE +LK+AV+ QPVSVAI+AGGR
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           AFQHY+SG+FTG+CG+ +DH VVAVGYG+ENGVDYW+VRNSWG+ WGE+GY++++RN+  
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA- 299

Query: 342 TNTGKCGIAMEASYPVKNSQNSAK 365
           + +GKCGIA+EASYPVK S N  +
Sbjct: 300 SKSGKCGIAIEASYPVKYSPNPVR 323


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 210/337 (62%), Positives = 261/337 (77%), Gaps = 17/337 (5%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+ +E   +Y  W+A HG+T N +G  E+R+Q+F+DNLR+ID 
Sbjct: 23  DMSIVSYGE--------RSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 74

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTN+EYRA YLG R+    R  + +    RY     ++L
Sbjct: 75  HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 130

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  VKDQGSCGSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 131 PESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 190

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE 
Sbjct: 191 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 250

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 251 SLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 310

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K
Sbjct: 311 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 346


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 209/341 (61%), Positives = 259/341 (75%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK+ N +G  E+R+  F+DNLR+IDE
Sbjct: 22  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 74  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQG CGSCWAFS +AAVE IN+IVTG+LISLSEQELVDCD   N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYN 189

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P  E 
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAV +QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 208/336 (61%), Positives = 259/336 (77%), Gaps = 17/336 (5%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSI+SY          RTD+E   +Y  W+A HG+T N +G  E+R+Q+F+DNLR+ID H
Sbjct: 27  MSIVSYGE--------RTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAH 78

Query: 87  NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           N+       ++++GLN+FADLTN+EY A YLG R+    R  + +    RY     ++LP
Sbjct: 79  NAAADAGVHSFRLGLNRFADLTNDEYPATYLGART----RPQRDRKLGARYHAADNEDLP 134

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
           ESVDWR KGAV  VKDQGSCG+CWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N 
Sbjct: 135 ESVDWRAKGAVAEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQ 194

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE S
Sbjct: 195 GCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKS 254

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA+QPVSVAIEA G AFQ Y SG+FTG CG+ LDHGV AVGYGTENG DYW+V+NS
Sbjct: 255 LQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNS 314

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           WGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K
Sbjct: 315 WGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 349


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 209/337 (62%), Positives = 260/337 (77%), Gaps = 17/337 (5%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+D+E   +Y  W+A HG+T N +G  E+R+Q+F+DNLR+ID 
Sbjct: 26  DMSIVSYGE--------RSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA 77

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTN+EYRA YLG R+    R  + +    RY     ++L
Sbjct: 78  HNAAADAGVHSFRLGLNRFADLTNDEYRATYLGART----RPQRERKLGARYHAADNEDL 133

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  VKDQGS GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 134 PESVDWRAKGAVAEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYN 193

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE 
Sbjct: 194 QGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEK 253

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEA G  FQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+V+N
Sbjct: 254 SLQKAVANQPVSVAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKN 313

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K
Sbjct: 314 SWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLK 349


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 210/360 (58%), Positives = 273/360 (75%), Gaps = 16/360 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MAT    + ++ L+F   + S S   ++          + + R + E   +Y+ WL ++ 
Sbjct: 1   MATPIKSITLALLIFSMLLISLSLGSVT---------AADTTRNEAEARRMYEQWLVENR 51

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K  NG+G  E RF+IF DNL++I+EHNS+ N+T++VGL +FADLTN+E+RA+YL  RS  
Sbjct: 52  KNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYL--RSKM 109

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           +R   +  V  +RY  K GD LP+ +DWR KGAVNPVKDQG+CGSCWAFS + AVEGIN+
Sbjct: 110 ER--TRVPVKGERYLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQ 167

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA-ENKCD 238
           I TGELISLSEQELVDCD   N GC GGLMDYAF+FII+NGG+D+E+DYPY    +N C+
Sbjct: 168 IKTGELISLSEQELVDCDTSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICN 227

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             ++N++VV+IDGYEDV   DE SLKKA+A+QP+SVAIEAGGRAFQ Y+SGVFTG CG++
Sbjct: 228 SDKKNSRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTS 287

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           LDHGVVAVGYG+E G DYW+VRNSWGS+WGE+GY KL+RN+ ++ +GKCG+AM ASYP K
Sbjct: 288 LDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKES-SGKCGVAMMASYPTK 346


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  436 bits (1121), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 209/341 (61%), Positives = 259/341 (75%), Gaps = 17/341 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK+ N +G  E+R+  F+DNLR+IDE
Sbjct: 22  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 74  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQ   GSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 130 PESVDWRTKGAVAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P  E 
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/313 (66%), Positives = 240/313 (76%), Gaps = 34/313 (10%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M +Y+ WLAKHGK+ N +G  E+RFQIFKDNLRFIDEHN+ NRTYK+             
Sbjct: 1   MAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKI------------- 47

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
                                S RYA + GD LPESVDWR+KGAV  VKDQGSCGSCWAF
Sbjct: 48  ---------------------SDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAF 86

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST+AAVEGINKIVTG LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE+DY
Sbjct: 87  STIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDY 146

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  ++ +CD  R+NAKVV+IDGYEDV   DE SL+KAVA+QPVSVAIEAGGR FQ Y+S
Sbjct: 147 PYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQS 206

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           G+FTG CG+ALDHGV AVGYGTENGVDYW+V+NSWG+ WGE GY++++R+L  + TGKCG
Sbjct: 207 GIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266

Query: 349 IAMEASYPVKNSQ 361
           IAMEASYP+K  Q
Sbjct: 267 IAMEASYPIKKGQ 279


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 211/353 (59%), Positives = 260/353 (73%), Gaps = 29/353 (8%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSI+SY          R+++E   +Y  W A+HGK  N +G  E+R+  F+DNLR+IDE
Sbjct: 22  DMSIVSYGE--------RSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE 73

Query: 86  HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           HN+       ++++GLN+FADLTNEEYR  YLG R+  +R     +  S RY     + L
Sbjct: 74  HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PESVDWR KGAV  +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD   N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR------------RNAKVVSI 249
            GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R            +NAKVV+I
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTI 249

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           D YEDV+P  E SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYG
Sbjct: 250 DSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG 309

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           TENG DYW+VRNSWG  WGE+GYV+++RN +  ++GKCGIA+E SYP+K  +N
Sbjct: 310 TENGKDYWIVRNSWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 361


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 206/323 (63%), Positives = 250/323 (77%), Gaps = 3/323 (0%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
           RT++EV  +Y+ WL  +GK  N +G  E+RF+IF DNLR+ID+HN    N +Y +GL +F
Sbjct: 29  RTEEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRF 88

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQ 159
           ADLTNEEYR+ YLG +    R    ++   + R     GD+LP+ VDWREKGAV P+KDQ
Sbjct: 89  ADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQ 148

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFSTVAAVEGIN+IVTG+LI LSEQELVDCD   N GCNGGLMDYAFQFII N
Sbjct: 149 GGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIISN 208

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+D+E+DYPY   +  CDP+R+NAKVVSID YEDV   DE +LK AVA QPVSVAIE G
Sbjct: 209 GGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGG 268

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           GR+FQ Y+SG+F G CG  LDHGVVAVGYGTE+G DYW+VRNSWG  WGE GY++++RNL
Sbjct: 269 GRSFQLYKSGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERNL 328

Query: 340 LDTNTGKCGIAMEASYPVKNSQN 362
             +++GKCGIA+E SYP+K  QN
Sbjct: 329 PSSSSGKCGIAIEPSYPIKKGQN 351


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 208/340 (61%), Positives = 260/340 (76%), Gaps = 7/340 (2%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSIISY+  H      RT+ E  T+Y+ WLA+HG+  N +G  ++RF++F DNLRF+D H
Sbjct: 84  MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 143

Query: 87  N--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPE 143
           N  +    +++G+N+FADLTN+E+RA YLG R  A RR  +     +RY    G +ELPE
Sbjct: 144 NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRR--RGTAVGERYRHGGGAEELPE 201

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
           SVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C     N+
Sbjct: 202 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNS 261

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV   DE S
Sbjct: 262 GCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKS 321

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA QPVSVAIEAGGR FQ Y++GVFTG C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 322 LQKAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNS 381

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WG+ WGE+GY++++RN ++  TGKCGIAM ASYP K   N
Sbjct: 382 WGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 420


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  433 bits (1113), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 208/340 (61%), Positives = 260/340 (76%), Gaps = 7/340 (2%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSIISY+  H      RT+ E  T+Y+ WLA+HG+  N +G  ++RF++F DNLRF+D H
Sbjct: 27  MSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAH 86

Query: 87  N--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPE 143
           N  +    +++G+N+FADLTN+E+RA YLG R  A RR  +     +RY    G +ELPE
Sbjct: 87  NERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPASRR--RGTAVGERYRHGGGAEELPE 144

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
           SVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C     N+
Sbjct: 145 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNS 204

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV   DE S
Sbjct: 205 GCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKS 264

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA QPVSVAIEAGGR FQ Y++GVFTG C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 265 LQKAVAHQPVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNS 324

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WG+ WGE+GY++++RN ++  TGKCGIAM ASYP K   N
Sbjct: 325 WGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 363


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 207/347 (59%), Positives = 263/347 (75%), Gaps = 7/347 (2%)

Query: 20  SSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDN 79
           +++    MSIISY+  H      RT+ E  T+Y+ WLA+HG+  N +G  ++RF++F DN
Sbjct: 17  AAAPGGRMSIISYNEEHAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDN 76

Query: 80  LRFIDEHN--SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
           LRF+D HN  +    +++G+N+FADLTN+E+RA YLG R  A RR  +     +RY    
Sbjct: 77  LRFVDAHNERAAEHGFRLGMNQFADLTNDEFRAAYLGARIPAARR--RGTAVGERYRHGG 134

Query: 138 G-DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
           G +ELPESVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C
Sbjct: 135 GAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVEC 194

Query: 197 DRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
                N+GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV
Sbjct: 195 STDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDV 254

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE SL+KAVA QPVSVAIEAGGR FQ Y++GVF+G C + LDHGVVAVGYGTENG D
Sbjct: 255 PENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGTENGKD 314

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YW+VRNSWG+ WGE+GY++++RN ++  TGKCGIAM ASYP K   N
Sbjct: 315 YWIVRNSWGAKWGEDGYIRMERN-VNATTGKCGIAMMASYPTKKGAN 360


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 214/353 (60%), Positives = 261/353 (73%), Gaps = 12/353 (3%)

Query: 18  FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK-TSNGMGHNEKRFQIF 76
           F + ++  DMSIISY+  H      RT+ E   IY  W A+HG   SN +G  E+RF+ F
Sbjct: 18  FGACAAGPDMSIISYNAEHGARGLERTEAEARAIYGLWRAEHGSGNSNSLGEEERRFRAF 77

Query: 77  KDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
            DNLRF+D HN+        +++G+N+FADLTN+E+RA YLG +   +RR  ++ V  +R
Sbjct: 78  WDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKGAGQRRSARAGVG-ER 136

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y     +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+AVE IN++VTGEL++LSEQE
Sbjct: 137 YRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVTGELVTLSEQE 196

Query: 193 LVDCDRKINA---GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           LV+CD  IN    GCNGGLMD AF FII NGG+D+E DYPY   + KCD +RRNAKVVSI
Sbjct: 197 LVECD--INGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKALDGKCDINRRNAKVVSI 254

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           DG+EDV   DE SL+KAVA QPVSVAIEAGGR FQ Y SGVFTG CG+ LDHGVVAVGYG
Sbjct: 255 DGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTELDHGVVAVGYG 314

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           TENG DYW+VRNSWG  WGE GY++++RN ++  TGKCGIAM +SYP K   N
Sbjct: 315 TENGKDYWIVRNSWGPKWGEAGYLRMERN-INATTGKCGIAMMSSYPTKKGAN 366


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  429 bits (1104), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 208/360 (57%), Positives = 270/360 (75%), Gaps = 19/360 (5%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M   + +L+    I+ S + DMS            S R++ EVMT+Y+ WL KH K   G
Sbjct: 1   MASILYSLILFGLITLSLSLDMS------------SGRSNKEVMTMYEKWLVKHQKVYYG 48

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +G   +RFQIFKDNL FIDEHN+ N +Y+VGLN+F+D+TN+EYR  YL   S+     +K
Sbjct: 49  LGEKNQRFQIFKDNLIFIDEHNAPNHSYRVGLNEFSDITNKEYRDTYLSRWSNNN---IK 105

Query: 126 SKVASQRYACKAG--DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
           +K+ S RYA KAG  ++LP SVDWR  GA+ P+K+QGSCG+CWAFS VAAVE INKIVTG
Sbjct: 106 NKITSVRYAYKAGHNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTG 163

Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
            L+SLSEQELVDCDR  N GCNGG    A++FI++NGG+DS+ DYPYLG ++ C+ +++N
Sbjct: 164 SLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKN 223

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
            KVVSI+GY++V    E +L +AVA+QPVSV IEA G+ FQ Y+SGVFTG CG++LDH V
Sbjct: 224 TKVVSINGYKNVQRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAV 283

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
           V VGYG+ENG DYWLV+NSWG++WGE GY+K++RNL +TNTGKCGIAM+A+YP K  +NS
Sbjct: 284 VVVGYGSENGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKLRENS 343


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 204/316 (64%), Positives = 256/316 (81%), Gaps = 8/316 (2%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
           E + +++ WL ++ K  NG+G  +KRF+IF DNL+F+ EHNS+ N++Y++GL +FADLTN
Sbjct: 32  EEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTN 91

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           EE+RA+YL  RS  +R   +  V S+RY    GD+LP+ VDWR KGAV PVKDQGSCGSC
Sbjct: 92  EEFRAIYL--RSKMER--TRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSC 147

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS + AVEGIN+I TGEL+SLSEQELVDCD   N GC GGLMDYAFQFII NGG+D+E
Sbjct: 148 WAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISNGGIDTE 207

Query: 226 QDYPYLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           +DYPY    +N C+  ++N +VV+IDGYEDV P +E SLKKA+A+QP+SVAIEAGGR FQ
Sbjct: 208 EDYPYTATDDNICNTDKKNTRVVTIDGYEDV-PENENSLKKALANQPISVAIEAGGRGFQ 266

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y+SGVFTG CG+ALDHGVVAVGYGT  G DYW++RNSWGS+WGE+GY+KLQRN+ D+ +
Sbjct: 267 LYKSGVFTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQRNIKDS-S 325

Query: 345 GKCGIAMEASYPVKNS 360
           GKCG+AM ASYP K+S
Sbjct: 326 GKCGVAMMASYPTKSS 341


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 207/340 (60%), Positives = 260/340 (76%), Gaps = 9/340 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISY+  H      RT+ E    Y  WLA++G++ N +G +E+RF++F DNLRF D 
Sbjct: 28  DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 87

Query: 86  HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
           HN+   +  +++G+N+FADLTNEE+RA +LG +      + +S+ A +RY     +ELPE
Sbjct: 88  HNARADDHGFRLGMNRFADLTNEEFRATFLGAKV-----VERSRAAGERYRHDGVEELPE 142

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INA 202
           SVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C     N+
Sbjct: 143 SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNS 202

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV   DE S
Sbjct: 203 GCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKS 262

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNS
Sbjct: 263 LQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNS 322

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  N
Sbjct: 323 WGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 361


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 204/341 (59%), Positives = 260/341 (76%), Gaps = 10/341 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISY+  H      RT+ E    Y  WLA++G++ N +G  E+RF++F DNL+F+D 
Sbjct: 23  DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82

Query: 86  HNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           HN+    +  +++G+N+FADLTN+E+R+ +LG +      + +S+ A +RY     +ELP
Sbjct: 83  HNARADEHGGFRLGMNRFADLTNDEFRSTFLGAKV-----VERSRAAGERYRHDGVEELP 137

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-IN 201
           ESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C     N
Sbjct: 138 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQN 197

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
           +GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV   DE 
Sbjct: 198 SGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEK 257

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRN
Sbjct: 258 SLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRN 317

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  N
Sbjct: 318 SWGPKWGESGYVRMERN-INATTGKCGIAMMASYPTKSGAN 357


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 206/340 (60%), Positives = 253/340 (74%), Gaps = 8/340 (2%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK-TSNGMGHNEKRFQIFKDNLRFIDE 85
           MSIISY+  H      RT+ EV  +Y+ WL +HG+  SN +G ++ RF++F DNLRF+D 
Sbjct: 31  MSIISYNEEHGARGLERTEAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDA 90

Query: 86  HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
           HN       +++G+N+FADLTN+E+RA YLG R  A R         + Y     +ELPE
Sbjct: 91  HNERAGEHGFRLGMNQFADLTNDEFRAAYLGARIPAAR---SGNAVGEMYRHDGAEELPE 147

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NA 202
           SVDWREKGAV PVK+QG CGSCWAFS V++VE IN+IVTGE+++LSEQELV+C     N+
Sbjct: 148 SVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNS 207

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +RRNAKVVSID +EDV   DE S
Sbjct: 208 GCNGGLMDAAFNFIIKNGGIDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKS 267

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA QPVSVAIEAGGR FQ Y+SGVF+G C + LDHGVVAVGYGTENG DYW+VRNS
Sbjct: 268 LQKAVAHQPVSVAIEAGGRQFQLYKSGVFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNS 327

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WG  WGE GY++++RN ++  TGKCGIAM ASYP K   N
Sbjct: 328 WGPKWGEAGYIRMERN-INATTGKCGIAMMASYPTKKGAN 366


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 207/344 (60%), Positives = 256/344 (74%), Gaps = 12/344 (3%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE----KRFQIFKDNLRF 82
           MSII+Y+  H      RT+ EV  +Y  WLA+HG+  N +G  E    +RF +F DNLRF
Sbjct: 32  MSIITYNEEHGARGLERTEPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRF 91

Query: 83  IDEHNSLN--RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK-AGD 139
           +D HN     R +++G+N+FADLTN+E+RA YLG    A RR     V  +RY    A +
Sbjct: 92  VDAHNERAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAARR---GAVVGERYRHDGAAE 148

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
           ELPESVDWREKGAV PVK+QG CGSCWAFS V++VE +N+IVTGE+++LSEQELV+C   
Sbjct: 149 ELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTD 208

Query: 200 I-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             N+GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R+NA+VVSIDG+EDV   
Sbjct: 209 GGNSGCNGGLMDAAFDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPEN 268

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           DE SL+KAVA QPVSVAIEAGGR FQ Y+SGVF+G C + LDHGVVAVGYG ENG DYW+
Sbjct: 269 DEKSLQKAVAHQPVSVAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWI 328

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           VRNSWG  WGE GY++++RN ++ +TGKCGIAM ASYP K   N
Sbjct: 329 VRNSWGPKWGEAGYIRMERN-VNASTGKCGIAMMASYPTKKGAN 371


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/350 (58%), Positives = 257/350 (73%), Gaps = 15/350 (4%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
           +ADMSII+Y+  H      RT+ E   +Y  WLA+HG  S    N +   E+RF  F DN
Sbjct: 24  SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83

Query: 80  LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           LRF+D HN+        +++ +N+FADLTN+E+RA YLG +  A+R     +V  +RY  
Sbjct: 84  LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGERYRH 142

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
              +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202

Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           CD  IN   +GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV   DE SL+KAVA  PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G DYW+VRNSWG +WGE GY++++RN ++  +GKCGIAM +SYP K   N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/350 (58%), Positives = 257/350 (73%), Gaps = 15/350 (4%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
           +ADMSII+Y+  H      RT+ E   +Y  WLA+HG  S    N +   E+RF  F DN
Sbjct: 24  SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83

Query: 80  LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           LRF+D HN+        +++ +N+FADLTN+E+RA YLG +  A+R     +V  +RY  
Sbjct: 84  LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGERYRH 142

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
              +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202

Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           CD  IN   +GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV   DE SL+KAVA  PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G DYW+VRNSWG +WGE GY++++RN ++  +GKCGIAM +SYP K   N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 214/353 (60%), Positives = 259/353 (73%), Gaps = 22/353 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG-H 68
           I  L+F  FI+ S+A+  SII            RTDDEVM +Y  W AKHGK  N +G  
Sbjct: 9   IMALLFFLFIALSAASPSSIIPQ----------RTDDEVMALYDQWRAKHGKLHNNLGAE 58

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            E RF IFKDNL+FIDE N+ N  Y++GLN FADLTNEEYR+ YLG +  +  R  ++  
Sbjct: 59  PENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRT-- 116

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            S RY  + GD+LP+S+DWR KGAV PVKDQGSCGSCWAFSTVA+VE IN+IVTG+LI+L
Sbjct: 117 -SNRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIAL 175

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQELVDCDR  N GCNGGLMDYAF+FII+NGG+D+E+DYPY G ++ C   ++NA    
Sbjct: 176 SEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA---- 231

Query: 249 IDGYEDVSPFDEMSLKKA---VADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           IDGYEDV   +E +L+KA        VSVAIE GGR+FQ Y+SG+FTG CG+ LDHGV  
Sbjct: 232 IDGYEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNV 291

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           VGYG+E GVDYW+VRNSWG  WGE+GYVK+QRN+  + TG CGIAME SYP K
Sbjct: 292 VGYGSEGGVDYWIVRNSWGGSWGESGYVKMQRNIA-SPTGLCGIAMEPSYPTK 343


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  416 bits (1070), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 201/312 (64%), Positives = 247/312 (79%), Gaps = 11/312 (3%)

Query: 53  QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-----YKVGLNKFADLTNEE 107
           Q+WL KH K  N +G  EKRF IF+DNL FID+HN+ N       +++GLNKFADLTN+E
Sbjct: 6   QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R +Y G     KR      V S RYA K GDELPESVDWR+KGAV+ VKDQG CGSCWA
Sbjct: 66  FRRIYFGV----KRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWA 121

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FS + AVEGINKIVTG+LI+LSEQELVDCD   N+GC+GGLMDYAF+FII NGG+D+++D
Sbjct: 122 FSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKD 181

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           YPY   +  CD +R+NAKVV+IDG EDV   +E +L+KAVA QPV +AIEAGGR FQ Y+
Sbjct: 182 YPYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYK 241

Query: 288 SGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           SGVFTG CG++LDHGVVAVGYG T++G DYW+VRNSWG DWGE+GY++++RN  ++ +GK
Sbjct: 242 SGVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERN-TESKSGK 300

Query: 347 CGIAMEASYPVK 358
           CGIA+E SYPVK
Sbjct: 301 CGIAIEPSYPVK 312


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  416 bits (1070), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/350 (58%), Positives = 256/350 (73%), Gaps = 15/350 (4%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDN 79
           +ADMSII+Y+  H      RT+ E   +Y  WLA+HG  S    N +   E+RF  F DN
Sbjct: 24  SADMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDN 83

Query: 80  LRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           LRF+D HN+        +++ +N+FADLTN+E+RA YLG +  A+R     +V   RY  
Sbjct: 84  LRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGAAERN-RAGRVVGDRYRH 142

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
              +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+
Sbjct: 143 DGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVE 202

Query: 196 CDRKIN---AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           CD  IN   +GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+
Sbjct: 203 CD--INGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGF 260

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN 312
           EDV   DE SL+KAVA  PVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTEN
Sbjct: 261 EDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTEN 320

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G DYW+VRNSWG +WGE GY++++RN ++  +GKCGIAM +SYP K   N
Sbjct: 321 GKDYWIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 369


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 201/346 (58%), Positives = 255/346 (73%), Gaps = 13/346 (3%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDNLR 81
           DMSII+Y+  H      RT+ E   +Y  WLA+HG  S    N +   E+RF+ F DNLR
Sbjct: 24  DMSIIAYNAEHGARGLERTEAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLR 83

Query: 82  FIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
           F+D HN+        +++ +N+FADLTN+E+RA YLG +    +R    +V  +RY    
Sbjct: 84  FVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKG---QRARPGRVVGERYRHDG 140

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
            +ELPE+VDWREKGAV PVK+QG CGSCWAFS ++ VE IN+IVTGE+++LSEQELV+CD
Sbjct: 141 AEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECD 200

Query: 198 RK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
               ++GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+EDV 
Sbjct: 201 TNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVP 260

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
             DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DY
Sbjct: 261 ENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDY 320

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           W+VRNSWG +WGE GY++++RN ++  +GKCGIAM +SYP K   N
Sbjct: 321 WIVRNSWGPNWGEAGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 365


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 13/346 (3%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS----NGMGHNEKRFQIFKDNLR 81
           DMSII+Y+  H      RT+ E   +Y  WLA++G  S    N +   E+RF+ F DNL 
Sbjct: 27  DMSIIAYNAEHGARGLERTEAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLN 86

Query: 82  FIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
           F+D HN+        Y++G+N+FADLTN+E+RA YLG ++   +R    ++  +RY    
Sbjct: 87  FVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKA---QRARPGRMVGERYRHDG 143

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
            +ELPE+VDWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+CD
Sbjct: 144 AEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECD 203

Query: 198 RK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
               ++GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+EDV 
Sbjct: 204 TNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVP 263

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
             DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DY
Sbjct: 264 ENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDY 323

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           W+VRNSWG +WGE+GY++++RN ++  +GKCGIAM +SYP K   N
Sbjct: 324 WIVRNSWGPNWGESGYLRMERN-INVTSGKCGIAMMSSYPTKKGAN 368


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/351 (58%), Positives = 261/351 (74%), Gaps = 14/351 (3%)

Query: 20  SSSSAADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQI 75
           ++++A DMSIISY+  H         T+ E    Y  WLA++G  S N +G  +E+RF +
Sbjct: 18  AATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLV 77

Query: 76  FKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
           F DNL+F+D HN+       +++G+N+FADLTNEE+RA +LG +        +S+ A +R
Sbjct: 78  FWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKV-----AERSRAAGER 132

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y     +ELPESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQE
Sbjct: 133 YRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQE 192

Query: 193 LVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           LV+C     N+GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG
Sbjct: 193 LVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDG 252

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           +EDV   DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+
Sbjct: 253 FEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTD 312

Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           NG DYW+VRNSWG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  N
Sbjct: 313 NGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 362


>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 464

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/340 (60%), Positives = 259/340 (76%), Gaps = 9/340 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISY+  H      RT+ E    Y  WLA++G++ N +G +E+RF++F DNLRF D 
Sbjct: 27  DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADA 86

Query: 86  HNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPE 143
           HN+   +  +++G+N+FADLTNEE+RA +LG +      + +S+ A +RY     +ELPE
Sbjct: 87  HNARADDHGFRLGMNRFADLTNEEFRATFLGAKV-----VERSRAAGERYRHDGVEELPE 141

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINA 202
           SVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C     N 
Sbjct: 142 SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNG 201

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV   DE S
Sbjct: 202 GCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKS 261

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNS 322
           L+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNS
Sbjct: 262 LQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNS 321

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           WG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  N
Sbjct: 322 WGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 360


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 205/373 (54%), Positives = 260/373 (69%), Gaps = 42/373 (11%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           DMSIISY+  H      RT+ E    Y  WLA++G++ N +G  E+RF++F DNL+F+D 
Sbjct: 23  DMSIISYNAEHGARGLERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDA 82

Query: 86  HNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           HN+    +  +++G+N+FADLTN+E+RA +LG +      + +S+ A +RY     +ELP
Sbjct: 83  HNARADEHGGFRLGMNRFADLTNDEFRATFLGAKF-----VERSRAAGERYRHDGVEELP 137

Query: 143 ESVDWREKGAVNPVKDQGSC--------------------------------GSCWAFST 170
           ESVDWREKGAV PVK+QG C                                GSCWAFS 
Sbjct: 138 ESVDWREKGAVAPVKNQGQCVDRIIVWNSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSA 197

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           V+ VE IN++VTGE+I+LSEQELV+C     N+GCNGGLMD AF FII+NGG+D+E DYP
Sbjct: 198 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 257

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KCD +R NAKVVSIDG+EDV   DE SL+KAVA QPVSVAIEAGGR FQ Y SG
Sbjct: 258 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 317

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           VF+G CG++LDHGVVAVGYGT+NG DYW+VRNSWG  WGE+GYV+++RN ++  TGKCGI
Sbjct: 318 VFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INATTGKCGI 376

Query: 350 AMEASYPVKNSQN 362
           AM ASYP K+  N
Sbjct: 377 AMMASYPTKSGAN 389


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 208/347 (59%), Positives = 259/347 (74%), Gaps = 14/347 (4%)

Query: 24  AADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQIFKDN 79
           A+DMSIISY+  H         T+ E    Y  WLA++G  S N +G  +E+RF +F DN
Sbjct: 21  ASDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDN 80

Query: 80  LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           L+F+D HN+       +++G+N+FADLTNEE+RA +LG +  A+R    S+ A +RY   
Sbjct: 81  LKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGAKV-AER----SRAAGERYRHD 135

Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
             +ELPESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQELV+C
Sbjct: 136 GVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVEC 195

Query: 197 DRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
                N+GCNGGLM  AF FII+NGG+D+E DYPY   + KCD +R NAKVVSIDG+EDV
Sbjct: 196 STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDV 255

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
              DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+NG D
Sbjct: 256 PQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKD 315

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YW+VRNSWG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  N
Sbjct: 316 YWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 361


>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 343

 Score =  406 bits (1044), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 199/359 (55%), Positives = 257/359 (71%), Gaps = 31/359 (8%)

Query: 11  STLVFLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           +T+  LFF  ++ SSA D+SIISYD +H   S WR+D+EVM+IY+  LAKHGK  N +  
Sbjct: 9   ATIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGKVYNAIDE 68

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            E+RFQI K+NL+F+++HN+ NRTYKVGLN+FAD                 + R+M    
Sbjct: 69  MEERFQISKENLKFVEQHNAGNRTYKVGLNRFAD-----------------RSRMMTR-- 109

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            S RYA +  D L ESVDWR++GAV  VK Q  C SC  F+ +AAVEGINKIVTG L +L
Sbjct: 110 PSSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIVTGNLTAL 169

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           S     DCDR +NAGC+GGL DYA +FII NGG+D+E+DYP+ GA   CD  + NA    
Sbjct: 170 S-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYKINA---- 220

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVA-IEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +DGYE V  +DE++LKKAVA+QPVSVA IEA G+ FQ YESG+FTG+CG+++DHGV AVG
Sbjct: 221 VDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVG 280

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
           YGTENG+DYW+V+NSWG +WGE GYV+++RN  +   GKCGIA+   YP+K+ QN + P
Sbjct: 281 YGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPIKSGQNPSNP 339


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 196/326 (60%), Positives = 247/326 (75%), Gaps = 10/326 (3%)

Query: 43  RTDDEVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNK 99
           RT+ +V  +Y+ W+A+HGK  SN +G +++RF+ F DNLRF+D HN+    R Y++G+N+
Sbjct: 43  RTEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINR 102

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           FADLTN E+RA YL   S   R    +    +RY     + LPE VDWR+KGAV PVK+Q
Sbjct: 103 FADLTNAEFRAAYL---SAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQ 159

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS V AVEGIN+IVTGEL++LSEQELVDC +   N GC+GG+MD AF FI+ 
Sbjct: 160 GQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVG 219

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+++DYPY   + KCD ++R+  VVSIDG+E V   DE SL+KAVA QPV+VAIEA
Sbjct: 220 NGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEA 279

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQ 336
           GGR FQ Y+SGVFTG CG++LDHGVVAVGYGTE   G DYWLVRNSWG+DWGE GY++++
Sbjct: 280 GGREFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRME 339

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
           RN +    GKCGIAMEASYPVK+  N
Sbjct: 340 RN-VGARAGKCGIAMEASYPVKSGAN 364


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 201/372 (54%), Positives = 267/372 (71%), Gaps = 22/372 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +   F+++S L F  F+  S A D  I          S  RT+DEVM +Y++WL K+G
Sbjct: 1   MGSPKSFISMSLLFFSTFLIFSFAIDAKI----------SPLRTNDEVMALYESWLVKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E R +IFK+NLRFIDEHN+  NR+Y VGLN+FADLT+EEYR+ YLG +S  
Sbjct: 51  KSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSS- 109

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
               +KSKV S RY  + G+ LP+ VDWR  GAV  VK+QG C SCWAF+T+A VE IN+
Sbjct: 110 ----LKSKV-SNRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQ 164

Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           I+TG+LISLSEQELVDC+R  IN GC GG MD A++FII NGG+++E++YPY+G +++CD
Sbjct: 165 IITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCD 224

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT-GECGS 297
             ++N   V+ID YE V P DE+++K+AVA QPVSVAI+A    F+ Y+SG+FT G CG+
Sbjct: 225 EPKKNQNYVTIDSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGT 284

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            L+H V  +GYGTENG+DYW+V+NS+G+ WGE+GY K+QRN+     G+CGIA    YPV
Sbjct: 285 TLNHAVTIIGYGTENGIDYWIVKNSYGTQWGESGYGKVQRNV--GGEGRCGIASYPFYPV 342

Query: 358 KN-SQNSAKPKP 368
           KN +   AKP P
Sbjct: 343 KNYTSKPAKPHP 354


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 251/322 (77%), Gaps = 9/322 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
           R + EV  +Y+ WL ++ K  NG+G  E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35  RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLTNEE+RA+YL  R   +R   K  V ++RY  K GD LP+ VDWR  GAV  VKDQG+
Sbjct: 95  DLTNEEFRAIYL--RKKMER--TKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR  +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210

Query: 221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           G++++QDYPY   +   C+  + N  +VV+IDGYEDV   DE SLKKAVA QPVSVAIEA
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
             +AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330

Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
            +D   GKCGIAM  SYP K+S
Sbjct: 331 -IDDPFGKCGIAMMPSYPTKSS 351


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 206/366 (56%), Positives = 256/366 (69%), Gaps = 17/366 (4%)

Query: 5   SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           S    IS L+FL   S S A D+S I Y   +D SS+WRTD+EV  IY+ WLAKH K  +
Sbjct: 2   STLFIISILLFL--ASFSYAMDISTIEY--KYDKSSAWRTDEEVKEIYELWLAKHDKVYS 57

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           G+   EKRF+IFKDNL+FIDEHNS N TYK+GL  + DLTNEE++A+YLGTRSD   RL 
Sbjct: 58  GLVEYEKRFEIFKDNLKFIDEHNSENHTYKMGLTPYTDLTNEEFQAIYLGTRSDTIHRLK 117

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           ++   S+RYA +AGD LPE +DWR+KGAV PVK+QG CGSCWAFSTV+ VE IN+I TG 
Sbjct: 118 RTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGN 177

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           LISLSEQ+LVDC++K N GC GG   YA+Q+II NGG+D+E +YPY   +  C   R   
Sbjct: 178 LISLSEQQLVDCNKK-NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPC---RAAK 233

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           KVV IDGY+ V   +E +LKKAVA QP  VAI+A  + FQHY+SG+F+G CG+ L+HGVV
Sbjct: 234 KVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVV 293

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS--QN 362
            VGY      DYW+VRNSWG  WGE GY++++R       G CGIA    YP K +  +N
Sbjct: 294 IVGYWK----DYWIVRNSWGRYWGEQGYIRMKRV---GGCGLCGIARLPYYPTKAAGDEN 346

Query: 363 SAKPKP 368
           S    P
Sbjct: 347 SKLETP 352


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 251/322 (77%), Gaps = 9/322 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
           R + EV  +Y+ WL ++ K  NG+G  E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35  RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLTNEE+RA+YL  R   +R   K  V ++RY  K GD LP+ VDWR  GAV  VKDQG+
Sbjct: 95  DLTNEEFRAIYL--RKKMERN--KDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR  +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210

Query: 221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           G++++QDYPY   +   C+  + N  +VV+IDGYEDV   DE SLKKAVA QPVSVAIEA
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
             +AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330

Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
            +D   GKCGIAM  SYP K+S
Sbjct: 331 -IDDPFGKCGIAMMPSYPTKSS 351


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 199/371 (53%), Positives = 260/371 (70%), Gaps = 20/371 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +    ++ S L F   +  SSA D+           +S  RT+D+VM +Y++WL +HG
Sbjct: 1   MGSPKSIISKSLLFFSTLLILSSAIDI----------ENSVQRTNDQVMAMYESWLVEHG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +   E RF+IFK+NLR ID+HN+  NR+Y +GLN+FADLT+EEYR+ YLG +   
Sbjct: 51  KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGP 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +      S +Y  K GD LP+ VDWR  GAV  VK+QG C SCWAFS VAAVEGINK
Sbjct: 111 KTDV------SNQYMPKVGDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINK 164

Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQELVDC R +I  GCN GLM  AF+FII NGG+++E +YPY   + +C+
Sbjct: 165 IVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCN 224

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
            S +N K V+ID Y++V   +EM+LKKAVA QPVSV +E+ G  F+ Y SG+FTG CG+A
Sbjct: 225 LSLKNQKYVTIDSYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTA 284

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DHGV  VGYGTE G+DYW+V+NSWG++WGE+GY+++QRN+     GKCGIA   SYPVK
Sbjct: 285 VDHGVTIVGYGTERGMDYWIVKNSWGTNWGESGYIRIQRNI--GGAGKCGIAKMPSYPVK 342

Query: 359 NSQNSAKPKPH 369
            + N  KP P+
Sbjct: 343 YTSNPLKPYPY 353


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/359 (54%), Positives = 263/359 (73%), Gaps = 14/359 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGM 66
           + I  L+ +F +S+ S+A M + +    H+     R+++EV  I+Q W++KHGKT +N +
Sbjct: 9   MTILFLLIVFVLSAPSSA-MDLPATSGGHN-----RSNEEVEFIFQMWMSKHGKTYTNAL 62

Query: 67  GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           G  E+RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EYR ++ G+    +R L  S
Sbjct: 63  GEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTS 122

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +    RY   AGD+LPESVDWR++GAV+ +KDQG+C SCWAFSTVAAVEG+NKIVTGELI
Sbjct: 123 R----RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELI 178

Query: 187 SLSEQELVDCDRKINAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQELVDC+  +N GC G GLMD AFQF+I N G+DSE+DYPY G +  C+  + +  
Sbjct: 179 SLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLL 237

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           V++ID YEDV   DE+SL+KAVA QPVSV ++   + F  Y S ++ G CG+ LDH +V 
Sbjct: 238 VITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVI 297

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VGYG+ENG DYW+VRNSWG+ WG+ GY+K+ RN  D   G CGIAM ASYP+KNS ++A
Sbjct: 298 VGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIKNSASNA 355


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 198/360 (55%), Positives = 264/360 (73%), Gaps = 15/360 (4%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGM 66
           + I  L+ +F +S+ S+A M + +    H+     R+++EV  I+Q W++KHGKT +N +
Sbjct: 9   MTILFLLIVFVLSAPSSA-MDLPATSGGHN-----RSNEEVEFIFQMWMSKHGKTYTNAL 62

Query: 67  GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           G  E+RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EYR ++ G+    +R L  S
Sbjct: 63  GEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTS 122

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +    RY   AGD+LPESVDWR++GAV+ +KDQG+C SCWAFSTVAAVEG+NKIVTGELI
Sbjct: 123 R----RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELI 178

Query: 187 SLSEQELVDCDRKINAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA- 244
           SLSEQELVDC+  +N GC G GLMD AFQF+I N G+DSE+DYPY G +  C+  +  + 
Sbjct: 179 SLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSN 237

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           KV++ID YEDV   DE+SL+KAVA QPVSV ++   + F  Y S ++ G CG+ LDH +V
Sbjct: 238 KVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALV 297

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
            VGYG+ENG DYW+VRNSWG+ WG+ GY+K+ RN  D   G CGIAM ASYP+KNS ++A
Sbjct: 298 IVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIKNSASNA 356


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 202/375 (53%), Positives = 260/375 (69%), Gaps = 28/375 (7%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F   +  S A D                RT+DEV  +Y++WL KHG
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLALDAK--------------RTNDEVKAMYESWLIKHG 46

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLG-TRSD 118
           K+ N +G  E+RF+IFK+ LRFIDEHN+  +R+YKVGLN+FADLTNEE+R+ YLG TR  
Sbjct: 47  KSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS 106

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
            K ++      S RY  + G  LP+ VDWR +GAV  +K+QG CGSCWAFS +AAVEGIN
Sbjct: 107 NKTKV------SNRYEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGIN 160

Query: 179 KIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           KIVTG LISLSEQELVDC R +   GC+GG M   F+FII NGG+++E++YPY   E +C
Sbjct: 161 KIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQC 220

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
           D + +N K V+ID YE+V  ++E +L+ AVA QPVSVA+E+ G AFQHY SG+FTG CG+
Sbjct: 221 DLNLQNEKYVTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGT 280

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           A DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPV
Sbjct: 281 ATDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPV 338

Query: 358 K-NSQNSAKPKPHSS 371
           K N+QN   PKP+SS
Sbjct: 339 KYNNQN--HPKPYSS 351


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 198/322 (61%), Positives = 239/322 (74%), Gaps = 13/322 (4%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADL 103
           D     +Y+ W+  HG+  NG+G  E+RFQIF+DN  +I+EHN  +N+TY +GLN FAD+
Sbjct: 27  DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T++E++A+Y GT    K  L  +  +  RY  K    LP   DWR KGAV  VK+QG+CG
Sbjct: 87  THDEFKALYFGT----KVPLSNTIKSGFRY--KDATNLPLDTDWRSKGAVATVKNQGACG 140

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFSTVAAVEG+N+IVTGEL+SLSEQELVDCD++ N GCNGGLMD AF+FIIQNGG+D
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLD 200

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           SE DYPY      CD SRRN+ VV+IDG+EDV    E  L KAVA+QPVSVAIEA GR F
Sbjct: 201 SEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNF 260

Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE---NGV--DYWLVRNSWGSDWGENGYVKLQRN 338
           Q Y  GV+TG CG  LDHGVVAVGYGT    +GV  DYW+VRNSWG  WGE+GY++LQRN
Sbjct: 261 QLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRN 320

Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
           +  +  GKCGIAM ASYPVKNS
Sbjct: 321 VA-SPRGKCGIAMMASYPVKNS 341


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 197/322 (61%), Positives = 240/322 (74%), Gaps = 13/322 (4%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADL 103
           D     +Y+ W+  HG+  NG+G  E+RFQIF+DN  +I+EHN  +N+TY +GLN FAD+
Sbjct: 27  DGSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T++E++A+Y GT    K  L  +  +  RY  +    LP   DWR KGAV  VK+QG+CG
Sbjct: 87  THDEFKALYFGT----KVPLSNTIKSGFRY--EDATNLPLDTDWRSKGAVATVKNQGACG 140

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFSTVAAVEG+N+IVTGEL+SLSEQELVDCD++ N GCNGGLMD AF+FIIQNGG+D
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLD 200

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           SE DYPY      CD SRRN+ VV+IDG+EDV    E  L KAVA+QPVSVAIEA GR F
Sbjct: 201 SEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNF 260

Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE---NGV--DYWLVRNSWGSDWGENGYVKLQRN 338
           Q Y  GV+TG CG  LDHGVVAVGYGT    +GV  DYW+VRNSWG  WGE+GY++LQRN
Sbjct: 261 QLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRN 320

Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
           +  ++ GKCGIAM ASYPVKNS
Sbjct: 321 VA-SSRGKCGIAMMASYPVKNS 341


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 20/346 (5%)

Query: 27  MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNGM-GHNEKRFQIFKDN 79
           MSII Y+  H         RT+ E   +Y  W+A+H   G + NG+ G  E+RF++F DN
Sbjct: 37  MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDN 96

Query: 80  LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           L+F+D HN+    +  +++G+N+FADLTN+E+RA YLGT    + R +      + Y   
Sbjct: 97  LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEAYRHD 151

Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             + LP+SVDWR+KGAV  PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 152 GVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 211

Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           C R   N+GCNGG+MD AF FI +NGG+D+E+DYPY   + KC+ ++++ KVVSIDG+ED
Sbjct: 212 CARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFED 271

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
           V   DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+   
Sbjct: 272 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 331

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G DYW VRNSWG DWGENGY++++RN+    TGKCGIAM ASYP+K
Sbjct: 332 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 376


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 201/346 (58%), Positives = 257/346 (74%), Gaps = 20/346 (5%)

Query: 27  MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNGM-GHNEKRFQIFKDN 79
           MSII Y+  H         RT+ E   +Y  W+A+H   G + NG+ G  E+RF++F DN
Sbjct: 37  MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDN 96

Query: 80  LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           L+F+D HN+    +  +++G+N+FADLTN+E+RA YLGT    + R +      + Y   
Sbjct: 97  LKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEAYRHD 151

Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             + LP+SVDWR+KGAV  PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 152 GVEVLPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 211

Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           C R   N+GCNGG+MD AF FI +NGG+D+E+DYPY   + KC+ ++++ KVVSIDG+ED
Sbjct: 212 CARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFED 271

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
           V   DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+   
Sbjct: 272 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 331

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G DYW VRNSWG DWGENGY++++RN+    TGKCGIAM ASYP+K
Sbjct: 332 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 376


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/346 (58%), Positives = 259/346 (74%), Gaps = 20/346 (5%)

Query: 27  MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNG-MGHNEKRFQIFKDN 79
           MSII Y+  H         RT+ E   +Y  W+A+H   G + NG +G  E+RF++F DN
Sbjct: 38  MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97

Query: 80  LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           L+F+D HN+    +  +++G+N+FADLTN+E+RA YLGT    + R +      + Y   
Sbjct: 98  LKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEMYRHD 152

Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             + LP+SVDWR+KGAV +PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 153 GVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 212

Query: 196 CDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           C R + N+GCNGG+MD AF FI +NGG+D+E+DYPY   + KCD ++++ KVVSIDG+ED
Sbjct: 213 CARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFED 272

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
           V   DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+   
Sbjct: 273 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 332

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G DYW VRNSWG DWGENGY++++RN+    TGKCGIAM ASYP+K
Sbjct: 333 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 377


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 186/327 (56%), Positives = 249/327 (76%), Gaps = 9/327 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFA 101
           RT+DEV+ ++++WL ++GK+ N +G  E+RF+IFKDNLRF+DEHN+ +NR+YKVGLN+F+
Sbjct: 39  RTNDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFS 98

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLT+ EY ++YLGT+ +     ++    S RY  + GD+LP+SVDWR+KGAV  VK+QG+
Sbjct: 99  DLTDAEYSSIYLGTKFN-----IRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGN 153

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
           CGSCW F+++AAVEGINKIVTG LISLSEQE+VDC RK  N GCNGG +  A+QFII NG
Sbjct: 154 CGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNG 213

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+++E +YPY G +  CD +++N K V+ID YE+V   +E +L+KAVA QPVSV I +  
Sbjct: 214 GINTEANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNS 273

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            AF+ Y+SG+F G CG  +DHGV  VGYGTE G DYW+VRNSWG +WGE+GYV++QRN+ 
Sbjct: 274 TAFKSYKSGIFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGYVRMQRNV- 332

Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPK 367
              +GKC IA    YPVK   N  KP+
Sbjct: 333 -GGSGKCFIARAPVYPVKYGPNPTKPR 358


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 196/371 (52%), Positives = 256/371 (69%), Gaps = 20/371 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +    +++S L F   +  S A D+           +S  RT+D+VM +Y++WL + G
Sbjct: 1   MGSPKSVISMSLLFFSTLLILSLALDI----------ENSVQRTNDQVMAMYESWLVEQG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +   E RF+IFK+NLR ID+HN+  NR+Y +GLN+FADLT+EEYR+ YLG +   
Sbjct: 51  KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGP 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +      S  Y  K G+ LP+ VDWR  GAV  VK+QG C SCWAFS V AVEGINK
Sbjct: 111 KTDV------SNEYMPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINK 164

Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQELVDC R +   GCN GLM  AFQFII NGG+++E +YPY   + +C+
Sbjct: 165 IVTGNLISLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCN 224

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
            S +N K V+ID Y++V   +EM+LKKAVA QPVSV +E+ G  F+ Y SG+FTG CG+A
Sbjct: 225 LSLKNQKYVTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTA 284

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DHGV  VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+     GKCGIA   SYPVK
Sbjct: 285 VDHGVTIVGYGTERGMDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIARMPSYPVK 342

Query: 359 NSQNSAKPKPH 369
            + N  KP P+
Sbjct: 343 YTTNPLKPYPY 353


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 195/313 (62%), Positives = 233/313 (74%), Gaps = 8/313 (2%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
           ++  +  W  KHGK  +       RF ++KDNL +I  H+  NRTY +GL KFADLTNEE
Sbjct: 50  LLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYI-RHSETNRTYSLGLTKFADLTNEE 108

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R MY GTR D  RR  +      RYA     E PESVDWR+ GAV  VKDQGSCGSCWA
Sbjct: 109 FRRMYTGTRIDRSRRAKRR--TGFRYA---DSEAPESVDWRKNGAVTSVKDQGSCGSCWA 163

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FS V +VEGIN I  GE +SLSEQELVDCD + N GCNGGLMDYAF FIIQNGG+D+E+D
Sbjct: 164 FSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGIDTEKD 223

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           YPY G + +CD S++NA VV+IDGYEDV   DE +LKKAVA QPVSVAIEAGGR FQ Y 
Sbjct: 224 YPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYA 283

Query: 288 SGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK- 346
            GVF+GECG+ LDHGV+AVGYGTE+GVDYW+V+NSWG  WGE+GY++++RN+ D+N G  
Sbjct: 284 QGVFSGECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPG 343

Query: 347 -CGIAMEASYPVK 358
            CGI +E SY VK
Sbjct: 344 LCGINIEPSYAVK 356


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/310 (62%), Positives = 231/310 (74%), Gaps = 6/310 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  W  KHGK  +       RF ++KDNL +I  H+  N +Y +GL KFADLTNEE+R  
Sbjct: 45  FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLTKFADLTNEEFRRQ 104

Query: 112 YLGTRSDAKRRLMKSKVA--SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           Y GTR D  RRL K + A  S RYA     E P+S+DWREKGAV  VKDQGSCGSCWAFS
Sbjct: 105 YTGTRIDRSRRLKKGRNATGSFRYA---NSEAPKSIDWREKGAVTSVKDQGSCGSCWAFS 161

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
            V +VEGIN I TG+ ISLS QELVDCD+K N GCNGGLMDYAF F+IQNGG+D+E+DYP
Sbjct: 162 AVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTEKDYP 221

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + +CD ++ NA+VV+ID YEDV   DE +LKKAVA QPVSVAIEAGGR FQ Y  G
Sbjct: 222 YQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGG 281

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT-GKCG 348
           VFTG CG+ LDHGV+AVGYG+E G+DYW+V+NSWG  WGE+GY+++QRNL D N  G CG
Sbjct: 282 VFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLCG 341

Query: 349 IAMEASYPVK 358
           I +E SY VK
Sbjct: 342 INIEPSYAVK 351


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 191/361 (52%), Positives = 248/361 (68%), Gaps = 15/361 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMS--IISYDNNHDHSSSWRTDDEVMTIYQTWLAK 58
           M      L +S ++ +  I   + A  +  I+ Y+ N  HS     DD ++ ++  WL  
Sbjct: 1   MGWGRRALGLSLVLLVIAIGQQADAGRANAIVDYEGNQLHS-----DDAILDVFHQWLET 55

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           H +    +     RFQIFK+N  +I  HN   ++Y +GLNKF+DLT++E+RA YLGT+  
Sbjct: 56  HSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSDLTHQEFRAQYLGTKPV 115

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
            ++R    K A+  Y      E    VDWR KGAV  VKDQG+CGSCWAFS V +VEG+N
Sbjct: 116 NRQR----KEANFMYE---DVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVN 168

Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
            I TGEL+SLSEQELVDCDRK N GCNGGLMDYAF+FII+NGG+D+E+DYPY   + +CD
Sbjct: 169 AIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCD 228

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             RRN+KVV ID Y+DV    E +L KA+   PVSVAIEAGGR FQHY+ GVFTG CGS 
Sbjct: 229 EGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSE 288

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV+AVGYGT ++GV+YW+V+NSWG  WGE GY++++R   D+  GKCGI +EAS+P+
Sbjct: 289 LDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348

Query: 358 K 358
           K
Sbjct: 349 K 349


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 194/371 (52%), Positives = 256/371 (69%), Gaps = 20/371 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +    +++S L F   +  SSA D+           +S+ RT+D+V  +Y++WL + G
Sbjct: 1   MGSPKSVISMSLLFFSTLLILSSALDIV----------NSAQRTNDQVRDMYESWLVEQG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +   E RF+IFKDNLR ID+HN+  NR++ +GLN+FADLT+EEYR+ YLG +S  
Sbjct: 51  KSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSGP 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K ++      S RY  K GD LP  VDWR  GAV  VK+QG C SCWAFS VAAVEGINK
Sbjct: 111 KAKV------SNRYVPKVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINK 164

Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           I+TG L+SLSEQELVDC R +   GCN G M  AFQFII NGG+++E +YPY   + +C+
Sbjct: 165 IMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCN 224

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AVA QPVSV +E+ G  F+ Y SG+FT  CG+A
Sbjct: 225 RYLQNQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTA 284

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DHGV  VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+     GKCGIA  ASYPVK
Sbjct: 285 IDHGVTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIARMASYPVK 342

Query: 359 NSQNSAKPKPH 369
            + N  KP P+
Sbjct: 343 YNSNPLKPYPY 353


>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 194/346 (56%), Positives = 253/346 (73%), Gaps = 16/346 (4%)

Query: 22  SSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNL 80
           SSA D+   S  +N       R+++EV  I+Q W++KHGKT +N +G  E+RFQ FKDNL
Sbjct: 25  SSAIDLPATSGGHN-------RSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNL 77

Query: 81  RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
           RFID+HN+ N +Y++GL +FADLT +EYR ++ G+    +R L  S+    RY    GD+
Sbjct: 78  RFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLRISR----RYVPLDGDQ 133

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPESVDWR +GAV+ +KDQG+C SCWAFSTVAAVEGINKIVTGEL+SLSEQELVDC+  +
Sbjct: 134 LPESVDWRNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNL-V 192

Query: 201 NAGCNG-GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA-KVVSIDGYEDVSPF 258
           N GC G G MD AFQF+I NGG+DS+ DYPY G++  C+     + K+++ID YEDV   
Sbjct: 193 NNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPAN 252

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           DE+SL+KAVA QPVSV ++   + F  Y SG++ G CG+ LDH +V VGYG+ENG DYW+
Sbjct: 253 DEISLQKAVAHQPVSVGVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYGSENGQDYWI 312

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VRNSWG+ WG+ GY K+ RN  +  +G CGIAM ASYPVKNS ++A
Sbjct: 313 VRNSWGTTWGDAGYAKMARN-FEYPSGVCGIAMLASYPVKNSASNA 357


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 203/373 (54%), Positives = 261/373 (69%), Gaps = 26/373 (6%)

Query: 5   SMFLAISTLVFLFFISSSSAAD-------MSIISYDNNHDHSSSWRTDDEVMTIYQTWLA 57
           S+  A++   FL  +++ +          MSII Y+  H      RT+ E    Y  WLA
Sbjct: 8   SVAAALAMACFLLILAAFAPPAAAAPPDIMSIIRYNAEHGVRGLERTEAEARAAYDLWLA 67

Query: 58  KHGKTSNG------MGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEY 108
           +H +   G      +G +E+RF++F DNL+F+D HN+       +++G+N+FADLTN E+
Sbjct: 68  RHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF 127

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV-NPVKDQGSCGSCWA 167
           RA YLGT    + R +      + Y     + LP+SVDWR+KGAV  PVK+QG CGSCWA
Sbjct: 128 RATYLGTTPAGRGRRV-----GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 182

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS VAAVEGINKIVTGEL+SLSEQELV+C R   N+GCNGG+MD AF FI +NGG+D+E+
Sbjct: 183 FSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEE 242

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           DYPY   + KC+ ++R+ KVVSIDG+EDV   DE+SL+KAVA QPVSVAI+AGGR FQ Y
Sbjct: 243 DYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 302

Query: 287 ESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
           +SGVFTG CG+ LDHGVVAVGYGT+   G  YW VRNSWG DWGENGY++++RN+    T
Sbjct: 303 DSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVT-ART 361

Query: 345 GKCGIAMEASYPV 357
           GKCGIAM ASYP+
Sbjct: 362 GKCGIAMMASYPI 374


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 203/373 (54%), Positives = 261/373 (69%), Gaps = 26/373 (6%)

Query: 5   SMFLAISTLVFLFFISSSSAAD-------MSIISYDNNHDHSSSWRTDDEVMTIYQTWLA 57
           S+  A++   FL  +++ +          MSII Y+  H      RT+ E    Y  WLA
Sbjct: 8   SVAAALAMACFLLILAAFAPPAAAAPPDIMSIIRYNAEHGVRGLERTEAEARAAYDLWLA 67

Query: 58  KHGKTSNG------MGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEY 108
           +H +   G      +G +E+RF++F DNL+F+D HN+       +++G+N+FADLTN E+
Sbjct: 68  RHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF 127

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV-NPVKDQGSCGSCWA 167
           RA YLGT    + R +      + Y     + LP+SVDWR+KGAV  PVK+QG CGSCWA
Sbjct: 128 RATYLGTTPAGRGRRV-----GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 182

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS VAAVEGINKIVTGEL+SLSEQELV+C R   N+GCNGG+MD AF FI +NGG+D+E+
Sbjct: 183 FSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEE 242

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           DYPY   + KC+ ++R+ KVVSIDG+EDV   DE+SL+KAVA QPVSVAI+AGGR FQ Y
Sbjct: 243 DYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 302

Query: 287 ESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
           +SGVFTG CG+ LDHGVVAVGYGT+   G  YW VRNSWG DWGENGY++++RN+    T
Sbjct: 303 DSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVT-ART 361

Query: 345 GKCGIAMEASYPV 357
           GKCGIAM ASYP+
Sbjct: 362 GKCGIAMMASYPI 374


>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
 gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
          Length = 299

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/280 (65%), Positives = 224/280 (80%), Gaps = 5/280 (1%)

Query: 13  LVFLFFISS---SSAADMSIISYDNNH-DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           L+ +  ISS   S A DMSIISYD  H D S+S RT+ EV+T+Y+ WL KHGK+ NG+G 
Sbjct: 12  LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK-SK 127
            +KRF+IFKDNL+FIDEHN LN TY++GL +FADLTNEEYR+ +LGT+ D  RR+ K   
Sbjct: 72  KDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKLGG 131

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S RYA + GD+LPESVDWR++GAV  VKDQ SCGSCWAFS +AAVEGINKIVTG+LIS
Sbjct: 132 SKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLIS 191

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE DYPY   + +CD +R+NAKVV
Sbjct: 192 LSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVV 251

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           +ID YEDV  +DE++L+KAVA+QP++VA+E GGR FQ YE
Sbjct: 252 TIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYE 291


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +   FL++S L F   +  S A +   ++           RT+DE+  +Y++WL K+G
Sbjct: 1   MGSPKSFLSMSLLFFSTLLVLSLAFNAKNLTK----------RTNDELKAMYESWLTKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+Y+VGLN+FAD TNEE+++ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            +  MK    S RY  + G  LP+ VDWR  GAV  +K QG CGSCWAFS +A VEGINK
Sbjct: 111 NK--MK---VSNRYEPRVGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG+LISLSEQELVDC R  N  GC+GG +   FQFII NGG+++E +YPY   + +C+
Sbjct: 166 IVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K  SID YE+V   +E +L+ AVA QPVSVA+EA G AFQHY SG+FTG CG+A
Sbjct: 226 LDLQNEKYASIDTYENVPYNNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA + SYPVK
Sbjct: 286 VDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYIRILRNV--GGAGTCGIATKPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M +    +++S L F   +  SSA D+           +S  RT+D+VM +Y++WL + G
Sbjct: 3   MGSPKSVISMSLLFFSTLLILSSALDI----------KNSVQRTNDQVMAMYESWLVEQG 52

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +   E RF+IFK+NLR ID+HN+  NR+Y +GLN+FADLT+EEYR+ YLG +S  
Sbjct: 53  KSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGP 112

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K ++      S RY  K G  LP  VDWR  GAV  VKDQG C SCWAFS VAAVEGINK
Sbjct: 113 KAKV------SNRYVPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINK 166

Query: 180 IVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQELVDC R +   GCN G M+ AFQFII NGG+++E +YPY   + +CD
Sbjct: 167 IVTGNLISLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCD 226

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             R+N + V+ID YE +   +E  L+ AVA QP++V +E+ G  F+ Y SG++TG CG+A
Sbjct: 227 WYRKNQRYVTIDNYEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTA 286

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DHGV  VGYGTE G+DYW+V+NSWG++WGENGY+++QRN+     GKCGIAM  SYPVK
Sbjct: 287 IDHGVTIVGYGTERGLDYWIVKNSWGTNWGENGYIRIQRNI--GGAGKCGIAMVPSYPVK 344

Query: 359 NSQNSAKPKPHSSA 372
            S  +  P  H S+
Sbjct: 345 YSYQN--PNKHYSS 356


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 186/309 (60%), Positives = 231/309 (74%), Gaps = 7/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  W  KHGK  + +  +  R+ ++KDNL +I  H+  NR+Y +GL KFAD+TN+E+R  
Sbjct: 46  FGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDEFRRQ 105

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y GTR D  +R    +    RYA     E PESVDWR+KGAV  VKDQGSCGSCWAFS +
Sbjct: 106 YTGTRIDRSKR--SKRKTGFRYA---DSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAI 160

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
            +VEGIN I TGE +SLSEQELVDCD + N GCNGGLMDYAF FI++NGG+D+E DYPY 
Sbjct: 161 GSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGIDTENDYPYK 220

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G + +CD +++NA VV+IDGYEDV   DE +LKKAVA QPVSVAIEAGGR FQ Y  GVF
Sbjct: 221 GLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVF 280

Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT--GKCGI 349
           TGECG+ LDHGV+AVGYG+E  +DYW+V+NSWG  WGE+GY+++QRN+ D+N   G CGI
Sbjct: 281 TGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDSNHQFGLCGI 340

Query: 350 AMEASYPVK 358
            +E SY VK
Sbjct: 341 NIEPSYAVK 349


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 193/329 (58%), Positives = 243/329 (73%), Gaps = 14/329 (4%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYK 94
           S   R+++E   +Y  W A+HG  S      E R++ F+DNLR+IDEHN+       +++
Sbjct: 30  SGQIRSEEETRRMYAEWTAQHG--SPITNEEEGRYEAFRDNLRYIDEHNAAADAGIHSFR 87

Query: 95  VGLNKFADLTNEEYRAMYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA 152
           +GLN+FA LTNEEYRA YLG R  S A   L K    S RY    G+ LPESVDWREKGA
Sbjct: 88  LGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRK---PSARYEAADGEALPESVDWREKGA 144

Query: 153 VNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
           V  VKDQG SCGS WAFS +AAVE IN+IVTGELISLSEQEL+DCD   NAGC+GGLMD 
Sbjct: 145 VGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDD 204

Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
           AF+FII NGG+D+++DYPY    + CD ++RN K V+ID YED+   +E SL+KAV++QP
Sbjct: 205 AFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLR-MNEKSLQKAVSNQP 263

Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENG 331
           VSVAIEAGGR FQ Y+SG+FTG CG+ LDH    VGYG+ENG DYW+V+ S+G+ WGE+G
Sbjct: 264 VSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGESG 323

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           Y +++RN+ +T +GKCGIAM  SYPVKN+
Sbjct: 324 YARMERNIKET-SGKCGIAMLPSYPVKNT 351


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/341 (54%), Positives = 250/341 (73%), Gaps = 14/341 (4%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
           A D SII+Y    +     RT+DEVM ++++WL ++GK+ N +G  E+RF+IFKDNLRF+
Sbjct: 24  AFDASIITYAKKWEQ----RTNDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFV 79

Query: 84  DEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           DEHN+ +NR+YKVGLN+F+DLT EEY ++YLGT+ D     M+    S RY  + GD+LP
Sbjct: 80  DEHNADVNRSYKVGLNQFSDLTLEEYSSIYLGTKFD-----MRMTNVSDRYEPRVGDQLP 134

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-IN 201
            S+DWR+KGAV  VK+QG+CGSCW F+ +AAVE IN+IVTG LISLSEQ++VDC RK  N
Sbjct: 135 NSIDWRKKGAVLGVKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPN 194

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC GG    A+QFII NGG+++E +YPY   + +CD  ++N K V+ID YE+V   +E 
Sbjct: 195 NGCKGGSRAGAYQFIIDNGGINTEANYPYKAQDGECD-EQKNQKYVTIDRYENVPRKNEK 253

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAV++Q VSV I +    F+ Y+SG+FTG CG+ +DH V  VGYGTE G+DYW+VRN
Sbjct: 254 ALQKAVSNQLVSVGIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRN 313

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           SWGS+WGENGYV++QRN+   N G C IA   +YPVK   N
Sbjct: 314 SWGSNWGENGYVRMQRNV--GNAGTCFIATSPNYPVKYGPN 352


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNGG +   FQFII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+HY SG+FTG CG+A
Sbjct: 226 LDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/356 (53%), Positives = 248/356 (69%), Gaps = 14/356 (3%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           + ++ LA S   F  F S +   D SI+ Y      S   ++ D+++ ++++W++KHGK 
Sbjct: 6   SKALVLACS---FCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSKHGKI 57

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF+IFKDNL+ IDE N +   Y +GLN+FADL+++E++  YLG + D  RR
Sbjct: 58  YQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 117

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               + + + +  K   ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 118 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 172

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  N GCNGGLMDYAF FI++NGG+  E+DYPY+  E  C+ ++ 
Sbjct: 173 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 232

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY DV   +E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CGS LDHG
Sbjct: 233 ETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 292

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V AVGYGT  GVDY +V+NSWGS WGE GY++++RN +    G CGI   ASYP K
Sbjct: 293 VAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 347


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNGG +   FQFII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+HY SG+FTG CG+A
Sbjct: 226 LDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 185/344 (53%), Positives = 245/344 (71%), Gaps = 25/344 (7%)

Query: 28  SIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN 87
           +I+ Y+ +  HS     DD ++ ++  WL +H +  + +   ++RFQIFKDNL +I  HN
Sbjct: 33  AIMDYEAHELHS-----DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHN 87

Query: 88  SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL------ 141
              ++Y +GLNKF+DLT++E+RA+YLG R            A + +  + GD        
Sbjct: 88  KQEKSYWLGLNKFSDLTHDEFRALYLGIRP-----------AGRAHGLRNGDRFIYEDVV 136

Query: 142 -PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
             E VDWR+KGAV+ VKDQGSCGSCWAFS + +VEG+N IVTGELISLSEQELVDCDR  
Sbjct: 137 AEEMVDWRKKGAVSDVKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQ 196

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR-NAKVVSIDGYEDVSPFD 259
           N GCNGGLMDYAF FII+NGG+D+E+DYPY   + +CD +R+  +KVV ID Y+DV    
Sbjct: 197 NQGCNGGLMDYAFDFIIKNGGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKS 256

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWL 318
           E SL KAV+  PVSVAIEAGGR FQHY+ GVFTG CG+ LDHGV+AVGYGT ++GV+YW+
Sbjct: 257 ESSLLKAVSKNPVSVAIEAGGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWI 316

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           V+NSWG  WGE GY++++R   ++ +GKCGI +E S+P+K   N
Sbjct: 317 VKNSWGPSWGEKGYIRMERMGSNSTSGKCGINIEPSFPIKKGAN 360


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/357 (52%), Positives = 248/357 (69%), Gaps = 14/357 (3%)

Query: 2   ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK 61
           ++ ++FLA S   F  F S + A D SI+ Y      S   ++ D+++ ++++W+++HGK
Sbjct: 5   SSKALFLACS---FCLFASLAVAGDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGK 56

Query: 62  TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
               +     RF IFKDNL+ IDE N +   Y +GLN+FADL+++E++  YLG + D  R
Sbjct: 57  IYQSIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSR 116

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
           R    + + + +  K   ELP+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGIN+IV
Sbjct: 117 R----RESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIV 171

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           TG L SLSEQEL+DCDR  N GCNGGLMDYAF FI++NGG+  E+DYPY+  E  C+ ++
Sbjct: 172 TGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTK 231

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
              +VV+I GY DV   +E SL KA+ +QP+SVAIEA GR FQ Y  GVF G CGS LDH
Sbjct: 232 EETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDH 291

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GV AVGYGT  GV+Y +V+NSWGS WGE GY++++RN +    G CGI   ASYP K
Sbjct: 292 GVAAVGYGTSKGVNYIIVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 347


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/358 (52%), Positives = 243/358 (67%), Gaps = 11/358 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA ++   A   L    FI+ + A D SI+ Y   H  S      D+ + ++++W++KH 
Sbjct: 1   MALSTFSKATLILSATLFITYAIAHDFSIVGYSPEHLASM-----DKTIELFESWMSKHS 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           KT   +     RF+IF DNL+ IDE N    +Y +GLN+FADL++EE+++ YLG R +  
Sbjct: 56  KTYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R     K +S+ ++    ++LPESVDWR KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCDR  N GC GGLMDYAFQ+I+ N G+  E+DYPYL  E +C   
Sbjct: 171 VTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIRE 230

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +   +VV+I GYEDV   DE SL KA++ QPVSVAIEA  R FQ Y+ G+FTG CG+ +D
Sbjct: 231 KEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMD 290

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYG+  G DY +V+NSWG  WGENGY++++RN      G CGI   ASYP K
Sbjct: 291 HGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRN-TGKPEGLCGINQMASYPTK 347


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/327 (57%), Positives = 232/327 (70%), Gaps = 11/327 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKT--SNGM------GHNEKRFQIFKDNLRFIDEHNSLNRTYKV 95
           +++ +  ++ +W+ +HGK+   N +      G    R+ IFKDNLRFI   N  N+ Y +
Sbjct: 49  SEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFL 108

Query: 96  GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
           GLN FADLTNEE+RA   G R D  R   ++     RY      +LP+S+DWREKGAV  
Sbjct: 109 GLNAFADLTNEEFRAQRHGGRFDRSRE--RTSYEEFRYGSVQLKDLPDSIDWREKGAVVG 166

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           VKDQGSCGSCWAFS VAA+EG+NK+ TGEL+SLSEQELVDCD+  + GCNGGLMDYAF F
Sbjct: 167 VKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGF 226

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           +I+NGG+D+E DYPY G   +CD S+ NAKVV+IDGYEDV   DE +L KAVA QPVSVA
Sbjct: 227 VIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVA 286

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           I+AGG + Q Y SG+FTG CG+ LDHGV  VGYG E+G  YW+++NSWGS+WGE GY+K+
Sbjct: 287 IDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGEKGYIKM 346

Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQN 362
            RN      G CGI MEASYP K   N
Sbjct: 347 ARN-TGLAAGLCGINMEASYPTKTGAN 372


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/327 (57%), Positives = 231/327 (70%), Gaps = 11/327 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKT--------SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKV 95
           +++ +  ++ +W+ +HGK+         +  G    R+ IFKDNLRFI   N  N+ Y +
Sbjct: 49  SEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFL 108

Query: 96  GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
           GLN FADLTNEE+RA   G R D  R   ++     RY      +LP+S+DWREKGAV  
Sbjct: 109 GLNAFADLTNEEFRAQRHGGRFDRSRE--RTSHEEFRYGSVQLKDLPDSIDWREKGAVVG 166

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           VKDQGSCGSCWAFS VAA+EG+NK+ TGEL+SLSEQELVDCD+  + GCNGGLMDYAF F
Sbjct: 167 VKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGF 226

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           +I+NGG+D+E DYPY G   +CD S+ NAKVV+IDGYEDV   DE +L KAVA QPVSVA
Sbjct: 227 VIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVA 286

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           I+AGG + Q Y SG+FTG CG+ LDHGV  VGYG E+G  YW+++NSWGS+WGE GYVK+
Sbjct: 287 IDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGEKGYVKM 346

Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQN 362
            RN      G CGI MEASYP K   N
Sbjct: 347 ARN-TGLAAGLCGINMEASYPTKTGAN 372


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/358 (52%), Positives = 243/358 (67%), Gaps = 11/358 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA ++   A   L    FI+ ++A D SI+ Y   H  S      D+ + ++++W++KH 
Sbjct: 1   MALSTFSKATLILSATLFITYATAHDFSIVGYSPEHLASM-----DKTIELFESWMSKHS 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF+IF DNL+ IDE N    +Y +GLN+FADL++EE+++ YLG R +  
Sbjct: 56  KAYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFP 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R     K +S+ ++    ++LPESVDWR KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCDR  N GC GGLMDYAFQ+I+ N G+  E+DYPYL  E +C   
Sbjct: 171 VTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIRE 230

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +   +VV+I GYEDV   DE SL KA++ QPVSVAIEA  R FQ Y+ G+FTG CG+ +D
Sbjct: 231 KEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMD 290

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYG+  G DY +V+NSWG  WGENGY++++RN      G CGI   ASYP K
Sbjct: 291 HGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRN-TGKPEGLCGINQMASYPTK 347


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/356 (52%), Positives = 244/356 (68%), Gaps = 11/356 (3%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           ++S  L +    F  F S +   D SI+ Y      S   ++ D+++ ++++W+++HGK 
Sbjct: 4   SSSKALVLIACSFCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGKI 58

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF+IFKDNL+ IDE N +   Y +GLN+FADL++ E+   YLG + D  RR
Sbjct: 59  YENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKYLGLKVDYSRR 118

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               + + + +  K   ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 173

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  N GCNGGLMDYAF FI++NGG+  E+DYPY+  E  C+ ++ 
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 233

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY DV   +E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CGS LDHG
Sbjct: 234 ETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V AVGYGT  GVDY  V+NSWGS WGE GY++++RN +    G CGI   ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 195/374 (52%), Positives = 257/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNGG +   FQFII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VELQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   P+P+SS
Sbjct: 344 YNNQN--YPEPYSS 355


>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
          Length = 245

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 173/228 (75%), Positives = 203/228 (89%)

Query: 138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD 197
           G+ LPESVDWRE GAVNPVKDQ SCGSCWAFSTVAAVEGIN+IVTGELISLSEQELVDCD
Sbjct: 3   GEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCD 62

Query: 198 RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
            + + GCNGGLMDYAF FII+NGG+D+E+DYPY G + +C+ S +++KVVSIDGYEDV P
Sbjct: 63  TEYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPP 122

Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
           FDE +L+KAVA QPVSVA+EAGGRA Q Y SG+FTGECG+ALDHG+VAVGYGTENG DYW
Sbjct: 123 FDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYW 182

Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           +VRNSWGS WGENGY++++RN+ D  +GKCGIAMEASYP+KN +N +K
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIKNGENPSK 230


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 191/314 (60%), Positives = 237/314 (75%), Gaps = 21/314 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +Y+ WL ++ K  NG+G  E+R +IFK+NL+FIDEHNSL N+T++VGL +FADLTN+E  
Sbjct: 1   MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE-- 58

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
                      +  MK+     RY  K GD LP+ +DWR KGAV PVKDQG+CGSCWAFS
Sbjct: 59  ----------PKDFMKA----DRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFS 104

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            V AVEGIN+I TGELISLS+QEL+DCDR  +NAGC GG+M+YAF+FII NGG++S+QDY
Sbjct: 105 AVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDY 164

Query: 229 PYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           PY   +   C+  ++N  +VV IDGYE V+  DE SLKKAVA QPV VAIEA  +AF+ Y
Sbjct: 165 PYTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLY 224

Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           +SGVFTG CG  LDHGVV VGYGT +G DYW++RNSWG +WGENGYVKLQRN +D + GK
Sbjct: 225 KSGVFTGTCGIYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRN-IDDSFGK 283

Query: 347 CGIAMEASYPVKNS 360
           CG+AM  SYP K+S
Sbjct: 284 CGVAMMPSYPTKSS 297


>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
          Length = 464

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/346 (56%), Positives = 252/346 (72%), Gaps = 20/346 (5%)

Query: 27  MSIISYDNNHDHSS---SWRTDDEVMTIYQTWLAKH---GKTSNG-MGHNEKRFQIFKDN 79
           MSII Y+  H         RT+ E   +Y  W+A+H   G + NG +G  E+RF++F DN
Sbjct: 38  MSIIRYNAEHGVRGLEVVERTEAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDN 97

Query: 80  LRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           L+F+D HN+    +  +++G+N+FADLTN+E+RA YLGT    + R +      + Y   
Sbjct: 98  LKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRHV-----GEMYRHD 152

Query: 137 AGDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
             + LP+SVDWR+KGAV +PVK+QG CGSCWAFS VAAVEGINKIVTGEL+SLSEQELV+
Sbjct: 153 GVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVE 212

Query: 196 CDRKINAGCNGG-LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           C R        G +MD AF FI +NGG+D+E+DYPY   + KCD ++++ KVVSIDG+ED
Sbjct: 213 CARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFED 272

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--N 312
           V   DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG++LDHGVVAVGYGT+   
Sbjct: 273 VPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAAT 332

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G DYW VRNSWG DWGENGY++++RN+    TGKCGIAM ASYP+K
Sbjct: 333 GTDYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPIK 377


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/356 (52%), Positives = 244/356 (68%), Gaps = 11/356 (3%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           ++S  L +    F  F S +   D SI+ Y      S   ++ D+++ ++++W+++HGK 
Sbjct: 4   SSSKALVLIACSFCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHGKI 58

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF+IFKDNL+ IDE N +   Y +GL++FADL++ E+   YLG + D  RR
Sbjct: 59  YENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKYLGLKVDYSRR 118

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               + + + +  K   ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 173

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  N GCNGGLMDYAF FI++NGG+  E+DYPY+  E  C+ ++ 
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGACEMTKE 233

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY DV   +E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CGS LDHG
Sbjct: 234 ETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V AVGYGT  GVDY  V+NSWGS WGE GY++++RN +    G CGI   ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YL   S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNGG +   FQFII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 VDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/336 (55%), Positives = 235/336 (69%), Gaps = 6/336 (1%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     T+D +  +Y+ W  +H K +   G   +RF +FK N+  + E N +++ YK+ L
Sbjct: 26  HEKELETEDNLWDMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKMDKPYKLKL 82

Query: 98  NKFADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           NKFAD+TN E+R++Y G++     R L   +  S+ +     + +P SVDWR+KGAV PV
Sbjct: 83  NKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPV 142

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
           KDQG CGSCWAFSTVAAVEGINKI T EL+SLSEQELVDCD   N GCNGGLMD AF FI
Sbjct: 143 KDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFI 202

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            + GG+  E  YPY   + KCD ++ N+ VVSIDG+EDV   DE SL KAVA+QPV+VAI
Sbjct: 203 KKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAI 262

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
           +AG   FQ Y  GVFTG+CG+ LDHGV AVGYGT  +G  YW+VRNSWGS+WGE GY+++
Sbjct: 263 DAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRM 322

Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
           +R + D   G CGIAMEASYP+KNS N+ K  P SS
Sbjct: 323 ERGISDKR-GLCGIAMEASYPIKNSSNNPKSSPTSS 357


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNTKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNGG +   FQFII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PK +SS
Sbjct: 344 YNNQN--HPKSYSS 355


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/356 (52%), Positives = 243/356 (68%), Gaps = 11/356 (3%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           + S  L +    F  F S +   D SI+ Y      S   ++ D+++ ++++W+++HGK 
Sbjct: 4   STSKALRVLACSFCLFASFTFGRDFSIVGYS-----SEDLKSMDKLIELFESWISRHGKI 58

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF+IFKDNL+ IDE N +   Y +GLN+FADL+++E++  YLG + D  RR
Sbjct: 59  YQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 118

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               + + + +  K   ELP+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 119 ----RESPEEFTYK-DVELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVT 173

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  N GCNGGLMDYAF FI++N G+  E+DYPY+  E  C+ ++ 
Sbjct: 174 GNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEEDYPYIMEEGTCEMAKE 233

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY DV   +E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CGS LDHG
Sbjct: 234 ETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 293

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V AVGYGT  GVDY  V+NSWGS WGE GY++++RN +    G CGI   ASYP K
Sbjct: 294 VAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRN-IGKPEGICGIYKMASYPTK 348


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/330 (56%), Positives = 244/330 (73%), Gaps = 17/330 (5%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK-RFQIFKDNLRFIDEHNSLN----RTYKVGL 97
           R DDEV  +Y+ W ++HG   +G G +++ R ++F+DNLR+ID HN+       T+++GL
Sbjct: 43  RADDEVRRMYEAWKSEHG---HGHGSDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGL 99

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQRYACKAGDELPESVDWREKGAVN 154
             FADLT EEYR   LG R+   RR   S+V   +S R   + GD LP+++DWRE GAV 
Sbjct: 100 TPFADLTLEEYRGRALGFRA---RRGGASRVGSGSSYRPRPRGGD-LPDAIDWRELGAVT 155

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
            VK+Q  CG CWAFS VAA+EGIN+IVTG L+SLSEQE++DCD + + GCNGG M  AFQ
Sbjct: 156 GVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQ 214

Query: 215 FIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
           F+I NGG+D+E DYPYLG +  CD +R N +VV+IDG+  V+  +E +L++AVA+QPVSV
Sbjct: 215 FVINNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSV 274

Query: 275 AIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
           AI+A GR FQHY SG+F G CG+ LDHGV AVGYG+ENG DYW+V+NSW S WGE GY++
Sbjct: 275 AIDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSSWGEAGYIR 334

Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           ++RN+    TGKCGIAM+ASYPVK+S N A
Sbjct: 335 IRRNVA-AATGKCGIAMDASYPVKSSSNPA 363


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 181/358 (50%), Positives = 244/358 (68%), Gaps = 11/358 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA + +  +  T     F+ S  A D SI+ Y   H  S      D+++ ++++W++ HG
Sbjct: 1   MALSVLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSV-----DKLVELFESWISGHG 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K  N +     RF++FK+NL+ ID+ N    +Y +GLN+FADL++EE+++ +LG   +  
Sbjct: 56  KAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFP 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R     K +S+ ++ +   +LP+S+DWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 116 R-----KKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQI 170

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           V G L SLSEQ+L+DCD   N GCNGGLMDYAF+FI+ NGG+  E+DYPYL  E  CD  
Sbjct: 171 VAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEK 230

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R   +VV+I GY DV   DE SL KA+A QP+SVAI+A GR FQ Y  GVF+G CG+ LD
Sbjct: 231 REEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLD 290

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYG+ +G+DY +V+NSWG  WGE GY++++RN      G CGI   ASYP K
Sbjct: 291 HGVAAVGYGSSSGIDYIIVKNSWGPKWGERGYLRMKRN-TGKPEGLCGINKMASYPTK 347


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  373 bits (957), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 193/374 (51%), Positives = 254/374 (67%), Gaps = 22/374 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M     F+++S L F         + + I+S   N  + +  RT+DEV  +Y++WL K+G
Sbjct: 1   MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K+ N +G  E+RF+IFK+ LRFIDEHN+  NR+YKVGLN+FADLT+EE+R+ YLG  S +
Sbjct: 51  KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                K+KV S RY  + G  LP  VDWR  GAV  +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165

Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           IVTG LISLSEQEL+DC R  N  GCNG  +   F FII NGG+++E++YPY   + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECN 225

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
              +N K V+ID YE+V   +E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +DH V  VGYGTE G+DYW+V+NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343

Query: 359 -NSQNSAKPKPHSS 371
            N+QN   PK +SS
Sbjct: 344 YNNQN--HPKSYSS 355


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/358 (50%), Positives = 247/358 (68%), Gaps = 10/358 (2%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S    + T     F+S +   D SI+ Y      S   ++ D+++ ++++W+++HG
Sbjct: 1   MAFFSPKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++FKDNL+ ID+ N +   Y +GLN+FADL+++E++  YLG + D  
Sbjct: 56  KIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLS 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           +R    + +S+        +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QR----RESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 171

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCD   N GCNGGLMDYAF FI++NGG+  E+DYPY+  E+ C+  
Sbjct: 172 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMK 231

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +  ++VV+I+GY DV   +E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CGS LD
Sbjct: 232 KEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELD 291

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYGT  G+DY +V+NSWG+ WGE G+++++RN +  + G CG+   ASYP K
Sbjct: 292 HGVSAVGYGTSKGLDYIIVKNSWGAKWGEKGFIRMKRN-IGKSEGICGLYKMASYPTK 348


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/368 (49%), Positives = 240/368 (65%), Gaps = 20/368 (5%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           +FL + TL  +  +  S         +D    H     T+++   +Y+ W + H   S  
Sbjct: 4   LFLVLFTLALVLRLGES---------FDF---HEKELETEEKFWELYERWRSHH-TVSRS 50

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +    KRF +FK N+ ++   N  ++ YK+ LNKFAD+TN E+R  Y G++    R L+ 
Sbjct: 51  LDEKHKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLG 110

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           +  A+  +     D +P S+DWR+KGAV PVKDQG CGSCWAFSTV AVEGIN+I T +L
Sbjct: 111 ASRANGTFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKL 170

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           +SLSEQELVDCD   N GCNGGLMD AF FI + GG+ +E+ YPY   ++KCD  +RN  
Sbjct: 171 VSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTP 230

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VVSIDG+EDV P DE +L KAVA+QP+SVAI+A G  FQ Y  GVFTGECG+ LDHGV  
Sbjct: 231 VVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAI 290

Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN-- 362
           VGYGT  +G  YW+V+NSWG+ WGE GY+++QR  +D   G CGIAM+ SYP+K S N  
Sbjct: 291 VGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRK-VDAEEGLCGIAMQPSYPIKTSSNPT 349

Query: 363 ---SAKPK 367
              +A PK
Sbjct: 350 GSPAATPK 357


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 175/334 (52%), Positives = 229/334 (68%), Gaps = 3/334 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     T++ +  +Y+ W + H   S  +    KRF +FK+N+ F+ E N  +  YK+ L
Sbjct: 24  HQKELETEESLWNLYERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKKDEPYKLKL 82

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  R    S+ A+  +  +    +P SVDWR+KGAV P+K
Sbjct: 83  NKFADMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIK 142

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN I T +L+SLSEQELVDCD   N GCNGGLM YAF+FI 
Sbjct: 143 DQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIK 202

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           + GG+ +EQ YPY   +  CD S+ N+ VVSIDG+E V P +E +L KA A+QP+SVAI+
Sbjct: 203 EKGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAID 262

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG AFQ Y  GVF G CG+ LDHGV  VGYGT  +G  YW+V+NSWG+DWGENGY++++
Sbjct: 263 AGGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMK 322

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
           R  +    G CGIA+EASYP+KNS  +    P S
Sbjct: 323 RG-ISAKEGLCGIAVEASYPIKNSSTNPVGAPSS 355


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 174/322 (54%), Positives = 228/322 (70%), Gaps = 3/322 (0%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           +Y+ W + H   S  +   +KRF +FK N+ ++   N  ++ YK+ LNKFAD+TN E+R 
Sbjct: 37  LYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRH 95

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            Y G++    R  + +  A+  +     +++P SVDWR+KGAV PVKDQG CGSCWAFST
Sbjct: 96  HYAGSKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFST 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           V AVEGIN+I T EL+SLSEQELVDCD   N GCNGGLMD AF+FI + GG+++E++YPY
Sbjct: 156 VVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPY 215

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +    +CD  +RN+ VVSIDGYEDV P DE SL KAVA+QPVSVAI+A G  FQ Y  GV
Sbjct: 216 MAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGV 275

Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+ LDHGV  VGYGT  +G  YW+VRNSWG +WGE GY+++QR  +D   G CGI
Sbjct: 276 FTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRE-IDAEEGLCGI 334

Query: 350 AMEASYPVKNSQNSAKPKPHSS 371
           AM+ SYP+K S ++    P ++
Sbjct: 335 AMQPSYPIKTSSSNPTGSPATA 356


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score =  369 bits (948), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 188/381 (49%), Positives = 251/381 (65%), Gaps = 22/381 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSS-------------SWRTDDE 47
           M +A   L I  L+ +   S ++A DMS+++YD+NH  ++             +   D E
Sbjct: 1   MGSAKSALLI-LLLAMVIASCATAMDMSVVTYDDNHHVTAGPGHHVTAGPGRRNGVFDVE 59

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
              I+++W+ KHGK  + +   E+R  IFKDNLRFI   NS N  Y++GLN+FADL+  E
Sbjct: 60  ASLIFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHE 119

Query: 108 YRAMYLGTRSDAKRR--LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           Y+ +  G      R    M S   S RY   AGD LP+SVDWR +GAV  VKDQG C SC
Sbjct: 120 YKEICHGADPKPPRNHVFMSS---SDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSC 176

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFSTV AVEG+NKIVTGEL++LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++
Sbjct: 177 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTD 235

Query: 226 QDYPYLGAENKCDPS-RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
            DYPY      CD   + N K V IDGYE++   DE++L KAVA QPV+  I++  R FQ
Sbjct: 236 NDYPYKAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQ 295

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            YESGVF G CG+ L+HGVV VGYGTENG +YW+VRNSWG+ WGE GY+K+ RN+ +   
Sbjct: 296 LYESGVFDGRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPR- 354

Query: 345 GKCGIAMEASYPVKNSQNSAK 365
           G CGIAM  SYP+KNS  + K
Sbjct: 355 GLCGIAMRVSYPLKNSFTTGK 375


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 184/361 (50%), Positives = 244/361 (67%), Gaps = 23/361 (6%)

Query: 8   LAISTLVFL-----FFISSSSAADMSIISYDNNHDHSSSWRTD----DEVMTIYQTWLAK 58
           +A+S L+ L     FF+ +S   D SI+ Y         W  D    D ++ +++ W++ 
Sbjct: 1   MALSKLLPLAMCMSFFVVTSFGKDFSIVGY---------WPEDLTSMDRLIELFEEWISN 51

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           HGK    +     RF++FKDNL+ IDE N    +Y +G+N+FADLT++E++ MYLG + +
Sbjct: 52  HGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVE 111

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
           + R    ++ + + +  K   +LP+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGIN
Sbjct: 112 SSR----TRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGIN 167

Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           KIV G L SLSEQEL+DCDR  N GC+GGLMDYAF FI+ +GG+  E+DYPYL  E+ CD
Sbjct: 168 KIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCD 227

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +   +VV+I GY+DV   +E SL KA+A QP+SVAIEA GR FQ Y  GVF G CG+ 
Sbjct: 228 NKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQ 287

Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           LDHGV AVGYG+  GVDY +V+NSWG  WGE GY++++RN      G CGI   ASYP K
Sbjct: 288 LDHGVTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTK 346

Query: 359 N 359
           +
Sbjct: 347 S 347


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  367 bits (943), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 184/360 (51%), Positives = 245/360 (68%), Gaps = 16/360 (4%)

Query: 6   MFLAISTLVFLFFIS------SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           M L+  +  FL FIS      S+ A D SI+ Y  + D +S     D++  ++++W++KH
Sbjct: 1   MALSPFSNFFLLFISMAVFAYSAFARDFSIVGYSPD-DLTSM----DKLTDLFESWMSKH 55

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           GK+         RF++F+DNL+ IDE N    +Y +GLN+FADL++EE++  YLG + + 
Sbjct: 56  GKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIEL 115

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            +R    + + + ++ K   +LP+SVDWR+KGAV  VK+QG+CGSCWAFSTVAAVEGIN+
Sbjct: 116 PKR----RDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQ 171

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG L +LSEQEL+DCD+  N GCNGGLMDYAF FII NGG+  E+DYPY+  E  C  
Sbjct: 172 IVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGE 231

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            +   +VV+I GY DV   +E S  KA+A+QP+SVAIEA  R FQ Y  G+F G CG+ L
Sbjct: 232 KKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTEL 291

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGV AVGYGT  GVDY  V+NSWGS WGE GY++++RN +    G CGI   ASYP KN
Sbjct: 292 DHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRN-VGKPEGICGIYKMASYPTKN 350


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  367 bits (943), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 179/358 (50%), Positives = 242/358 (67%), Gaps = 9/358 (2%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S    + T     F+S +   D SI+ Y      S   ++ D+++ ++++W+++HG
Sbjct: 1   MAFFSSKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++FKDNL+ IDE N +   Y +GLN+FADL+++E++  YLG + +  
Sbjct: 56  KIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLS 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           +R   S    + +  +  D LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QRRESSN--EEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 172

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCD   N GCNGGLMDYAF FI+QNGG+  E DYPY+  E+ C+  
Sbjct: 173 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMK 232

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +   +VV+I+GY DV   +E SL KA+A+QP+SVAIEA  R FQ Y  GVF G CGS LD
Sbjct: 233 KEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLD 292

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYGT   +DY +V+NSWG+ WGE G+++++RN +    G CG+   ASYP K
Sbjct: 293 HGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRN-IGKPEGICGLYKMASYPTK 349


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  367 bits (942), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 181/357 (50%), Positives = 239/357 (66%), Gaps = 18/357 (5%)

Query: 7   FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTD----DEVMTIYQTWLAKHGKT 62
           F     +   FF+ +S   D SI+ Y         W  D    D ++ +++ W++ HGK 
Sbjct: 8   FYFFLAMCMSFFVVTSFGKDFSIVGY---------WPEDLTSMDRLIELFEEWISNHGKI 58

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF++FKDNL+ IDE N    +Y +G+N+FADLT++E++ MYLG + ++ R 
Sbjct: 59  YETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSR- 117

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
              ++ + + +  K   +LP+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGINKIV 
Sbjct: 118 ---TRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVG 174

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  N GC+GGLMDYAF FI+ +GG+  E+DYPYL  E+ CD  + 
Sbjct: 175 GNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKG 234

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY+DV   +E SL KA+A QP+SVAIEA GR FQ Y  GVF G CG+ LDHG
Sbjct: 235 ELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHG 294

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           V AVGYG+  GVDY +V+NSWG  WGE GY++++RN      G CGI   ASYP K+
Sbjct: 295 VTAVGYGSSKGVDYIIVKNSWGPKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTKS 350


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  367 bits (941), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 185/355 (52%), Positives = 244/355 (68%), Gaps = 15/355 (4%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           + ++ LA S   F  F S +   D SI+ Y      S   ++ D+++ ++++W++KHGK 
Sbjct: 6   SKALVLACS---FCLFASLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSKHGKI 57

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +     RF+IFKDNL+ IDE N +   Y +GLN+FADL+++E++  YLG + D  RR
Sbjct: 58  YQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRR 117

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
               + + + +  K   ELP+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVT
Sbjct: 118 ----RESPEEFTYK-DVELPKSVDWRKKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVT 172

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           G L SLSEQEL+DCDR  + GCNGGLMDYAF FI++NGG+  E+DYPY+  E  C+ ++ 
Sbjct: 173 GNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKE 232

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
             +VV+I GY DV   +E SL KA+A+Q +SVAIEA GR FQ Y  GVF G CGS LDHG
Sbjct: 233 ETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSGGVFDGHCGSDLDHG 292

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V AVGYGT  GVDY +V+NSWGS WGE GY+++ R  L+T  G       ASYP+
Sbjct: 293 VAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRM-RGTLETR-GNLRYLQMASYPL 345


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score =  367 bits (941), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 180/326 (55%), Positives = 233/326 (71%), Gaps = 9/326 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           R+D+EV  +Y  W  K+      +  NE R ++FK+NL+F+DEHN+       T+ +G+N
Sbjct: 44  RSDEEVRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMN 103

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEEYR  +L  R  ++ R   S   S RY  + GD+LP+S+DWRE GAV PVK+
Sbjct: 104 RFADLTNEEYRTRFL--RDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKN 161

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC    N GC GG M+ AFQFI+ 
Sbjct: 162 QGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TANHGCRGGWMNPAFQFIVN 220

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG++SE+ YPY G    C+ S  NA VVSID YE+V   +E SL+KAVA+QPVSV ++A
Sbjct: 221 NGGINSEETYPYRGQNGICN-STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDA 279

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            GR FQ Y SG+FTG C  + +H +  VGYGTEN  D+W+V+NSWG +WGE+GY++ +RN
Sbjct: 280 AGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAERN 339

Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSA 364
           + + N GKCGI   ASYPVK   N+A
Sbjct: 340 IENPN-GKCGITRFASYPVKKGANTA 364


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  366 bits (940), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 184/360 (51%), Positives = 239/360 (66%), Gaps = 9/360 (2%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +  L+ +F  S           YD+    S     ++ + T+Y  W + H      +   
Sbjct: 1   MKKLLLIFLFSLVILQTACGFDYDDKEIES-----EEGLSTLYDRWRSHHS-VPRSLNER 54

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           EKRF +F+ N+  +   N  NR+YK+ LNKFADLT  E++  Y G+     R L   K  
Sbjct: 55  EKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRG 114

Query: 130 SQR--YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           S++  Y  +   +LP SVDWR+KGAV  +K+QG CGSCWAFSTVAAVEGINKI T +L+S
Sbjct: 115 SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVS 174

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQELVDCD K N GCNGGLM+ AF+FI +NGG+ +E  YPY G + KCD S+ N  +V
Sbjct: 175 LSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +IDG+EDV   DE +L KAVA+QPVSVAI+AG   FQ Y  GVFTG CG+ L+HGV AVG
Sbjct: 235 TIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVG 294

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
           YG+E G  YW+VRNSWG++WGE GY+K++R  +D   G+CGIAMEASYP+K S ++  PK
Sbjct: 295 YGSERGKKYWIVRNSWGAEWGEGGYIKIERE-IDEPEGRCGIAMEASYPIKLSSSNPTPK 353


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score =  366 bits (940), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 181/326 (55%), Positives = 235/326 (72%), Gaps = 9/326 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           R+D+EV  +Y  W AK+      +  NE R ++FK+NL+F+D+HN+       T+++G+N
Sbjct: 42  RSDEEVRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGMN 101

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEEYR  +L  R  ++ R   S   S RY  + GD+LP+S+DWREKGAV PVK+
Sbjct: 102 RFADLTNEEYRTRFL--RDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPVKN 159

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC    N GC GG M+ AFQFI+ 
Sbjct: 160 QGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT-TANHGCRGGWMNPAFQFIVN 218

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG++SE+ YPY G    C+ S  NA VVSID YE+V   +E SL+KAVA+QPVSV ++A
Sbjct: 219 NGGINSEETYPYRGQNGICN-STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDA 277

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            GR FQ Y SG+FTG C  + +H +  VGYGTEN  DY  V+NSWG +WGE+GY++++RN
Sbjct: 278 AGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDYRTVKNSWGKNWGESGYIRVERN 337

Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSA 364
           + + N GKCGI   ASYPVK   N+A
Sbjct: 338 IGNPN-GKCGITRFASYPVKKGTNTA 362


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 173/314 (55%), Positives = 224/314 (71%), Gaps = 5/314 (1%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D+++  +++W++KHGK    M     RF++F++NL  IDE N    +Y +GLN+FADL++
Sbjct: 398 DKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSH 457

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           EE+++ YLG R++  R    S+  S  +  +   +LPESVDWR+KGAV  VK+QG+CGSC
Sbjct: 458 EEFKSKYLGLRAEFPR----SRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSC 513

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFSTVAAVEGIN+IVTG L +LSEQEL+DCD   N+GCNGGLMDYAF FI  NGG+  E
Sbjct: 514 WAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKE 573

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
            DYPYL  E  C+  + +  +V+I GYEDV   DE SL KA+A QP+SVAIEA GR FQ 
Sbjct: 574 DDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQF 633

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y  GVF G CG+ LDHGV AVGYG+  G+DY +V+NSWG  WGE GY++++RN   T  G
Sbjct: 634 YSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE-G 692

Query: 346 KCGIAMEASYPVKN 359
            CGI   ASYP K+
Sbjct: 693 LCGINKMASYPTKD 706


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 185/342 (54%), Positives = 234/342 (68%), Gaps = 10/342 (2%)

Query: 17  FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIF 76
           FF +S  A D SI+ Y    D +S     D+++ ++++W++KHGK    +     RF+IF
Sbjct: 3   FFANSGLARDFSIVGY-TPEDLTSG----DKIIDLFESWISKHGKIYESIEEKWLRFEIF 57

Query: 77  KDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           KDNL  IDE N     Y +GLN+F+DL++EE++  YLG + D   R    +  SQ +  K
Sbjct: 58  KDNLFHIDETNKKVVNYWLGLNEFSDLSHEEFKNKYLGLKVDMSER----RECSQEFNYK 113

Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
               +P+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGIN+IVTG L SLSEQELVDC
Sbjct: 114 DVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDC 173

Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
           D   N GCNGGLMDYAF +II NGG+  E DYPY+  E  C+  +  ++VV+I GY DV 
Sbjct: 174 DTTNNYGCNGGLMDYAFSYIISNGGLHKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVP 233

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
              E SL KA+A+QP+SVAIEA GR FQ Y  GVF G CG+ LDHGV AVGYG+ NG+DY
Sbjct: 234 QNSEESLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGTQLDHGVAAVGYGSTNGLDY 293

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            +V+NSWGS WGE GY++++RN      G CGI   ASYP K
Sbjct: 294 IIVKNSWGSKWGEKGYIRMKRN-TGKPAGLCGINKMASYPTK 334


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  365 bits (936), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 171/319 (53%), Positives = 224/319 (70%), Gaps = 3/319 (0%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           +Y+ W + H   S  +   +KRF +FK N+ ++   N  ++ YK+ LNKFAD+TN E+R 
Sbjct: 37  LYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRH 95

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            Y G++    R  + +  A+  +     D +P +VDWR+KGAV PVKDQG CGSCWAFST
Sbjct: 96  HYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFST 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           V AVEGIN+I T EL+SLSEQELVDCD   N GCNGGLMD AF+FI + GG+++E++YPY
Sbjct: 156 VVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPY 215

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +    +CD  +RN+ VVSIDG+EDV P DE SL KAVA+QPVSVAI+A G  FQ Y  GV
Sbjct: 216 MAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSEGV 275

Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+ LDHGV  VGYGT  +   YW+V+NSWG +WGE GY+++QR  +D   G CGI
Sbjct: 276 FTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQRE-IDAEEGLCGI 334

Query: 350 AMEASYPVKNSQNSAKPKP 368
           AM+ SYP+K S ++    P
Sbjct: 335 AMQPSYPIKTSSSNPTGSP 353


>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
          Length = 355

 Score =  365 bits (936), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 184/353 (52%), Positives = 242/353 (68%), Gaps = 11/353 (3%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M +AI  L  +F +SS  A DMSIIS+DN H   ++ RTDDEVM++++ WL KH K  N 
Sbjct: 1   MNMAIVLLFMVFAVSS--ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNA 58

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +G  EKRFQIFK+NLRFIDE NSLNRTYK+GLN FADLTN EYRAMYL T  D  R  + 
Sbjct: 59  LGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLD 118

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGE 184
           +     RY  + GD +P+SVDWR++GAV PVK+QG +C SCWAF+ V AVE + KI TG+
Sbjct: 119 TP-PRNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGD 177

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           LISLSEQE+VDC    + GC GG + + + +I +N G+  E+DYPY G E KCD +++NA
Sbjct: 178 LISLSEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNA 236

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            +V+IDG+  V    E +LK+ +A+QPV+V I A    FQ+Y SGVF G+CG+ L+H ++
Sbjct: 237 -IVTIDGHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALL 295

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            VGYG E   DYW+ +NS+   WGENGY+++QR L       C       YP+
Sbjct: 296 LVGYGAEKDGDYWIAKNSYSDKWGENGYIRIQRKL-----STCKFGNGGYYPI 343


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 182/326 (55%), Positives = 235/326 (72%), Gaps = 7/326 (2%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ + ++Y+ W A H   S  +   +KRF +FK+N++FI E N   + TYK+ LNKF D
Sbjct: 33  SEESLWSLYEKWRAHHA-VSRDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGD 91

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +TN+E+R+ Y G++ D    L   K A + ++ +   +LP SVDWREKGAV  VKDQG C
Sbjct: 92  MTNQEFRSTYAGSKIDHHMTLRGVKDAGE-FSYEKFHDLPTSVDWREKGAVTGVKDQGQC 150

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFSTV AVEGIN+I T EL+SLSEQ+LVDCD K N+GCNGGLMDYAF FI  NGG+
Sbjct: 151 GSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGL 209

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            SE  YPYL  +  C  S  N+ VV+IDGY+DV   +E +L KAVA+QPVSVAIEA G A
Sbjct: 210 SSEDSYPYLAEQKSCG-SEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYA 268

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVF+G CG+ LDHGV AVGYG  ++G  YW+V+NSWG  WGE+GY++++R + D
Sbjct: 269 FQFYSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKD 328

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
              GKCGIAMEASYP+K+S N  K +
Sbjct: 329 KR-GKCGIAMEASYPIKSSPNPKKAE 353


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  364 bits (934), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 178/358 (49%), Positives = 242/358 (67%), Gaps = 9/358 (2%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S    + T     F+S +   D SI+ Y      S   ++ D+++ ++++W+++HG
Sbjct: 1   MAFFSSKTLVLTCSLCLFLSLAFGRDFSIVGYS-----SEDLKSMDKLIELFESWMSRHG 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++FKDNL+ ID+ N +   Y +GLN+FADL+++E++  YLG + D  
Sbjct: 56  KIYETIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLS 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           +R   S    + +  +  D LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+I
Sbjct: 116 QRRESSN--EEEFTYRDVD-LPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQI 172

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCD   N GCNGGLMDYAF FI QNGG+  E+DYPY+  E+ C+  
Sbjct: 173 VTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMK 232

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +   +VV+I+GY DV   +E SL KA+A+QP+SVAIEA  R FQ Y  GVF G CGS LD
Sbjct: 233 KEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLD 292

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYGT   +DY +V+NSWG+ WGE G+++++R+ +    G CG+   ASYP K
Sbjct: 293 HGVSAVGYGTSKNLDYIIVKNSWGAKWGEKGFIRMKRD-IGKPEGICGLYKMASYPTK 349


>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
          Length = 328

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 169/243 (69%), Positives = 199/243 (81%), Gaps = 1/243 (0%)

Query: 121 RRLMK-SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           RR+ K     S RYA + GD+LPESVDWR++GAV  VKDQ SCGSCWAFS +AAVEGINK
Sbjct: 3   RRMKKFGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINK 62

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE DYPY   + +CD 
Sbjct: 63  IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 122

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           +R+NAKVV+ID YEDV  +DE++L+KAVA+QP++VA+E GGR FQ YE GV TG CG+AL
Sbjct: 123 NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTAL 182

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGV AVGYGTENG DYW+VRNSWG  WGE GY++L+RNL  +  GKCGIA+E SYP+KN
Sbjct: 183 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 242

Query: 360 SQN 362
            QN
Sbjct: 243 GQN 245


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 182/331 (54%), Positives = 233/331 (70%), Gaps = 18/331 (5%)

Query: 43  RTDDEVMTIYQTWLAKH------GKTSNGMGHNE----KRFQIFKDNLRFIDEHNSLN-- 90
           RTD+EV  +Y+ W ++H      G T   +G  E    +R ++F+ NLR+ID HN+    
Sbjct: 44  RTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADA 103

Query: 91  --RTYKVGLNKFADLTNEEYRA-MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDW 147
               +++GL +FADLT EEYRA + LG+R   +       V S+RY   AG++LP++VDW
Sbjct: 104 GLHGFRLGLTRFADLTLEEYRARLLLGSR--GRNGTAVGVVGSRRYLPLAGEQLPDAVDW 161

Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
           RE+GAV  VKDQG CG+CWAFS VAAVEGINKIVTG LISLSEQEL+DCD+  + GC+GG
Sbjct: 162 RERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGG 221

Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
           LMD AF F+I+NGG+D+E DYP+ G +  CD   +N +VVSID +E V    E +L+KAV
Sbjct: 222 LMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAV 281

Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
           A QPVS +IEA  RAFQ Y SG+F G CG+ LDHGV  VGYG+E G DYW+V+NSWG+ W
Sbjct: 282 AHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGKDYWIVKNSWGTQW 341

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GE GYV++ RN +    GKCGIAME  YPVK
Sbjct: 342 GEAGYVRMARN-VRVRAGKCGIAMEPLYPVK 371


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 176/324 (54%), Positives = 228/324 (70%), Gaps = 5/324 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +++ +  +Y+TW + H  +  G+G     +RF +FK+N+R+I E N  +R +++ LNKFA
Sbjct: 32  SEESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKKDRPFRLALNKFA 91

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQG 160
           D+T +E+R  Y G+R    R L   +         A  E LP +VDWR+KGAV P+KDQG
Sbjct: 92  DMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKDQG 151

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFST+ AVEGINKI TG L+SLSEQEL+DC+   N GCNGGLMD AFQFI QNG
Sbjct: 152 QCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQNG 211

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY G +N CD S+ N+  VSIDGYEDV   DE +L+KAVA+QPVSVAI+A G
Sbjct: 212 GITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASG 271

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GVFT + G+ LDHGV AVGYG T +G  YW+V+NSWG DWGE GY+++QR +
Sbjct: 272 NDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGV 331

Query: 340 LDTNTGKCGIAMEASYPVKNSQNS 363
                G CGIAMEASYP K++ ++
Sbjct: 332 KQAE-GLCGIAMEASYPTKSAPHA 354


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 163/234 (69%), Positives = 200/234 (85%)

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
           + LPE+VDWR+KGAVN +K+QG+CGSCWAFST A VEGINKIVTGELISLSEQELVDCD+
Sbjct: 2   EALPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDK 61

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             N GCNGGLMDYAFQFI++NGG+++EQDYPY G++ KC+   +N+KVV+IDGYEDV   
Sbjct: 62  SYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTN 121

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           DE +LK+AV+ QPVSVAI+AGGR FQHY+SG+FTGECG+ +DH VVAVGYG+ENGVDYW+
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWI 181

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
           VRNSWG  WGE+GY++++RNL  + +GKCGIA+EASYPVK S N  +    SS 
Sbjct: 182 VRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVKYSPNPIRGNTISSV 235


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 174/334 (52%), Positives = 232/334 (69%), Gaps = 13/334 (3%)

Query: 44  TDDEVMTIYQTWLAKH--------GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYK 94
           +++ +  +Y+ W +++        G   N  G   +RF +F +N R+I E N    R ++
Sbjct: 34  SEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFR 93

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRL---MKSKVASQRYACKAGDELPESVDWREKG 151
           + LNKFAD+T +E+R  Y G+R+   R L      +  S RY     D LP +VDWRE+G
Sbjct: 94  LALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERG 153

Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
           AV  +KDQG CGSCWAFSTVAAVEG+NKI TG L++LSEQELVDCD   N GC+GGLMDY
Sbjct: 154 AVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDY 213

Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
           AFQFI +NGG+ +E +YPY   + +C+ ++ ++  V+IDGYEDV   DE +L+KAVA+QP
Sbjct: 214 AFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQP 273

Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGEN 330
           V+VA+EA G+ FQ Y  GVFTGECG+ LDHGV AVGYG T +G  YW+V+NSWG DWGE 
Sbjct: 274 VAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGER 333

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           GY+++QR +   + G CGIAMEASYPVK+   +A
Sbjct: 334 GYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNA 367


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 181/350 (51%), Positives = 250/350 (71%), Gaps = 17/350 (4%)

Query: 18  FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQI 75
           ++ S+SA+D +    D + +   S R+      +Y  W  +H ++S  +   E  +RF+I
Sbjct: 18  WVLSASASDFTPGFTDEDLESEKSLRS------LYDNWALQH-RSSRSLDSEEHAERFEI 70

Query: 76  FKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           FK+N+++ID  N  +  YK+GLNKFADL+NEE++A+Y+GT+ D +      +V S  +  
Sbjct: 71  FKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRG---DREVQSGSFMY 127

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
           +  + LP S+DWR+KGAV  VK+QG CGSCWAFSTVA+VEGIN I TG L+SLSEQ+LVD
Sbjct: 128 QNSEPLPASIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVD 187

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV--VSIDGYE 253
           C  + N+GCNGGLMD AFQ+II NGG+ +E +YPY     +C  ++ N++   V IDG+E
Sbjct: 188 CSTE-NSGCNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFE 246

Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-N 312
           DV   +E +LK+AVA QPVSVAIEA G+ FQ Y +GVFTG+CG+ALDHGVVAVGYGT   
Sbjct: 247 DVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPE 306

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           G++YW+VRNSWG  WGE GY+++Q+  ++   GKCGIAM+ASYP K +Q+
Sbjct: 307 GINYWIVRNSWGPKWGEEGYIRMQQG-IEAAEGKCGIAMQASYPTKKTQD 355


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  362 bits (928), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 179/359 (49%), Positives = 240/359 (66%), Gaps = 12/359 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           ++  S+ +AIS    L     + A D SI+ Y   H  ++     D+++ ++++W+++H 
Sbjct: 8   LSKFSLLVAISASALL---CCAFARDFSIVGYTPEHLTNT-----DKLLELFESWMSEHS 59

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++F++NL  ID+ N+   +Y +GLN+FADLT+EE++  YLG    AK
Sbjct: 60  KAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            +  + +  S  +  +   +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG L SLSEQEL+DCD   N+GCNGGLMDYAFQ+II  GG+  E DYPYL  E  C   
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + + + V+I GYEDV   D+ SL KA+A QPVSVAIEA GR FQ Y+ GVF G+CG+ LD
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLD 296

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV AVGYG+  G DY +V+NSWG  WGE G+++++RN      G CGI   ASYP K 
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRN-TGKPEGLCGINKMASYPTKT 354


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  361 bits (927), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 179/359 (49%), Positives = 239/359 (66%), Gaps = 12/359 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           +   S+ +AIS    L    S+ A D SI+ Y      S+     ++++ ++++W+++H 
Sbjct: 8   LTKFSLLVAISASALL---CSALARDFSIVGYTPEQLTST-----EKLLELFESWMSEHS 59

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++F++NL  ID+ N+   +Y +GLN+FADLT+EE++  YLG    AK
Sbjct: 60  KVYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            +  + +  S  +  +   +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG L SLSEQEL+DCD   N+GCNGGLMDYAFQ+II  GG+  E DYPYL  E  C   
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + + + V+I GYEDV   D+ SL KA+A QPVSVAIEA GR FQ Y+ GVF G+CG+ LD
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGQCGTDLD 296

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV AVGYG+  G DY +V+NSWG  WGE G+++++RN      G CGI   ASYP K 
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRN-TGKPEGLCGINKMASYPTKT 354


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  360 bits (925), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 181/354 (51%), Positives = 243/354 (68%), Gaps = 6/354 (1%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LV +   S ++A DMS++SYD+N+   S +  D E   I+++W+ KHGK    +   E+R
Sbjct: 12  LVAMVIASCATAIDMSVVSYDDNNRLHSVF--DAEASLIFESWMVKHGKVYGSVAEKERR 69

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
             IF+DNLRFI+  N+ N +Y++GL  FADL+  EY+ +  G      R  +    +S R
Sbjct: 70  LTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDR 128

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y   A D LP+SVDWR +GAV  VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+
Sbjct: 129 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 188

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDG 251
           L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY      CD   + N K V IDG
Sbjct: 189 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 247

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           YE++   DE +L KAVA QPV+  I++  R FQ YESGVF G CG+ L+HGVV VGYGTE
Sbjct: 248 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTE 307

Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           NG DYWLV+NS G  WGE GY+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct: 308 NGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 360


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  360 bits (925), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 173/331 (52%), Positives = 231/331 (69%), Gaps = 12/331 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +++ +  +Y+ W + +  +  G+G +  E+RF +FK+N R++ E N  +R +++ LNKFA
Sbjct: 33  SEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKRDRPFRLALNKFA 92

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           D+T +E+R  Y G+R      L   +     +     D LP +VDWR+KGAV  +KDQG 
Sbjct: 93  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD   N GC GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-G 211

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY G +  CD ++ NA+ V+IDGYEDV   DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y  GVFTGEC + LDHGV AVGYG T +G  YW+V+NSWG DWGE GY+++QR + 
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331

Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
            T  G CGIAM+ASYP K++       PH+S
Sbjct: 332 QTE-GLCGIAMQASYPTKSA-------PHAS 354


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score =  360 bits (925), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 181/354 (51%), Positives = 243/354 (68%), Gaps = 6/354 (1%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LV +   S ++A DMS++SYD+N+   S +  D E   I+++W+ KHGK    +   E+R
Sbjct: 5   LVAMVIASCATAIDMSVVSYDDNNRLHSVF--DAEASLIFESWMVKHGKVYGSVAEKERR 62

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
             IF+DNLRFI+  N+ N +Y++GL  FADL+  EY+ +  G      R  +    +S R
Sbjct: 63  LTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDR 121

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y   A D LP+SVDWR +GAV  VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+
Sbjct: 122 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 181

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDG 251
           L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY      CD   + N K V IDG
Sbjct: 182 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 240

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           YE++   DE +L KAVA QPV+  I++  R FQ YESGVF G CG+ L+HGVV VGYGTE
Sbjct: 241 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTE 300

Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           NG DYWLV+NS G  WGE GY+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct: 301 NGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 353


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 187/371 (50%), Positives = 241/371 (64%), Gaps = 19/371 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MAT SM LA+  +V L F+  +           N  D +S    ++ +  +Y+ W + H 
Sbjct: 1   MATKSMLLAL--VVALAFVGVARTIPF------NEKDLAS----EESLWGLYERWRSHH- 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             S  +    KRF +FK+N +FI E N  +  YK+GLNKFAD+TN+E+R+ Y G++    
Sbjct: 48  TVSRDLSEKNKRFNVFKENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHH 107

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R    +  A+  +  +    +P SVDWR +GAV PVKDQG CGSCWAFST+A+VEGINKI
Sbjct: 108 RTQRGTPRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKI 167

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            T +L+ LS Q+LVDCD   N GCNGGLMDYAF+FI  NGG+ SE  YPY   +  C  S
Sbjct: 168 KTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSC-AS 226

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
             +A VV+IDGYEDV   +E +L KAVA+Q VSVAIEA G AFQ Y  GVFTG CG+ LD
Sbjct: 227 ESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELD 286

Query: 301 HGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK- 358
           HGV  VGYG T +G  YW+VRNSWG++WGE GY+++QR  +    G CGIAME SYP+K 
Sbjct: 287 HGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRG-IRARHGLCGIAMEPSYPLKT 345

Query: 359 --NSQNSAKPK 367
             N +N+  PK
Sbjct: 346 SPNPKNNISPK 356


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  360 bits (924), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 168/254 (66%), Positives = 203/254 (79%), Gaps = 4/254 (1%)

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           R  Y G R   +R      +AS RY  +AGD LP+SVDWREKGAV P+KDQG CGSCWAF
Sbjct: 12  RTTYFGVRGAGRR---TPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAF 68

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST+A+VEGINKIVTG+LISLSEQELVDCD+  N GCNGGLMDYAFQFII NGG+D+E+DY
Sbjct: 69  STIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDY 128

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   + +CD  R+NAKVVSI+ YEDV   DE +LKKA A QP++VAI+ GGR+FQ Y S
Sbjct: 129 PYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNS 188

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           G+FTG+CG++LDHGV  VGYG+E+G DYW+VRNSWG  WGE GY+++ RN +D+ +G CG
Sbjct: 189 GIFTGKCGTSLDHGVTVVGYGSESGKDYWIVRNSWGESWGEKGYIRMARN-IDSPSGICG 247

Query: 349 IAMEASYPVKNSQN 362
           IAMEASYP+K  QN
Sbjct: 248 IAMEASYPIKKGQN 261


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  360 bits (924), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 173/334 (51%), Positives = 231/334 (69%), Gaps = 13/334 (3%)

Query: 44  TDDEVMTIYQTWLAKH--------GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYK 94
           +++ +  +Y+ W +++        G   N  G   +RF +F +N R+I E N    R ++
Sbjct: 34  SEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFR 93

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA---SQRYACKAGDELPESVDWREKG 151
           + LNKFAD+T +E+R  Y G+R+   R L   +     S RY     D LP +VDWRE+G
Sbjct: 94  LALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERG 153

Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
           AV  +KDQG CGSCWAFS VAAVEG+NKI TG L++LSEQELVDCD   N GC+GGLMDY
Sbjct: 154 AVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDY 213

Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
           AFQFI +NGG+ +E +YPY   + +C+ ++ ++  V+IDGYEDV   DE +L+KAVA+QP
Sbjct: 214 AFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQP 273

Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGEN 330
           V+VA+EA G+ FQ Y  GVFTGECG+ LDHGV AVGYG T +G  YW+V+NSWG DWGE 
Sbjct: 274 VAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGER 333

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           GY+++QR +   + G CGIAMEASYPVK+   +A
Sbjct: 334 GYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNA 367


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  360 bits (923), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 181/359 (50%), Positives = 243/359 (67%), Gaps = 9/359 (2%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRT-----DDEVMTIYQTWLAKHGKTSNGMG 67
           LV +   S ++A DMS++S +NNH  ++S        D E   I+ +W+ KHGK    + 
Sbjct: 12  LVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHGKVYGSVA 71

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             E+R  IF+DNLRFI   N+ N +Y++GL +FADL+  EY  +  G      R  +   
Sbjct: 72  EKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGEVCHGADPRPPRNHV-FM 130

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
            +S RY   AGD LP+SVDWR +GAV  VKDQG C SCWAFSTV AVEG+NKIVTGEL++
Sbjct: 131 TSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVT 190

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKV 246
           LSEQ+L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY      CD   + N K 
Sbjct: 191 LSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKN 249

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V IDG+E++   DE +L KAVA QPV+  I++  R FQ YESGVF G CG+ L+HGVV V
Sbjct: 250 VMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVV 309

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           GYGTENG DYWLV+NS G+ WGE GY+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct: 310 GYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score =  359 bits (921), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 177/320 (55%), Positives = 230/320 (71%), Gaps = 8/320 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           R+D+EV  IYQ W  KH    N     + R ++FK+NLRF+DEHN+        Y++G+N
Sbjct: 43  RSDEEVRIIYQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMN 102

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEEYRA +L   S   R    S   S +Y  + GD LP+S+DWREKGAV  VK+
Sbjct: 103 RFADLTNEEYRARFLRDLSRLGRS--TSGEISNQYRLREGDVLPDSIDWREKGAVVAVKN 160

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAF+ +AAVEGIN+IVTG+LISLSEQ+LVDC  + N GC GG    AFQ+II 
Sbjct: 161 QGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTR-NYGCEGGWPYRAFQYIIN 219

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG++SE+ YPY G    C+ ++ NA VVSID Y +V   DE SL+KA A+QP+SV I+A
Sbjct: 220 NGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDA 279

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            GR FQ Y SG+FTG C ++L+HGV  VGYGTENG DYW+V+NSWG +WG +GY+ ++RN
Sbjct: 280 SGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTENGNDYWIVKNSWGENWGNSGYILMERN 339

Query: 339 LLDTNTGKCGIAMEASYPVK 358
           + ++ +GKCGIA+  SYP+K
Sbjct: 340 IAES-SGKCGIAISPSYPIK 358


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 173/326 (53%), Positives = 229/326 (70%), Gaps = 4/326 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y  W + H      +   EKRF +F+ N+  +   N  NR+YK+ LNKFADL
Sbjct: 30  SEEGLSKLYDRWRSHH-SVPRSLHEREKRFNVFRHNVMHVHNSNKKNRSYKLKLNKFADL 88

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQR--YACKAGDELPESVDWREKGAVNPVKDQGS 161
           T  E++  Y G++    R L   K  S++  Y  +   +LP SVDWR+KGAV  +K+QG 
Sbjct: 89  TIHEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDHENVSKLPSSVDWRKKGAVTEIKNQGK 148

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFSTVAAVEGINKI T +L+SLSEQELVDCD   N GCNGGLM+ AF+FI +NGG
Sbjct: 149 CGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTNQNEGCNGGLMEIAFEFIKKNGG 208

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY G + KCD S+ N  +V+IDG+E+V   DE +L KAVA+QPVSVAI+AG  
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDNGVLVTIDGHENVPENDENALLKAVANQPVSVAIDAGSS 268

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            FQ Y  GVFTG+CG+ L+HGV  VGYG++ G  YW+VRNSWG++WGE GY+K++R  +D
Sbjct: 269 DFQFYSEGVFTGDCGTELNHGVATVGYGSQGGKKYWIVRNSWGTEWGEGGYIKIERG-ID 327

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
              G+CGIAMEASYP+K S ++  PK
Sbjct: 328 EPEGRCGIAMEASYPIKLSSSNPTPK 353


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 179/342 (52%), Positives = 229/342 (66%), Gaps = 10/342 (2%)

Query: 17  FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIF 76
           FF SS  A D SI+ Y           + D ++ ++++W++KH K    +     RF+IF
Sbjct: 3   FFASSCLARDFSIVGY-----APEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIF 57

Query: 77  KDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACK 136
           KDNL  IDE N     Y +GLN+FADL++EE++  YLG   D   R    +  S+ +  K
Sbjct: 58  KDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNKYLGLNVDLSNR----RECSEEFTYK 113

Query: 137 AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
               +P+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGIN+IVTG L SLSEQELVDC
Sbjct: 114 DVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDC 173

Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
           D   N GCNGGLMDYAF +II NGG+  E+DYPY+  E  C+  +  ++VV+I GY DV 
Sbjct: 174 DTTYNNGCNGGLMDYAFAYIISNGGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVP 233

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDY 316
              E SL KA+A+QP+SVAI+A GR FQ Y  GVF G CG+ LDHGV AVGYG+  G+D+
Sbjct: 234 QNSEESLLKALANQPLSVAIDASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDF 293

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            +V+NSWGS WGE G+++++RN      G CGI   ASYP K
Sbjct: 294 IVVKNSWGSKWGEKGFIRMKRN-TGKPAGLCGINKMASYPTK 334


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 177/368 (48%), Positives = 240/368 (65%), Gaps = 16/368 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M T  + L + ++  +  +S S           + HD   S  +D+ +  +Y+ W + H 
Sbjct: 1   MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
            + N +   +KRF +FK N+  +   N +++ YK+ LNKFAD+TN E++  Y G++ +  
Sbjct: 49  VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHH 107

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R    +   S  +  +   + P SVDWR+KGAV  VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            T  L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E  YPY   +  CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDAT 227

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + N   VSIDG+E V   DE +L KAVA+QPVSVAI+AGG  FQ Y  GVFTG+CG  L+
Sbjct: 228 KENVPAVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287

Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV  VGYGT  +G +YW+VRNSWG++WGE GY++++RN +    G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGYIRMKRN-VSNKEGLCGIAMEASYPVKN 346

Query: 360 -SQNSAKP 366
            S+N A P
Sbjct: 347 SSKNPAGP 354


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score =  357 bits (917), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 170/330 (51%), Positives = 231/330 (70%), Gaps = 5/330 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +++ +  +Y+ W + +  +  G+G +  E+RF +FK N R++ E N  +  +++ LNKFA
Sbjct: 33  SEESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDMPFRLALNKFA 92

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           D+T +E+R  Y G+R      L   +     +     D LP +VDWR+KGAV  +KDQG 
Sbjct: 93  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 152

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD   N GC+GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-G 211

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY G +  CD ++ NA+ V+IDGYEDV   DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y  GVFTGEC + LDHGV AVGYG T +G  YW+V+NSWG DWGE GY+++QR + 
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331

Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
            T  G CGIAM+ASYP K++ +++  +  S
Sbjct: 332 QTE-GLCGIAMQASYPTKSAPHASTVREES 360


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  357 bits (917), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 170/326 (52%), Positives = 239/326 (73%), Gaps = 9/326 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
            +D+ +  +Y  W  +H +++  +  +E  +RF+IFK+N++ ID  N  +  YK+GLNKF
Sbjct: 36  ESDESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKF 94

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           ADL+NEE++AM++ T+ +  + L   + V S  +  +    LP S+DWR+KGAV PVK+Q
Sbjct: 95  ADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQ 154

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFST+A+VEGIN I TG+L+SLSEQ+LVDC ++ NAGCNGGLMD AFQ+II N
Sbjct: 155 GQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIIDN 213

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVS--IDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           GG+ +E +YPY     +C  ++  +K ++  IDG+EDV   +E +LKKAVA QPVS+AIE
Sbjct: 214 GGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIE 273

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
           A G  FQ Y +GVFTG+CG+ LDHGVV VGYG +  G++YW+VRNSWG +WGE GY+++Q
Sbjct: 274 ASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQ 333

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
           R  ++   GKCGI+M+ASYP K +Q+
Sbjct: 334 RG-IEATEGKCGISMQASYPTKKTQD 358


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  357 bits (916), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 170/330 (51%), Positives = 231/330 (70%), Gaps = 5/330 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +++ +  +Y+ W + +  +  G+G +  E+RF +FK N R++ E N  +  +++ LNKFA
Sbjct: 33  SEESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKRDMPFRLALNKFA 92

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           D+T +E+R  Y G+R      L   +     +     D LP +VDWR+KGAV  +KDQG 
Sbjct: 93  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 152

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD   N GC+GGLMDYAFQFI +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN-G 211

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY G +  CD ++ NA+ V+IDGYEDV   DE +L+KAVA QPVSVAI+A G+
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y  GVFTGEC + LDHGV AVGYG T +G  YW+V+NSWG DWGE GY+++QR + 
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331

Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
            T  G CGIAM+ASYP K++ +++  +  S
Sbjct: 332 QTE-GLCGIAMQASYPTKSAPHASTVREES 360


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  357 bits (915), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 177/333 (53%), Positives = 225/333 (67%), Gaps = 14/333 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W  +H   +  +G   +RF +FK N+R I E N  +  YK+ LN+F D+
Sbjct: 148 SEEALWALYERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDM 206

Query: 104 TNEEYRAMYLGTRSDAKRRLMK-----SKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           T +E+R  Y G+R  A  R+ +     S  ++  +      ++P SVDWR+KGAV  VKD
Sbjct: 207 TADEFRRHYAGSRV-AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 265

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I +
Sbjct: 266 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 325

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           +GG+ +E  YPY   +  C  S   A VV+IDGYEDV   DE +LKKAVA QPVSVAIEA
Sbjct: 326 HGGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 383

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
            G  FQ Y  GVF+G CG+ LDHGV AVGYG T +G  YWLV+NSWG +WGE GY+++ R
Sbjct: 384 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMAR 443

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
           ++     G CGIAMEASYPVK S N   PK H+
Sbjct: 444 DVA-AKEGHCGIAMEASYPVKTSPN---PKVHA 472


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  357 bits (915), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 177/353 (50%), Positives = 234/353 (66%), Gaps = 15/353 (4%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
            FLA+S    L F++ S  A  SI+ Y           ++D+++ ++++W+++ G+    
Sbjct: 10  FFLAVS----LSFLAYSGFARDSIVGY-----APEDLTSNDKLIDLFESWISRFGRVYES 60

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
                +RF+IFKDNL  ID+ N   R Y +GLN+FADL++EE++  YLG + D  +R   
Sbjct: 61  AEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNKYLGLKPDLSKRAQC 120

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
            +  + +        +P+SVDWR+KGAV PVK+QGSCGSCWAFSTVAAVEGIN+IVTG L
Sbjct: 121 PEEFTYKDVA-----IPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNL 175

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
            SLSEQEL+DCD   N GCNGGLMDYAF +I+ NGG+  E+DYPY+  E  CD  +  + 
Sbjct: 176 TSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYIMEEGTCDMRKEESD 235

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
            V+I GY DV    E SL KA+A+QP+S+AIEA GR FQ Y  GVF G CG+ LDHGV A
Sbjct: 236 AVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 295

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           VGYGT  G+DY +V+NSWG  WGE GY++++R       G CGI   ASYP K
Sbjct: 296 VGYGTSKGLDYIIVKNSWGPKWGEKGYIRMKRK-TSKPEGICGIYKMASYPTK 347


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  356 bits (914), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 169/327 (51%), Positives = 227/327 (69%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +G   KRF +FK N+  +   N +++ YK+ L
Sbjct: 26  HEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  +    S+  S  +  +    +P SVDWR+KGAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   E  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           RN +    G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 177/368 (48%), Positives = 239/368 (64%), Gaps = 16/368 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M T  + L + ++  +  +S S           + HD   S  +D+ +  +Y+ W + H 
Sbjct: 1   MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
            + N +   +KRF +FK N+  +   N +++ YK+ LNKFAD+TN E++  Y GT+ +  
Sbjct: 49  VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTKVNHH 107

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R    +   S  +  +   + P SVDWR+KGAV  VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            T  L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E  YPY   +  CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDAT 227

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + N   VSIDG+E V   DE +L KAVA+QPVSVAI+AGG  FQ Y  GVFTG+CG  L+
Sbjct: 228 KENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287

Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV  VGYGT  +G +YW+VRNSWG++WGE G ++++RN +    G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRN-VSNKEGLCGIAMEASYPVKN 346

Query: 360 -SQNSAKP 366
            S+N A P
Sbjct: 347 SSKNPAGP 354


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 169/327 (51%), Positives = 227/327 (69%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +G   KRF +FK N+  +   N +++ YK+ L
Sbjct: 26  HEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  +    S+  S  +  +    +P SVDWR+KGAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   E  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           RN +    G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 177/320 (55%), Positives = 228/320 (71%), Gaps = 8/320 (2%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           R+D+EV  IYQ W AKH    N     + R ++FK+NLRF+DEHN+        Y++G+N
Sbjct: 34  RSDEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMN 93

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEEYRA +L   S   R    S   S +Y  + GD LP+S+DWREKGAV  VK 
Sbjct: 94  RFADLTNEEYRARFLRDLSRLGRS--TSGEISNQYRLREGDVLPDSIDWREKGAVVAVKS 151

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAF+ +A VEGIN+IVTG+LISLSEQ+LVDC  + N GC GG    AFQ+II 
Sbjct: 152 QGRCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIIN 210

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG++SE+ YPY G    C+ ++ NA VVSID Y +V   DE SL+KAVA+QP+SV I A
Sbjct: 211 NGGVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINA 270

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            GR FQ Y SG+FTG C ++L+HGV  VGYGT NG DYW+V+NSWG  WG++GY+ ++RN
Sbjct: 271 SGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTVNGNDYWIVKNSWGESWGDSGYILMERN 330

Query: 339 LLDTNTGKCGIAMEASYPVK 358
           + ++ +GKCGIA+  SYP+K
Sbjct: 331 IAES-SGKCGIAISPSYPIK 349


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 169/320 (52%), Positives = 225/320 (70%), Gaps = 5/320 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHN--EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +++ +  +Y+ W + +  +  G+G +  E+RF +FK+N R+I E N  +R +++ LNKFA
Sbjct: 32  SEENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRPFRLALNKFA 91

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           D+T +E+R  Y G+R      L   +     +     D LP +VDWR+KGAV  +KDQG 
Sbjct: 92  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQ 151

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFST+ AVEGINKI TG+L+SLSEQEL+DCD   N GC+GGLMDYAFQFI +N G
Sbjct: 152 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN-G 210

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY G +  CD ++  A  V+IDGYEDV   DE +L+KAVA QPVSVAI+A G 
Sbjct: 211 ITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGN 270

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y  GVFTGEC + LDHGV AVGYG T +G  YW+V+NSWG DWGE GY+++QR + 
Sbjct: 271 DFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 330

Query: 341 DTNTGKCGIAMEASYPVKNS 360
               G+CGIAM+ASYP K++
Sbjct: 331 QAE-GQCGIAMQASYPTKSA 349


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score =  355 bits (910), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 12/335 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMG---------HNE-KRFQIFKDNLRFIDEHNSLNRTY 93
           +++ +  +Y+ W +++  + +  G         H+  +RF +FK+N+++I E N  +R +
Sbjct: 30  SEESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKKDRPF 89

Query: 94  KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
           ++ LNKFAD+T +E R  Y G+R    R L   + A   +     + LP +VDWREKGAV
Sbjct: 90  RLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPAVDWREKGAV 149

Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
             +KDQG CGSCWAFST+AAVE INKI TG+L+SLSEQEL+DCD   + GC+GGLMDYAF
Sbjct: 150 TGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDYAF 209

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           QFI +NGG+ SE +YPY G +N CD ++ N   V+IDGYEDV   DE +L+KAVA QPVS
Sbjct: 210 QFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQPVS 269

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGY 332
           VAIEA G+ FQ Y  GVFTG+C + LDHGV AVGYGT  +G  YW+V+NSWG DWGE GY
Sbjct: 270 VAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEKGY 329

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
           +++QR +     G CGIAM+ASYP+K + ++   +
Sbjct: 330 IRMQRGVSQAE-GLCGIAMQASYPIKAAPHATTAR 363


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score =  355 bits (910), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 168/324 (51%), Positives = 219/324 (67%), Gaps = 3/324 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +    KRF +FK+N+  + + N + + YK+ L
Sbjct: 26  HEKDLESEESLWDLYERWRSHH-TVSTSLDEKHKRFNVFKENVMHVHKTNKMGKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R++Y G++    R    +   +  +     +++P SVDWR+KGAV  VK
Sbjct: 85  NKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKVEKVPTSVDWRKKGAVTAVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFST+ AVEGIN I T EL+SLSEQELVDCD   N GCNGGLM+YAF+FI 
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTTENQGCNGGLMEYAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +  G+ +E  YPY   +  CD ++ N   VSIDGYE V   DE +L KA A+QPVSVAI+
Sbjct: 205 KKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVPENDEDALLKAAANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVF GECG+ LDHGV  VGYGT  +G  YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
           R + D   G CGIAMEASYP+KNS
Sbjct: 325 RGISDKE-GLCGIAMEASYPIKNS 347


>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
          Length = 289

 Score =  355 bits (910), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 165/224 (73%), Positives = 193/224 (86%), Gaps = 1/224 (0%)

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
           D +PESVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKIVTG+LISLSEQELVDCD 
Sbjct: 1   DAIPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDT 60

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             N GCNGGLMDYAF+FII+NGG+D+E+DYPY  A+ +CD +R+NAKVV+ID YEDV   
Sbjct: 61  SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPEN 120

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           +E +LKKA+A+QP+SVAIEAGGRAFQ Y SGVF G CG+ LDHGVVAVGYGTENG DYW+
Sbjct: 121 NEAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWI 180

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           VRNSWG  WGE+GY+K+ RN+ +  TGKCGIAMEASYP+K  QN
Sbjct: 181 VRNSWGGSWGESGYIKMARNIAEA-TGKCGIAMEASYPIKKGQN 223


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 175/355 (49%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 7   FLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGM 66
           F+ ++ L  L  + ++ + D           H     ++D +  +Y+ W + H   +  +
Sbjct: 4   FIVLA-LCMLMVLETTKSLDF----------HEKDVESEDSLWELYERWKSHH-TIARSL 51

Query: 67  GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
               KRF +FK N++ I E N    +YK+ LNKF D+T+EE+R  Y G+     R     
Sbjct: 52  EEKAKRFNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGE 111

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +  ++ +     D LP SVDWR+ GAV PVK+QG CGSCWAFSTV AVEGIN+I T +L 
Sbjct: 112 RQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLT 171

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLSEQELVDCD   N GCNGGLMD AF+FI + GG+ SE  YPY  ++  CD ++ NA V
Sbjct: 172 SLSEQELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPV 231

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           VSIDG+EDV    E+ L KAVA QPVSVAI+AGG  FQ Y  GVFTG CG+ L+HGV  V
Sbjct: 232 VSIDGHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVV 291

Query: 307 GYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           GYGT  +G  YW+V+NSWG +WGE GY+++QR +     G CGIAMEASYP+KNS
Sbjct: 292 GYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE-GLCGIAMEASYPLKNS 345


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 171/324 (52%), Positives = 221/324 (68%), Gaps = 3/324 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H+    +++ +  +Y+ W + H   +  +    KRF +FK N++ I E N  +++YK+ L
Sbjct: 24  HNKDVESENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKL 82

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKF D+T+EE+R  Y G+     R     K A++ +     + LP SVDWR+ GAV PVK
Sbjct: 83  NKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVK 142

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QG CGSCWAFSTV AVEGIN+I T +L SLSEQELVDCD   N GCNGGLMD AF+FI 
Sbjct: 143 NQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK 202

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           + GG+ SE  YPY  ++  CD ++ NA VVSIDG+EDV    E  L KAVA+QPVSVAI+
Sbjct: 203 EKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG CG+ L+HGV  VGYGT  +G  YW+V+NSWG +WGE GY+++Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322

Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
           R +     G CGIAMEASYP+KNS
Sbjct: 323 RGIRHKE-GLCGIAMEASYPLKNS 345


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 176/368 (47%), Positives = 239/368 (64%), Gaps = 16/368 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M T  + L + ++  +  +S S           + HD   S  +D+ +  +Y+ W + H 
Sbjct: 1   MTTKKLLLIVLSIALVLVVSESF----------DFHDKDVS--SDESLWDLYERWRSHHT 48

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
            + N +   +KRF +FK N+  +   N +++ YK+ LNKFAD+TN E++  Y G++ +  
Sbjct: 49  VSRN-LNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHH 107

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R    +   S  +  +   + P SVDWR+KGAV  VKDQG CGSCWAFSTV AVEGIN+I
Sbjct: 108 RMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            T  L+ LSEQEL+DCD + N GCNGGLM+YAF++I Q GG+ +E  YPY   +  CD +
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDAT 227

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + N   VSIDG+E V   DE +L KAVA+QPVSVAI+AGG  FQ Y  GVFTG+CG  L+
Sbjct: 228 KENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELN 287

Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV  VGYGT  +G +YW+VRNSWG++WGE G ++++RN +    G CGIAMEASYPVKN
Sbjct: 288 HGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRN-VSNKEGLCGIAMEASYPVKN 346

Query: 360 -SQNSAKP 366
            S+N A P
Sbjct: 347 SSKNPAGP 354


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 178/332 (53%), Positives = 224/332 (67%), Gaps = 13/332 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W  +H    + +G   +RF +FK N+R I E N  +  YK+ LN+F D+
Sbjct: 41  SEEALWALYERWRGRHALARD-LGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDM 99

Query: 104 TNEEYRAMYLGTRSDAKRRLMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           T +E+R  Y G+R  A  R+ +       AS  +      ++P SVDWR+KGAV  VKDQ
Sbjct: 100 TADEFRRHYAGSRV-AHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQ 158

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I ++
Sbjct: 159 GQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKH 218

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E  YPY   +  C  S   A VV+IDGYEDV   DE +LKKAVA QPVSVAIEA 
Sbjct: 219 GGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEAS 276

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRN 338
           G  FQ Y  GVF+G CG+ LDHGV AVGYG T +G  YWLV+NSWG +WGE GY+++ R+
Sbjct: 277 GSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARD 336

Query: 339 LLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
           +     G CGIAMEASYPVK S N   PK H+
Sbjct: 337 VA-AKEGHCGIAMEASYPVKTSPN---PKVHA 364


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 170/314 (54%), Positives = 216/314 (68%), Gaps = 3/314 (0%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           +Y+ W + H   S  +   +KRF +FK N   +   N +++ YK+ LNKFAD+TN E+R 
Sbjct: 37  LYERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            Y G++    R        +  +  +  D +P SVDWR+KGAV  VKDQG CGSCWAFST
Sbjct: 96  TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           + AVEGIN+I T +L+SLSEQELVDCD   N GCNGGLMDYAF+FI Q GG+ +E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
              +  CD S+ NA  VSIDG+E+V   DE +L KAVA+QPVSVAI+AGG  FQ Y  GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275

Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG CG+ LDHGV  VGYGT  +G  YW V+NSWG +WGE GY++++R + D   G CGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKE-GLCGI 334

Query: 350 AMEASYPVKNSQNS 363
           AMEASYP+K S N+
Sbjct: 335 AMEASYPIKKSSNN 348


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  353 bits (906), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 180/360 (50%), Positives = 237/360 (65%), Gaps = 13/360 (3%)

Query: 15  FLFFISSSSAADMSIISYDNNHD-HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
           FL+ + S S     ++   N+ D H     +++ +  +Y+ W + H   S  +G   KRF
Sbjct: 6   FLWVVLSLSL----VLGVANSFDFHDKDLESEESLWDLYERWRSHH-TVSRSLGDKHKRF 60

Query: 74  QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
            +FK N+  +   N +++ YK+ LNKFAD+TN E+R+ Y G++ +  R        +  +
Sbjct: 61  NVFKANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTF 120

Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
             +    +P SVDWR+KGAV  VKDQG CGSCWAFSTV AVEGIN+I T +L+SLSEQEL
Sbjct: 121 MYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQEL 180

Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
           VDCD + NAGCNGGLM+ AFQFI Q GG+ +E  YPY   +  CD S+ N   VSIDG+E
Sbjct: 181 VDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHE 240

Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TEN 312
           +V   DE +L KAVA+QPVSVAI+AGG  FQ Y  GVFTG+C + L+HGV  VGYG T +
Sbjct: 241 NVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVD 300

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN-----SAKPK 367
           G  YW+VRNSWG +WGE GY+++QRN +    G CGIAM ASYP+KNS N     S+ PK
Sbjct: 301 GTSYWIVRNSWGPEWGELGYIRMQRN-ISKKEGLCGIAMLASYPIKNSSNNPTGPSSSPK 359


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 180/364 (49%), Positives = 244/364 (67%), Gaps = 10/364 (2%)

Query: 9   AISTLVFLFFISS-SSAADMSIISYDNNHDHSS-----SWRTDDEVMTIYQTWLAKHGKT 62
           A+  L+    ISS ++A DMSI+S ++NH  ++         D E   ++++W+ KHGK 
Sbjct: 7   AMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGKV 66

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
              +   E+R  IF+DNLRFI   N+ N +Y++GLN+FADL+  EY  +  G      R 
Sbjct: 67  YESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQICHGADPRPPRN 126

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
            +    +S RY    GD LP+SVDWR +GAV  VKDQG C SCWAFSTV AVEG+NKIVT
Sbjct: 127 HV-FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIVT 185

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSR 241
           GEL++LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++ DYPY      C D  +
Sbjct: 186 GELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRLK 244

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
            N K V IDGYE++   DE +L KAVA QPV+  +++  R FQ Y SGVF G CG+ L+H
Sbjct: 245 ENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLNH 304

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           GVV VGYGTENG DYW+VRNS G+ WGE GY+K+ RN+ +   G CGIAM ASYP+KNS 
Sbjct: 305 GVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSF 363

Query: 362 NSAK 365
           ++ K
Sbjct: 364 STDK 367


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  353 bits (905), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 172/336 (51%), Positives = 226/336 (67%), Gaps = 8/336 (2%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++    +Y+ W + H   S  +G   KRF +FK N+  +   N +++ YK+ L
Sbjct: 26  HDKDLASEESFWDLYERWRSHH-TVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  R    +   +  +  +    +P SVDWR+ GAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD K NAGCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   +  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN-----SAKPK 367
           R+ +    G CGIAM ASYP+KNS N     S+ PK
Sbjct: 325 RS-ISKKEGLCGIAMMASYPIKNSSNNPTGPSSSPK 359


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 170/324 (52%), Positives = 233/324 (71%), Gaps = 8/324 (2%)

Query: 44  TDDEVMTIYQTWLAKHGK---TSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNK 99
           +D ++   Y +W AK GK   +SN +G  ++RF+ FK+N R+I+EHN   + +Y++GLN+
Sbjct: 5   SDSDLSGEYASWCAKFGKECASSNSLG--DRRFETFKENFRYIEEHNRAGKHSYRLGLNQ 62

Query: 100 FADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           F+DLT+EE+R  +LG R D     ++K    S         +LP SVDWR+ GAV   KD
Sbjct: 63  FSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKD 122

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCG CWAF+T  A+EGIN+IVTG+L+SLSEQEL+DCD+K + GC+GGLM+ A+QFI++
Sbjct: 123 QGSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIVE 182

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+E DYPY  +E+ C+  + N++VV+IDGYE +   DE +L +AVA QPVSVAIE 
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEG 242

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
             + FQHY SGVFTG CG  ++HGV+ VGYGTE+G+DYW+V+NSW + WG+ G+VK+QRN
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRN 302

Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
                 G C I   ASYPVK+  N
Sbjct: 303 -TGKRGGLCSINTLASYPVKSGGN 325


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 168/327 (51%), Positives = 226/327 (69%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +G   KRF +FK NL  +   N +++ YK+ L
Sbjct: 26  HDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  R    +   +  +  +    +P SVDWR+KGAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN+I T +L++LSEQELVDCD++ N GCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   E  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW+VRNSWG +WGE+GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           RN +    G CGIAM  SYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMLPSYPIKNSSDN 350


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 168/327 (51%), Positives = 226/327 (69%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +G   KRF +FK NL  +   N +++ YK+ L
Sbjct: 25  HDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKL 83

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  R    +   +  +  +    +P SVDWR+KGAV  VK
Sbjct: 84  NKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVK 143

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN+I T +L++LSEQELVDCD++ N GCNGGLM+ AF+FI 
Sbjct: 144 DQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIK 203

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   E  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 204 QKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAID 263

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW+VRNSWG +WGE+GY+++Q
Sbjct: 264 AGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQ 323

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           RN +    G CGIAM  SYP+KNS ++
Sbjct: 324 RN-ISKKEGLCGIAMLPSYPIKNSSDN 349


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 171/333 (51%), Positives = 229/333 (68%), Gaps = 8/333 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           D SI+ Y           + D+++ +++ W++   K    +     RF++FKDNL+ IDE
Sbjct: 30  DYSIVGYS-----PEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
            N   ++Y +GLN+FADL++EE++ MYLG ++D  RR  +   A   +A +  + +P+SV
Sbjct: 85  TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA--EFAYRDVEAVPKSV 142

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWR+KGAV  VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD   N GCN
Sbjct: 143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMDYAF++I++NGG+  E+DYPY   E  C+  +  ++ V+I+G++DV   DE SL K
Sbjct: 203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           A+A QP+SVAI+A GR FQ Y  GVF G CG  LDHGV AVGYG+  G DY +V+NSWG 
Sbjct: 263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGP 322

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            WGE GY++L+RN      G CGI   AS+P K
Sbjct: 323 KWGEKGYIRLKRN-TGKPEGLCGINKMASFPTK 354


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 173/328 (52%), Positives = 228/328 (69%), Gaps = 19/328 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFAD 102
           ++D + ++Y+ W + H   S  +   +KRF +FK+N++FI E N + + T+K+ LNKF D
Sbjct: 30  SEDSLWSLYERWRSHHA-VSRDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGD 88

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-------PESVDWREKGAVNP 155
           +TN+E+RA Y G++    R +  S     R+   +G +        P S+DWRE+GAV  
Sbjct: 89  MTNQEFRAKYAGSKVHHHRTMKGS-----RHGSGSGAKFMYENAVAPPSIDWRERGAVAA 143

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           VK+QG CGSCWAFS +AAVEGIN+IVT EL+ LSEQEL+DCD   N GC+GGLMDYAF+F
Sbjct: 144 VKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEF 203

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           I  NGG+ +E  YPY   +  C   ++N+  V IDGYEDV   DE +L KAVA+QPV+VA
Sbjct: 204 IKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVA 260

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVK 334
           IEA G  FQ Y  GVFTG CG+ LDHGV  VGYG T++G  YW VRNSWG+DWGE+GYV+
Sbjct: 261 IEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVR 320

Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           +QR +  T+ G CGIAM+ASYP+K S N
Sbjct: 321 MQRGIKATH-GLCGIAMQASYPIKTSLN 347


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  350 bits (897), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 176/360 (48%), Positives = 240/360 (66%), Gaps = 14/360 (3%)

Query: 4   ASMFLAISTLVFLFFIS----SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           A +F +  T +FL F+S    S+ A + SI+ Y           +  +V+ ++++WLAKH
Sbjct: 2   AFIFSSKKTSLFLVFVSVLACSALANEFSILGYA-----PEDLTSIHKVIHLFESWLAKH 56

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
            K    +     RF+IF DNL+ ID+ N     Y +GLN+FADLT+EE++  +LG + + 
Sbjct: 57  SKIYESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGEL 116

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
             R  +S    + ++ +   +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+
Sbjct: 117 PERKDES---IEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQ 173

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG L  LSEQEL+DCD   N GCNGGLMDYAF +++++G +  E++YPY+ +E  CD 
Sbjct: 174 IVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDE 232

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            +  ++ V+I GY DV   +E S  KA+A+QP+SVAIEA GR FQ Y  GVF G CG+ L
Sbjct: 233 KKDVSETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTEL 292

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGV AVGYGT  G+DY +VRNSWG  WGE GY++++R     + G CG+ M ASYP K 
Sbjct: 293 DHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPH-GMCGLYMMASYPTKQ 351


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 166/327 (50%), Positives = 226/327 (69%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +    KRF +FK+N+  +   N +++ YK+ L
Sbjct: 26  HEKDLASEESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  +    ++  +  +  +    +P SVDWR+KGAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   E  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GV TG+C + L+HGV  VGYGT  +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           RN +    G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 175/352 (49%), Positives = 235/352 (66%), Gaps = 13/352 (3%)

Query: 11  STLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           ++L+FLF      S+ A + SI+ Y           +  +V+ ++++WL KH K    + 
Sbjct: 10  TSLLFLFVSILACSALAHEFSILGYA-----PEDLTSIHKVIHLFESWLVKHSKFYESLD 64

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
               RF+IF DNL+ IDE N     Y +GLN+FADLT+EE++  +LG + +   R  +S 
Sbjct: 65  EKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDES- 123

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S+ +  +   +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG L  
Sbjct: 124 --SKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQEL+DCD   N GCNGGLMDYAF +++++G +  E++YPY+ +E  CD  +  ++ V
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEKV 240

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +I GY DV   DE S  KA+A+QP+SVAIEA GR FQ Y  GVF G CG+ LDHGV AVG
Sbjct: 241 TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           YGT  G+DY +VRNSWG  WGE GY++++R     + G CG+ M ASYP K 
Sbjct: 301 YGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMASYPTKQ 351


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 175/359 (48%), Positives = 242/359 (67%), Gaps = 9/359 (2%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRT-----DDEVMTIYQTWLAKHGKTSNGMG 67
           L+ L   S ++A DMS++S ++NH  ++         D E   ++++W+ KHGK  + + 
Sbjct: 12  LLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDSVA 71

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             E+R  IF+DNLRFI   N+ N +Y++GLN+FADL+  EY  +  G      R  +   
Sbjct: 72  EKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV-FM 130

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
            +S RY    GD LP+SVDWR +GAV  VKDQG C SCWAFSTV AVEG+NKIVTGEL++
Sbjct: 131 TSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVT 190

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKV 246
           LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++ DYPY      C+   + + K 
Sbjct: 191 LSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDNKN 249

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V IDGYE++   DE +L KAVA QPV+  +++  R FQ YESGVF G CG+ L+HGVV V
Sbjct: 250 VMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVV 309

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
           GYGTENG DYW+V+NS G  WGE GY+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct: 310 GYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 165/277 (59%), Positives = 210/277 (75%), Gaps = 16/277 (5%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH 86
           MSI+SY          R+++E   +Y  W+A HG+T N +G  E+RF++F+DNLR++D H
Sbjct: 29  MSIVSYGE--------RSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAH 80

Query: 87  NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
           N+       ++++GLN+FADLTN+EYRA YLG RS    R  + +    RY     ++LP
Sbjct: 81  NAAADAGVHSFRLGLNRFADLTNDEYRATYLGVRS----RPQRERRLGDRYLAGDNEDLP 136

Query: 143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA 202
           ESVDWR KGAV  VKDQGSCGSCWAFST+AAVEGIN+IVTG++ISLSEQELVDCD   N 
Sbjct: 137 ESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQ 196

Query: 203 GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMS 262
           GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NAKVV+ID YEDV    E S
Sbjct: 197 GCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKS 256

Query: 263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           L+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+++
Sbjct: 257 LQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGNSV 293


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 168/304 (55%), Positives = 216/304 (71%), Gaps = 5/304 (1%)

Query: 56  LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGT 115
           ++KHGK+         RF++F+DNL+ IDE N    +Y +GLN+FADL++EE++  YLG 
Sbjct: 1   MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60

Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
           + +    L K + + + ++ K   +LP+SVDWR+KGAV  VK+QG+CGSCWAFSTVAAVE
Sbjct: 61  KIE----LPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116

Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           GIN+IVTG L +LSEQEL+DCD+  N GCNGGLMDYAF FII NGG+  E+DYPY+  E 
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
            C   +   +VV+I GY DV   +E S  KA+A+QP+SVAIEA  R FQ Y  G+F G C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236

Query: 296 GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
           G+ LDHGV AVGYGT  GVDY  V+NSWGS WGE GY++++RN +    G CGI   ASY
Sbjct: 237 GTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRN-VGKPEGICGIYKMASY 295

Query: 356 PVKN 359
           P KN
Sbjct: 296 PTKN 299


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 177/352 (50%), Positives = 238/352 (67%), Gaps = 11/352 (3%)

Query: 15  FLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
            L F  + SAA +S+ S   +HD+S          + D+++ +++ W++   K    +  
Sbjct: 9   ILCFPLALSAATLSL-SVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEE 67

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
              RF++FKDNL+ IDE N   ++Y +GLN+FADL++EE++ MYLG ++D  RR  +   
Sbjct: 68  KLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           A   +A +  + +P+SVDWR+KGAV  VK+QGSCGSCWAFSTVAAVEGINKIVTG L +L
Sbjct: 128 A--EFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTL 185

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQEL+DCD   N GCNGGLMDYAF++I++NGG+  E+DYPY   E  C+  +  ++ V+
Sbjct: 186 SEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVT 245

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES-GVFTGECGSALDHGVVAVG 307
           IDG++DV   DE SL KA+A QP+SVAI+A GR FQ Y    VF G CG  LDHGV AVG
Sbjct: 246 IDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHGVAAVG 305

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           YG+  G DY +V+NSWG  WGE GY++L+RN      G CGI   AS+P K 
Sbjct: 306 YGSSKGSDYIIVKNSWGPKWGEKGYIRLKRN-TGKPEGLCGINKMASFPTKT 356


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 169/324 (52%), Positives = 231/324 (71%), Gaps = 8/324 (2%)

Query: 44  TDDEVMTIYQTWLAKHGK---TSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNK 99
           +D ++   Y +W AK GK   +SN +G  + RF+ FK+N R+I+EHN   + +Y++GLN+
Sbjct: 5   SDSDLSGEYASWCAKFGKECASSNSLG--DHRFETFKENFRYIEEHNRAGKHSYRLGLNQ 62

Query: 100 FADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           F+DLT+EE+R  +LG R D     ++K    S         +LP SVDWR+ GAV   KD
Sbjct: 63  FSDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKD 122

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCG CWAF+T  A+EGIN+IVTG+L+SLSEQEL+DCD+K + GC+GGLM+ A+QFI++
Sbjct: 123 QGSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVE 182

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+E DYPY  +E+ C+  + N++VV+IDGY+ +   DE +L  AVA QPVSVAIE 
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEG 242

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
             + FQHY SGVFTG CG  ++HGV+ VGYGTE+G+DYW+V+NSW + WG+ G+VK+QRN
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRN 302

Query: 339 LLDTNTGKCGIAMEASYPVKNSQN 362
                 G C I   ASYPVK+  N
Sbjct: 303 -TGKRGGLCSINTLASYPVKSGGN 325


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 168/315 (53%), Positives = 221/315 (70%), Gaps = 6/315 (1%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
           D++  ++  W  KHGKT       ++R QIFKDN  F+ +HN + N TY + LN FADLT
Sbjct: 26  DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           + E++A  LG    A   +M SK  S   + K    +P+SVDWR+KGAV  VKDQGSCG+
Sbjct: 86  HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 141

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGIN+IVTG+LISLSEQEL+DCD+  NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 142 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 201

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E+DYPY   +  C   +   KVV+ID Y  V   DE +L +AVA QPVSV I    RAFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y SG+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG  WG +G++ +QRN  +++ 
Sbjct: 262 LYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320

Query: 345 GKCGIAMEASYPVKN 359
           G CGI M ASYP+K 
Sbjct: 321 GVCGINMLASYPIKT 335


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 172/366 (46%), Positives = 239/366 (65%), Gaps = 15/366 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M    + L   +LV +F ++ S   D   ++            +++ +  +Y+ W + H 
Sbjct: 1   MKMEKVILVALSLVLVFGLAESFDFDEKDLA------------SEESLWDLYERWRSYH- 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             S  +    KRF +FK+N + + + N +++ YK+ LNKFAD+TN E+R+ Y G++    
Sbjct: 48  TVSRDLEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHY 107

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           R L   +  +  +  +    LP SVDWR+KGAV  +KDQG CGSCWAFSTV  VEGIN+I
Sbjct: 108 RMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQI 167

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            T EL+SLSEQ+L+DCDR  + GCNGGLM+ AF+FI +NGG+ +E +YPY   + +CD  
Sbjct: 168 KTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDML 227

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + NA VV+IDG+E V   DE +L KAVA QPVSVAI+AGG   Q Y  GVF GECG+ LD
Sbjct: 228 KMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELD 287

Query: 301 HGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           HGV  VGYGT  +G  YW+V+NSWG++WGE GY+++ R  +    G+CGIAMEASYPVK+
Sbjct: 288 HGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARG-IQAAEGQCGIAMEASYPVKS 346

Query: 360 SQNSAK 365
           S N+ +
Sbjct: 347 SNNTRR 352


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 179/356 (50%), Positives = 241/356 (67%), Gaps = 25/356 (7%)

Query: 31  SYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG--MGHNEKRF--QIFKDNLRFIDEH 86
           SY      + + R D+EV  +Y+ W +KHG+      M  +E R   ++F+DNLR+ID H
Sbjct: 33  SYTTTIVPAPAERADEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAH 92

Query: 87  NSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAK----RRLMKSKVAS-------- 130
           N+       T+++GL  FADLT EEYR   LG R+  +     R   S+V S        
Sbjct: 93  NAEADAGLHTFRLGLTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHR 152

Query: 131 -QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
             R   + GD LP+++DWR+ GAV  VK+Q  CG CWAFS VAA+EGIN IVTG L+SLS
Sbjct: 153 RPRPRPRCGD-LPDAIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLS 211

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN-AKVVS 248
           EQE++DCD + ++GCNGG M+ AFQF+I NGG+DSE DYP++  +  CD ++ N  KV +
Sbjct: 212 EQEIIDCDTQ-DSGCNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAA 270

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           IDG+ +V+  +E +L++AVA QPVSVAI+AGGRAFQHY SG+F G CG+ LDHGV  VGY
Sbjct: 271 IDGFVEVASNNETALQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGY 330

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           G+ENG  YW+V+NSW   WGE GY++++RN+     GKCGIAM+ASYPVK++   A
Sbjct: 331 GSENGKAYWIVKNSWSDSWGEAGYIRIRRNVF-LPVGKCGIAMDASYPVKDTYGPA 385


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 174/316 (55%), Positives = 221/316 (69%), Gaps = 7/316 (2%)

Query: 46  DEVMTIYQTWLAKHGKT--SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           DE    ++ W+++HG+        H  KRF +FK+N+  I+E N   +T+K+ +N+FADL
Sbjct: 31  DEDSMRHEEWMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFND-GKTFKLAINQFADL 89

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TNEE+RA Y G +         +K    RY       LP SVDWR+KGAV PVK+QG CG
Sbjct: 90  TNEEFRASYNGFKGPMVLSSQITKPTPFRYE-NVSSALPVSVDWRKKGAVTPVKNQGQCG 148

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA+EGI +I TG+LISLSEQELVDCD K I+ GC GGLMD AF+FII NGG+
Sbjct: 149 CCWAFSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGGL 208

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY G +  C+ ++ N   VSI GYEDV   DE +L KAVA QPVSVAIEAGG  
Sbjct: 209 TTESNYPYKGEDGTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSD 268

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y SGVFTGECG+ LDH V AVGYG +E+G  YW+V+NSWG+ WGE+GY+++Q++ + 
Sbjct: 269 FQFYSSGVFTGECGTELDHAVTAVGYGESEDGSKYWIVKNSWGTKWGESGYIEMQKD-IK 327

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM+ASYP 
Sbjct: 328 VKQGLCGIAMQASYPT 343


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 172/326 (52%), Positives = 228/326 (69%), Gaps = 15/326 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNE-----KRFQIFKDNLRFIDEHNSLNRTYKVGLN 98
           +++ +  +Y+ W + H   S   G  E     + F +FK+N+R+I E N   R++++ LN
Sbjct: 34  SEESLRALYEQWRS-HYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKGRSFRLALN 92

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKV-----ASQRYACKAGDELPESVDWREKGAV 153
           KFAD+T +E+R  Y         R + S +      S  YA +AG+ LP +VDWR++GAV
Sbjct: 93  KFADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGSFMYA-QAGN-LPLAVDWRQRGAV 150

Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
             +KDQG CGSCWAFST+AAVEGINKI TG+L+SLSEQELVDCD   N GCNGGLMDYAF
Sbjct: 151 TGIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAF 210

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           Q+I +NGG+ +E +YPYL  +  C+ ++  +  V+IDGYEDV   +E +L+KAVA+QPVS
Sbjct: 211 QYIKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVS 270

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
           +AIEA G+ FQ Y  GVFTG CG+ LDHGV AVGYG T +G  YW+V+NSWG DWGE GY
Sbjct: 271 IAIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGY 330

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVK 358
           +++QR + D+  G CGIAME SYP K
Sbjct: 331 IRMQRGISDSQ-GLCGIAMEPSYPTK 355


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 166/324 (51%), Positives = 218/324 (67%), Gaps = 3/324 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++ +  +Y+ W + H   S  +    KRF +F+ N+  +   N +++ YK+ L
Sbjct: 24  HEKDLESEESLWDLYEKWRSHH-TVSTSLDEKRKRFNVFRANVLHVHNTNKMDKPYKLKL 82

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R  Y  ++         + + +  +     D++P S+DWR+KGAV PVK
Sbjct: 83  NKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNIDKVPASIDWRKKGAVTPVK 142

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFST+ AVEGIN I T +LISLSEQELVDC+   N GCNGGLMDYAF+FI 
Sbjct: 143 DQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFIT 202

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +  G+ +E +YPY   +  CD ++ N   VSIDG+EDV   +E +L KAVA+QPVSVAI+
Sbjct: 203 KQKGITTEANYPYRAQDGHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAID 262

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTGECG  LDHGV  VGYGT  +G  YW+VRNSWG +WGE GY+++Q
Sbjct: 263 AGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQ 322

Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
           R + D   G CGIAMEASYP+K S
Sbjct: 323 RGISD-RRGLCGIAMEASYPIKKS 345


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 172/337 (51%), Positives = 237/337 (70%), Gaps = 21/337 (6%)

Query: 43  RTDDEVMTIYQTWLAKHGK--TSN-----GMGHNEK------RFQIFKDNLRFIDEHNSL 89
           R D+EV  +Y+ W +KHG+  +SN       G +E+      R ++F+DNLR+ID+HN+ 
Sbjct: 75  RADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAE 134

Query: 90  N----RTYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPES 144
                 T+++GL  FADLT +EYR   LG      +           R   + GD LP++
Sbjct: 135 ADAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDA 194

Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
           +DWR+ GAV  VKDQ  CG CWAFS VAA+EGIN I TG L+SLSEQE++DCD + ++GC
Sbjct: 195 IDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGC 253

Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN-AKVVSIDGYEDVSPFDEMSL 263
           +GG M+ AF+F+I NGG+D+E DYP++G +  CD S+ N  KV +IDG  +V+  +E +L
Sbjct: 254 DGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETAL 313

Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
           ++AVA QPVSVAI+A GRAFQHY SG+F G CG++LDHGV AVGYG+E+G DYW+V+NSW
Sbjct: 314 QEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSW 373

Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            + WGE GY++++RN +   TGKCGIAM+ASYPVK++
Sbjct: 374 SASWGEAGYIRMRRN-VPRPTGKCGIAMDASYPVKDT 409


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 173/337 (51%), Positives = 240/337 (71%), Gaps = 25/337 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGK--TSN-----GMGHNEK-------RFQIFKDNLRFIDEHNS 88
           R D+EV  +Y+ W +KHG+  +SN       G +E+       R ++F+DNLR+ID HN+
Sbjct: 45  RADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNA 104

Query: 89  LN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
                  T+++GL  FADLT EEYR   LG R+  +            Y+ + GD LP++
Sbjct: 105 EADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGR---RSGARYGSGYSVRGGD-LPDA 160

Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
           +DWR+ GAV  VKDQ  CG CWAFS VAA+EG+N I TG L+SLSEQE++DCD + ++GC
Sbjct: 161 IDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGC 219

Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSL 263
           +GG M+ AF+F+I NGG+D+E DYP++G +  CD S+ +N KV +IDG  +V+  +E +L
Sbjct: 220 DGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETAL 279

Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
           ++AVA QPVSVAI+A GRAFQHY SG+F G CG++LDHGV AVGYG+E+G DYW+V+NSW
Sbjct: 280 QEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKDYWIVKNSW 339

Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            + WGE GY++++RN +   TGKCGIAM+ASYPVK++
Sbjct: 340 SASWGEAGYIRMRRN-VPRPTGKCGIAMDASYPVKDT 375


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  347 bits (891), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 183/364 (50%), Positives = 239/364 (65%), Gaps = 14/364 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSA--ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAK 58
           MA+    + +S  + L  + +  A  +D SI+ Y +  D SS+ R    ++ +++ WLAK
Sbjct: 1   MASPQHLMKLSGALLLLCVGACVARNSDFSIVGY-SEEDLSSNER----LVELFEKWLAK 55

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           H K          RF++FKDNL+ ID+ N    +Y +GLN+FADLT++E++A YLG  + 
Sbjct: 56  HQKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAYLGLDAA 115

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
             RR       S RY   +  +LP+SVDWR+KGAV  VK+QG CGSCWAFSTVAAVEGIN
Sbjct: 116 PARR---GSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGIN 172

Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC- 237
            IVTG L +LSEQEL+DC    N+GCNGGLMDYAF +I  +GG+ +E+ YPYL  E  C 
Sbjct: 173 AIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCG 232

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
           D  +  ++ V+I GYEDV   DE +L KA+A QPVSVAIEA GR FQ Y  GVF G CG+
Sbjct: 233 DGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGA 292

Query: 298 ALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
            LDHGV AVGYG++ G   DY +VRNSWG+ WGE GY++++R       G CGI   ASY
Sbjct: 293 QLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRG-TSNGEGLCGINKMASY 351

Query: 356 PVKN 359
           P K+
Sbjct: 352 PTKD 355


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  347 bits (891), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 174/352 (49%), Positives = 234/352 (66%), Gaps = 13/352 (3%)

Query: 11  STLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           ++L+FLF      S  A + SI+ Y           +  +V+ ++++WL KH K    + 
Sbjct: 10  TSLLFLFVSILACSPLAHEFSILGYA-----PEDLTSIHKVIHLFESWLVKHSKFYESLD 64

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
               RF+IF DNL+ IDE N     Y +GLN+FADLT+EE++  +LG + +   R  +S 
Sbjct: 65  EKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDES- 123

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
             S+ +  +   +LP+SVDWR+KGAV PVK+QG CG+CWAFSTVAAVEGIN+IVTG L  
Sbjct: 124 --SKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTM 181

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQEL+DCD   N GCNGGLMDYAF +++++G +  E++YPY+ +E  CD  +  ++ V
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRSG-LHKEEEYPYIMSEGTCDEKKDVSEKV 240

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +I GY DV   DE S  KA+A+QP+SVAIEA GR FQ Y  GVF G CG+ LDHGV AVG
Sbjct: 241 TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           YGT  G+DY +VRNSWG  WGE GY++++R     + G CG+ M ASYP K 
Sbjct: 301 YGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMASYPTKQ 351


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 171/361 (47%), Positives = 238/361 (65%), Gaps = 15/361 (4%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           + L   +LV +F ++ S   D   ++            +++ +  +Y+ W + H   S  
Sbjct: 4   VILVALSLVLVFGLAESFDFDEKDLA------------SEESLWDLYERWRSYH-TVSRD 50

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +    KRF +FK+N + + + N +++ YK+ LNKFAD+TN E+R+ Y G++    R L  
Sbjct: 51  LEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRG 110

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
            +  +  +  +    LP SVDWR+KGAV  +KDQG CGSCWAFSTV  VEGIN+I T EL
Sbjct: 111 DRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKEL 170

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           +SLSEQ+L+DCDR  + GCNGGLM+ AF+FI +NGG+ +E +YPY   + +CD  + NA 
Sbjct: 171 LSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAP 230

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV+IDG+E V   DE +L KAVA QPVSVAI+AGG   Q Y  GVF GECG+ LDHGV  
Sbjct: 231 VVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAI 290

Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VGYGT  +G  YW+V+NSWG++WGE GY+++ R  +    G+CGIAMEASYPVK+S N+ 
Sbjct: 291 VGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARG-IQAAEGQCGIAMEASYPVKSSNNTR 349

Query: 365 K 365
           +
Sbjct: 350 R 350


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 178/340 (52%), Positives = 229/340 (67%), Gaps = 27/340 (7%)

Query: 43  RTDDEVMTIYQTWLAKH------GKTSNGMGHNE-------------KRFQIFKDNLRFI 83
           RTD+EV  +Y+ W ++H      G T   +G  +             +R ++F+DNLR+I
Sbjct: 44  RTDEEVRRLYEEWRSEHDAGPRRGATGGSLGPGDADAGAGAGEDDDARRLEVFRDNLRYI 103

Query: 84  DEHNSLN----RTYKVGLNKFADLTNEEYRA-MYLGTRSDAKRRLMKSKVASQRYACKAG 138
           D HN+        +++GL +FADLT EEYRA + LG+R   +       V  +RY   AG
Sbjct: 104 DAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSR--GRNGTAVGVVGRRRYLPLAG 161

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
           ++LP++VDWRE+GAV  VKDQG CG CWAFS VAAVEGINKIVTG LISLSEQEL+DCD+
Sbjct: 162 EQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLISLSEQELIDCDK 221

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             + GC+GGLMD AF F+I+NGG+D+E DYP+ G +  CD   +N +VVSID +E V   
Sbjct: 222 FQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPIN 281

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
            E +L+KAVA QPVS +IEA  RAFQ Y SG+F G CG+ LDHGV  VGYG+E G DYW+
Sbjct: 282 YERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGSEGGKDYWI 341

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V+NSWG+ WGE GYV++ RN +       GIAME  YPVK
Sbjct: 342 VKNSWGTQWGEAGYVRMARN-VRVRPPSAGIAMEPLYPVK 380


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 178/368 (48%), Positives = 242/368 (65%), Gaps = 18/368 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M T  + LA+ ++V +F ++ S         +D   +  +S   ++ +  +Y+ W + H 
Sbjct: 1   MDTRKVILAVFSVVLVFRLADS---------FDYTEEDLAS---EERLRDLYERWRSHH- 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             S  +   ++RF +FK+NL+ I + N  +R YK+ LN FAD+TN E+   Y G++  + 
Sbjct: 48  TVSRSLAEKQERFNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKV-SH 106

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            R+++ +        +   +LP SVDWR+ GAV  +KDQG CGSCWAFSTVAAVEGINKI
Sbjct: 107 YRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKI 166

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TGELISLSEQELVDCD   N GCNGGLM+ AF FI Q GG+ SE  YPY   E  CD +
Sbjct: 167 KTGELISLSEQELVDCDSD-NHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSN 225

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           + N+ VV+IDGYE V   DE +L KAVA+QPV++A++AGG+  Q Y   +FTG+CG+ L+
Sbjct: 226 KMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELN 285

Query: 301 HGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK- 358
           HGV  VGYG T++G  YW+V+NSWG+DWGE GY+++QR  +D   G CGI MEASYPVK 
Sbjct: 286 HGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRG-IDAEEGLCGITMEASYPVKL 344

Query: 359 NSQNSAKP 366
            S N   P
Sbjct: 345 RSDNKKAP 352


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 167/315 (53%), Positives = 220/315 (69%), Gaps = 6/315 (1%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
           D++  ++  W  KHGKT       ++R QIFKDN  F+ +HN + N TY + LN FADLT
Sbjct: 26  DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 85

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           + E++A  LG    A   +M SK  S   + K    +P+SVDWR+KGAV  VKDQGSCG+
Sbjct: 86  HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 141

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGIN+IVTG+LISLSEQEL+DCD+  NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 142 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 201

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E+DYPY   +  C   +   KVV+ID Y  V   DE +L +AVA QPVSV I    RAFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  G+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG  WG +G++ +QRN  +++ 
Sbjct: 262 LYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320

Query: 345 GKCGIAMEASYPVKN 359
           G CGI M ASYP+K 
Sbjct: 321 GVCGINMLASYPIKT 335


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  347 bits (889), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 181/361 (50%), Positives = 235/361 (65%), Gaps = 18/361 (4%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
            S ++   F++S +A  + I   D          T+D +  +Y+ W + H   S  +   
Sbjct: 4   FSLILVASFLASVAATAIDIADKD--------LETEDSLWNLYERWRSHH-TVSRDLDEK 54

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK- 127
           +KRF +FK+N R+I + N   +  YK+ LNKFADLTN E+R+ Y G+R +  R L  S+ 
Sbjct: 55  QKRFNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRR 114

Query: 128 ---VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
                S  Y       LP S+DWR+KGAV  VKDQG CGSCWAFSTVAAVEGIN+I T +
Sbjct: 115 GGATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKK 174

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           L+SLSEQEL+DCD   N GCNGGLMDYAF FI +NGG+ SE +YPY   ++ C  + + +
Sbjct: 175 LLSLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYC-ATEKKS 233

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            VVSIDG+EDV   DE SL KAVA+QPVS+AIEA G  FQ Y  GVFTG  G+ LDHGV 
Sbjct: 234 HVVSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVA 293

Query: 305 AVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
            VGYG T+ G  YW+VRNSWG++WGE GY+++  +    +   CG+AMEASYP+K S N 
Sbjct: 294 IVGYGKTQQGTKYWIVRNSWGAEWGEKGYIRI--SAASDSKRLCGLAMEASYPIKTSPNP 351

Query: 364 A 364
           +
Sbjct: 352 S 352


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  346 bits (888), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 175/358 (48%), Positives = 239/358 (66%), Gaps = 10/358 (2%)

Query: 16  LFFISSSSAADMSII-SYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
           +FF++ S A  + +  S++ N     S   ++ +  +Y+ W + H   S  +     RF 
Sbjct: 6   VFFVALSFALVLRVAESFEFNEKDLES---EEGLWDLYERWRSHH-TVSRSLDEKHNRFN 61

Query: 75  IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
           +FK N+  +   N +++ YK+ LN+FAD+TN E+R++Y G++ +  R    +   +  + 
Sbjct: 62  VFKGNVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGNGTFM 121

Query: 135 CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELV 194
            +  D +P SVDWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+I T +L+ LSEQELV
Sbjct: 122 YQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELV 181

Query: 195 DCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           DCD   N GCNGGLM+ AF+FI Q  G+ +  +YPY   +  CD S+ N   VSIDG+E+
Sbjct: 182 DCDTTQNQGCNGGLMESAFEFIKQY-GITTASNYPYEAKDGTCDASKVNEPAVSIDGHEN 240

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENG 313
           V   +E +L KAVA QPVSVAIEAGG  FQ Y  GVFTG CG+ALDHGV  VGYG T++G
Sbjct: 241 VPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDG 300

Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
             YW V+NSWGS+WGE GY++++R+ +    G CGIAMEASYP+K S  S+KP+ HSS
Sbjct: 301 TKYWTVKNSWGSEWGEKGYIRMKRS-ISVKKGLCGIAMEASYPIKKS--SSKPREHSS 355


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score =  346 bits (888), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 167/327 (51%), Positives = 222/327 (67%), Gaps = 3/327 (0%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
           H     +++    +Y+ W + +   S  +G   KRF +FK N+  +   N +++ YK+ L
Sbjct: 26  HDKDLASEESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           NKFAD+TN E+R+ Y G++ +  R    +   +  +  +    +P S DWR+ GAV  VK
Sbjct: 85  NKFADMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVK 144

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFSTV AVEGIN+I T +L+SLSEQELVDCD K NAGCNGGLM+ AF+FI 
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ +E +YPY   +  CD S+ N   VSIDG+E+V   DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
           AGG  FQ Y  GVFTG+C + L+HGV  VGYGT  +G +YW VRNSWG +WGE GY+++Q
Sbjct: 265 AGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
           R++     G CGIAM ASYP+KNS N+
Sbjct: 325 RSIFKKE-GLCGIAMMASYPIKNSSNN 350


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  346 bits (888), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 163/308 (52%), Positives = 224/308 (72%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++GK        EKRF+IFK+N+ +I+  +N+ N+ YK+ +N+FADLTNEE+  
Sbjct: 586 HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 643

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 644 --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 701

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 702 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 761

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+ +     VV+I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 762 YKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 821

Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG  N G +YWLV+NSWG++WGE GY+++QR  +D+  G CG
Sbjct: 822 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 880

Query: 349 IAMEASYP 356
           IAM+ASYP
Sbjct: 881 IAMQASYP 888


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 173/315 (54%), Positives = 215/315 (68%), Gaps = 5/315 (1%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++ +++ W+AK+ K          RF++FKDNL  IDE N    TY +GLN FADLT+
Sbjct: 60  DRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTH 119

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           +E++A YLG R    ++   S+    RY   A D++P SVDWR+KGAV  VK+QG CGSC
Sbjct: 120 DEFKATYLGLRQPETKKTTDSRF---RYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSC 176

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFSTVAAVEGIN+IVTG L SLSEQELVDC    N GCNGG+MD AF +I  +GG+ +E
Sbjct: 177 WAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGLRTE 236

Query: 226 QDYPYLGAENKCDPSRRNA-KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           + YPYL  E  CD   R+  +VV+I GYEDV   DE +L KA+A QP+SVAIEA GR FQ
Sbjct: 237 EAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQ 296

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  GVF G CGS LDHGV AVGYG+  G DY +V+NSWGS WGE GY++++R       
Sbjct: 297 FYSGGVFNGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGSHWGEKGYIRMKRG-TGKPE 355

Query: 345 GKCGIAMEASYPVKN 359
           G CGI   ASYP K+
Sbjct: 356 GLCGINKMASYPTKD 370


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 171/321 (53%), Positives = 216/321 (67%), Gaps = 6/321 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W  +H   +  +G   +RF +FK+N+R I + N  +  YK+ LN+F D+
Sbjct: 39  SEEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQRDEPYKLRLNRFGDM 97

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGSC 162
           T +E+R  Y G+R    R     +  S      AG  +LP SVDWR+KGAV  VKDQG C
Sbjct: 98  TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQC 157

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K NAGC+GGLMDYAFQ+I ++GG+
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGV 217

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY   +  C  S   A  V+IDGYEDV   DE +LKKAVA QPVSVAIEA G  
Sbjct: 218 AAEDAYPYKARQASCKKS--PAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 275

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVF G CG+ LDHGV AVGYG   +G  YW+V+NSWG +WGE GY+++ R++  
Sbjct: 276 FQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVA- 334

Query: 342 TNTGKCGIAMEASYPVKNSQN 362
              G CGIAMEASYPVK S N
Sbjct: 335 AKEGHCGIAMEASYPVKTSPN 355


>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
          Length = 324

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 180/350 (51%), Positives = 235/350 (67%), Gaps = 43/350 (12%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT-SNGMGHNE 70
           +L+ +F +  SSA D+S+ S           R+++EV  I+QTW++KHGKT +N +G  E
Sbjct: 13  SLLIIFLLPPSSAMDLSVTS--------GGLRSNEEVGFIFQTWMSKHGKTYTNALGDKE 64

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           +RFQ FKDNLRFID+HN+ N +Y++GL +FADLT +EY+ ++ G R   K++ ++    +
Sbjct: 65  QRFQNFKDNLRFIDQHNAKNLSYRLGLTQFADLTVQEYQDLFSG-RPIQKQKALR---VT 120

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
            RY   A D+LP+SVDWR+KGAV+ +KDQG C           VE INKIVTGELISLSE
Sbjct: 121 HRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSE 170

Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK-VVSI 249
           QELVDC    N GCNGGLMD AFQF+I N G++ + DYPY   +  C+ ++  +K V+ I
Sbjct: 171 QELVDCSID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKI 229

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           DGYEDV   +E SL+KAVA QP                 G++TG CG+ LDH VV VGYG
Sbjct: 230 DGYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYG 272

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           TENG DYW+VRNSWG+ WGE GY K+ RN  +  TG CGIAM ASYP+KN
Sbjct: 273 TENGQDYWIVRNSWGTVWGEAGYAKIARN-FENPTGVCGIAMVASYPIKN 321


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 178/338 (52%), Positives = 227/338 (67%), Gaps = 12/338 (3%)

Query: 25  ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
           +D SI+ Y +  D SS    +D ++ +++ WLAKH K          RF++FKDNL+ ID
Sbjct: 128 SDFSIVGY-SEEDLSS----NDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHID 182

Query: 85  EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
           + N    +Y +GLN+FADLT+EE++A YLG    A  R  +    S +Y   + D+LP+S
Sbjct: 183 KVNREVTSYWLGLNEFADLTHEEFKATYLGLAPPAPARESR---GSFKYEDVSADDLPKS 239

Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGC 204
           VDWR KGAV  VK+QG CGSCWAFSTVAAVEGIN IVTG L +LSEQEL+DC    N GC
Sbjct: 240 VDWRTKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGC 299

Query: 205 NGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSRRNAKVVSIDGYEDVSPFDEMSL 263
           NGGLMDYAF +I  +GG+ +E+ YPYL  E  C D  +  ++ V+I GYEDV   +E +L
Sbjct: 300 NGGLMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQAL 359

Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGV--DYWLVRN 321
            KA+A QPVSVAIEA GR FQ Y  GVF G CG+ LDHGV AVGYG++ G   DY +VRN
Sbjct: 360 IKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRN 419

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           SWG+ WGE GY++++R       G CGI   ASYP K+
Sbjct: 420 SWGAKWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKD 456


>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 289

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 167/276 (60%), Positives = 211/276 (76%), Gaps = 20/276 (7%)

Query: 19  ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
           +S ++AADMSI+SY          R+++EV  +Y  W+A+HG T N +G  E+RF+ F+D
Sbjct: 18  VSLAAAADMSIVSYGE--------RSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRD 69

Query: 79  NLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRS--DAKRRLMKSKVASQR 132
           NLR+ID+HN+       ++++GLN+FADLTNEEYR+ YLG R+  D +R+L      S R
Sbjct: 70  NLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKL------SAR 123

Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
           Y     DELPESVDWR+KGAV  VKDQG CGSCWAFS +AAVEGIN+IVTG++I LSEQE
Sbjct: 124 YQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQE 183

Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGY 252
           LVDCD   N GCNGGLMDYAF+FII NGG+DSE+DYPY   +N+CD +++NAKVV+IDGY
Sbjct: 184 LVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGY 243

Query: 253 EDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           EDV    E SL+KAVA+QP+SVAIEAGGRAFQ Y+S
Sbjct: 244 EDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 279


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 167/315 (53%), Positives = 214/315 (67%), Gaps = 26/315 (8%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D+++  +++W++KHGK    M     RF++F++NL  IDE N    +Y +GLN+FADL++
Sbjct: 43  DKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSH 102

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           EE+++                         K   +LPESVDWR+KGAV  VK+QG+CGSC
Sbjct: 103 EEFKS-------------------------KDVADLPESVDWRKKGAVTHVKNQGACGSC 137

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFSTVAAVEGIN+IVTG L +LSEQEL+DCD   N+GCNGGLMDYAF FI  NGG+  E
Sbjct: 138 WAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKE 197

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
            DYPYL  E  C+  + +  +V+I GYEDV   DE SL KA+A QP+SVAIEA GR FQ 
Sbjct: 198 DDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQF 257

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y  GVF G CG+ LDHGV AVGYG+  G+DY +V+NSWG  WGE GY++++RN   T  G
Sbjct: 258 YSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE-G 316

Query: 346 KCGIAMEASYPVKNS 360
            CGI   ASYP K++
Sbjct: 317 LCGINKMASYPTKDN 331


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 163/309 (52%), Positives = 224/309 (72%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++GK        EKRF+IFK+N+ +I+  +N+ N+ YK+ +N+FADLTNEE+  
Sbjct: 57  HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 114

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 115 --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 172

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 173 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 232

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+ +     VV+I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 233 YKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 292

Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG  N G +YWLV+NSWG++WGE GY+++QR  +D+  G CG
Sbjct: 293 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 351

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 352 IAMQASYPT 360


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 168/325 (51%), Positives = 219/325 (67%), Gaps = 11/325 (3%)

Query: 41  SWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNK 99
           SW   D +  +YQ W+ +HGK  N     +KRFQIFK+N+ +I+ HN+  N ++ +GLNK
Sbjct: 27  SWHFIDPLWQVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNK 86

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           FADLTN E+R +Y+G       RL +     +        +   SVDWR+KG V  +KDQ
Sbjct: 87  FADLTNSEFRGLYVG-------RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQ 139

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFS VAAVEG+  + TG L+SLSEQELVDCD  +N GC+GG+MDYAFQ++I+N
Sbjct: 140 GDCGSCWAFSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRN 199

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ S+ +YPY      CD  +      +I+G++ + P  E  L +AVA+QPVSVAIEAG
Sbjct: 200 GGITSQSNYPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAG 259

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRN 338
           G+ FQ Y SGVFTGECGS LDHGV  VGYGT+  G  YWLV+NSWGS WGE+GYV+++R 
Sbjct: 260 GQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ 319

Query: 339 LLDTNTGKCGIAMEASYPVKNSQNS 363
                 G CGI ++ASYP K  Q +
Sbjct: 320 --GPGAGVCGINLDASYPTKIQQRT 342


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  343 bits (880), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 172/332 (51%), Positives = 225/332 (67%), Gaps = 11/332 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           ++D +  +Y+ W   H   +  +    +RF +FK+N++FI E N   +  YK+ LNKF D
Sbjct: 32  SEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGD 90

Query: 103 LTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           +TN+E+R+ Y G++    R  R ++    S  Y    G     S+DWR KGAV  VKDQG
Sbjct: 91  MTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE-NVGSLPAASIDWRAKGAVTGVKDQG 149

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFST+A+VEGIN+I TGEL+SLSEQELVDCD   N GCNGGLMDYAF+F IQ  
Sbjct: 150 QCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEF-IQKN 208

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY   +  C  +  N+ VVSIDG++DV   +E +L +AVA+QP+SV+IEA G
Sbjct: 209 GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GVFTG CG+ LDHGV  VGYG T +G  YW+V+NSWG +WGE+GY+++QR +
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328

Query: 340 LDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
            D   GKCGIAMEASYP+K S N   PK  S+
Sbjct: 329 SDKR-GKCGIAMEASYPIKTSAN---PKNSST 356


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 168/322 (52%), Positives = 221/322 (68%), Gaps = 13/322 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
           D++  ++  W  KHGKT       ++R QIFKDN  F+ +HN + N TY + LN FADLT
Sbjct: 24  DDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLT 83

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           + E++A  LG    A   +M SK  S   + K    +P+SVDWR+KGAV  VKDQGSCG+
Sbjct: 84  HHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKDQGSCGA 139

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGIN+IVTG+LISLSEQEL+DCD+  NAGCNGGLMDYAF+F+I+N G+D+
Sbjct: 140 CWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDT 199

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E+DYPY   +  C   +   KVV+ID Y  V   DE +L +AVA QPVSV I    RAFQ
Sbjct: 200 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 259

Query: 285 HYES-------GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
            Y S       G+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG  WG +G++ +QR
Sbjct: 260 LYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQR 319

Query: 338 NLLDTNTGKCGIAMEASYPVKN 359
           N  +++ G CGI M ASYP+K 
Sbjct: 320 NTENSD-GVCGINMLASYPIKT 340


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 164/309 (53%), Positives = 217/309 (70%), Gaps = 6/309 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++TW+ ++G+   G    EKRF+IFK+N+ FI+  +N+ N+ YK+G+N F DLTNEE+RA
Sbjct: 38  HKTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            + G            +  S RY       +P S+DWR KGAV  +KDQG CG CWAFS 
Sbjct: 98  SHNGYTMSMSSHQSSYRTKSFRYENVTA--VPPSLDWRTKGAVTHIKDQGQCGCCWAFSA 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI K+ TG LISLSEQELVDCD   ++ GC GGLMD AF+FII+N G+ +E +YP
Sbjct: 156 VAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYP 215

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +       I GYE+V  +DE +L+KAVA+QPVSVAI+AG  AFQHY SG
Sbjct: 216 YEGVDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSG 275

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           +FTG+CG+ LDHGV  VGYGT ++G  YWLV+NSWG+ WGE+GY++++R+ +D   G CG
Sbjct: 276 IFTGDCGTELDHGVTVVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERD-IDAKEGLCG 334

Query: 349 IAMEASYPV 357
           IAME SYP 
Sbjct: 335 IAMEPSYPT 343


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 169/336 (50%), Positives = 232/336 (69%), Gaps = 17/336 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNE----KRFQIFKDNLRFIDEHNSLN-RTYKVGLN 98
           +++ +  +Y+ W + + + S   G ++    +RF +FK+N R++ E N  + R +++ LN
Sbjct: 33  SEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALN 92

Query: 99  KFADLTNEEYRAMYLGTRSDAKR-RLMKSKVASQRYACKAGD---ELPESVDWREKGAVN 154
           KFAD+T +E+R  Y G+R+   R +L +++  +     + G     LP +VDWR +GAV 
Sbjct: 93  KFADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVT 152

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
            VKDQG CGSCWAFS +AAVEG+NKI+TG+L+SLSEQELVDCD   N GC+GGLMDYAFQ
Sbjct: 153 GVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQ 212

Query: 215 FIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
           +I +NGG+ +E +YPYL  +  C+ ++  +  V+IDGYEDV   +E +L+KAVA QPV+V
Sbjct: 213 YIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAV 272

Query: 275 AIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYV 333
           AIEA G+ FQ Y  GVFTG CG+ LDHGV AVGYGT  +G  YW V+NSWG DWGE GY+
Sbjct: 273 AIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYI 332

Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
           ++QR + D+  G CGIAME SYP K      KP  H
Sbjct: 333 RMQRGVPDSR-GLCGIAMEPSYPTK------KPAGH 361


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 169/326 (51%), Positives = 220/326 (67%), Gaps = 4/326 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W  +H + +  +G   +RF +FKDN+R I E N  +  YK+ LN+F D+
Sbjct: 40  SEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T +E+R  Y  +R    R           +      +LP +VDWREKGAV  VKDQG CG
Sbjct: 99  TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFST+AAVEGIN I T  L +LSEQ+LVDCD K  NAGC+GGLMD AFQ+I ++GG+
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +   YPY   ++ C  S  ++  V+IDGYEDV    E +LKKAVA+QPVSVAIEAGG  
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVF G+CG+ LDHGV AVGYGT  +G  YW+VRNSWG+DWGE GY++++R+ + 
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRD-VS 337

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
              G CGIAMEASYP+K S N A  K
Sbjct: 338 AKEGLCGIAMEASYPIKTSPNPAPKK 363


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/315 (53%), Positives = 215/315 (68%), Gaps = 4/315 (1%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
           E+  +++TW  +HGKT         R ++F+DN  F+ EHNS  N +Y + LN FADLT+
Sbjct: 25  EIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTH 84

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E++A  LG  S A   L   +  S R       ++P SVDWR+ GAV  VKDQG+CG+C
Sbjct: 85  HEFKASRLGLSSAASASLNVDR--SNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGAC 142

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           W+FS   A+EGINKIVTG L+SLSEQELVDCD+  N GC GG+MDYAFQF+I N G+D+E
Sbjct: 143 WSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTE 202

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           +DYPY G +  C+  +    VV+IDGY DV   +E  L KAVA+QPVSV I    RAFQ 
Sbjct: 203 EDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQL 262

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y  G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWGS WG +GY+ +QRN   ++ G
Sbjct: 263 YSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRN-SGSSRG 321

Query: 346 KCGIAMEASYPVKNS 360
            CGI M ASYP K S
Sbjct: 322 LCGINMLASYPKKTS 336


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 166/323 (51%), Positives = 224/323 (69%), Gaps = 10/323 (3%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
           S + + D  +   ++ W+  +GK    +   E R +IFK+N+ +I+  N+   N+ YK+G
Sbjct: 28  SRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N+FADLTNEE+    + +R+  K  +  S   +  +  +    +P +VDWR+KGAV PV
Sbjct: 88  INQFADLTNEEF----IASRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTPV 142

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
           K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           IIQN G+++E  YPY G +  C  ++ +   V+I GYEDV   +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVA 262

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
           I+A G  FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+K
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIK 322

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           +QR  +D   G CGIAMEASYP 
Sbjct: 323 MQRG-VDAAEGLCGIAMEASYPT 344


>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
 gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
          Length = 359

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 164/324 (50%), Positives = 227/324 (70%), Gaps = 5/324 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W + H   S  +    +RF +FK+NL+ I + N  +R YK+ LNKFAD+
Sbjct: 32  SEESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQKDRPYKLRLNKFADM 90

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN E+   Y G++    R    S+  +  +A +    LP S+DWR++GAV  VKDQG CG
Sbjct: 91  TNHEFLQHYGGSKVSHYRMFHGSRRQTG-FAHENTSNLPSSIDWRKQGAVTGVKDQGKCG 149

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFS+VAAVEGINKI TGELISLSEQELVDC+  +N GC+GGLM+ AF FI + GG+ 
Sbjct: 150 SCWAFSSVAAVEGINKIKTGELISLSEQELVDCN-SVNHGCDGGLMEQAFSFIEKTGGLT 208

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E +YPY   +  CD ++ N  +V+IDGYE V   DE +L +AVA+QPVS+AI+AGG+ F
Sbjct: 209 TENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDF 268

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GV+TG+CG+ L+HGV  VGYG T++G  YW+V+NSWGS+WGENG++++QR   D 
Sbjct: 269 QFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRE-NDV 327

Query: 343 NTGKCGIAMEASYPVKNSQNSAKP 366
             G CGI +EASYP+K   +  +P
Sbjct: 328 EEGLCGITLEASYPIKQRSDIKQP 351


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 166/323 (51%), Positives = 224/323 (69%), Gaps = 10/323 (3%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
           S + + D  +   ++ W+  +GK    +   E R +IFK+N+ +I+  N+   N+ YK+G
Sbjct: 28  SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N+FADLTNEE+    + +R+  K  +  S   +  +  +    +P +VDWR+KGAV PV
Sbjct: 88  INQFADLTNEEF----IASRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTPV 142

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
           K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           IIQN G+++E  YPY G +  C  ++ +   V+I GYEDV   +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVA 262

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
           I+A G  FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+K
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIK 322

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           +QR  +D   G CGIAMEASYP 
Sbjct: 323 MQRG-VDAAEGLCGIAMEASYPT 344


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 172/302 (56%), Positives = 207/302 (68%), Gaps = 10/302 (3%)

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM---KSKVA 129
           F +FK N+R I E N  +  YK+ LN+F D+T +E+R  Y G+R    R      +   A
Sbjct: 70  FNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSA 129

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S  +      ++P SVDWR+KGAV  VKDQG CGSCWAFST+AAVEGIN I T  L SLS
Sbjct: 130 SASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLS 189

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQ+LVDCD K NAGCNGGLMDYAFQ+I ++GG+ +E  YPY   +  C  S   A VV+I
Sbjct: 190 EQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKS--PAPVVTI 247

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
           DGYEDV   DE +LKKAVA QPVSVAIEA G  FQ Y  GVF+G CG+ LDHGV AVGYG
Sbjct: 248 DGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYG 307

Query: 310 -TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
            T +G  YWLV+NSWG +WGE GY+++ R++     G CGIAMEASYPVK S N   PK 
Sbjct: 308 VTADGTKYWLVKNSWGPEWGEKGYIRMARDVA-AKEGHCGIAMEASYPVKTSPN---PKV 363

Query: 369 HS 370
           H+
Sbjct: 364 HA 365


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 164/317 (51%), Positives = 224/317 (70%), Gaps = 9/317 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
           DD +   +  W++++GK        E RF+IFK+N+ +I+  N+ +  ++YK+G+N+FAD
Sbjct: 32  DDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFAD 91

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LTNEE+    + +R+  K  +  S + +  +  +    +P +VDWR+KGAV PVK+QG C
Sbjct: 92  LTNEEF----IASRNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPVKNQGQC 147

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G
Sbjct: 148 GCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHG 207

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY G +  C+ ++ + + V+I GYEDV    E +L+KAVA+QP+SVAI+A G 
Sbjct: 208 LSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGS 267

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+ +QR  +
Sbjct: 268 DFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-I 326

Query: 341 DTNTGKCGIAMEASYPV 357
           +   G CGIAM+ASYP 
Sbjct: 327 EAAEGICGIAMQASYPT 343


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 171/312 (54%), Positives = 221/312 (70%), Gaps = 11/312 (3%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEE 107
           M  ++TW+A++G+   G    E+R  IFK+N+ FI+  N +  + YK+ +N+FADLTNEE
Sbjct: 1   MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           ++A   G +  A   L  S     RY   +   +P ++DWR+KGAV P+KDQG CG CWA
Sbjct: 61  FQASRNGYKMSA--HLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWA 116

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS VAA EGI ++ TG+LISLSEQELVDCD    + GCNGGLMD AF FIIQN G+ +E 
Sbjct: 117 FSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEA 176

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY GA+  C+  +  AK   I GYEDV    E +L KAVA+QPVSVAI+AGG AFQ Y
Sbjct: 177 NYPYQGADGACNSGKAAAK---ITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFY 233

Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
            SGVFTG+CG+ LDHGV AVGYG +++G  YWLV+NSWG+ WGENGY++++R+ +D   G
Sbjct: 234 SSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERD-IDAQEG 292

Query: 346 KCGIAMEASYPV 357
            CGIAMEASYP 
Sbjct: 293 LCGIAMEASYPT 304


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 161/309 (52%), Positives = 225/309 (72%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++GK        EKRF++FK+N+ +I+  +N+ N++YK+G+N+FADLTN+E+  
Sbjct: 39  HEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  +  +     P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNGFKGHMCSSIIRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ +  G+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+ +       +I GYEDV   +EM+L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKGVDGKCNANEAAKNAATITGYEDVPANNEMALQKAVANQPVSVAIDASGSDFQFYKSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG +++G +YWLV+NSWG++WGE GY+++QR  +D+  G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGLCG 333

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 334 IAMQASYPT 342


>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 170/324 (52%), Positives = 224/324 (69%), Gaps = 29/324 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK-RFQIFKDNLRFIDEHNSLN----RTYKVGL 97
           R D+EV  +Y+TW ++HG+  +G+   +  R ++F+DNLR+ID HN+       T+++GL
Sbjct: 42  RADEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGL 101

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
             F DLT EE+RA  LG  +    R     VAS RY  +AGD+LP++VDWR++GAV  VK
Sbjct: 102 TPFTDLTLEEFRAHALGFLNSTLPR-----VASDRYLPRAGDDLPDAVDWRQQGAVTGVK 156

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +Q  CG CWAFS VAA+EGINKIVT  LISLSEQEL+DCD + + GC GG M  AFQF+I
Sbjct: 157 NQLDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVI 215

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+D+E DYP++G    CD  R   KVVSID YE+V   DE +L+KAVA+QP      
Sbjct: 216 DNGGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP------ 269

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
                      G+F G CG  LDHGV AVGYG++NG D+W+V+NSWG++WGE+GY++++R
Sbjct: 270 -----------GIFNGPCGFILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESGYIRMKR 318

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQ 361
           N+L    GKCGIAM ASYPVKN +
Sbjct: 319 NVL-LPMGKCGIAMYASYPVKNGR 341


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 178/356 (50%), Positives = 232/356 (65%), Gaps = 12/356 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           L+++ L+       +  +D SI+ Y +  D SS     D ++ +++ WLAKH K      
Sbjct: 5   LSVAVLLLCVGACVARNSDFSIVGY-SEEDLSSH----DRLVELFEKWLAKHQKAYASFE 59

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
               RF++FKDNL+ IDE N    +Y +GLN+FADLT++E++  YLG      RR     
Sbjct: 60  EKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTYLGLSPPPARRSSSRS 119

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
               RY   A  +LP++VDWR+KGAV  VK+QG CGSCWAFSTVAAVEGIN IVTG L +
Sbjct: 120 F---RYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTA 176

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-DPSRRNAKV 246
           LSEQEL+DC    N+GCNGG+MDYAF +I  +GG+ +E+ YPYL  E  C D  +  ++ 
Sbjct: 177 LSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEA 236

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           VSI GYEDV   DE +L KA+A QPVSVAIEA GR FQ Y  GVF G CG+ LDHGV AV
Sbjct: 237 VSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAV 296

Query: 307 GYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           GYG++ G   DY +V+NSWG  WGE GY++++R     + G CGI   ASYP K++
Sbjct: 297 GYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRG-TGKSEGLCGINKMASYPTKDN 351


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 163/317 (51%), Positives = 220/317 (69%), Gaps = 10/317 (3%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFAD 102
           DD +   ++ W+  +GK        EKR +IF +NL++I+  N+   N+ YK+G+N+FAD
Sbjct: 32  DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKLGINQFAD 91

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LTNEE+    + +R+  K  +  S + +  +  +    +P +VDWR+KGAV PVK+QG C
Sbjct: 92  LTNEEF----IASRNKFKGHMCSSIIRTTTFKYE-NTSVPSTVDWRKKGAVTPVKNQGQC 146

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS +AA EGI+KI TG+L+SLSEQELVDCD   ++ GC GGLMD AF+FIIQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY G +  C  +  +    +I GYEDV   +E +L+KAVA+QP+SVAI+A G 
Sbjct: 207 ISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASGS 266

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+++QR+ +
Sbjct: 267 DFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRS-I 325

Query: 341 DTNTGKCGIAMEASYPV 357
           D   G CGIAM+ASYP 
Sbjct: 326 DAAEGLCGIAMQASYPT 342


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 165/311 (53%), Positives = 217/311 (69%), Gaps = 8/311 (2%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           ++  W  +HGKT       ++R QIFKDN  F+ +HN + N TY + LN FADLT+ E++
Sbjct: 31  LFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFK 90

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A  LG    A   +M SK  S         ++P+SVDWR+KGAV  VKDQGSCG+CW+FS
Sbjct: 91  ASRLGLSVSASSLIMASKGQSL----GGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
              A+EGIN+IVTG+LISLSEQEL+DCD+  NAGCNGGLMDYAF+F+I+N G+D+E+DYP
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE-- 287
           Y   +  C   +   KVV+ID Y  V   DE +L++AVA QPVSV I    RAFQ Y   
Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266

Query: 288 SGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           SG+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG  WG +G++ +QRN  ++  G C
Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSE-GIC 325

Query: 348 GIAMEASYPVK 358
           GI M ASYP+K
Sbjct: 326 GINMLASYPIK 336


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 163/316 (51%), Positives = 223/316 (70%), Gaps = 9/316 (2%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LNRTYKVGLNKFADL 103
           D++   ++ W++++GK        EKRF+IF +N+ +I+  N    N+ Y +G+N+FADL
Sbjct: 32  DDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADL 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN+E+ +    +R+  K  +  S   +  +  +    +P SVDWR+KGAV PVK+QG CG
Sbjct: 92  TNDEFTS----SRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPVKNQGQCG 147

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           ++E +YPY G +  C+ ++ +   V+I GYEDV   +E +L+KAVA+QP+SVAI+A G  
Sbjct: 208 NTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASGSD 267

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG++WGE GY+ +QR  +D
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRG-VD 326

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM+ASYP 
Sbjct: 327 AAEGLCGIAMQASYPT 342


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 162/310 (52%), Positives = 221/310 (71%), Gaps = 9/310 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LNRTYKVGLNKFADLTNEEYR 109
           ++ W+  +GK        EKRF+IF +N+++I+  N+   N +YK+G+N+FADLTNEE+ 
Sbjct: 39  HERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEFV 98

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A    +R+  K  +  S + +  +  +    +P +VDWR+KGAV PVK+QG CG CWAFS
Sbjct: 99  A----SRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFS 154

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+++E  Y
Sbjct: 155 AVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQY 214

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY G +  C+ ++ + +  +I GYEDV   +E +L+KAVA+QP+SVAI+A G  FQ Y+S
Sbjct: 215 PYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKS 274

Query: 289 GVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+ +QR  ++   G C
Sbjct: 275 GVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-VEAAEGLC 333

Query: 348 GIAMEASYPV 357
           GIAM+ASYP 
Sbjct: 334 GIAMQASYPT 343


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 163/316 (51%), Positives = 222/316 (70%), Gaps = 8/316 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
           DD +   +  W++++GK        E RF+IF +N+ +++  N+ + ++YK+G+N+FADL
Sbjct: 32  DDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADL 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TNEE+ A    +R+  K  +  S   +  +  +    +P +VDWR+KGAV PVK+QG CG
Sbjct: 92  TNEEFVA----SRNKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCG 147

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA EGI+K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 148 CCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 207

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY G +  C+ ++ + + V+I GYEDV    E +L+KAVA+QP+SVAI+A G  
Sbjct: 208 STEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSD 267

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+ +QR  ++
Sbjct: 268 FQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYIMMQRG-VE 326

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM+ASYP 
Sbjct: 327 AAEGLCGIAMQASYPT 342


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 160/309 (51%), Positives = 222/309 (71%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++GK        EKRF+IFK+N+ +I+  +N+ N+ YK+ +N+FADLTNEE+  
Sbjct: 39  HEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ + +G+LISLSEQELVDCD K ++ GC GGLMD AF+F+IQN G+++E +YP
Sbjct: 155 VAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+ +       +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKGVDGKCNVNEAANDAATITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSG 274

Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG  N G +YWLV+NSWG++WGE GY+++QR  +++  G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRG-VNSEEGLCG 333

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 334 IAMQASYPT 342


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 176/360 (48%), Positives = 243/360 (67%), Gaps = 23/360 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LF +    AA  S     N H+ S   R +D        W+A++G
Sbjct: 1   MASVNQYRYI-CLALLFVL----AAWASHAKARNLHEASMYERHED--------WMAQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +     G   KR++IFKDN+  I+  N ++N++YK+ +N+FADLTNEE+RA    +R+  
Sbjct: 48  RVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRA----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--EHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GC+GGLMD AF+FI QN G+ +E +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCN 221

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAVA QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++ +   G CGIAM+ASYP 
Sbjct: 282 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKE-GLCGIAMQASYPT 340


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 164/309 (53%), Positives = 217/309 (70%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
           +Q W+ ++ K  N     EKRFQIFK+N+ +I+  N    R YK+G+N+F DLTNEE+  
Sbjct: 39  HQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  Y  +    +P +VDWR+KGAV PVKDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSIIRTNTYKYENVTTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+++ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+D+E  YP
Sbjct: 155 VAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ +  +    +I  YEDV   +E +L+KAVA+QP+SVAI+A G  FQ Y SG
Sbjct: 215 YQGVDGTCNANEASINAATITSYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG +++G  YWLV+NSWG+ WGE GY+++QR  +D   G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSDDGTKYWLVKNSWGTSWGEEGYIRMQRG-VDAVEGLCG 333

Query: 349 IAMEASYPV 357
           IAM+ASYP+
Sbjct: 334 IAMQASYPI 342


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  337 bits (864), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 162/317 (51%), Positives = 219/317 (69%), Gaps = 10/317 (3%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFAD 102
           DD +   ++ W+  +GK        EKR +IF +NL++I+  N+    + YK+G+N+FAD
Sbjct: 32  DDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKLGINQFAD 91

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LTNEE+    + +R+  K  +  S + +  +  +    +P +VDWR+KGAV PVK+QG C
Sbjct: 92  LTNEEF----IASRNKFKGHMCSSIIRTTTFKYE-NTSVPSTVDWRKKGAVTPVKNQGQC 146

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS +AA EGI+KI TG+L+SLSEQELVDCD   ++ GC GGLMD AF+FIIQN G
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY G +  C  +  +    +I GYEDV   +E +L+KAVA+QP+SVAI+A G 
Sbjct: 207 ISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPISVAIDASGS 266

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+++QR+ +
Sbjct: 267 DFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGYIRMQRS-I 325

Query: 341 DTNTGKCGIAMEASYPV 357
           D   G CGIAM+ASYP 
Sbjct: 326 DAAEGLCGIAMQASYPT 342


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  337 bits (864), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 164/312 (52%), Positives = 221/312 (70%), Gaps = 10/312 (3%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
           ++  ++ W+A+HG+    M   EKR+ IFK+N+  I+  +N  +R YK+G+NKFADLTNE
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+RAMY G +  +      SK+ S  +  +   ++P S+DWR  GAV PVKDQG+CG CW
Sbjct: 61  EFRAMYHGYKRQS------SKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 114

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFSTVAA+EGI K+ TG LISLSEQ+LVDC    N GC GGLMD AFQ+II+NGG+ SE 
Sbjct: 115 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG-NKGCQGGLMDTAFQYIIRNGGLTSED 173

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY G +  C   +  +    I GYEDV   +E +L +AVA QPVSVA++ GG  F+ Y
Sbjct: 174 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFY 233

Query: 287 ESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           +SGVF G+CG+ L+HGV A+GYGT+ +G DYWLV+NSWG+ WGE+GY ++QR  +  + G
Sbjct: 234 KSGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRG-IGASEG 292

Query: 346 KCGIAMEASYPV 357
            CG+AM+ASYP 
Sbjct: 293 LCGVAMDASYPT 304


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score =  337 bits (863), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 177/358 (49%), Positives = 221/358 (61%), Gaps = 36/358 (10%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA +   + + T+     I S  A D SI+ Y   H  S    T+     ++++W++KHG
Sbjct: 1   MAPSVSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTE-----LFESWMSKHG 55

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           KT   +     R ++FKDNL  ID  N    TY + LN+FADL++EE+            
Sbjct: 56  KTYESIEEKLHRLEVFKDNLMHIDRRNRDVTTYWLALNEFADLSHEEF------------ 103

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
               KSK+A  R                EKGAV PVK+QGSCGSCWAFSTVAAVEGIN+I
Sbjct: 104 ----KSKLAQIRRL--------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQI 145

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG L SLSEQEL+DCD   N+GCNGGLMDYAF +I+ NGG+  E+DYPYL  E  CD  
Sbjct: 146 VTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEK 205

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           R   +VV+I GY DV   +E SL KA+A QP+S+AIEA GR FQ Y  GVF G CG+ LD
Sbjct: 206 REEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLD 265

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           HGV AVGYG+  G+DY +V+NSWG  WGE GY++++RN      G CGI   ASYP K
Sbjct: 266 HGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRN-TGKPEGLCGINKMASYPTK 322


>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
          Length = 234

 Score =  336 bits (862), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 154/197 (78%), Positives = 172/197 (87%)

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CG CWAFST+AAVEGIN IVTGELISLSEQELVDCDR  N GCNGGLMDYAF+FII+NGG
Sbjct: 1   CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRSYNQGCNGGLMDYAFEFIIKNGG 60

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +DSE+DYPY   +  CDP R+NAKVV+IDGYEDV   DE SLKKAVA QPVSVAIEAGGR
Sbjct: 61  IDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGGR 120

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            FQ Y+SG+FTG CG+ALDHGV AVGYGTENG+DYW+VRNSWGS WGENGY++++RN+  
Sbjct: 121 EFQLYQSGIFTGRCGTALDHGVAAVGYGTENGIDYWIVRNSWGSSWGENGYIRMERNVKT 180

Query: 342 TNTGKCGIAMEASYPVK 358
           T TGKCGIAMEASYP K
Sbjct: 181 TKTGKCGIAMEASYPTK 197


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  336 bits (862), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 164/313 (52%), Positives = 221/313 (70%), Gaps = 10/313 (3%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
           ++  ++ W+A+HG+    M   EKR+ IFK+N+  I+  +N  +R YK+G+NKFADLTNE
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+RAM+ G +  +      SK+ S  +  +    +P S+DWR+ GAV PVKDQG+CG CW
Sbjct: 61  EFRAMHHGYKRQS------SKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCW 114

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFS VAA+EGI K+ TG+LISLSEQ+LVDCD K ++ GC GGLMD AFQFI++NGG+ SE
Sbjct: 115 AFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSE 174

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
             YPY G +  C   +  +    I GYEDV   +E +L +AVA QPVSVA+E GG  FQ 
Sbjct: 175 ATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQF 234

Query: 286 YESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
           Y+SGVF G+CG+ LDH V A+GYGT  +G +YWLV+NSWG+ WGE+GY+++QR  +    
Sbjct: 235 YKSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRG-IGARE 293

Query: 345 GKCGIAMEASYPV 357
           G CG+AM+ASYP 
Sbjct: 294 GLCGVAMDASYPT 306


>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
           Precursor
 gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
          Length = 346

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 158/233 (67%), Positives = 192/233 (82%), Gaps = 1/233 (0%)

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           SK  S RY  K GD LPES+DWREKG +  VKDQGSCGSCWAFS VAA+E IN IVTG L
Sbjct: 3   SKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNL 62

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           ISLSEQELVDCDR  N GC+GGLMDYAF+F+I+NGG+D+E+DYPY      CD  R+NAK
Sbjct: 63  ISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAK 122

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV ID YEDV   +E +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVV 
Sbjct: 123 VVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVI 182

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            GYGTENG+DYW+VRNSWG++  ENGY+++QRN + +++G CG+A+E SYPVK
Sbjct: 183 AGYGTENGMDYWIVRNSWGANCRENGYLRVQRN-VSSSSGLCGLAIEPSYPVK 234


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 164/312 (52%), Positives = 215/312 (68%), Gaps = 7/312 (2%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           ++++W  +HGKT         RF+IF++N  F+ +HNS  N +Y + LN FADLT+ E++
Sbjct: 31  LFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFK 90

Query: 110 AMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           A  LG  +     +L +       +    GD +P S+DWR+KGAV+ VKDQG+CG+CW+F
Sbjct: 91  ASRLGLSAFSTSGKLSRRNFPLHDFV---GD-VPISIDWRKKGAVSQVKDQGNCGACWSF 146

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           S   A+EGINKIVTG L+SLSEQELVDCDR  N GC GGLMDYA+QF+I+N G+D+E+DY
Sbjct: 147 SATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDY 206

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   E  C+  +    VV+IDGY DV   +E  L KAVA QPVSV I    RAFQ Y  
Sbjct: 207 PYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSK 266

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWG+ WG NGY+ + RN  ++  G CG
Sbjct: 267 GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQ-GLCG 325

Query: 349 IAMEASYPVKNS 360
           I M AS+PVK S
Sbjct: 326 INMLASFPVKTS 337


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 163/308 (52%), Positives = 216/308 (70%), Gaps = 7/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ K+G+        E+RF+IF++N+ FI+  N   NR YK+ +N+FADLTNEE++A
Sbjct: 38  HEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G +  +   L  S+ +S RY       +P S+DWR+KGAV P+KDQG CG CWAFS 
Sbjct: 98  SRNGYKRSSNVGL--SEKSSFRYGNVTA--VPTSMDWRQKGAVTPIKDQGQCGCCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FI QNGG+ +E +YP
Sbjct: 154 VAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ ++       I GYEDV    E +L KAVA QPVSVAI+A G AFQ Y  G
Sbjct: 214 YQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGG 273

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           VFTG+CG+ LDHGV AVGYGT +G  YWLV+NSWG+ WGE+GY++++R+ ++   G CGI
Sbjct: 274 VFTGDCGTELDHGVTAVGYGTSDGTKYWLVKNSWGTSWGEDGYIRMERD-IEAKEGLCGI 332

Query: 350 AMEASYPV 357
           AM++SYP 
Sbjct: 333 AMQSSYPT 340


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 173/339 (51%), Positives = 220/339 (64%), Gaps = 8/339 (2%)

Query: 23  SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRF 82
           S  + SI+ Y +  D +S     D ++ +++ W+AK+ K         +RF++FKDNL  
Sbjct: 27  SGGEFSIVGY-SEEDLASH----DRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNH 81

Query: 83  IDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDEL 141
           ID+ N    +Y +GLN+FADLT++E++A YLG      R   K   + + RY   +  E+
Sbjct: 82  IDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEV 141

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P+ +DWR+K AV  VK+QG CGSCWAFSTVAAVEGIN IVTG L SLSEQEL+DC    N
Sbjct: 142 PKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGN 201

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGGLMDYAF +I   GG+ +E+ YPY   E  CD   + A VV+I GYEDV   DE 
Sbjct: 202 NGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCDEG-KGAAVVTISGYEDVPANDEQ 260

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L KA+A QPVSVAIEA GR FQ Y  GVF G CG  LDHGV AVGYGT  G DY +V+N
Sbjct: 261 ALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKN 320

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           SWG  WGE GY++++R       G CGI   ASYP K++
Sbjct: 321 SWGPHWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKDN 358


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 171/322 (53%), Positives = 223/322 (69%), Gaps = 11/322 (3%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
           E+   ++ W+AKHGK         +RFQIFK N+ FI+  N+  N++Y +G+NKFADLTN
Sbjct: 34  EMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTN 93

Query: 106 EEYRAMYLGTRSDAKRRLMKS-KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           EE+RA + G     KR L  S K+   +Y  +    LP S+DWR KGAV P+KDQG CGS
Sbjct: 94  EEFRAFWNGY----KRPLGASRKITPFKY--ENVTALPSSIDWRSKGAVTPIKDQGVCGS 147

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMD 223
           CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K  + GC GGLM  AF+FI ++GGM 
Sbjct: 148 CWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMT 207

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           SE +YPY G + KCD  +  ++ V I GY+ V    E +L KAVA+QPVSVAI+AG  +F
Sbjct: 208 SEANYPYQGRDGKCDTKKEASRAVKITGYQAVPKNSEAALLKAVANQPVSVAIDAGSLSF 267

Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y SG+FTG CG  ++HGV AVGYG  N G  YW+V+NSWG++WGE GY++++R+ + +
Sbjct: 268 QFYRSGIFTGICGKDINHGVAAVGYGRSNSGSKYWIVKNSWGTEWGEKGYIRMKRD-VRS 326

Query: 343 NTGKCGIAMEASYPVKNSQNSA 364
             G CGIAME SYP    Q S+
Sbjct: 327 KEGLCGIAMECSYPTAQVQASS 348


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  335 bits (858), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 176/360 (48%), Positives = 242/360 (67%), Gaps = 23/360 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LFF+    AA  S  +  N  + S   R +D        W+A++G
Sbjct: 1   MASVNQYQYI-CLALLFFL----AAWASQATARNLLEASMYERHED--------WMAQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA    +R+  
Sbjct: 48  RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y   A   +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKYEHVAA--VPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GCNGGLMD AF+FI QN G+ +E +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCN 221

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAVA QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 282 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEVGYIRMQRDVT-AKEGLCGIAMQASYPT 340


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  334 bits (857), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 162/312 (51%), Positives = 220/312 (70%), Gaps = 10/312 (3%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNE 106
           ++  ++ W+A+HG+    M   EKR+ IFK+N+  I+  +N  +R YK+G+NKFADLTNE
Sbjct: 36  MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 95

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+RAMY G +  +      SK+ S  +  +   ++P S+DWR  GAV PVKDQG+CG CW
Sbjct: 96  EFRAMYHGYKRQS------SKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 149

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFSTVAA+EGI K+ TG LISLSEQ+LVDC    N GC GGLMD AFQ+II+NGG+ SE 
Sbjct: 150 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG-NKGCQGGLMDTAFQYIIRNGGLTSED 208

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY G +  C   +  +    I GYEDV   +E +L +AVA QPVSV ++ GG  FQ Y
Sbjct: 209 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFY 268

Query: 287 ESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           +SGVF G+CG+  +H V A+GYGT+ +G DYWLV+NSWG+ WGENGY++++R  + ++ G
Sbjct: 269 KSGVFNGDCGTQQNHAVTAIGYGTDIDGTDYWLVKNSWGTSWGENGYMRMRRG-IGSSEG 327

Query: 346 KCGIAMEASYPV 357
            CG+AM+ASYP 
Sbjct: 328 LCGVAMDASYPT 339


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  334 bits (856), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 160/322 (49%), Positives = 226/322 (70%), Gaps = 9/322 (2%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID-EHNSLNRTYKVGL 97
           SS    D  +   ++ W+A++G+    +   EKRF IFK+N+ +I+  +N+ ++ YK+G+
Sbjct: 26  SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYIEASNNAGDKPYKLGV 85

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           N+FADLTNEE+    + TR+  K  +  S   +  +  +     P +VDWR++GAV PVK
Sbjct: 86  NQFADLTNEEF----IATRNKFKGHMSSSITRTTTFKYE-NVTAPSTVDWRQEGAVTPVK 140

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFI 216
           +QG+CG CWAFS VAA EGI+K+ TG L+SLSEQELVDCD    + GC GGLMD AF+FI
Sbjct: 141 NQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFI 200

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
           IQNGG+++E  YPY G +  C+ +     V +I GYEDV   +E +L++AVA+QP+S+AI
Sbjct: 201 IQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNEQALQQAVANQPISIAI 260

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKL 335
           +A G  FQ+Y+SGVFTG CG+ LDHGV  VGYG +++G  YWLV+NSWG+DWGE GY+++
Sbjct: 261 DASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGADWGEEGYIRM 320

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
           QR+ +D   G CG+AM+ SYP 
Sbjct: 321 QRD-VDAPEGLCGLAMQPSYPT 341


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  334 bits (856), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 170/366 (46%), Positives = 240/366 (65%), Gaps = 34/366 (9%)

Query: 1   MATASMFLAISTLVFL------FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQT 54
           MAT + F  IS  + L      F +SS +  D S+      H+              ++ 
Sbjct: 1   MATKNQFYQISFALVLCLGLWAFQVSSRTLQDASM------HER-------------HEQ 41

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFID-EHNSLNRTYKVGLNKFADLTNEEYRAMYL 113
           W+A++GK    +   EKRF IF++N+++I+  +N+ N+ YK+G+N+F DLTN+E+    +
Sbjct: 42  WMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKEF----I 97

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
            TR+  K  +  S   +  +  +     P +VDWR++GAV PVK+QG+CG CWAFS VAA
Sbjct: 98  ATRNKFKGHMSSSITRTTTFKYE-NVTAPSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAA 156

Query: 174 VEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
            EGI+K+ TG L+SLSEQELVDCD    + GC GGLMD AF+FIIQNGG+++E  YPY G
Sbjct: 157 TEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQG 216

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
            +  C+ +     V +I GYEDV   +E +L++AVA+QP+SVAI+A G  FQ+Y+SGVFT
Sbjct: 217 VDGTCNTNEEVTHVATITGYEDVPSNNEQALQQAVANQPISVAIDASGSDFQNYQSGVFT 276

Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           G CG+ LDHGV  VGYG +++G  YWLV+NSWG DWGE GY+++QR+ ++   G CGIAM
Sbjct: 277 GSCGTQLDHGVAVVGYGVSDDGTKYWLVKNSWGEDWGEEGYIRMQRD-VEAPEGLCGIAM 335

Query: 352 EASYPV 357
           + SYP 
Sbjct: 336 QPSYPT 341


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 161/309 (52%), Positives = 215/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+   GK        E+RF+IFKDN+ +I+  N+  N+ YK+ +NKFADLTNEE + 
Sbjct: 38  HEQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKV 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G R   + R MK  V S +Y       +P ++DWR+KGAV P+KDQG CGSCWAFST
Sbjct: 98  ARNGYRRPLQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTPIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGIN++ TG+L+SLSEQELVDCD +  + GC GGLM+  F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+  +  +++  I GYE V    E +L KAVA QP+SV+I+AGG  FQ Y SG
Sbjct: 214 YQAADGTCNSKKEASRIAKITGYESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSG 273

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYG T +G  YWLV+NSWG+ WGE GY+++QR+  +   G CG
Sbjct: 274 VFTGQCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRD-TEAEEGLCG 332

Query: 349 IAMEASYPV 357
           IAM++SYP 
Sbjct: 333 IAMDSSYPT 341


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  333 bits (854), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 165/327 (50%), Positives = 220/327 (67%), Gaps = 5/327 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFAD 102
           +D+ +  +Y+ W  +H       G   +RF  FKDN+R+I EHN    R Y++ LN+F D
Sbjct: 38  SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGRGYRLRLNRFGD 96

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +  EE+RA + G+ ++  RR   +      +  +   +LP +VDWR KGAV  VKDQG C
Sbjct: 97  MGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKC 156

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD   N+GC GGLM+ AF++I  +GG+
Sbjct: 157 GSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGI 216

Query: 223 DSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
            +E  YPY  A   CD  R R A +V IDG+++V    E +L KAVA+QPVSVAI+AG +
Sbjct: 217 TTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 276

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           +FQ Y  GVF G+CG+ LDHGV  VGYG T +G +YW+V+NSWG+ WGE GY+++QR+  
Sbjct: 277 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-S 335

Query: 341 DTNTGKCGIAMEASYPVKNSQNSAKPK 367
             + G CGIAMEASYPVK S N   P+
Sbjct: 336 GYDGGLCGIAMEASYPVKFSPNRVTPR 362


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  333 bits (854), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 177/356 (49%), Positives = 222/356 (62%), Gaps = 26/356 (7%)

Query: 24  AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFI 83
           + D SI+ Y +  D SS     + +  +++ WL++H +    +    +RFQ+FKDNL  I
Sbjct: 36  SGDFSIVGY-SEEDLSS----HESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHI 90

Query: 84  DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---- 139
           DE N    +Y +GLN+FADLT++E++A YLG RS             +    +  +    
Sbjct: 91  DETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDG 150

Query: 140 -ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
             LP+SVDWR KGAV  VK+QG CGSCWAFSTVAAVEGIN+IVTG L +LSEQEL+DCD 
Sbjct: 151 ASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDT 210

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR--------------NA 244
             N GCNGGLMDYAF +I  NGG+ +E+ YPYL  E  C  S                +A
Sbjct: 211 DGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDA 270

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            VV+I GYEDV   +E +L KA+A QPVSVAIEA GR FQ Y  GVF G CG+ LDHGV 
Sbjct: 271 AVVTISGYEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVA 330

Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           AVGYGT   G DY +V+NSWG  WGE GY++++R       G CGI   ASYP KN
Sbjct: 331 AVGYGTAAKGHDYIIVKNSWGPSWGEKGYIRMRRG-TGKRQGLCGINKMASYPTKN 385


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  333 bits (854), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 162/308 (52%), Positives = 219/308 (71%), Gaps = 10/308 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W++++GK        EKRF IFKDN+ FI+  N+  N+ YK+ +N  ADLT +E++A
Sbjct: 40  HEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKA 99

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K+  +  + A+  +  +    +PE+VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKK--IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGIN+I TG+LISLSEQELVDCD K  + GC GGLM+  F+FII+NGG+ SE +YP
Sbjct: 154 VAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+ +   A V  I GYE V    E+SL KAVA+QP+SV+I+A   +F  Y SG
Sbjct: 214 YKAADGSCN-TATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR + D   G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GLCGI 331

Query: 350 AMEASYPV 357
           AM++SYP 
Sbjct: 332 AMDSSYPT 339


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  333 bits (853), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 162/323 (50%), Positives = 221/323 (68%), Gaps = 10/323 (3%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVG 96
           S + + D  +   ++ W+  +GK    +   E R +IFK+N+ +I+  N+   N+ YK+G
Sbjct: 28  SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N+FAD+TNEE+    + +R+  K  +  S   +  +  +    +P +VDWR+KGAV PV
Sbjct: 88  INQFADITNEEF----IASRNKFKGHMCSSITKTSTFKYENA-SVPSTVDWRKKGAVTPV 142

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
           K+QG CG CWAFS VAA EGI+K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+F
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           IIQN G+ +E  YPY G +  C  +  +    +I GYEDV   +E +L+KAVA+QP+SVA
Sbjct: 203 IIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVA 262

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVK 334
           I+A G  FQ Y+SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY++
Sbjct: 263 IDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIR 322

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           +QR+ +D   G CGIAM ASYP 
Sbjct: 323 MQRS-VDAAQGLCGIAMMASYPT 344


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 162/308 (52%), Positives = 218/308 (70%), Gaps = 10/308 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W++++GK        EKRF IFKDN+ FI+  N+  N+ YK+ +N  ADLT +E++A
Sbjct: 40  HEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKA 99

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K+  +  + A+  +  +    +PE+VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKK--IDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGIN+I TG+LISLSEQELVDCD K  + GC GGLM+  F+FII+NGG+ SE +YP
Sbjct: 154 VAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C  +   A V  I GYE V    E+SL KAVA+QP+SV+I+A   +F  Y SG
Sbjct: 214 YKAADGSCSAAT-TAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR + D   G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GLCGI 331

Query: 350 AMEASYPV 357
           AM++SYP 
Sbjct: 332 AMDSSYPT 339


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 158/308 (51%), Positives = 217/308 (70%), Gaps = 12/308 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++GK        +KRFQIFKDN+ FI+  N+  N+ YK+G+N  ADLT EE++A
Sbjct: 38  HEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  KR     + ++  +  +    +P ++DWR KGAV P+KDQG CGSCWAFST
Sbjct: 98  ----SRNGFKR---PHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCWAFST 150

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           +AA EGI++I TG+L+SLSEQELVDCD K ++ GC GG M+  F+FII+NGG+ SE +YP
Sbjct: 151 IAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSETNYP 210

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+  +  + V  I GYE V P  E +L+KAVA+QPVSV+I+A G  F  Y SG
Sbjct: 211 YKAVDGKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGFMFYSSG 268

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++ GECG+ LDHGV AVGYGT NG DYW+V+NSWG+ WGE GYV++QR +   + G CGI
Sbjct: 269 IYNGECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKH-GLCGI 327

Query: 350 AMEASYPV 357
           A+++SYP 
Sbjct: 328 ALDSSYPT 335


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 158/307 (51%), Positives = 217/307 (70%), Gaps = 5/307 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W+A++G+  + +    +R ++FK N+ FI+  N+ N  + +  N+FAD+T +E+RAM
Sbjct: 33  HEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFADITKDEFRAM 92

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           + G +        K++    RYA  + D+LP SVDWR  GAV PVKDQG CG CWAFSTV
Sbjct: 93  HKGYKMQVIGS--KARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCGCCWAFSTV 150

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           A++EGI K+ TG+LISLSEQELVDCD  + N GC GGLMD AF+FI+ NGG+D+E DYPY
Sbjct: 151 ASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGGLDTEADYPY 210

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            GA+  C+ ++ +    SI GYEDV   DE SL+KAVA QPVS+A++ G   F+ Y+ GV
Sbjct: 211 TGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDGGDDLFRFYKGGV 270

Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
            TG CG+ LDHGV AVGYG   +G  YWLV+NSWG+ WGE+G+++L+R++ D   G CG+
Sbjct: 271 LTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERDVAD-EAGMCGL 329

Query: 350 AMEASYP 356
           AM+ SYP
Sbjct: 330 AMKPSYP 336


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 176/350 (50%), Positives = 228/350 (65%), Gaps = 16/350 (4%)

Query: 19  ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
           ++ +  +++SI+ Y +  D +S  R    +M +++ ++AK+ K  + +    +RF++FKD
Sbjct: 24  VAVAMPSELSIVGY-SEEDLASHER----LMELFEKFMAKYRKAYSSLEEKLRRFEVFKD 78

Query: 79  NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG 138
           NL  IDE N     Y +GLN+FADLT++E++A YLG      RR    ++   RY     
Sbjct: 79  NLNHIDEENKKITGYWLGLNEFADLTHDEFKAAYLGLTLTPARRNSNDQLF--RYEEVEA 136

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
             LP+ VDWR+KGAV  VK+QG CGSCWAFSTVAAVEGIN IVTG L  LSEQEL+DCD 
Sbjct: 137 ASLPKEVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDT 196

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC-------DPSRRNAKVVSIDG 251
             N GC+GGLMDYAF +I  NGG+ +E+ YPYL  E  C       D     A  V+I G
Sbjct: 197 DGNNGCSGGLMDYAFSYIAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISG 256

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           YEDV   +E +L KA+A QPVSVAIEA GR FQ Y  GVF G CG+ LDHGV AVGYGT 
Sbjct: 257 YEDVPRNNEQALLKALAHQPVSVAIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTA 316

Query: 312 N-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           + G DY +V+NSWGS WGE GY++++R     + G CGI   ASYP KN+
Sbjct: 317 SKGHDYIIVKNSWGSHWGEKGYIRMRRG-TGKHDGLCGINKMASYPTKNA 365


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 163/309 (52%), Positives = 217/309 (70%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+AK+G+        E+RF+IF++N+ FI+  N L NR YK+ +N+FADLTNEE++ 
Sbjct: 38  HEMWMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTNEEFKV 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G +  +   L  ++ +S RYA      +P S+DWR+ GAV P+KDQG CG CWAFS 
Sbjct: 98  SKNGYKRSSGVGL--TEKSSFRYANVTA--VPTSMDWRQNGAVTPIKDQGQCGCCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FI QNGG+ +E +YP
Sbjct: 154 VAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ ++       I GYEDV    E +L KAVA QPVSVAI+A G AFQ Y  G
Sbjct: 214 YQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASGSAFQFYSGG 273

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYGT ++G  YWLV+NSWG+ WGE+GY++++R+ ++   G CG
Sbjct: 274 VFTGDCGTELDHGVTAVGYGTSDDGTKYWLVKNSWGTSWGEDGYIRMERD-IEAKEGLCG 332

Query: 349 IAMEASYPV 357
           IAM+ SYP 
Sbjct: 333 IAMQPSYPT 341


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 173/360 (48%), Positives = 241/360 (66%), Gaps = 23/360 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LF +    AA  S  +  N H+ S   R +D        W+ ++G
Sbjct: 1   MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMVQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA    +R+  
Sbjct: 48  REYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GC+GGLMD AF+FI QN G+ +E +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCN 221

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAVA QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTE 281

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 282 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 340


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  332 bits (850), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 168/328 (51%), Positives = 220/328 (67%), Gaps = 8/328 (2%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +D+ +  +Y+ W   H +     G   +RF  FK+N RFI  HN   +R Y++ LN+F D
Sbjct: 34  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGD 92

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +  EE+R+ +  +R +  RR   +  A   +      +LP SVDWR+KGAV  VK+QG C
Sbjct: 93  MGREEFRSGFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRC 152

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFSTV AVEGIN I TG L+SLSEQEL+DCD   N GC GGLM+ AF+FI  +GG+
Sbjct: 153 GSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDEN-GCQGGLMENAFEFIKSHGGI 211

Query: 223 DSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
            +E  YPY  +   CD +R R  +VV+IDG++ V    E +L KAVA QPVSVAI+AGG+
Sbjct: 212 TTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQ 271

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           A Q Y  GVFTG+CG+ LDHGV AVGYG +++G  YW+V+NSWG  WGE GY+++QR   
Sbjct: 272 ALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGT- 330

Query: 341 DTNTGKCGIAMEASYPVKNSQN-SAKPK 367
             N G CGIAMEAS+P+K S N S KP+
Sbjct: 331 -GNGGLCGIAMEASFPIKTSPNPSRKPR 357


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 168/359 (46%), Positives = 229/359 (63%), Gaps = 26/359 (7%)

Query: 1   MATASMFLAISTLVFL-FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           +   S F+ ++ L  L  + S S+A  +  +S    H+                 W+A++
Sbjct: 3   LTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQ----------------WMAQY 46

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           G+        E R+ IFK+N+  ID  NS   ++YK+G+N+FADL+NEE++A    +R+ 
Sbjct: 47  GRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKA----SRNR 102

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
            K  +   +    RY  +    +P ++DWR+KGAV PVKDQG CG CWAFS VAA+EGIN
Sbjct: 103 FKGHMCSPQAGPFRY--ENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGIN 160

Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           ++ TG+LISLSEQE+VDCD K  + GCNGGLMD AF+FI QN G+ +E +YPY G +  C
Sbjct: 161 QLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTC 220

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
           +  +       I G+EDV    E +L KAVA QPVSVAI+AGG  FQ Y SG+FTG CG+
Sbjct: 221 NTQKEATHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGT 280

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            LDHGV AVGYG  +G  YWLV+NSWG+ WGE GY+++Q++ +    G CGIAM+ASYP
Sbjct: 281 QLDHGVTAVGYGISDGTKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAMQASYP 338


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 175/360 (48%), Positives = 235/360 (65%), Gaps = 24/360 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S  + I+ L+   + S + +  +        H+ S S R +D        W+  +G
Sbjct: 1   MALESKIICITLLIMGVWASQALSRTL--------HEVSMSERHED--------WMGLYG 44

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +T   +   E+RF+IFK+N+ +I+  NS  NR YK+ +N+FAD TNEE++A   G    +
Sbjct: 45  RTYKDIAEKERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSS 104

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           + R   S++ S RY   A   +P S+DWR+KGAV P+KDQG CG CWAFS VAA+EG+ +
Sbjct: 105 RPR--SSEITSFRYENVAA--VPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQ 160

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TGELISLSEQELVDCD    + GC GGLMD AF+FII NGG+ +E +YPY G +  C+
Sbjct: 161 LKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCN 220

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +  +    I  YEDV    E +L KAVA  PVSVAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 221 KKKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTE 280

Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYG T++G  YWLV+NSWG+ WGE+GY+ ++R+ +  + G CGIAMEASYP 
Sbjct: 281 LDHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 339


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 158/308 (51%), Positives = 213/308 (69%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        E R +IFK+N++ I+  N+  N++YK+G+N+FADLTNEE++A
Sbjct: 39  HEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
                R+  K  +  +   +  +  +    +P S+DWR+KGAV P+KDQG CG CWAFS 
Sbjct: 99  -----RNRFKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+++E  YP
Sbjct: 154 VAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ +       SI G+EDV    E +L KAVA+QP+SVAI+A G  FQ Y SG
Sbjct: 214 YQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSG 273

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           VFTG CG+ LDHGV AVGYG++ G  YWLV+NSWG  WGE GY+++QR++     G CG 
Sbjct: 274 VFTGSCGTELDHGVTAVGYGSDGGTKYWLVKNSWGEQWGEQGYIRMQRDVA-AEEGLCGF 332

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 333 AMQASYPT 340


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 158/292 (54%), Positives = 210/292 (71%), Gaps = 9/292 (3%)

Query: 70  EKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           EKR +IF  N+ +I+  NS   N+ YK+ +NKFADLTNEE+    + +R+  K  +  S 
Sbjct: 5   EKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEF----IASRNKFKGHMCSSI 60

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           + +  +  +    +P +VDWR+KGAV PVK+QG CGSCWAFS VAA EGI+++ TG+L+S
Sbjct: 61  IRTTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVS 120

Query: 188 LSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQEL+DCD K ++ GC GGLMD AF+FIIQN G+ +E  YPY G +  C+ ++ +   
Sbjct: 121 LSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIHA 180

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V+I GYEDV   +E++L+KAVA+QP+SVAI+A G  FQ Y SGVFTG CG+ LDHGV AV
Sbjct: 181 VTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAV 240

Query: 307 GYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GYG  N G  YWLV+NSWG+DWGE GY+++QR +     G CGIAM+ASYP 
Sbjct: 241 GYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAE-GLCGIAMQASYPT 291


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 175/365 (47%), Positives = 227/365 (62%), Gaps = 16/365 (4%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M L    L FL  + +S   D                 T++ V  +Y+ W   H  T   
Sbjct: 1   MKLFFIVLSFLCLLQASKGFDFD----------EKELETEENVWKLYERWRDHHSVTR-- 48

Query: 66  MGHNE-KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
             H   KRF +F+ N+  +   N  N+ YK+ +N+FAD+T+ E+R+ Y G+     R L 
Sbjct: 49  ASHEALKRFNVFRHNVLHVHRTNKKNKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLR 108

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
             K  S  +  +    +P SVDWREKGAV  VK+Q  CGSCWAFSTVAAVEGINKI T +
Sbjct: 109 GPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNK 168

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRN 243
           L+SLSEQELVDCD + N GC GGLM+ AF+FI  NGG+ +E+ YPY   + + C     +
Sbjct: 169 LVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSID 228

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
            + V+IDG+E V   DE +L KAVA QPVSVAI+AG   FQ Y  GVF GECG+ L+HGV
Sbjct: 229 GETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGV 288

Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           V VGYG T+NG  YW+VRNSWG +WGE GYV+++R + + N G+CGIAMEASYP K S  
Sbjct: 289 VIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE-NEGRCGIAMEASYPTKVSST 347

Query: 363 SAKPK 367
            + P+
Sbjct: 348 PSTPE 352


>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
           C-169]
          Length = 481

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 159/289 (55%), Positives = 208/289 (71%), Gaps = 5/289 (1%)

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR-LMKSKV 128
           E++F ++ DNL F+  HN  + T+K+GL  FADLT++EYR   LG R + K   L   K 
Sbjct: 67  ERKFSVWLDNLEFVHSHNEKDSTFKLGLTNFADLTHDEYRQHALGYRPELKGTGLGTGKS 126

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
              +YA     E P S+DWR+KGAV  VK+Q  CGSCWAFST  +VEG N I +GEL+SL
Sbjct: 127 TGFQYA---DYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGSVEGANAIYSGELVSL 183

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQELVDCD   + GC+GGLMD+AF FII+NGG+D+E+DY Y   +  C+ ++    VV+
Sbjct: 184 SEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKAQDGVCNIAKEKRHVVT 243

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           ID YEDV P DE +LKKA A+QP+SVAIEA  R FQ Y  GVF   CG+ALDHGV+ VGY
Sbjct: 244 IDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFDAPCGTALDHGVLVVGY 303

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G++NG DYW+V+NSWG  WG++GY++L R + ++  G+CGIAM+ASYP+
Sbjct: 304 GSDNGTDYWIVKNSWGDFWGDSGYIRLARGISNS-AGQCGIAMQASYPI 351


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 176/359 (49%), Positives = 227/359 (63%), Gaps = 8/359 (2%)

Query: 16  LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQI 75
           LFFI   S   +   S   + D      T++ V  +Y+ W   H   S       KRF +
Sbjct: 3   LFFIVLISFLSLLQASKGFDFD-EKELETEENVWKLYERWRGHH-SVSRASHEAIKRFNV 60

Query: 76  FKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           F+ N+  +   N  N+ YK+ +N+FAD+T+ E+R+ Y G+     R L   K  S  +  
Sbjct: 61  FRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMY 120

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
           +    +P SVDWREKGAV  VK+Q  CGSCWAFSTVAAVEGINKI T +L+SLSEQELVD
Sbjct: 121 ENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVD 180

Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYED 254
           CD + N GC GGLM+ AF+FI  NGG+ +E+ YPY  ++ + C  +    + V+IDG+E 
Sbjct: 181 CDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEH 240

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENG 313
           V   DE  L KAVA QPVSVAI+AG   FQ Y  GVF GECG+ L+HGVV VGYG T+NG
Sbjct: 241 VPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG 300

Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
             YW+VRNSWG +WGE GYV+++R + + N G+CGIAMEASYP K    S+ P  H S 
Sbjct: 301 TKYWIVRNSWGPEWGEGGYVRIERGISE-NEGRCGIAMEASYPTK---LSSTPSTHESV 355


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  330 bits (847), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 234/356 (65%), Gaps = 14/356 (3%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           T++  +A  +L FL  I + + A  ++ + D   D S   R        ++ W+AK+G+ 
Sbjct: 70  TSTPTMASRSLGFLIAILACTCAVSALAARDLTDDLSMVAR--------HEQWMAKYGRV 121

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
            N +    +R ++FK N+ FI+  N+ N  + +  N+FAD+T +E+RA + G +      
Sbjct: 122 YNDVAEKAQRLEVFKANVAFIELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKPVPAN- 180

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
             K +    +YA  + D LP S+DWR KGAV P+KDQG CG CWAFSTVA+VEGI K+ T
Sbjct: 181 --KGRTTQFKYANVSLDALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLST 238

Query: 183 GELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           G+LISLSEQELVDCD   ++ GC GGLMD AF+FII NGG+ +E +YPY G ++ C+ ++
Sbjct: 239 GKLISLSEQELVDCDVDGMDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNK 298

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
            +  V SI GYEDV   DE SL KAVA QPVS+A++ G   F+ Y+ GV +G CG+ LDH
Sbjct: 299 ESNDVASIKGYEDVPSNDETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDH 358

Query: 302 GVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           G+ AVGYG T +G  +WL++NSWG+ WGE G+++++R++ D   G CG+AM+ SYP
Sbjct: 359 GIAAVGYGITSDGTKFWLMKNSWGTSWGEKGFIRMERDIAD-EEGLCGLAMQPSYP 413


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 180/340 (52%), Positives = 232/340 (68%), Gaps = 19/340 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFA 101
           +++ +  +Y+ W A+H   S  +    +RF +F++N R + E N L R   YK+ LN+FA
Sbjct: 41  SEESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFN-LRRDAPYKLRLNRFA 98

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA-------GDELPESVDWREKGAVN 154
           DLT++E+R  Y  +R  +  R+ K + A+              G  LP SVDWREKGAV 
Sbjct: 99  DLTSDEFRRSYASSRV-SHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVT 157

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQ 214
            VKDQG CGSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K NAGC+GGLMD AF 
Sbjct: 158 GVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFS 217

Query: 215 FIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +I ++GG+ +E+ YPY   + + C+  +  A VVSIDGYEDV   DE +LKKAVA QPV+
Sbjct: 218 YIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVA 277

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
           VAIEAGG  FQ Y  GVF G+CG+ LDHGV AVGYG T +G  YW+V+NSWG +WGE GY
Sbjct: 278 VAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGY 337

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
           ++++R++ D   G CGIAMEASYPVK S N   PK H++A
Sbjct: 338 IRMKRDVADKE-GLCGIAMEASYPVKTSPN---PK-HAAA 372


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  330 bits (846), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 177/368 (48%), Positives = 233/368 (63%), Gaps = 17/368 (4%)

Query: 1   MATASMFLAI-STLVFLF----FISSSSAA---DMSIISYDNNHDHSSSWRTDDEVMTIY 52
           MA  +  LA+ S L  LF    F++ S+ A   D S++ Y             ++++ ++
Sbjct: 1   MAGNNSLLAMDSKLSMLFLLLGFVACSATASHHDPSVVGYSQE-----DLALPNKLVGLF 55

Query: 53  QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMY 112
            +W  KH K         KR++IFK NLR I E N  N +Y +GLN FAD+ +EE++A Y
Sbjct: 56  TSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASY 115

Query: 113 LGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           LG +    RR  +   ++  RYA      LP +VDWR+KGAV PVK+QG CGSCWAFSTV
Sbjct: 116 LGLKPGLARRDAQPHGSTTFRYANAV--NLPWAVDWRKKGAVTPVKNQGECGSCWAFSTV 173

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           AAVEGIN+IVTG+L+SLSEQEL+DCD   N GC GGLMD+AF +I+ N G+ +E+DYPYL
Sbjct: 174 AAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYL 233

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
             E  C   + ++KV++I GYEDV    E SL KA+A QPVSV I AG R FQ Y+ G+F
Sbjct: 234 MEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIF 293

Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
            GECG   DH + AVGYG+  G DY +++NSWG +WGE GY +++R       G C I  
Sbjct: 294 DGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGYFRIRRG-TGKPEGVCDIYK 352

Query: 352 EASYPVKN 359
            ASYP KN
Sbjct: 353 IASYPTKN 360


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  330 bits (845), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 159/308 (51%), Positives = 217/308 (70%), Gaps = 9/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
           ++ W+++ G+  N     E R++IFK+N++ I+  N  + ++YK+G+N+FADLTNEE++ 
Sbjct: 39  HEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKT 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +  S+    RY        P S+DWR+KGAV  +KDQG CGSCWAFS 
Sbjct: 99  ----SRNRFKGHMCSSQAGPFRYENLTA--APSSMDWRKKGAVTAIKDQGQCGSCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAAVEGI ++ T +LISLSEQELVDCD K  + GC GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G++  C+  +       I+G+EDV   +E +L KAVA QPVSVAI+AGG  FQ Y SG
Sbjct: 213 YEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +FTG+CG+ LDHGV AVGYG  NG++YWLV+NSWG+ WGE GY+++Q++ +D   G CGI
Sbjct: 273 IFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKD-IDAKEGLCGI 331

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 332 AMQASYPT 339


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  330 bits (845), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 159/316 (50%), Positives = 218/316 (68%), Gaps = 9/316 (2%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADL 103
           D +   ++ W++++ K        E+R +IF  N+ +I+  N  + N+ YK+G+N+FADL
Sbjct: 34  DSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKLGINQFADL 93

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TNEE+    + +R+  K  +  S   +  +  +    +P +VDWR+KGAV PVK+QG CG
Sbjct: 94  TNEEF----IASRNKFKGHMCSSIAKTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCG 149

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA EGI K+ TG+L+SLSEQELVDCD K ++ GC GGLMD AF+FIIQN G+
Sbjct: 150 CCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGL 209

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY G +  C+ ++ +    +I GYEDV   +E +L+KAVA+QP+SVAI+A G  
Sbjct: 210 STEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQALQKAVANQPISVAIDASGSD 269

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y+SGVF+G CG+ LDHGV AVGYG  N G  YWLV+NSWG+DWGE GY+++QR  +D
Sbjct: 270 FQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIRMQRG-VD 328

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM+ASYP 
Sbjct: 329 AAEGLCGIAMQASYPT 344


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 160/315 (50%), Positives = 209/315 (66%), Gaps = 15/315 (4%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++TW  +HGK+         R ++F+DN  F+ +HNS  N +Y + LN FADLT+ E++
Sbjct: 28  LFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFK 87

Query: 110 AMYLGTRSD----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
              LG  +     A R L  + V           ++P S+DWR KG V  VKDQGSCG+C
Sbjct: 88  TSRLGLSAAPLNLAHRNLEITGVVG---------DIPASIDWRNKGVVTNVKDQGSCGAC 138

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           W+FS   A+EGINKIVTG L+SLSEQEL++CD+  N GC GGLMDYAFQF+I N G+D+E
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTE 198

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           +DYPY   +  C+  R   +VV+ID Y DV   +E  L +AVA QPVSV I    RAFQ 
Sbjct: 199 EDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQM 258

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y  G+FTG C ++LDH V+ VGYG+ENGVDYW+V+NSWG+ WG  GY+ +QRN  ++  G
Sbjct: 259 YSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ-G 317

Query: 346 KCGIAMEASYPVKNS 360
            CGI M ASYPVK S
Sbjct: 318 VCGINMLASYPVKTS 332


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 163/316 (51%), Positives = 215/316 (68%), Gaps = 10/316 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           T+  ++  ++ W+AK+ K        EKRF IFKDN+ FI+  N+  N+ YK+G+N  AD
Sbjct: 33  TETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLGVNHLAD 92

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LT EE++A    +R+  KR     +V +  +  +    +P SVDWR+KGAV P+KDQG C
Sbjct: 93  LTIEEFKA----SRNGLKRSY-DYEVGTTSFKYENVTAIPASVDWRKKGAVTPIKDQGQC 147

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFSTVAA EGI+KI TG+L+SLSEQELVDCDRK  + GC GG M+  F+FII+NGG
Sbjct: 148 GSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFEFIIKNGG 207

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY   +  C  +   A    I GYE V    E +L KAVA+QPVSV+I+A   
Sbjct: 208 ITTEANYPYKAVDGSCKNA--TAPAAQIKGYEKVPVNSEKALLKAVANQPVSVSIDAADG 265

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           +F  Y SG+FTGECG+ LDHGV AVGYG  NG DYW+V+NSWG+ WGE GY+++QR +  
Sbjct: 266 SFMFYSSGIFTGECGTELDHGVTAVGYGRANGTDYWIVKNSWGTVWGEQGYIRMQRGIA- 324

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM++SYP 
Sbjct: 325 AKEGLCGIAMDSSYPT 340


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 170/351 (48%), Positives = 225/351 (64%), Gaps = 12/351 (3%)

Query: 13  LVFLFFISSSSAA---DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
            + L F++ S+ A   D S++ Y             ++++ ++ +W  KH K        
Sbjct: 9   FLLLGFVACSATASHHDPSVVGYSQE-----DLALPNKLVGLFTSWSVKHSKIYASPKEK 63

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
            KR++IFK NLR I E N  N +Y +GLN FAD+ +EE++A YLG +    RR  +   +
Sbjct: 64  VKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGS 123

Query: 130 SQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           +  RYA      LP +VDWR+KGAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG+L+SL
Sbjct: 124 TTFRYANAV--NLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSL 181

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQEL+DCD   N GC GGLMD+AF +I+ N G+ +E+DYPYL  E  C   + ++KV++
Sbjct: 182 SEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSKVIT 241

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I GYEDV    E SL KA+A QPVSV I AG R FQ Y+ G+F GECG   DH + AVGY
Sbjct: 242 ITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTAVGY 301

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           G+  G DY +++NSWG +WGE GY +++R       G C I   ASYP KN
Sbjct: 302 GSYYGQDYIIMKNSWGKNWGEQGYFRIRRG-TGKPEGVCDIYKIASYPTKN 351


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 156/308 (50%), Positives = 218/308 (70%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ +HGK        EKRF+IF +N+ +++  +N+ N+ YK+G+N+F DLTN+E+  
Sbjct: 135 HEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEF-- 192

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S + +  +  +    +P +VDWR+ GAV PVKDQG CG CWAFS 
Sbjct: 193 --IAPRNRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSA 250

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ +  G+LISLSEQELVDCD K ++ GC GGLMD A++FIIQN G+++E +YP
Sbjct: 251 VAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYP 310

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+ +       +I GYEDV   +E +L+KAVA+QPVSVAI+A    FQ Y+SG
Sbjct: 311 YKGVDGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSG 370

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
            FTG CG+ LDHGV AVGYG +++G  YWLV+NSWG++WGE GY+++QR  +D+  G CG
Sbjct: 371 AFTGSCGTELDHGVTAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRG-VDSEEGVCG 429

Query: 349 IAMEASYP 356
           IAM+ASYP
Sbjct: 430 IAMQASYP 437


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 171/360 (47%), Positives = 239/360 (66%), Gaps = 23/360 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LF +    AA  S  +  N H+ S   R +D        W+A++G
Sbjct: 1   MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMAQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+      +R+  
Sbjct: 48  RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGT----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y  +    +P ++DWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GCNGGLMD AF+FI QN G+ +E +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCN 221

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAV  QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 222 RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 281

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 282 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 340


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 174/364 (47%), Positives = 234/364 (64%), Gaps = 14/364 (3%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
           TL+ +  ++ S+      I +D     S     D+ +  +Y+ W   H    +  G   +
Sbjct: 7   TLLLVALVAMSAVELCRAIEFDERDLAS-----DEALWDLYERWQTHHHVHRH-HGEKGR 60

Query: 72  RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTR-SDAKRRLMKSKVA 129
           RF  FK+N+RFI  HN   +R Y++ LN+F D+  EE+R+ +  +R +D +R    +  A
Sbjct: 61  RFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESPAAPA 120

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
              +      +LP SVDWR++GAV  VKDQG CGSCWAFSTV +VEGIN I TG L+SLS
Sbjct: 121 VPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSLS 180

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVS 248
           EQEL+DCD   N GC GGLM+ AF+FI   GG+ +E  YPY  +   CD  R R  ++VS
Sbjct: 181 EQELIDCDTDEN-GCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRRGQIVS 239

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           IDG++ V    E +L KAVA+QPVSVAI+AGG+AFQ Y  GVFTG+CG+ LDHGV AVGY
Sbjct: 240 IDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 299

Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
           G +++G  YW+V+NSWG  WGE GY+++QR     N G CGIAMEAS+P+K S N A+ K
Sbjct: 300 GVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPAR-K 356

Query: 368 PHSS 371
           P  +
Sbjct: 357 PRRA 360


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 155/309 (50%), Positives = 204/309 (66%), Gaps = 8/309 (2%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++ W  ++GKT +       R ++F++N  F+ +HNS+ N +Y + LN FADLT+ E++
Sbjct: 28  LFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFK 87

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A  LG      + +       Q         +P +VDWR+ GAV  VKDQG+CG CW+FS
Sbjct: 88  ASRLGFSPGRAQSIRSVGTPVQEL------HVPPAVDWRKSGAVTGVKDQGNCGGCWSFS 141

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T  A+EGINKIVTG L+SLSEQELVDCDR  N+GC GGLMDYA+QF+I+N G+DSE DYP
Sbjct: 142 TTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYP 201

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y+G +  C+  +    +V+IDGY D+ P DE  L + VA QPVSV I    + FQ Y  G
Sbjct: 202 YVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKG 261

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           V+TG C S LDH V+ VGYGTE+GVD+W+V+NSWG  WG  GY+ + RN   T  G CGI
Sbjct: 262 VYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRN-NGTAEGICGI 320

Query: 350 AMEASYPVK 358
            M ASYP K
Sbjct: 321 NMLASYPAK 329


>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
          Length = 321

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 151/200 (75%), Positives = 175/200 (87%)

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFS+VAAVEGIN+IVTGELI LSEQELVDCD+  N GCNGGLMDYAFQFII NGG+
Sbjct: 13  GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGI 72

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           D+E+DYPY G +  CDP+R+NAKVV+IDGYEDV   DE SLKKAVA+QPVSVAIEAGGRA
Sbjct: 73  DTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRA 132

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           FQ Y+SGVFTG CG+ LDHGVVAVGYGT+NG DYW+VRNSWG DWGE+GY++L+RN+ + 
Sbjct: 133 FQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANI 192

Query: 343 NTGKCGIAMEASYPVKNSQN 362
            TGKCGIA++ SYP K+  N
Sbjct: 193 TTGKCGIAVQPSYPTKSGAN 212


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 216/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A +GK        E+RF+IFK+N+ +I+  N+  N+ YK+ +NKFAD TNE+++ 
Sbjct: 38  HEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKG 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G R   + R MK  V S +Y       +P ++DWR+KGAV P+KDQG CGSCWAFST
Sbjct: 98  ARNGYRRPFQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTPIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGIN++ TG+L+SLSEQELVDCD +  + GC GGLM+  F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+  ++ + +  I GYE V    E  L K VA+QP+SV+I+AGG  FQ Y SG
Sbjct: 214 YQAADGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSG 273

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYG T +G  YWLV+NSW + WGE GY+++QR+ +D   G CG
Sbjct: 274 VFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNSWXTSWGEEGYIRMQRD-IDAEEGLCG 332

Query: 349 IAMEASYPV 357
           IAM++SYP 
Sbjct: 333 IAMDSSYPT 341


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 4/325 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +D+ +  +Y+ W  +H       G   +RF  FKDN+R+I EHN     Y   LN+F D+
Sbjct: 38  SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYPP-LNRFGDM 95

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             EE+RA + G+ ++  RR   +      +  +   +LP +VDWR KGAV  VKDQG CG
Sbjct: 96  GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFSTV +VEGIN I TG L+SLSEQEL+DCD   N+GC GGLM+ AF++I  +GG+ 
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGIT 215

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E  YPY  A   CD  R    +V IDG+++V    E +L KAVA+QPVSVAI+AG ++F
Sbjct: 216 TESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GVF G+CG+ LDHGV  VGYG T +G +YW+V+NSWG+ WGE GY+++QR+    
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-SGY 334

Query: 343 NTGKCGIAMEASYPVKNSQNSAKPK 367
           + G CGIAMEASYPVK S N   P+
Sbjct: 335 DGGLCGIAMEASYPVKFSPNRVTPR 359


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 4/325 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +D+ +  +Y+ W  +H       G   +RF  FKDN+R+I EHN     Y   LN+F D+
Sbjct: 38  SDEALWDLYERW-QEHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYAP-LNRFGDM 95

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             EE+RA + G+ ++  RR   +      +  +   +LP +VDWR KGAV  VKDQG CG
Sbjct: 96  GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFSTV +VEGIN I TG L+SLSEQEL+DCD   N+GC GGLM+ AF++I  +GG+ 
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGIT 215

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E  YPY  A   CD  R    +V IDG+++V    E +L KAVA+QPVSVAI+AG ++F
Sbjct: 216 TESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GVF G+CG+ LDHGV  VGYG T +G +YW+V+NSWG+ WGE GY+++QR+    
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD-SGY 334

Query: 343 NTGKCGIAMEASYPVKNSQNSAKPK 367
           + G CGIAMEASYPVK S N   P+
Sbjct: 335 DGGLCGIAMEASYPVKFSPNRVTPR 359


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 159/309 (51%), Positives = 217/309 (70%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A +GK        E+RF+IFK+N+ +I+  N+  N+ YK+ +NKFAD TNE+++ 
Sbjct: 38  HEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKG 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G R   + R MK  V S +Y       +P ++DWR+KGAV  +KDQG CGSCWAFST
Sbjct: 98  ARNGYRRPFQTRPMK--VTSFKYENVTA--VPATMDWRKKGAVTLIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGIN++ TG+L+SLSEQELVDCD +  + GC GGLM+  F+FII+N G+ +E +YP
Sbjct: 154 VAATEGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHGITTEANYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+  ++ + +  I GYE V    E  L K VA+QP+SV+I+AGG  FQ Y SG
Sbjct: 214 YQAADGTCNSKKQASHIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSDFQFYSSG 273

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYG T +G  YWLV+NSWG+ WGE GY+++QR+ +DT  G CG
Sbjct: 274 VFTGKCGTELDHGVTAVGYGETSDGTKYWLVKNSWGTSWGEEGYIRMQRD-IDTEEGLCG 332

Query: 349 IAMEASYPV 357
           IAM++SYP 
Sbjct: 333 IAMDSSYPT 341


>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
          Length = 348

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 160/321 (49%), Positives = 219/321 (68%), Gaps = 12/321 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           TD  +  +Y+ W ++H   S      +KRF +FK N+  I+  N L + YK+ LN+FAD+
Sbjct: 32  TDKSLWDLYERWGSQH-MVSRAPDEKKKRFNVFKYNVNHINRVNQLGKPYKLKLNEFADM 90

Query: 104 TNEEYRAMYLGTRSDAK---RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           TN E++A +     D+K    R++K K     +      + P S+DWR  GAVNP+K+QG
Sbjct: 91  TNHEFKAGF-----DSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQG 145

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFST+  VEGINKI T +L+SLSEQELVDC+     GCNGGLM+  ++FI + G
Sbjct: 146 RCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE-GCNGGLMENGYEFIKETG 204

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +EQ YPY     +CD S+RN+ VV IDG+E+V   DE ++ +AVA+QPVS+AI+AGG
Sbjct: 205 GVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGG 264

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GVF G CG+ L+HGV  VGYG T++G +YW+VRNSWG+ WGE GYV++QR  
Sbjct: 265 LNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRG- 323

Query: 340 LDTNTGKCGIAMEASYPVKNS 360
           ++   G CG+AM+ASYP+K S
Sbjct: 324 VNVPEGLCGLAMDASYPIKAS 344


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 162/310 (52%), Positives = 217/310 (70%), Gaps = 8/310 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFKDN+ FI+  N+  N+ YK+G+N  ADLT EE++ 
Sbjct: 38  HENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
              G +   +      K+   +Y  +   ++PE++DWR KGAV P+KDQG  CGSCWAFS
Sbjct: 98  SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFS 155

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           TVAA EGI +I TG L+SLSEQELVDCD  ++ GC+GGLM+  F+FII+NGG+ SE +YP
Sbjct: 156 TVAATEGIYQISTGMLMSLSEQELVDCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   +  CD S+  +    I GYE V    E +L++AVA+QPVSV+I+AGG  FQ Y SG
Sbjct: 215 YTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           VFTG+CG+ LDHGV  VGYG T++G  +YW+V+NSWG+ WGE GY+++QR  +D   G C
Sbjct: 275 VFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRG-IDALEGLC 333

Query: 348 GIAMEASYPV 357
           GIAM+ASYP 
Sbjct: 334 GIAMDASYPT 343


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 159/292 (54%), Positives = 207/292 (70%), Gaps = 4/292 (1%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNE 106
           +V+ ++++ L KH K          RF+IF DNL+ IDE N     Y +GLN+FADLT+E
Sbjct: 44  KVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHE 103

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E++  +LG + +   R  +S +   RY  +   +LP+SVDWR+KGAV+PVK+QG CGSCW
Sbjct: 104 EFKNKFLGFKGELAERKDES-IEQFRY--RDFVDLPKSVDWRKKGAVSPVKNQGQCGSCW 160

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFSTVAAVEGIN+IVTG L  LSEQEL+DCD   N GCNGGLMDYAF ++ +NG +  E+
Sbjct: 161 AFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRNG-LHKEE 219

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY+ +E  CD  R  ++ V+I GY DV   +E S  KA+A+QP+SVAIEA GR FQ Y
Sbjct: 220 EYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFY 279

Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
             GVF G CG+ LDHGV AVGYGT  G+DY +VRNSWG  WGE GY++++RN
Sbjct: 280 SGGVFDGHCGTELDHGVAAVGYGTSKGLDYVIVRNSWGPKWGEKGYIRMKRN 331


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 174/359 (48%), Positives = 223/359 (62%), Gaps = 25/359 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+      I  LV L  I +S      +      H+ S S R        ++ W+ K+G
Sbjct: 1   MASIGKKQHILALVLLLSICTSQVMSRYL------HEASMSER--------HEQWMKKYG 46

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K        +KR  IFKDN+ FI+  N+  N+ YK+G+N  AD TNEE+ A + G +  A
Sbjct: 47  KVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKHKA 106

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                  K     Y    G  +P +VDWRE GAV  VKDQG CGSCWAFSTVAA EGI +
Sbjct: 107 SHSQTPFK-----YENVTG--VPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQ 159

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           I T  L+SLSEQELVDCD  ++ GC+GG M+  F+FII+NGG+ SE +YPY   +  CD 
Sbjct: 160 ITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDA 218

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           ++  +    I GYE V    E +L+KAVA+QPVSV I+AGG AFQ Y SGVFTG+CG+ L
Sbjct: 219 NKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQL 278

Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV AVGYG T++G  YW+V+NSWG+ WGE GY+++QR   D   G CGIAM+ASYP 
Sbjct: 279 DHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGIAMDASYPT 336


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 160/308 (51%), Positives = 210/308 (68%), Gaps = 7/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           Y+ WL +HG+        ++ F I++ N+RFI+  N+ N ++ +  N+FAD+TNEEY+A+
Sbjct: 45  YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKAL 104

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y+G  +    R  +S    +R        LP SVDWR+ GAV PV++QG CGSCWAFSTV
Sbjct: 105 YMGLGTSETSRKNQSSFKRERSKV-----LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 159

Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           AAVEGINKI TG+L+SLSEQEL+DCD    N GCNGG M  AF+FI QNGG+ + ++YPY
Sbjct: 160 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 219

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +G +  C+  +    VV I GYE V P +E  L+ AVA QPVSVAI+AGG  FQ Y  G+
Sbjct: 220 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 279

Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           F G CG  L+H V  +GYG +NG  YWLV+NSWG+ WGE GY ++ R+  D + G CGIA
Sbjct: 280 FNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRD-DEGICGIA 338

Query: 351 MEASYPVK 358
           MEASYP+K
Sbjct: 339 MEASYPIK 346


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 171/371 (46%), Positives = 233/371 (62%), Gaps = 22/371 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M  A +F  +  ++ +      +A  M I   D          +++ +  +Y+ W + H 
Sbjct: 3   MGKAFLFAVVLAVILV------AAMSMEITERD--------LASEESLWDLYERWRSHH- 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             S  +    KRF +FK N+  I + N  ++ YK+ LN FAD+TN E+R  Y    S  K
Sbjct: 48  TVSRDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFY---SSKVK 104

Query: 121 R-RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
             R++    A+  +     + LP SVDWR++GAV  VK+QG CGSCWAFSTV  VEGINK
Sbjct: 105 HYRMLHGSRANTGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINK 164

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           I TG+L+SLSEQELVDC+   N GCNGGLM+ A++FI ++GG+ +E+ YPY   +  CD 
Sbjct: 165 IKTGQLVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDS 223

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSA 298
           S+ NA  V+IDG+E V   DE +L KAVA+QPVSVAI+A G   Q Y  GV+ G+ CG+ 
Sbjct: 224 SKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNE 283

Query: 299 LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV  VGYGT  +G  YW+V+NSWG+ WGE GY+++QR +     G CGIAMEASYP+
Sbjct: 284 LDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPL 343

Query: 358 KNSQNSAKPKP 368
           K S ++ KP P
Sbjct: 344 KLSSHNPKPSP 354


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  327 bits (839), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 167/315 (53%), Positives = 216/315 (68%), Gaps = 11/315 (3%)

Query: 46  DEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D +M + ++ W+A++G+         KRF IFK+N+ +I+  N    + YK+G+N FADL
Sbjct: 32  DSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADL 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN+E++A    +R+  K     S     RY  +    +P +VDWR KGAV PVKDQG CG
Sbjct: 92  TNQEFKA----SRNGYKLPHDCSSNTPFRY--ENVSSVPTTVDWRTKGAVTPVKDQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA+EGI K+ TG LISLSEQELVDCD K I+ GC GGLMD AF FII N G+
Sbjct: 146 CCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY G +  C  S+ +     I GYEDV    E +L+KAVA+QPVSVAI+AGG  
Sbjct: 206 TTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSD 265

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y SGVFTGECG+ LDHGV AVGYG  E+G  YWLV+NSWG+ WGE GY+++Q++ ++
Sbjct: 266 FQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IE 324

Query: 342 TNTGKCGIAMEASYP 356
              G CGIAM++SYP
Sbjct: 325 AKEGLCGIAMQSSYP 339


>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 351

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 174/353 (49%), Positives = 229/353 (64%), Gaps = 15/353 (4%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M +AI  L  +F +SS  A DMSIIS+DN H   ++ RTDDEVM++++ WL KH K  N 
Sbjct: 1   MNMAIVLLFMVFAVSS--ALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNA 58

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
           +G  EKRFQIFK+NLRFIDE NSLNRTYK+GLN FADLTN EYRAMYL T  D  R  + 
Sbjct: 59  LGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLD 118

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGE 184
           +      Y  + GD +P+SVDWR++GAV PVK+QG +C SCWAF+ V AVE + KI TG+
Sbjct: 119 TP-PRNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGD 177

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           LISLSEQE+VDC    + GC GG + + + +I +N G+  E+DYPY G E KCD +++NA
Sbjct: 178 LISLSEQEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNA 236

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
            +V+IDG+  V    E +L +A+                Q    GVF G+CG+ L+H ++
Sbjct: 237 -IVTIDGHGWVPTQLEEALNRALFCYCAYFLYVDKFFLCQ----GVFKGKCGTELNHALL 291

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            VGYGTE   DYW+ +NS+   WGENGY+++QR L       C       YP+
Sbjct: 292 LVGYGTEKDGDYWIAKNSYSDKWGENGYIRIQRKL-----STCKFGNGGYYPI 339


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 177/337 (52%), Positives = 233/337 (69%), Gaps = 14/337 (4%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
           + S R + EV+T+Y+ WL ++GK  NG+G  E+RF+IFKDNL+ I+EHNS  NR+Y+ GL
Sbjct: 28  TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
           NKF+DLT +E++A YLG + + K     S VA +RY  K GD LP+ VDWRE+GAV P V
Sbjct: 88  NKFSDLTADEFQASYLGGKMEKKSL---SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K QG CGSCWAF+   AVEGIN+I TGEL+SLSEQEL+DCDR   N GC GG   +AF+F
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203

Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           I +NGG+ S++ Y Y G +   C     +  +VV+I+G+E V   DEMSLKKAVA QP+S
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263

Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
           V I A       Y+SGV+ G C +   DH V+ VGYGT +   DYWL+RNSWG +WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           Y++LQRN  +  TGKC +A+   YP+K++ +S    P
Sbjct: 322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSSHLLSP 357


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 157/308 (50%), Positives = 210/308 (68%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ WL  H K   G      RF I++ N++ ID  NSL+  +K+  N+FAD+TN E++A 
Sbjct: 43  FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           +LG  + + R   K     QR  C     +P++VDWR +GAV P+++QG CG CWAFS V
Sbjct: 103 FLGLNTSSLRLHKK-----QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157

Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           AA+EGINKI TG L+SLSEQ+L+DCD    N GC+GGLM+ AF+FI  NGG+ +E DYPY
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPY 217

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G E  CD  +   KVV+I GY+ V+  +E SL+ A A QPVSV I+AGG  FQ Y SGV
Sbjct: 218 TGIEGTCDQEKAKNKVVTIQGYQKVAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276

Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           FT  CG+ L+HGV  VGYG E    YW+V+NSWG+ WGE GY++++R + + +TGKCGIA
Sbjct: 277 FTSYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGISE-DTGKCGIA 335

Query: 351 MEASYPVK 358
           M ASYP++
Sbjct: 336 MLASYPLQ 343


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 160/308 (51%), Positives = 210/308 (68%), Gaps = 7/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           Y+ WL +HG+        ++ F I++ N+RFI+  N+ N ++ +  N+FAD+TNEEY+A+
Sbjct: 41  YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKAL 100

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y+G  +    R  +S    +R        LP SVDWR+ GAV PV++QG CGSCWAFSTV
Sbjct: 101 YMGLGTSETSRKNQSSFKRERSKV-----LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155

Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           AAVEGINKI TG+L+SLSEQEL+DCD    N GCNGG M  AF+FI QNGG+ + ++YPY
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +G +  C+  +    VV I GYE V P +E  L+ AVA QPVSVAI+AGG  FQ Y  G+
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 275

Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           F G CG  L+H V  +GYG +NG  YWLV+NSWG+ WGE GY ++ R+  D + G CGIA
Sbjct: 276 FNGFCGKQLNHAVTVIGYGEDNGKKYWLVKNSWGTGWGEAGYARMIRDSRD-DEGICGIA 334

Query: 351 MEASYPVK 358
           MEASYP+K
Sbjct: 335 MEASYPIK 342


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 157/307 (51%), Positives = 210/307 (68%), Gaps = 6/307 (1%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
           W+ KHG+    +     R+ +FK N+  I+  N++   RT+K+ +N+FADLTN+E+R+MY
Sbjct: 41  WMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMY 100

Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            G +   +     ++K  S RY   +   LP SVDWR KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAV 160

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           AA+EG  +I  G+LISLSEQ+LVDCD   + GC GGLMD AF+ I+  GG+ +E +YPY 
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIMATGGLTTESNYPYK 219

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G +  C+  + N K  SI GYEDV   DE +L KAVA QPVSV IE GG  FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279

Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           TGEC + LDH V A+GYG + NG  YW+++NSWG+ WGE+GY+++Q+++ D   G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQ-GLCGLA 338

Query: 351 MEASYPV 357
           M+ASYP 
Sbjct: 339 MKASYPT 345


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 158/307 (51%), Positives = 212/307 (69%), Gaps = 6/307 (1%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
           W+ KHG+    +     R+ +FK+N+  I+  NS+   RT+K+ +N+FADLTN+E+R+MY
Sbjct: 41  WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMY 100

Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            G +   A     ++K++  RY   +   LP SVDWR+KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           AA+EG  +I  G+LISLSEQ+LVDCD   + GC GGLMD AF+ I   GG+ +E +YPY 
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYK 219

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G +  C+  + N K  SI GYEDV   DE +L KAVA QPVSV IE GG  FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279

Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           TGEC + LDH V A+GYG + NG  YW+++NSWG+ WGE+GY+++Q+++ D   G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLA 338

Query: 351 MEASYPV 357
           M+ASYP 
Sbjct: 339 MKASYPT 345


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  327 bits (837), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 157/308 (50%), Positives = 210/308 (68%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ WL  H K   G      RF I++ N++ ID  NSL+  +K+  N+FAD+TN E++A 
Sbjct: 43  FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           +LG  + + R   K     QR  C     +P++VDWR +GAV P+++QG CG CWAFS V
Sbjct: 103 FLGLNTSSLRLHKK-----QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157

Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           AA+EGINKI TG L+SLSEQ+L+DCD    N GC+GGLM+ AF+FI  NGG+ +E DYPY
Sbjct: 158 AAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPY 217

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G E  CD  +   KVV+I GY+ V+  +E SL+ A A QPVSV I+AGG  FQ Y SGV
Sbjct: 218 TGIEGTCDQEKSKNKVVTIQGYQKVAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276

Query: 291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           FT  CG+ L+HGV  VGYG E    YW+V+NSWG+ WGE GY++++R + + +TGKCGIA
Sbjct: 277 FTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSE-DTGKCGIA 335

Query: 351 MEASYPVK 358
           M ASYP++
Sbjct: 336 MMASYPLQ 343


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  327 bits (837), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 159/309 (51%), Positives = 212/309 (68%), Gaps = 10/309 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A +G+    +   +KR++IF++N+  I+  N   N+ YK+ +N+FADLTNEE++A
Sbjct: 38  HEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +  +K  S +Y   +   +P ++DWR KGAV PVKDQG CG CWAFS 
Sbjct: 98  ----SRNRFKGHICSTKSTSFKYGNVSA--VPSAMDWRMKGAVTPVKDQGQCGCCWAFSA 151

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TGELISLSEQELVDCD   ++ GC GGLMD AF FI  N G+ SE +YP
Sbjct: 152 VAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYP 211

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ +++      I+G+EDV    E +L  AVA QPVSVAI+AGG  FQ Y  G
Sbjct: 212 YKGVDGTCNTNKQAIHAAEINGFEDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKG 271

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VF G CG+ LDHGV AVGYGT ++G  YWLV+NSWG+ WGE GY+++QR+ +D   G CG
Sbjct: 272 VFIGACGTQLDHGVTAVGYGTSDDGTKYWLVKNSWGTQWGEEGYIRMQRD-VDAKEGLCG 330

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 331 IAMKASYPT 339


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  327 bits (837), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 166/315 (52%), Positives = 215/315 (68%), Gaps = 11/315 (3%)

Query: 46  DEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D +M + ++ W+A++G+         KRF IFK+N+ +I+  N    + YK+G+N FADL
Sbjct: 30  DSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADL 89

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN+E++A    +R+  K     S     RY  +    +P +VDWR KGAV PVKDQG CG
Sbjct: 90  TNQEFKA----SRNGYKLPHDCSSNTPFRY--ENVSSVPTTVDWRTKGAVTPVKDQGQCG 143

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA+EGI K+ TG LISLSEQELVDCD K  + GC GGLMD AF FII N G+
Sbjct: 144 CCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKGL 203

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY G +  C  S+ +     I GYEDV    E +L+KAVA+QPVSVAI+AGG  
Sbjct: 204 TTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSD 263

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y SGVFTGECG+ LDHGV AVGYG  E+G  YWLV+NSWG+ WGE GY+++Q++ ++
Sbjct: 264 FQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IE 322

Query: 342 TNTGKCGIAMEASYP 356
              G CGIAM++SYP
Sbjct: 323 AKEGLCGIAMQSSYP 337


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  327 bits (837), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 156/308 (50%), Positives = 213/308 (69%), Gaps = 6/308 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFKDN+ FI+  N+  N+ YK+G+N  ADLT EE++ 
Sbjct: 38  HENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
              G +   +      K+   +Y  +   ++PE++DWR KGAV P+KDQG  CGSCWAFS
Sbjct: 98  SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFS 155

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T+AA EGI++I TG L+SLSEQELVDCD  ++ GC GG M+  F+FII+NGG+ SE +YP
Sbjct: 156 TIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ +   + V  I GYE V  + E +L+KAVA+QPVSV+I A    F  Y SG
Sbjct: 215 YKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSG 274

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++ GECG+ LDHGV AVGYGTENG DYW+V+NSWG+ WGE GY+++ R +   + G CGI
Sbjct: 275 IYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGI 333

Query: 350 AMEASYPV 357
           A+++SYP 
Sbjct: 334 ALDSSYPT 341


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 157/309 (50%), Positives = 220/309 (71%), Gaps = 10/309 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39  HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS 
Sbjct: 99  ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI ++ TG+LISLSEQELVDCD    + GC+GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +       I+GYEDV   +E +L+KAVA QP++VAI+A G  FQ Y SG
Sbjct: 213 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++     G CG
Sbjct: 273 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-AKEGLCG 331

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 332 IAMQASYPT 340


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 13/309 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W+A+HG+          RF+IF+ N+  I+  N+ N  +K+G+N+FADLTNEE++  
Sbjct: 41  HEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTNEEFKT- 99

Query: 112 YLGTRSDAKRRLMKSKVASQR-YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
                   +  L  SK+AS + +  +    +P ++DWR KGAV P+KDQG CGSCWAFS 
Sbjct: 100 --------RNTLKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSA 151

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TG+LISLSEQE+VDCD    + GCNGG MD AF++II+N G+ +E +YP
Sbjct: 152 VAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYP 211

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+  +  +   SI GYEDV+   E +L KA A+QP++VAI+AG  AFQ Y SG
Sbjct: 212 YKAADGTCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSG 271

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV  VGYG T +G  YWLV+NSWG+ WGE+GY++++R+ +D   G CG
Sbjct: 272 VFTGDCGTDLDHGVTLVGYGATSDGTKYWLVKNSWGTSWGEDGYIRMERD-VDAKEGLCG 330

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 331 IAMDASYPT 339


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 157/309 (50%), Positives = 220/309 (71%), Gaps = 10/309 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39  HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS 
Sbjct: 99  ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI ++ TG+LISLSEQELVDCD    + GC+GGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +       I+GYEDV   +E +L+KAVA QP++VAI+A G  FQ Y SG
Sbjct: 213 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++     G CG
Sbjct: 273 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-VKEGLCG 331

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 332 IAMQASYPT 340


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 230/337 (68%), Gaps = 14/337 (4%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
           + S R + EV TIY+ WL +HGK  NG+G  E+RF+IFKDNL+ I+EHNS  NR+Y  GL
Sbjct: 28  TESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGL 87

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
           N+F+DLT +E++A YLG + + K     S VA +RY  K GD LP+ VDWRE+GAV P V
Sbjct: 88  NQFSDLTVDEFQASYLGGKIEKKSL---SDVA-ERYQYKEGDILPDEVDWRERGAVVPRV 143

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K QG CGSCWAF+   AVEGIN+I TGEL+SLSEQEL+DCDR K N GC GG   +AF+F
Sbjct: 144 KRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEF 203

Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           I +NGG+ +++DY Y G +   C     +  +VV+I+G+E V   DEMSLKKAV+ QP+S
Sbjct: 204 IKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPIS 263

Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
           V I A       Y+SGV+ G C +   DH V+ VGYGT +   DYWL+RNSWG  WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGG 321

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           Y++LQRN  +  TGKC +A+   YP+K +  S    P
Sbjct: 322 YLRLQRN-FNEPTGKCAVAVAPVYPIKTNSASNLLSP 357


>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 151/217 (69%), Positives = 183/217 (84%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+DSE+DYPY      CD  R+NAKVV ID YEDV   +E 
Sbjct: 62  QGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG+DWGE GY+++QRN+  +++G CG+A+E SYPVK
Sbjct: 182 SWGADWGEKGYLRVQRNVA-SSSGLCGLAIEPSYPVK 217


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 174/359 (48%), Positives = 222/359 (61%), Gaps = 25/359 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+      I  LV L  I +S       +   N H+ S S R        ++ W+ K+G
Sbjct: 1   MASIGKKQHILALVLLLSICTSQ------VMSRNLHEASMSER--------HEQWMKKYG 46

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K        +KR  IFKDN+ FI+  N+  NR YK+ +N  AD TNEE+ A + G +   
Sbjct: 47  KVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKG 106

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                  K     Y    G  +P +VDWRE GAV  VKDQG CGSCWAFSTVAA EGI +
Sbjct: 107 SHSQTPFK-----YENVTG--VPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQ 159

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           I T  L+SLSEQELVDCD  ++ GC+GG M+  F+FII+NGG+ SE +YPY   +  CD 
Sbjct: 160 ITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDA 218

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           ++  +    I GYE V    E +L+KAVA+QPVSV I+AGG AFQ Y SGVFTG+CG+ L
Sbjct: 219 NKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQL 278

Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV AVGYG T++G  YW+V+NSWG+ WGE GY+++QR   D   G CGIAM+ASYP 
Sbjct: 279 DHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGIAMDASYPT 336


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 164/348 (47%), Positives = 222/348 (63%), Gaps = 25/348 (7%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVF F    ++A  +  +S    H+                 W+ ++GK        E R
Sbjct: 16  LVFGFLAFEANARTLEDVSLKERHEQ----------------WMTQYGKVYTDSYEKELR 59

Query: 73  FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
             IFK+N++ I+  N+  N+ YK+G+N+FADLTNEE++A     R+  K  +  +   + 
Sbjct: 60  SNIFKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKA-----RNRFKGHMCSNSTRTP 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +  +    +P S+DWR+KGAV P+KDQG CG CWAFS VAA EGI K+ TG+LISLSEQ
Sbjct: 115 TFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQ 174

Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD K ++ GC GGLMD AF+FI+QN G+++E  YPY G +  C+ +       SI 
Sbjct: 175 ELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIK 234

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
           G+EDV    E +L KAVA+QP+SVAI+A G  FQ Y SG+FTG CG+ LDHGV AVGYG 
Sbjct: 235 GFEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGV 294

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +++G  YWLV+NSWG  WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 295 SDDGTKYWLVKNSWGEQWGEEGYIRMQRDVA-AEEGLCGIAMQASYPT 341


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 173/359 (48%), Positives = 230/359 (64%), Gaps = 30/359 (8%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           I  LV L  I +S       +   N H+ S S R        ++ W+ K+GK        
Sbjct: 10  ILALVLLLSICTSQ------VMSRNLHEASMSER--------HEQWMKKYGKVYKDAAEK 55

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
           +KR  IFKDN+ FI+  N+  N+ YK+ +N  AD TNEE+ A + G          K K 
Sbjct: 56  QKRLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEEFVASHNG---------YKYKG 106

Query: 129 ASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +  +   K G+  ++P +VDWR+ GAV  VKDQG CGSCWAFSTVAA EGI +I TG L+
Sbjct: 107 SHSQTPFKYGNVTDIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLM 166

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLSEQELVDCD  ++ GC+GGLM+  F+FII+NGG+ SE +YPY   +  CD S+  +  
Sbjct: 167 SLSEQELVDCD-SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPA 225

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
             I GYE V    E +L++AVA+QPVSV+I+AGG  FQ Y SGVFTG+CG+ LDHGV  V
Sbjct: 226 AQIKGYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVV 285

Query: 307 GYG-TENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
           GYG T++G  +YW+V+NSWG+ WGE GY+++QR  +D   G CGIAM+ASYP+  S +S
Sbjct: 286 GYGTTDDGTHEYWIVKNSWGTQWGEEGYIRMQRG-IDAQEGLCGIAMDASYPMGKSSDS 343


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 175/345 (50%), Positives = 226/345 (65%), Gaps = 24/345 (6%)

Query: 35  NHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK 94
           +HD +S    +D +  +Y+ W  +H   +  +G   +RF +F++N+R I E N  +  YK
Sbjct: 34  DHDLAS----EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNRGDAPYK 88

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRL---------MKSKVASQRYACKAGDELPESV 145
           + LN+F D+T +E+R  Y  +R    R           M    AS R       ++P SV
Sbjct: 89  LRLNRFGDMTADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVR-------DVPPSV 141

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWR+KGAV  VKDQG CGSCWAFST+AAVEGIN I +  L SLSEQ+LVDCD K NAGCN
Sbjct: 142 DWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCN 201

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMDYAFQ+I ++GG+ +E  YPY  A      +++ + VV+IDGYEDV   DE +LKK
Sbjct: 202 GGLMDYAFQYIAKHGGVAAEDAYPYK-ARQASSCNKKPSAVVTIDGYEDVPANDETALKK 260

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
           AVA QPV+VAIEA G  FQ Y  GVF G+CG+ LDHGV AVGYGT  +G  YW+V+NSWG
Sbjct: 261 AVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWG 320

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
            +WGE GY++++R++ D   G CGIAMEASYPVK S N      H
Sbjct: 321 PEWGEKGYIRMKRDVKDKE-GLCGIAMEASYPVKTSANPKHAGAH 364


>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 150/217 (69%), Positives = 183/217 (84%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+DSE+DYPY    + CD  R+NAKVV ID YEDV   +E 
Sbjct: 62  EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG++WGE GY+++QRN+  +++G CG+A E SYPVK
Sbjct: 182 SWGANWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 170/351 (48%), Positives = 225/351 (64%), Gaps = 11/351 (3%)

Query: 13  LVFLFF--ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           ++FL F   S+S   D S++ Y             + ++ ++++W  KH K         
Sbjct: 9   VLFLAFAACSASHHRDPSVVGYSQE-----DLALPNRLVNLFKSWSVKHRKIYVSPKEKL 63

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           KR+ IFK NL  I E N  N +Y +GLN+FAD+T+EE++A +LG +    R   +++  +
Sbjct: 64  KRYGIFKQNLMHIAETNRKNGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123

Query: 131 Q-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
             RYA  A   LP SVDWR KGAV PVK+QG CGSCWAFS+VAAVEGIN+IVTG+L+SLS
Sbjct: 124 TFRYAAAA--NLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLS 181

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQEL+DCD  ++ GC GGLMD+AF +I+ + G+ +E DYPYL  E  C   +  A VV+I
Sbjct: 182 EQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTI 241

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
            GYEDV    E+SL KA+A QPVSV I AG R FQ Y+ GVF G C   LDH + AVGYG
Sbjct: 242 TGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYG 301

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           +  G +Y  ++NSWG +WGE GYV+++        G CGI   ASYPVKN+
Sbjct: 302 SSYGQNYITMKNSWGKNWGEQGYVRIKMG-TGKPEGVCGIYTMASYPVKNA 351


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 160/308 (51%), Positives = 211/308 (68%), Gaps = 10/308 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ +HGK        EKRF IFKDN+ FI+  N+  N+ YK+ +N  ADLT +E++A
Sbjct: 40  HEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEFKA 99

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K+   +    S +Y       +P +VDWR KGAV P+KDQG CGSCWAFST
Sbjct: 100 ----SRNGYKKIDREFTTTSFKYENVTA--IPAAVDWRVKGAVTPIKDQGQCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGIN+I TG+L+SLSEQELVDCD K  + GC GGLM+  F+FII+NGG+ SE +YP
Sbjct: 154 VAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSETNYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+ +     V  I GYE V    E SL KAVA+QP+SV+I+A   +F  Y SG
Sbjct: 214 YKAADGSCNTAT-TTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++TGECG+ LDHGV AVGYG+ NG DYW+V+NSWG+ WGE GY+++QR +     G CGI
Sbjct: 273 IYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIA-AKEGLCGI 331

Query: 350 AMEASYPV 357
           AM++SYP 
Sbjct: 332 AMDSSYPT 339


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 158/307 (51%), Positives = 211/307 (68%), Gaps = 6/307 (1%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
           W+ KHG+    +     R+ +FK+N+  I+  NS+   RT+K+ +N+FADLTN+E+ +MY
Sbjct: 41  WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMY 100

Query: 113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            G +   A     ++K++  RY   +   LP SVDWR+KGAV P+K+QGSCG CWAFS V
Sbjct: 101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           AA+EG  +I  G+LISLSEQ+LVDCD   + GC GGLMD AF+ I   GG+ +E DYPY 
Sbjct: 161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESDYPYK 219

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G +  C+  + N K  SI GYEDV   DE +L KAVA QPVSV IE GG  FQ Y SGVF
Sbjct: 220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279

Query: 292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           TGEC + LDH V A+GYG + NG  YW+++NSWG+ WGE+GY+++Q+++ D   G CG+A
Sbjct: 280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLA 338

Query: 351 MEASYPV 357
           M+ASYP 
Sbjct: 339 MKASYPT 345


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 167/362 (46%), Positives = 236/362 (65%), Gaps = 30/362 (8%)

Query: 4   ASMFLAISTLV-FLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHG 60
           A++  +IS ++ F FF  ++ AA D+S                DD VM   ++ W+A++ 
Sbjct: 95  ATLKASISAIIGFAFFCGAAMAARDLS----------------DDSVMVARHEQWMAQYS 138

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         +RF++FK N++FI+  N+  N  + +G+N+FADLTN+E+R+    T+++ 
Sbjct: 139 RVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRS----TKTNK 194

Query: 120 KRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
             +    K+ +  RY   + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI 
Sbjct: 195 GLKSSNMKIPTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIV 254

Query: 179 KIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           KI TG+L+SL+EQELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC
Sbjct: 255 KISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC 314

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
                +A   +I GYEDV   DE +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+
Sbjct: 315 KSGSNSA--ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGT 372

Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            LDHG+ A+GYG T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+AME SYP
Sbjct: 373 DLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYP 431

Query: 357 VK 358
            +
Sbjct: 432 TE 433


>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
 gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
          Length = 217

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 150/217 (69%), Positives = 182/217 (83%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+DSE+DYPY    + CD  R+NAKVV ID YEDV   +E 
Sbjct: 62  EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG+ WGE GY+++QRN+  +++G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 215/309 (69%), Gaps = 13/309 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRF IFK N+ FI+  N+  N+ YK+G+N  ADLT EE++A
Sbjct: 38  HEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC-GSCWAFS 169
               +R+  KR     ++++  +  +    +P ++DWR KGAV  +KDQG C GSCWAFS
Sbjct: 98  ----SRNGLKRPY---ELSTTPFKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFS 150

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           TVAA EGI++I TG+L+SLSEQELVDCD K ++ GC GG M+  F+FII+NGG+ SE +Y
Sbjct: 151 TVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANY 210

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   + KC+  +  + V  I GYE V P  E +L+KAVA+QPVSV+I+A G  F  Y S
Sbjct: 211 PYKAVDGKCN--KATSPVAQIKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSS 268

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           G++ GECG+ LDHGV AVGYG  NG DYWLV+NSWG+ WGE GYV++QR +   + G CG
Sbjct: 269 GIYNGECGTELDHGVTAVGYGIANGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKH-GLCG 327

Query: 349 IAMEASYPV 357
           IA+++SYP 
Sbjct: 328 IALDSSYPT 336


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 157/315 (49%), Positives = 218/315 (69%), Gaps = 9/315 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
           D  +   ++ W+ +  +  +     E R++IFK+N++ I+  N  + ++YK+G+N+FADL
Sbjct: 32  DASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADL 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TNEE++     +R+  K  +  S+    RY       +P S+DWR++GAV  +KDQG CG
Sbjct: 92  TNEEFKT----SRNRFKGHMCSSQAGPFRYENITA--VPSSMDWRKEGAVTAIKDQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS VAAVEGI ++ T +LISLSEQELVDCD K  + GC GGLMD AF+FI QN G+
Sbjct: 146 SCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY G++  C+  +       I+G+EDV   +E +L KAVA QPVSVAI+AGG  
Sbjct: 206 TTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFE 265

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           FQ Y SG+FTG+CG+ LDHGV AVGYG  NG++YWLV+NSWG+ WGE GY+++Q++ +D 
Sbjct: 266 FQFYSSGIFTGDCGTELDHGVAAVGYGESNGMNYWLVKNSWGTQWGEEGYIRMQKD-IDA 324

Query: 343 NTGKCGIAMEASYPV 357
             G CGIAM+ASYP 
Sbjct: 325 KEGLCGIAMQASYPT 339


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 158/308 (51%), Positives = 211/308 (68%), Gaps = 9/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+          R+ IFK+N+  ID  NS   ++YK+G+N+FADLTNEE++A
Sbjct: 39  HEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +   +    RY  +    +P +VDWR++GAV PVKDQG CG CWAFS 
Sbjct: 99  ----SRNRFKGHMCSPQAGPFRY--ENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGINK+ TG+LISLSEQE+VDCD K  + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 153 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ ++       I G+EDV    E +L KAVA QPVSVAI+AGG  FQ Y SG
Sbjct: 213 YKGTDGTCNTNKAAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 272

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +FTG C + LDHGV AVGYG  +G  YWLV+NSWG+ WGE GY+++Q++ +    G CGI
Sbjct: 273 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 331

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 332 AMQASYPT 339


>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 149/217 (68%), Positives = 183/217 (84%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG+LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+D+E+DYPY    + CD  R+NAKVV ID YEDV   +E 
Sbjct: 62  QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG+ WGE GY+++QRN+  +++G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIA-SSSGLCGLATEPSYPVK 217


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  324 bits (831), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 159/309 (51%), Positives = 215/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRF++FK+N+ +I+  +N+ N+ YK+G+N+FADLT+EE+  
Sbjct: 39  HEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+        S   +  +  +    LP+S+DWR+KGAV P+K+QGSCG CWAFS 
Sbjct: 97  --IVPRNRFNGHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           +AA EGI+KI TG+L+SLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E  YP
Sbjct: 155 IAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G + KC+         +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVSVAIDASGADFQFYKSG 274

Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           +FTG CG+ LDHGV AVGYG  N G  YWLV+NSWG++WGE GY+ +QR +     G CG
Sbjct: 275 IFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGYIMMQRGVKAVE-GICG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 176/337 (52%), Positives = 232/337 (68%), Gaps = 14/337 (4%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
           + S R +  V+T+Y+ WL ++GK  NG+G  E+RF+IFKDNL+ I+EHNS  NR+Y+ GL
Sbjct: 28  TESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
           NKF+DLT +E++A YLG + + K     S VA +RY  K GD LP+ VDWRE+GAV P V
Sbjct: 88  NKFSDLTADEFQASYLGGKMEKKSL---SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K QG CGSCWAF+   AVEGIN+I TGEL+SLSEQEL+DCDR   N GC GG   +AF+F
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203

Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           I +NGG+ S++ Y Y G +   C     +  +VV+I+G+E V   DEMSLKKAVA QP+S
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263

Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
           V I A       Y+SGV+ G C +   DH V+ VGYGT +   DYWL+RNSWG +WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           Y++LQRN  +  TGKC +A+   YP+K++ +S    P
Sbjct: 322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSSHLLSP 357


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 158/308 (51%), Positives = 210/308 (68%), Gaps = 9/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+          R+ IFK+N+  ID  NS   ++YK+G+N+FADLTNEE++A
Sbjct: 5   HEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA 64

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +   +    RY   +   +P +VDWR++GAV PVKDQG CG CWAFS 
Sbjct: 65  ----SRNRFKGHMCSPQAGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCCWAFSA 118

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGINK+ TG+LISLSEQE+VDCD K  + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 119 VAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 178

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +       I G+EDV    E +L KAVA QPVSVAI+AGG  FQ Y SG
Sbjct: 179 YKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSG 238

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +FTG C + LDHGV AVGYG  +G  YWLV+NSWG+ WGE GY+++Q++ +    G CGI
Sbjct: 239 IFTGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 297

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 298 AMQASYPT 305


>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 150/217 (69%), Positives = 181/217 (83%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+DSE+DYPY    + CD  R+NAKVV ID YEDV   +E 
Sbjct: 62  EGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVVA GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG+ WGE GY+++QRN+  + +G CG+A E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNIARS-SGLCGLATEPSYPVK 217


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  323 bits (829), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 159/306 (51%), Positives = 213/306 (69%), Gaps = 8/306 (2%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
           W+A++ K        EKRF+IFK+N+ +I+  NS  N++YK+ +N+FADLTNEE+    +
Sbjct: 42  WMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEF----I 97

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
             R+  K  +  S   +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS VAA
Sbjct: 98  APRNRFKGHMCSSITRTTTFKYENVTVIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAA 157

Query: 174 VEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
            EGI+ +  G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YPY  
Sbjct: 158 TEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKA 217

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
           A+ KC+         +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SGVFT
Sbjct: 218 ADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFT 277

Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           G CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR  +    G CGIAM
Sbjct: 278 GSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCGIAM 336

Query: 352 EASYPV 357
            ASYP 
Sbjct: 337 MASYPT 342


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  323 bits (829), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++ K        E+RF+IFK+N+ +I+  +N+ N+ Y +G+N+FADLTNEE+  
Sbjct: 39  HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S   +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ +  G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+       V +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR  +    G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  323 bits (829), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 164/326 (50%), Positives = 218/326 (66%), Gaps = 15/326 (4%)

Query: 37  DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG 96
           D+S       ++   YQ W+ K+G+        E+RF I++ N+++ID  NS+N ++ + 
Sbjct: 4   DYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLA 63

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVN 154
            N FADLTNEE++A YLG ++          V+      + G+   LP +VDWR++GAV 
Sbjct: 64  ENNFADLTNEEFKATYLGYKT----------VSIPDTCFRYGNMVNLPTNVDWRQEGAVT 113

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
           P+K+QG CGSCWAFS VAAVEGINKI  G+LISLSEQELVDCD    N GCNGG M  AF
Sbjct: 114 PIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAF 173

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +FI +  G+ +E +YPY GAE+ C+  +   + VSI GYE V   DE SLK AVA+QPVS
Sbjct: 174 EFI-KRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVS 232

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
           VAI+A G  FQ Y  G+F+G CG+ L+HGV  VGYG  +   YWLV+NSWG+DWGE+GY+
Sbjct: 233 VAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYI 292

Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN 359
           +++R+  D   G CGIAM ASYP K+
Sbjct: 293 RMKRDSTD-RQGTCGIAMMASYPTKD 317


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  323 bits (829), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 215/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++ K        EKRF+IFK+N+ +I+  +N+ N+ YK+G+N+FADLTNEE+  
Sbjct: 39  HEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S   +  +  +    LP +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ + +G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+ +       +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y++G
Sbjct: 215 YKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG + +G  YWLV+NSWG++WGE GY+ +QR  +    G CG
Sbjct: 275 VFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRG-VKAQEGLCG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 165/311 (53%), Positives = 208/311 (66%), Gaps = 12/311 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           Y  WL ++G+  +       RF I+  N++FI+  NS N ++K+  NKFADLTN+E+ ++
Sbjct: 46  YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSI 105

Query: 112 YLG--TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           YLG   RS  +R L      S         +LP++VDWRE GAV P+KDQG CGSCWAFS
Sbjct: 106 YLGYQIRSYKRRNLSHMHENST--------DLPDAVDWRENGAVTPIKDQGQCGSCWAFS 157

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAAVEGINKI TG L+SLSEQELVDCD    N GCNGG M+ AF FI   GG+ +E DY
Sbjct: 158 AVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDY 217

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY G +  C+ ++ +   V I GYE V   +E SLK AV+ QPVSVAI+A G  FQ Y  
Sbjct: 218 PYKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSE 277

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVF+G CG  L+HGV  VGYG  NG  YWLV+NSWG  WGE+GY++++R+  DT  G CG
Sbjct: 278 GVFSGYCGIQLNHGVTIVGYGDNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTK-GMCG 336

Query: 349 IAMEASYPVKN 359
           IAME SYP+K+
Sbjct: 337 IAMEPSYPIKD 347


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 156/309 (50%), Positives = 214/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++ K        E+RF+IFK+N+ +I+  +N+ N+ Y +G+N+FADLTNEE+  
Sbjct: 39  HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S   +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ +  G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+       V +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR  +    G CG
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLCG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 171/320 (53%), Positives = 213/320 (66%), Gaps = 15/320 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLT 104
           D ++ +++ W+AK+ K         +RF++FKDNL  IDE N    T Y +GLN FADLT
Sbjct: 66  DRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLT 125

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKGAVNPVKDQG 160
           ++E++A YLG        L+  + +  R+        GDE+P SVDWR+KGAV  VK+QG
Sbjct: 126 HDEFKATYLG--------LLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFSTVAAVEGIN+IVTG L SLSEQ+LVDC    N GC+GG+MD AF FI    
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           G+ SE+ YPYL  E  CD   R+ +V V+I GYEDV   DE +L KA+A QPVSVAIEA 
Sbjct: 238 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 297

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           GR FQ Y  GVF G CGS LDHGV AVGYG+  G DY +V+NSWG+ WGE GY++++R  
Sbjct: 298 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMKRG- 356

Query: 340 LDTNTGKCGIAMEASYPVKN 359
                G CGI   ASYP K+
Sbjct: 357 TGKPEGLCGINKMASYPTKD 376


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  323 bits (828), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 162/291 (55%), Positives = 198/291 (68%), Gaps = 3/291 (1%)

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           +RF++FKDNL  ID+ N    +Y +GLN+FADLT++E++A YLG      R   K   + 
Sbjct: 48  RRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSE 107

Query: 131 Q-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           + RY   +  E+P+ +DWR+K AV  VK+QG CGSCWAFSTVAAVEGIN IVTG L SLS
Sbjct: 108 EFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLS 167

Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           EQEL+DC    N GCNGGLMDYAF +I   GG+ +E+ YPY   E  CD   + A VV+I
Sbjct: 168 EQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDCDEG-KGAAVVTI 226

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
            GYEDV   DE +L KA+A QPVSVAIEA GR FQ Y  GVF G CG  LDHGV AVGYG
Sbjct: 227 SGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYG 286

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           T  G DY +V+NSWG  WGE GY++++R       G CGI   ASYP K++
Sbjct: 287 TSKGQDYIIVKNSWGPHWGEKGYIRMKRG-TGKGEGLCGINKMASYPTKDN 336


>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 423

 Score =  323 bits (828), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 173/362 (47%), Positives = 225/362 (62%), Gaps = 17/362 (4%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LV L F+SS++      I +D          +D+ +  +Y+ W   H +     G   +R
Sbjct: 54  LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 107

Query: 73  FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
           F  FK+N+RFI  HN   +R Y++ LN+F D+  EE+R+ +  +R +  RR         
Sbjct: 108 FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 167

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           A   +   +  + P SVDWR++GAV  VKDQG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 168 AVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASL 227

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
           SEQEL+DCD   N GC GGLM+ AF+FI   GG+ +E  YPY  +   CD     R    
Sbjct: 228 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 286

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV IDG++ V    E +L KAVA QPVSVA++AGG+AFQ Y  GVFTG+CG+ LDHGV A
Sbjct: 287 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 346

Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VGYG  ++G  YW+V+NSWG+ WGE GY+++QR     N G CGIAMEAS+P+K S N A
Sbjct: 347 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 404

Query: 365 KP 366
            P
Sbjct: 405 DP 406


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 171/320 (53%), Positives = 213/320 (66%), Gaps = 15/320 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLT 104
           D ++ +++ W+AK+ K         +RF++FKDNL  IDE N    T Y +GLN FADLT
Sbjct: 80  DRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLT 139

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKGAVNPVKDQG 160
           ++E++A YLG        L+  + +  R+        GDE+P SVDWR+KGAV  VK+QG
Sbjct: 140 HDEFKATYLG--------LLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFSTVAAVEGIN+IVTG L SLSEQ+LVDC    N GC+GG+MD AF FI    
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           G+ SE+ YPYL  E  CD   R+ +V V+I GYEDV   DE +L KA+A QPVSVAIEA 
Sbjct: 252 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 311

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           GR FQ Y  GVF G CGS LDHGV AVGYG+  G DY +V+NSWG+ WGE GY++++R  
Sbjct: 312 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMKRG- 370

Query: 340 LDTNTGKCGIAMEASYPVKN 359
                G CGI   ASYP K+
Sbjct: 371 TGKPEGLCGINKMASYPTKD 390


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 173/360 (48%), Positives = 232/360 (64%), Gaps = 27/360 (7%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKH 59
           MA+ S+ L I+ L  +F  S+  A   +++               D +M + ++ W+A++
Sbjct: 1   MASNSLKLLIA-LALVFATSAYLATSRTLL---------------DSLMAVRHEQWMAQY 44

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           G+         KR+ IFK+N+ +I+  N    + YK+G+N FADLTN+E+    + +R+ 
Sbjct: 45  GRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNKEF----IASRNG 100

Query: 119 AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
                  S     RY  +    +P +VDWR+KGAV PVKDQG CG CWAFS VAA+EGI 
Sbjct: 101 YILPHECSSNTPFRY--ENVSAVPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGIT 158

Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           K+ TG LISLSEQELVDCD K I+ GC GGLMD AF FII N G+ +E +YPY G +  C
Sbjct: 159 KLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSC 218

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
             S+ +     I GYEDV    E +L+KAVA+QPVSVAI+AGG  FQ Y SGVFTGECG+
Sbjct: 219 KKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGT 278

Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            LDHGV AVGYG  E+G  YWLV+NSWG+ WGE GY+++Q++ ++   G CGIAM++SYP
Sbjct: 279 ELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKD-IEAKEGLCGIAMQSSYP 337


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 167/354 (47%), Positives = 236/354 (66%), Gaps = 22/354 (6%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGM 66
           + +S ++   +I +S+  ++        H  +S   T+  VM   Y+TWL ++G+     
Sbjct: 5   ITLSIVILNLWIIASACPEI--------HTKNS---TNPAVMKKRYETWLKRYGRHYRDR 53

Query: 67  GHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
              E RF I++ N+++I+ +NS N +YK+  N+FAD+TNEE+++ YLG        L + 
Sbjct: 54  EEWEVRFDIYQSNVQYIEFYNSQNYSYKLIDNRFADITNEEFKSTYLGY-------LPRF 106

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +V ++    K G ELP+S+DWR+KGAV  VKDQG CGSCWAFS VAAVEGINKI T  L+
Sbjct: 107 RVQTEFRYHKHG-ELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLV 165

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ+L+DCD K  N GC GG M  AF +I ++GG+ + ++YPY G +  C+ S+    
Sbjct: 166 SLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNN 225

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
            V+I GYE V   +E  LK AVA QPVS+A +AGG AFQ Y  G+F+G CG  L+HG+  
Sbjct: 226 AVTISGYESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTI 285

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           VGYG ENG  YW+V+NSW +DWGE+GYV+++R+  D + G CGIAM+A+YPVK+
Sbjct: 286 VGYGEENGDKYWIVKNSWANDWGESGYVRMKRDTKDKD-GTCGIAMDATYPVKH 338


>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 379

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 175/369 (47%), Positives = 228/369 (61%), Gaps = 19/369 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LV L F+SS++      I +D          +D+ +  +Y+ W   H +     G   +R
Sbjct: 10  LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63

Query: 73  FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
           F  FK+N+RFI  HN   +R Y++ LN+F D+  EE+R+ +  +R +  RR         
Sbjct: 64  FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 123

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           A   +   +  + P SVDWR++GAV  VKDQG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 124 AVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASL 183

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
           SEQEL+DCD   N GC GGLM+ AF+FI   GG+ +E  YPY  +   CD     R    
Sbjct: 184 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 242

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV IDG++ V    E +L KAVA QPVSVA++AGG+AFQ Y  GVFTG+CG+ LDHGV A
Sbjct: 243 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 302

Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VGYG  ++G  YW+V+NSWG+ WGE GY+++QR     N G CGIAMEAS+P+K S N A
Sbjct: 303 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 360

Query: 365 KP--KPHSS 371
            P  KP  +
Sbjct: 361 DPPRKPRRA 369


>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
 gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 148/217 (68%), Positives = 182/217 (83%), Gaps = 1/217 (0%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           P SVDWR+KG +  VKDQGSCGSCWAFS VAA+E IN IVTG LISLSEQELVDCD+  N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC+GGLMDYAF+F+I NGG+D+E+DYPY      CD  R+NAKVV+ID YEDV   +E 
Sbjct: 62  EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVV  GYGTENG+DYW+VRN
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVRN 181

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           SWG+ WGE GY+++QRN+  +++G CG+A+E SYPVK
Sbjct: 182 SWGAKWGEKGYLRVQRNVA-SSSGLCGLAIEPSYPVK 217


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 173/358 (48%), Positives = 233/358 (65%), Gaps = 13/358 (3%)

Query: 16  LFFISSSSAADMSII-SYD-NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
           L FIS S A   ++  ++D N HD  S    +  +  +Y+ W + H  T N +     RF
Sbjct: 6   LLFISLSLALIFTVANTFDFNEHDLES----EKSLWNLYERWRSHHTVTRN-LDEKHNRF 60

Query: 74  QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
            +FK N+  +   N L++ YK+ LNKF D+TN E+R +Y  ++    R        +  +
Sbjct: 61  NVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTF 120

Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
             +   ++P S+DWR KGAV  VKDQG CGSCWAFST+AAVEGIN+I T +L+SLSEQ+L
Sbjct: 121 MYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQL 180

Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
           VDCD + N GCNGGLM+YAF+FI QN G+ +E +YPY   +  CD  + + K VSIDG+E
Sbjct: 181 VDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKED-KAVSIDGHE 238

Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TEN 312
           +V   +E +L KA A QPVSVAI+AGG  FQ Y  GVFTG C + L+HGV  VGYG T++
Sbjct: 239 NVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQD 298

Query: 313 GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
              YW+++NSWGS+WGE GY+++QR  + +  G CGIAMEASYP+K S  S KP   S
Sbjct: 299 RTKYWIMKNSWGSEWGEQGYIRMQRG-ISSREGLCGIAMEASYPIKKS--STKPTESS 353


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 153/273 (56%), Positives = 200/273 (73%), Gaps = 7/273 (2%)

Query: 87  NSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVD 146
           N  N+ YK+G+NKFADLTNEE++A    +R+  K  +  S + +  +  +    +P +VD
Sbjct: 4   NVNNKLYKLGINKFADLTNEEFKA----SRNKFKGHMCSSIIRTTTFKYENASAIPSTVD 59

Query: 147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCN 205
           WR+KGAV PVK+QG CGSCWAFS VAA EGI+++ TG+L+SLSEQEL+DCD K ++ GC 
Sbjct: 60  WRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCE 119

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMD AF+FIIQN G+ +E  YPY G +  C+ +  +   V+I GYEDV   +E++L+K
Sbjct: 120 GGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQK 179

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWG 324
           AVA+QP+SVAI+A G  FQ Y SGVFTG CG+ LDHGV AVGYG  N G  YWLV+NSWG
Sbjct: 180 AVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWG 239

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +DWGE GY+++QR  +D   G CGIAM+ASYP 
Sbjct: 240 ADWGEEGYIRMQRG-IDAAEGLCGIAMQASYPT 271


>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
          Length = 480

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 180/361 (49%), Positives = 234/361 (64%), Gaps = 20/361 (5%)

Query: 20  SSSSAADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQI 75
           ++++A DMSIISY+  H         T+ E    Y  WLA++G  S N +G  +E+RF +
Sbjct: 18  AATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLV 77

Query: 76  FKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
           F DNL+F+D HN+       +++G+N+                +   +    + +V  +R
Sbjct: 78  FWDNLKFVDAHNARADERGGFRLGMNRLRRSHQRGVPRDLPRRQGRREEPRRRGEVPPRR 137

Query: 133 YACKAG----DELPESVDWREKGAVNP------VKDQGSCGSCWAFSTVAAVEGINKIVT 182
               AG    +        +E G +        VK  G  GSCWAFS V+ VE IN++VT
Sbjct: 138 GGGAAGVRRLEGEGRRRPRQEPGPMRSFSVHLSVKYFGQ-GSCWAFSAVSTVESINQLVT 196

Query: 183 GELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           GE+I+LSEQELV+C     N+GCNGGLMD AF FII+NGG+D+E DYPY   + KCD +R
Sbjct: 197 GEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINR 256

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
            NAKVVSIDG+EDV   DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDH
Sbjct: 257 ENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDH 316

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           GVVAVGYGT+NG DYW+VRNSWG  WGE+GYV+++RN ++  TGKCGIAM ASYP K+  
Sbjct: 317 GVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGA 375

Query: 362 N 362
           N
Sbjct: 376 N 376


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 157/309 (50%), Positives = 215/309 (69%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++ K        EKRF+IFK+N+ +I+  +N+ ++ YK+G+N+FADLTNEE+  
Sbjct: 39  HEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S   +  +  +    LP +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ + +G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+ +       +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y++G
Sbjct: 215 YKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG + +G  YWLV+NSWG++WGE GY+ +QR  +    G CG
Sbjct: 275 VFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYIMMQRG-VKAQEGLCG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 165/353 (46%), Positives = 230/353 (65%), Gaps = 25/353 (7%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L F FF  ++ AA       D N D +   R        ++ W+A++ +        
Sbjct: 9   LAVLSFAFFCGAALAA------RDLNEDSAMVAR--------HEQWMAQYSRVYKDAAEK 54

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF++FK N++FI+  N+  NR + +G+N+FADLTN+E+R     T+++   +    KV
Sbjct: 55  ARRFEVFKANVKFIESFNTGGNRKFWLGINQFADLTNDEFRT----TKTNKGFKPSLDKV 110

Query: 129 ASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           ++  RY   + D +P ++DWR  GAV P+KDQG CG CWAFS VAA EGI KI TG+LIS
Sbjct: 111 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 170

Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQELVDCD    + GC GGLMD AF+FII+NGG+ +E +YPY  A+ KC     +A  
Sbjct: 171 LSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSA-- 228

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
            +I GYEDV   DE +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+ A+
Sbjct: 229 ANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 288

Query: 307 GYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GYG T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+AME SYP +
Sbjct: 289 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISD-KKGMCGLAMEPSYPTE 340


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 172/363 (47%), Positives = 235/363 (64%), Gaps = 31/363 (8%)

Query: 1   MATASMFLAIS-TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           MA+ + +  +S  L+F+    +S A   S+      H+ S   R +D        W+A++
Sbjct: 1   MASTNQYQYVSMALLFILAAWASQATSRSL------HEASMYERHED--------WMARY 46

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           G+        EKRF+IFKDN+  I+  N ++++TYK+ +N+FADLTNEE+R++       
Sbjct: 47  GRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSL------- 99

Query: 119 AKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
             R   K+ + S+    K  +   +P ++DWR+KGAV P+KDQ  CG CWAFS VAA EG
Sbjct: 100 --RNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEG 157

Query: 177 INKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           I +I TG+LISLSEQELVDCD    N GC+GGLMD AF+FI +  G+ SE  YPY G + 
Sbjct: 158 ITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEGDDG 216

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
            C+  +       I GYEDV   +E +L+KAVA QPV+VAI+AGG  FQ Y SGVFTG+C
Sbjct: 217 TCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQC 276

Query: 296 GSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
           G+ LDHGV AVGYG  ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+AS
Sbjct: 277 GTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQAS 335

Query: 355 YPV 357
           YP 
Sbjct: 336 YPT 338


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 166/350 (47%), Positives = 225/350 (64%), Gaps = 13/350 (3%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMGH 68
           ++T +F+  +  ++    S       H   SS   D E M   +  W+ +HG+       
Sbjct: 6   LTTTIFILLMLCNTCVIASESECPPTHKQKSS---DVEAMKKRFDGWVKRHGRKYKHNDE 62

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            E RF I++ N+++I   N+   +Y +  NKFADLTNEE+++ Y+G  +      ++S  
Sbjct: 63  REVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTR-----LRSHN 117

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
              RY  + GD LPES DWR++GAV  + DQG CG CWAF+ VAAVEGINKI +G+LISL
Sbjct: 118 TGFRYD-EHGD-LPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISL 175

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQEL+DCD K  N GC GGLM+ A+ FII+NGG+ +EQDYPY G +  C   +      
Sbjct: 176 SEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAA 235

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           SI GYE+V   +E  LK A A QPVSVAI+AGG +FQ Y  GVF+G CG  L+HGV  VG
Sbjct: 236 SISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVG 295

Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           YG E    YW+V+NSWG+DWGE+GY++++R+ L +  G CGIAM+ASYP+
Sbjct: 296 YGKETINKYWIVKNSWGADWGESGYIRMKRDTL-SKEGMCGIAMQASYPL 344


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 209/309 (67%), Gaps = 8/309 (2%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
           W+A+HG+T       E+R  IFK N+ +I+  N+  R Y++  N+FADLT+EE++AM+ G
Sbjct: 38  WMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNAGKRKYQLAANQFADLTHEEFKAMHTG 97

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
            +         +K A   +   +   +P+SVDWR KGAV PVKDQG CGSCWAF+ VAAV
Sbjct: 98  FKPSGT----GAKKAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAV 153

Query: 175 EGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EGI KIVTG+LISLSEQ+LVDCD    + GC GG MD AF+FI+ NGG+ SE +YPY   
Sbjct: 154 EGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEV 213

Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA-FQHYESGVFT 292
           +  C+    +  V +I+ +EDV   DE +L+KAVA+QPVSV I+AG    FQ Y  GVF+
Sbjct: 214 QRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFS 273

Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           GECG+ LDH V  VGYG T +G  YWL +NSWG  WGENGY++++R++     G CGIAM
Sbjct: 274 GECGTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVA-AKEGLCGIAM 332

Query: 352 EASYPVKNS 360
           +ASYP   +
Sbjct: 333 QASYPTAGT 341


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 155/308 (50%), Positives = 211/308 (68%), Gaps = 6/308 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFKDN+ FI+  N+  N+ YK+G+N  ADLT EE++ 
Sbjct: 38  HENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKD 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
              G +   +      K+   +Y  +   ++PE++DWR KGAV P+KDQG  CG  WAFS
Sbjct: 98  SRNGLKRTYEFSTTTFKLNGFKY--ENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFS 155

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T+AA EGI++I TG L+SLSEQELVDCD  ++ GC GG M+  F+FII+NGG+ SE +YP
Sbjct: 156 TIAATEGIHQISTGNLVSLSEQELVDCD-SVDDGCEGGFMEDGFEFIIKNGGITSETNYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+ +   + V  I GYE V  + E +LKKAVA+QPVSV+I A    F  Y SG
Sbjct: 215 YKGVDGTCNTTIAASPVAQIKGYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSG 274

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           ++ GECG+ LDHGV AVGYGTENG DYW+V+NSWG+ WGE GY+++ R +   + G CGI
Sbjct: 275 IYNGECGTDLDHGVTAVGYGTENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGI 333

Query: 350 AMEASYPV 357
           A+++SYP 
Sbjct: 334 ALDSSYPT 341


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 163/323 (50%), Positives = 216/323 (66%), Gaps = 15/323 (4%)

Query: 37  DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG 96
           D+S       ++   YQ W+ K+G+        E+RF I++ N+++ID  NS+N ++ + 
Sbjct: 4   DYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLA 63

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVN 154
            N FADLTNEE++A YLG ++          V+      + G+   LP +VDWR++GAV 
Sbjct: 64  ENNFADLTNEEFKATYLGYKT----------VSIPDTCFRYGNMVNLPTNVDWRQEGAVT 113

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
           P+K+QG CGSCWAFS VAAVEGINKI  G+LISLSEQELVDCD    N GCNGG M  AF
Sbjct: 114 PIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAF 173

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +FI +  G+ +E +YPY GAE+ C+  +   + VSI GYE V   DE SLK AVA+QPVS
Sbjct: 174 EFI-KRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVS 232

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
           VAI+A G  FQ Y  G+F+G CG+ L+HGV  VGYG  +   YWLV+NSWG+DWGE+GY+
Sbjct: 233 VAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYI 292

Query: 334 KLQRNLLDTNTGKCGIAMEASYP 356
           +++R+  D   G CGIAM ASYP
Sbjct: 293 RMKRDSTDKQ-GTCGIAMMASYP 314


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 152/269 (56%), Positives = 193/269 (71%), Gaps = 2/269 (0%)

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +TN E+R+ Y G++ +  R    S+ A+  +  +    +P SVDWR+KGAV P+KDQG C
Sbjct: 1   MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFSTV AVEGIN I T +L+SLSEQELVDCD   N GCNGGLM YAF+FI + GG+
Sbjct: 61  GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +EQ YPY   +  CD S+ N+ VVSIDG+E V P +E +L KA A+QP+SVAI+AGG A
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVF G CG+ LDHGV  VGYGT  +G  YW+V+NSWG+DWGENGY++++R  + 
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG-IS 239

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
              G CGIA+EASYP+KNS  +    P S
Sbjct: 240 AKEGLCGIAVEASYPIKNSSTNPVGAPSS 268


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 152/317 (47%), Positives = 217/317 (68%), Gaps = 8/317 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  ++ W+AK  +         +RF++FK N+ FI+  N+ NR + +G+N+F DLT
Sbjct: 30  DTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAENRKFWLGVNQFTDLT 89

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           N+E+RA    T+++   ++   +  +  +Y+  + D LP +VDWR KG V P+KDQG CG
Sbjct: 90  NDEFRA----TKTNKGLKMSGGRAPTGFKYSNVSIDALPTAVDWRTKGVVTPIKDQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS V A EGI K+ TG+LISLSEQELVDCD   ++ GC GG MD AF+FII+NGG+
Sbjct: 146 CCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY   + +C  S  +  V +I GYEDV   DE SL KAVA+QPVSVA++ G   
Sbjct: 206 TTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVI 265

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQHY  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGE+GY+++++++ D
Sbjct: 266 FQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKDISD 325

Query: 342 TNTGKCGIAMEASYPVK 358
             +G CG+AM+ SYP +
Sbjct: 326 -KSGMCGLAMQPSYPTE 341


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 164/295 (55%), Positives = 205/295 (69%), Gaps = 12/295 (4%)

Query: 80  LRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG 138
           LRFIDEHN+  NR+YKVGLN+FADLT EE+R+ YLG    +     K+KV S RY  +  
Sbjct: 1   LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGSN----KTKV-SNRYEPRVS 55

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
             LP  VDWR  GAV  +K QG CG CWAFS +A VEGINKIVTG LISLSEQEL+ C  
Sbjct: 56  QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGG 115

Query: 199 KINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSP 257
             N  GCNGG +   FQFII NGG+++ ++YPY   + +C+   +N K V+ID Y +V  
Sbjct: 116 TQNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPY 175

Query: 258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYW 317
            +E +L+ AV  QPVSVA++A G AF+HY SG+FTG CG+A+DH V  VGYGTE G+DYW
Sbjct: 176 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYW 235

Query: 318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK-NSQNSAKPKPHSS 371
           +V NSW + WGE GY+++ RN+     G CGIA   SYPVK N+QN   PKP+SS
Sbjct: 236 IVENSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVKYNNQN--YPKPYSS 286


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 168/362 (46%), Positives = 228/362 (62%), Gaps = 15/362 (4%)

Query: 1   MATAS-MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           MA  S + L   +L F+ + SS+S  D S++ Y +  D +  ++  D    ++ +W  KH
Sbjct: 1   MAMGSKLSLFFLSLGFVAYSSSASHNDPSVVGY-SQEDLALPYKLVD----LFSSWSVKH 55

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS-- 117
            K         KR+++FK NL+ I E N  N +Y +GLN+FAD+ +EE+++ YLG ++  
Sbjct: 56  SKIYVSPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLGLKTGM 115

Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
           D   R      A   +  +    LP SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGI
Sbjct: 116 DGPAR------APTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGI 169

Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           N+I TG+L SLSEQEL+DCD   + GC GG MD+AF +I+ N G+ ++ DYPYL  E  C
Sbjct: 170 NQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYLMEEGYC 229

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
              +  +KVV+I GYEDV    E+SL KA+A QP+SV I AG + FQ Y+ GVF G CG+
Sbjct: 230 KEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFEGSCGT 289

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            LDH + AVGYG+ +G DY +++NSWG  WGE GY +++R       G C I   ASYP 
Sbjct: 290 ELDHALTAVGYGSSDGQDYIIMKNSWGKSWGEQGYFRIKRG-TGKPEGVCSIYSMASYPT 348

Query: 358 KN 359
           K 
Sbjct: 349 KT 350


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 167/354 (47%), Positives = 227/354 (64%), Gaps = 27/354 (7%)

Query: 10  ISTLVFLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMG 67
           ++ L F FF  ++ AA D+S                DD  M   ++ W+A++ +      
Sbjct: 9   LAILGFAFFCGAALAARDLS----------------DDSAMVARHEQWMAQYSRVYKDAS 52

Query: 68  HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
              +RF++FK N++FI+  N+  N  + +G+N+FADLTN+E+R+  + T    K   MK 
Sbjct: 53  EKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRS--IKTNKGFKSSNMKI 110

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                RY   + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI KI TG+L+
Sbjct: 111 PTGF-RYENVSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLV 169

Query: 187 SLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SL+EQELVDCD    + GC GGLMD AF+FII NGG+ +E  YPY  A+ KC     +A 
Sbjct: 170 SLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNSA- 228

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
             +I GYEDV   DE +L KAVA+QPVSVA++ G   FQ Y SGV TG CG+ LDHG+ A
Sbjct: 229 -ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAA 287

Query: 306 VGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +GYG T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+AME SYP +
Sbjct: 288 IGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTE 340


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 163/333 (48%), Positives = 213/333 (63%), Gaps = 13/333 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ +  +Y+ W   H +         +RF  FK N+ FI  HN   +R Y++ LN+F D
Sbjct: 38  SEEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGD 96

Query: 103 LTNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           ++  E+RA + G+R   +RR        V    YA     +LP SVDWR+KGAV  VK+Q
Sbjct: 97  MSQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFSTV +VEGIN I TG+L+SLSEQEL+DCD   N GC GGLMD AF++I +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216

Query: 220 GGMDSEQDYPYLGAENKCDP---SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
           GG+ +E  YPY  A   C     ++ +  VV IDG++DV    E +L KAVA+QPVSV I
Sbjct: 217 GGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGI 276

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
           +A G+AF  Y  GVFTGECG+ LDHGV  VGYG  E+G  YW V+NSWG  WGE GY+++
Sbjct: 277 DASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRV 336

Query: 336 QRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           +++      G CGIAMEASY VK     +KPKP
Sbjct: 337 EKD-SGAEGGLCGIAMEASYAVK---TDSKPKP 365


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 165/357 (46%), Positives = 224/357 (62%), Gaps = 23/357 (6%)

Query: 6   MFLAISTLV-FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           +FL +S +  F   I+ S   D + +     HD                 W+AKHG+   
Sbjct: 8   IFLIVSLISSFCLSITLSRPLDDNELIMQKRHDE----------------WMAKHGRVYA 51

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
            M     R+ +FK N+  I+  N++   RT+K+ +N+FADLTN+E+R+MY G +  +   
Sbjct: 52  DMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLS 111

Query: 123 LMK-SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
               +K +S RY   +   LP SVDWR+KGAV P+K+QG+CG CWAFS VAA+EG  KI 
Sbjct: 112 SQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIK 171

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
            G+LISLSEQ+LVDCD   + GC+GGLMD AF+ I+  GG+ +E +YPY G +  C    
Sbjct: 172 KGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKN 230

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
                 SI GYEDV   DE +L KAVA QPVS+ IE GG  FQ Y SGVFTGEC + LDH
Sbjct: 231 TKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDH 290

Query: 302 GVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            V AVGYG + NG  YW+++NSWG+ WGE+GY+++++++ D   G CG+AM+ASYP 
Sbjct: 291 AVTAVGYGQSSNGSKYWIIKNSWGTKWGESGYMRIKKDVKD-KKGLCGLAMKASYPT 346


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 21/348 (6%)

Query: 15  FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
            LF I S      ++++     DH++       ++  ++ W+ ++G+         +RF+
Sbjct: 7   LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59

Query: 75  IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
           IFK N+ FI+  N+ N  + +G+N+FADLTN E+RA      +   +  + S V    + 
Sbjct: 60  IFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY   + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173

Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC+    +A   +I 
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
           GYEDV   +E +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+VA+GYG 
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + +G  YWL++NSWG+ WGENG++++++++ D   G CG+AME SYP 
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 165/348 (47%), Positives = 224/348 (64%), Gaps = 24/348 (6%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           L+F   + +S AA  S+       + +S   T D+       W+A++G+         +R
Sbjct: 14  LLFTIGVLASLAAARSL-------NEASMTETHDQ-------WMARYGRVYKTANEKNRR 59

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
             IF++NL++I   N  N + YK+G+N+FADLTNEE+      +R+  K  +  +     
Sbjct: 60  STIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTT----SRNKFKSHVCATVTNVF 115

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY  +    +P ++DWR+KGAV P+K+QG CG CWAFS VAA+EGI ++ TG+LISLSEQ
Sbjct: 116 RY--ENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQ 173

Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMDYAF FI QN G+ +E +YPY G +  C+ ++      +I 
Sbjct: 174 ELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATIT 233

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
           G+EDV    E +L KAVA+QP+SVAI+A G  FQ Y SGVFTGECG+ LDHGV AVGYGT
Sbjct: 234 GHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGT 293

Query: 311 -ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +G  YWLV+NSWG+ WGE GY+++QR +     G CGIAM+ASYP 
Sbjct: 294 AADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAE-GLCGIAMQASYPT 340


>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
 gi|194701540|gb|ACF84854.1| unknown [Zea mays]
          Length = 379

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 174/369 (47%), Positives = 227/369 (61%), Gaps = 19/369 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LV L F+SS++      I +D          +D+ +  +Y+ W   H +     G   +R
Sbjct: 10  LVALVFVSSAAVELCRAIDFDER-----DLASDEALWDLYERWQTHH-RVHRHHGEKGRR 63

Query: 73  FQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV--- 128
           F  FK+N+RFI  HN   +R Y++ LN+F D+  EE+R+ +  +R +  RR         
Sbjct: 64  FGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAARAG 123

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           A   +   +  + P SVDWR++GAV  VK QG CGSCWAFSTV AVEGIN I TG L SL
Sbjct: 124 AVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLASL 183

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD---PSRRNAK 245
           SEQEL+DCD   N GC GGLM+ AF+FI   GG+ +E  YPY  +   CD     R    
Sbjct: 184 SEQELIDCDTDEN-GCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGV 242

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV IDG++ V    E +L KAVA QPVSVA++AGG+AFQ Y  GVFTG+CG+ LDHGV A
Sbjct: 243 VVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 302

Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSA 364
           VGYG  ++G  YW+V+NSWG+ WGE GY+++QR     N G CGIAMEAS+P+K S N A
Sbjct: 303 VGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGA--GNGGLCGIAMEASFPIKTSPNPA 360

Query: 365 KP--KPHSS 371
            P  KP  +
Sbjct: 361 DPPRKPRRA 369


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 155/309 (50%), Positives = 213/309 (68%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++ K        E+RF+IFK+N+ +I+  +N+ N+ Y +G+N+FADLTNEE+  
Sbjct: 39  HEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF-- 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  R+  K  +  S   +  +  +    +P +VDWR+KGAV P+KDQG CG CWAFS 
Sbjct: 97  --IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI+ +  G+LISLSEQE+VDCD K  + GC GG MD AF+FIIQN G+++E +YP
Sbjct: 155 VAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC+       V +I GYEDV   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG
Sbjct: 215 YKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG + +G +YWLV+NSWG++WGE GY+++QR  +    G  G
Sbjct: 275 VFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRG-VKAEEGLXG 333

Query: 349 IAMEASYPV 357
           IAM ASYP 
Sbjct: 334 IAMMASYPT 342


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 159/309 (51%), Positives = 208/309 (67%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+          RFQIF DN++FI+E N   R +YK+ +N+FAD TNEE++A
Sbjct: 57  HEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQA 116

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G +     R   S+    RY  +    +P S+DWR+KGAV PVKDQG CGSCWAFST
Sbjct: 117 SRNGYKMAVSSR--PSQTTLFRY--ENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFST 172

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           +AA EGI K+ TG+LISLSEQELVDCD+   + GC GG M+  F+FI++N G+  E  YP
Sbjct: 173 IAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIALEASYP 232

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A+  C+     ++   I GYE V    E +L KAVA+QPVSV+I+A G AFQ Y SG
Sbjct: 233 YTAADGTCNSKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSG 292

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTGECG+ LDHGV AVGYG T +G  YWLV+NSWG+ WG++GY+ +QR +     G CG
Sbjct: 293 VFTGECGTDLDHGVTAVGYGKTSDGTKYWLVKNSWGASWGDSGYIMMQRGVA-AKGGLCG 351

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 352 IAMDASYPT 360


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 155/308 (50%), Positives = 210/308 (68%), Gaps = 6/308 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ K+GK        EKRF IF++N+ FI+  N+  N+ YK+ +N  AD TNEE+ A
Sbjct: 38  HEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            + G +    + L  +     +Y  +   ++P +VDWR+KG    +KDQG CG CWAFS 
Sbjct: 98  SHKGYKGSHWQGLRITTQTPFKY--ENVTDIPWAVDWRQKGDATSIKDQGQCGICWAFSA 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA EGI +I TG L+SLSEQELVDCD  ++ GC+GGLM++ F+FII+NGG+ SE +YPY
Sbjct: 156 VAATEGIYQITTGNLVSLSEQELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANYPY 214

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
                 CD ++  +    I GYE V    E  L+KAVA+QPVSV+I+AGG AFQ Y SGV
Sbjct: 215 TAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGGSAFQFYSSGV 274

Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+ LDHGV AVGYG T++G+ YW+V+NSWG+ WGE GY+++ R  +D   G CGI
Sbjct: 275 FTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRG-IDAQEGLCGI 333

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 334 AMDASYPT 341


>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
          Length = 360

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 220/339 (64%), Gaps = 10/339 (2%)

Query: 34  NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY 93
           N HD  S    +  +  +Y+ W + H  T + +     RF +FK N+  +   N L++ Y
Sbjct: 26  NEHDLDS----EKSLWDLYERWRSHHTVTRS-LDEKHNRFNVFKANVMHVHNTNKLDKPY 80

Query: 94  KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
           K+ LNKFAD+TN E+R +Y  ++    R        +  +  +    +P S+DWR+KGAV
Sbjct: 81  KLKLNKFADMTNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAV 140

Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
             VKDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD   N GCNGGLM+YAF
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAF 200

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +FI QN G+ +E +YPY   +  CD  + +   VSIDGYE+V   +E +L KA A QPVS
Sbjct: 201 EFIKQN-GITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVS 259

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGY 332
           VAI+AGG  FQ Y  GVF+G CG+ L+HGV  VGYG T++   YW+V+NSWGS+WGE GY
Sbjct: 260 VAIDAGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGY 319

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
           +++QR  +    G CGIAMEASYP+K S  S  P   S+
Sbjct: 320 IRMQRG-ISHKEGLCGIAMEASYPIKKS--STNPTESST 355


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 158/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N+ FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R M    +++       ++V +  RY     D LP +VDWR KGAV P+KDQG 
Sbjct: 88  LTNDEFRWM----KTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY  A++KC     +  V SI GYEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV TG CG+ LDHG+VA+GYG   +G  YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 167/334 (50%), Positives = 215/334 (64%), Gaps = 14/334 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFA 101
           +D+ +  +Y+ W   H +     G   +RF  FK+N+RFI  HN      +Y++ LN+F 
Sbjct: 38  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFG 96

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQR---YACKAGDELPESVDWREKGAVNPVKD 158
           D+  EE+R+ +  +R +  RR  +S  A+     +      ++P SVDWR+ GAV  VK+
Sbjct: 97  DMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKN 156

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCWAFSTV AVEGIN I TG L+SLSEQELVDCD   N GC GGLM+ AF FI  
Sbjct: 157 QGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAEN-GCQGGLMENAFDFIKS 215

Query: 219 NGGMDSEQDYPYLGAENKCD--PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            GG+ +E  YPY  +   CD   +RR    VSIDG++ V    E +L KAVA QPVSVAI
Sbjct: 216 YGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAI 275

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
           +AGG+AFQ Y  GVFTG+CG+ LDHGV  VGYG    +G  YW+V+NSWG  WGE GY++
Sbjct: 276 DAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYIR 335

Query: 335 LQRNLLDTNTGKCGIAMEASYPVKNSQNSA-KPK 367
           +QR     N G CGIAMEAS+P+K S N A KP+
Sbjct: 336 MQRGA--GNGGLCGIAMEASFPIKTSHNPARKPR 367


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 158/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N+ FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R     T+++       ++V +  RY     D LP +VDWR KGAV P+KDQG 
Sbjct: 88  LTNDEFR----WTKTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY  A++KC     +  V SI GYEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV TG CG+ LDHG+VA+GYG   +G  YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 167/349 (47%), Positives = 229/349 (65%), Gaps = 22/349 (6%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKHGKTSNGMGHNEK 71
           L+ LFF+ +  A            D +S+    +  M   ++ W+AKHGK         +
Sbjct: 11  LIALFFVLAMWA------------DQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLR 58

Query: 72  RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           RFQIFK+N+ FI+  N+  N +Y +G+N+FADLTNEE+RA + G     KR L  S++ +
Sbjct: 59  RFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGY----KRPLDASRIVT 114

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
             +  +    LP S+DWR KGAV  +KDQ  CGSCWAFS VAA EG++K+ TG+L+SLSE
Sbjct: 115 P-FKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSE 173

Query: 191 QELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           QELVDCD K  + GC GGLM+ AF+FI +NGG+ +E +Y Y G + KCD  +  + V  I
Sbjct: 174 QELVDCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKI 233

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
            GY+ V    E +L KAVA QPVSV+I+AG  +FQ Y+SG++ G CGS L+HGV AVGYG
Sbjct: 234 TGYQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYG 293

Query: 310 T-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           T  +G  YW+V+NSWG +WGE GYV+++R+ + +  G CGIAM+ SYP 
Sbjct: 294 TSSSGSKYWIVKNSWGPEWGERGYVRMKRD-ITSRKGLCGIAMDCSYPT 341


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 226/348 (64%), Gaps = 21/348 (6%)

Query: 15  FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
            LF I S      ++++     DH++       ++  ++ W+ ++G+         +RF+
Sbjct: 7   LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59

Query: 75  IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
           IFK N+ FI+  N+ N  + +G+N+FADLTN E+RA      +   +  + S V    + 
Sbjct: 60  IFKANVAFIESFNAGNHKFWLGVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY   + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173

Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC+    +A   +I 
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
           GYE+V   +E +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+VA+GYG 
Sbjct: 232 GYEEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + +G  YWL++NSWG+ WGENG++++++++ D   G CG+AME SYP 
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 159/308 (51%), Positives = 208/308 (67%), Gaps = 9/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A HGK        E+++Q FK+N++ I+  N   N+ YK+G+N FADLTNEE++A
Sbjct: 40  HEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKA 99

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +    R         ++  + RY  +    +P ++DWR++GAV P+KDQG CG CWAFS 
Sbjct: 100 I---NRFKGHVCSKITRTPTFRY--ENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+ +E  YP
Sbjct: 155 VAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+         SI GYEDV    E +L KAVA+QPVSVAIEA G  FQ Y  G
Sbjct: 215 YEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV AVGYG +++G  YWLV+NSWG  WG+ GY+++QR++     G CG
Sbjct: 275 VFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIRMQRDVA-AKEGLCG 333

Query: 349 IAMEASYP 356
           IAM ASYP
Sbjct: 334 IAMLASYP 341


>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 166/316 (52%), Positives = 223/316 (70%), Gaps = 5/316 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           T++ +  +Y+ W  KH   S  +    KRF +FK+N+  +   N +++ YK+ LNKFAD+
Sbjct: 33  TEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADM 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           +N E+   Y  +     R+L + +  +  +  +   +LP SVDWRE+GAVN VK+QG CG
Sbjct: 92  SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCG 151

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFS+VAAVEGINKI T +L+SLSEQEL+DC+ + N GCNGG M+ AF FI +NGG+ 
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIA 210

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E  YPY G+   C  SR ++ +V IDGYE V P +E +L +AVA+QPVSVAI+A GR F
Sbjct: 211 TENSYPYHGSRGLCRSSRISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDF 269

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GVF G CG+ L+HGVVA+GYG TE+G DYWLVRNSWG  WGE+GYV+++R  ++ 
Sbjct: 270 QFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRG-VEQ 328

Query: 343 NTGKCGIAMEASYPVK 358
             G CGIAMEASYP+K
Sbjct: 329 AEGLCGIAMEASYPIK 344


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 225/348 (64%), Gaps = 21/348 (6%)

Query: 15  FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
            LF I S      ++++     DH++       ++  ++ W+ ++G+         +RF+
Sbjct: 7   LLFAILSCLCLCSAVLAAREQSDHAA-------MVARHERWMEQYGRVYKDATEKARRFE 59

Query: 75  IFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV---ASQ 131
           IFK N+ FI+  N+ N  + + +N+FADLTN E+RA      +   +  + S V    + 
Sbjct: 60  IFKANVAFIESFNAGNHKFWLSVNQFADLTNYEFRA------TKTNKGFIPSTVRVPTTF 113

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY   + D LP +VDWR KGAV P+KDQG CG CWAFS VAA+EGI K+ TG+LISLSEQ
Sbjct: 114 RYENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 173

Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC+    +A   +I 
Sbjct: 174 ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNSA--ATIK 231

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
           GYEDV   +E +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+VA+GYG 
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 311 E-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + +G  YWL++NSWG+ WGENG++++++++ D   G CG+AME SYP 
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKR-GMCGLAMEPSYPT 338


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 158/299 (52%), Positives = 210/299 (70%), Gaps = 14/299 (4%)

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA--KRRLMKSK 127
           E+RF ++ DNLRF+ E+N+ + ++ + +  +ADL+ +EYR+  LG  +D   +R L  + 
Sbjct: 58  ERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAP 117

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
              +      G   P+ VDW  KGAV PVK+Q  CGSCWAFST  AVEG + I TG+L S
Sbjct: 118 FLYE------GTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLAS 171

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           LSEQ LVDCDR+ + GC+GGLMD+AF+FI++NGG+D+E DYPY   E  C  ++    VV
Sbjct: 172 LSEQMLVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           +ID Y+DV P DE +L KAVA+QPVSVAIEA  RAFQ Y  GVF  ECG+ALDHGV+ VG
Sbjct: 232 TIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVG 291

Query: 308 YGT-ENG---VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           YGT  NG   + YWLV+NSWG++WG+ GY++L RNL     G+CG+AM+AS+P+K   N
Sbjct: 292 YGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNL--GEEGQCGVAMQASFPIKKGAN 348


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 160/308 (51%), Positives = 208/308 (67%), Gaps = 9/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
           ++ W+A HGK        E+++QIF +N++ I+  N+   + YK+G+N FADLTNEE++A
Sbjct: 38  HEQWMATHGKVYKHSYEKEQKYQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +    R        +++  + RY  +    +P S+DWR+KGAV P+KDQG CG CWAFS 
Sbjct: 98  I---NRFKGHVCSKRTRTTTFRY--ENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TG+LISLSEQELVDCD K ++ GC GGLMD AF+FI+QN G+ +E  YP
Sbjct: 153 VAATEGITKLRTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+         SI GYEDV    E +L KAVA+QPVSVAIEA G  FQ Y  G
Sbjct: 213 YEGFDGTCNAKADGNHAGSIKGYEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGG 272

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG CG+ LDHGV +VGYG  ++G  YWLV+NSWG  WGE GY+++QR++     G CG
Sbjct: 273 VFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVKNSWGVKWGEKGYIRMQRDVA-AKEGLCG 331

Query: 349 IAMEASYP 356
           IAM ASYP
Sbjct: 332 IAMLASYP 339


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 156/318 (49%), Positives = 219/318 (68%), Gaps = 11/318 (3%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N+ FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R+    T+++       ++V +  RY     D LP ++DWR KG V P+KDQG 
Sbjct: 88  LTNDEFRS----TKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY  A++KC     +  V SI GYEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV TG CG+ LDHG+VA+GYG   +G  YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 322 SDKR-GMCGLAMEPSYPT 338


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score =  317 bits (811), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 159/319 (49%), Positives = 216/319 (67%), Gaps = 12/319 (3%)

Query: 45  DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           DD  M   ++ W+A++ +         +RF++FK N++FI+  N+  NR + +G+N+FAD
Sbjct: 29  DDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+RA    T+++   +    KV +  RY   + D LP S+DWR KGAV P+KDQG 
Sbjct: 89  LTNDEFRA----TKTNKGFKPSPVKVPTGFRYENVSVDALPASIDWRTKGAVTPIKDQGQ 144

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA EGI KI T +LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 145 CGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 204

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY   + KC     +A   +I G+EDV   DE +L KAVA+QPVSVA++ G 
Sbjct: 205 GLTTESSYPYTATDGKCKSGTNSA--ANIKGFEDVPANDEAALMKAVANQPVSVAVDGGD 262

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENGY+++++++
Sbjct: 263 MTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 322

Query: 340 LDTNTGKCGIAMEASYPVK 358
            D   G CG+AME SYP +
Sbjct: 323 SDKR-GMCGLAMEPSYPTE 340


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 164/331 (49%), Positives = 211/331 (63%), Gaps = 13/331 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ +  +Y+ W + H +         +RF  FK N  FI  HN   +  Y++ LN+F D
Sbjct: 38  SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96

Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           +   E+RA ++G  R D   +     V    YA     +LP SVDWR+KGAV  VKDQG 
Sbjct: 97  MDQAEFRATFVGDLRRDTPSK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD   N GC GGLMD AF++I  NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214

Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           + +E  YPY  A   C+ +R    +  VV IDG++DV    E  L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
            G+AF  Y  GVFTGECG+ LDHGV  VGYG  E+G  YW V+NSWG  WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           +    + G CGIAMEASYPVK     +KPKP
Sbjct: 335 D-SGASGGLCGIAMEASYPVK---TYSKPKP 361


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 155/316 (49%), Positives = 215/316 (68%), Gaps = 10/316 (3%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           +D +  +++ W+ +HGK        +KRF IFK+N+ +I+  N++ N++YK+GLN FADL
Sbjct: 32  NDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFADL 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN E+    +  R+     L  S + + +Y  K   ++P +VDWR++GAV PVK+QG CG
Sbjct: 92  TNHEF----IAARNKFNGYLHGSIITTFKY--KNVSDVPSAVDWRQEGAVTPVKNQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VA+ EGI+K+ TG L+SLSEQELVDCD    + GC GGLMD AF+FIIQN G+
Sbjct: 146 CCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY G +  C+ +   +   +I GYE+V   DE +L+KAVA+QPVSVAI+A G  
Sbjct: 206 STEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDASGSD 265

Query: 283 FQHYESGVFTGECGSALDH-GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y+SGVFTG CG+ LDH   V      E+  +YWLV+NSWG+ WGE GY+++QR  +D
Sbjct: 266 FQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRG-VD 324

Query: 342 TNTGKCGIAMEASYPV 357
            + G CGIAM+ SYP 
Sbjct: 325 ASEGLCGIAMQPSYPT 340


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 152/311 (48%), Positives = 206/311 (66%), Gaps = 11/311 (3%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
           V  +++ W  +HGK+ +       R  +F DN  F+  HN+L N +Y + LN +ADLT+ 
Sbjct: 25  VSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHH 84

Query: 107 EYRAMYLGTRSDAK--RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           E++   LG     +  R ++  + +  R       ++P+S+DWR+KGAV  VKDQGSCG+
Sbjct: 85  EFKVSRLGFSPALRNFRPVLPQEPSLPR-------DVPDSLDWRKKGAVTAVKDQGSCGA 137

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGIN+I+TG LISLSEQEL+DCDR  N+GC GGLMDYA+QF+I N G+D+
Sbjct: 138 CWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDT 197

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYPY   +  C   +    VV+IDGY D+   DE  L +AVA QPVSV I    RAFQ
Sbjct: 198 ENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQ 257

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  G+F+G C ++LDH V+ VGYG+ENGVDYW+V+NSWG  WG +GY+ +QRN  ++  
Sbjct: 258 LYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSE- 316

Query: 345 GKCGIAMEASY 355
           G CGI   ASY
Sbjct: 317 GVCGINKLASY 327


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 160/299 (53%), Positives = 203/299 (67%), Gaps = 11/299 (3%)

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA-KRRLMKSKV 128
           E+RF I+ DNLRF  E+N+ + ++ + +  +ADL+ +EYR+  LG  +   K+R +++  
Sbjct: 69  ERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRAAP 128

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
              +     G   PE VDW   GAV PVKDQ  CGSCWAFST  AVEG N I TG+L+SL
Sbjct: 129 FLYK-----GTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSL 183

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQ LVDCDR+ + GC GG MD AF FI+ NGG+D+E DYPY   +  C  +R    VV+
Sbjct: 184 SEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVT 243

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           IDGY+DV P DE +L KAVA QPVSVAIEA   AFQ Y  GVF  ECG+ALDH V+ VGY
Sbjct: 244 IDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGY 303

Query: 309 GT-ENG---VDYWLVRNSWGSDWGENGYVKLQRNL-LDTNTGKCGIAMEASYPVKNSQN 362
           GT  NG   + YWLV+NSWG++WGE GY++L RNL  D   G+CG+AM AS+P+K   N
Sbjct: 304 GTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPIKKGAN 362


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 162/326 (49%), Positives = 210/326 (64%), Gaps = 20/326 (6%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +++ +  +Y+ W  +H + +  +G   +RF +FKDN+R I E N  +  YK+ LN+F D+
Sbjct: 40  SEEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T +E    Y  +R    R        +QR                  GAV  VKDQG CG
Sbjct: 99  TADESAGAYASSRVSHHRMFRGRGEKAQRL----------------HGAVGAVKDQGQCG 142

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFST+AAVEGIN I T  L +LSEQ+LVDCD K  NAGC+GGLMD AFQ+I ++GG+
Sbjct: 143 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 202

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +   YPY   ++ C  S  ++  V+IDGYEDV    E +LKKAVA+QPVSVAIEAGG  
Sbjct: 203 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 262

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVF G+CG+ LDHGV AVGYGT  +G  YW+VRNSWG+DWGE GY++++R+ + 
Sbjct: 263 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRD-VS 321

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPK 367
              G CGIAMEASYP+K S N A  K
Sbjct: 322 AKEGLCGIAMEASYPIKTSPNPAPKK 347


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 153/309 (49%), Positives = 212/309 (68%), Gaps = 10/309 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++TW+A++G+         ++F++FK N RFID  N+ N  + +G+N+FADLTNEE++A 
Sbjct: 37  HETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADLTNEEFKA- 95

Query: 112 YLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              T+++      K++V++  +Y     + LP S+DWR KGAV PVKDQG CG CWAFS 
Sbjct: 96  ---TKTNKGFISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA EGI K+ TG+L+SLSEQELVDCD    + GC GGLMD AF+FII NGG+  E  YP
Sbjct: 153 VAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYP 212

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC    ++A   +I  YEDV   +E +L KAVA+QPVSVA++ G   FQ Y  G
Sbjct: 213 YDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGG 270

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           V TG CG+ LDHG+ A+GYG T +G  +WL++NSWG+ WGENG++++++++ D   G CG
Sbjct: 271 VMTGSCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGENGFLRMEKDIAD-KKGMCG 329

Query: 349 IAMEASYPV 357
           +AME SYP 
Sbjct: 330 LAMEPSYPT 338


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 158/322 (49%), Positives = 214/322 (66%), Gaps = 16/322 (4%)

Query: 45  DDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
           DDE+  +  ++ W+ +HG+          RF +FK N++FI+  N+     NR + +G+N
Sbjct: 32  DDELAMVARHEQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNRKFWLGVN 91

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVK 157
           +FADLTN+E+RA    T+++        KV +  RY   + D LP++VDWR KGAV P+K
Sbjct: 92  QFADLTNDEFRA----TKTNKGFNPNVVKVPTGFRYQNLSIDALPQTVDWRTKGAVTPIK 147

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFI 216
           DQG CG CWAFS VAA EGI KI TG+L SLSEQELVDCD    + GCNGG MD AF+FI
Sbjct: 148 DQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFI 207

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
           I+NGG+ +E +YPY   + +C      A   +I GYEDV   DE +L KAVA QPVSVA+
Sbjct: 208 IKNGGLTTESNYPYTAQDGQCKSGSNGA--ATIKGYEDVPANDEAALMKAVASQPVSVAV 265

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKL 335
           + G   FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENG++++
Sbjct: 266 DGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGFLRM 325

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
           ++++ D   G CG+AM+ SYP 
Sbjct: 326 EKDIAD-KKGMCGLAMQPSYPT 346


>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
          Length = 1039

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 143/206 (69%), Positives = 171/206 (83%), Gaps = 1/206 (0%)

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 772

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE SL+KAVA+QPVSVAIEA G  
Sbjct: 773 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 832

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           FQ Y SG+FTG CG+ALDHGV  VGYGTENG DYW+++NSWGS WGE+GYV+++RN +  
Sbjct: 833 FQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERN-IKA 891

Query: 343 NTGKCGIAMEASYPVKNSQNSAKPKP 368
           ++GKCGIA+E SYP+K   N   P P
Sbjct: 892 SSGKCGIAVEPSYPLKEGANPPNPGP 917


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 150/321 (46%), Positives = 213/321 (66%), Gaps = 8/321 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
           D  ++  ++ W+A+HG+         +RF+ F++N+ FI+  N+    R + +G+N+F D
Sbjct: 30  DAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTD 89

Query: 103 LTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           LTN+E+RA        + +A      S   + RY+  + D LP +VDWR KGAV P+K+Q
Sbjct: 90  LTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQ 149

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQ 218
           G CG CWAFS VAA EGI ++ TG+L+ LSEQELVDCD    + GC GG MD AF+FII+
Sbjct: 150 GQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIK 209

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+ SE +YPY   + +C        V +I GYEDV   DE SL KAVA QPVSVA++ 
Sbjct: 210 NGGLTSETNYPYTAQDGQCKAKNTINSVATIKGYEDVPANDEASLMKAVAAQPVSVAVDG 269

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
           G   FQHY  GV +G CG++LDHG+VAVGYG  ++G  +WL++NSWG+ WGE+GY+++++
Sbjct: 270 GDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGYIRMEK 329

Query: 338 NLLDTNTGKCGIAMEASYPVK 358
           ++ D   G CG+AM+ SYP +
Sbjct: 330 DVADAG-GMCGLAMQPSYPTE 349


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 159/307 (51%), Positives = 212/307 (69%), Gaps = 16/307 (5%)

Query: 56  LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLG 114
           +A++G+        EKRF+IFKDN+  I+  N ++++TYK+ +N+FADLTNEE+R++   
Sbjct: 1   MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSL--- 57

Query: 115 TRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
                 R   K+ + S+    K  +   +P ++DWR+KGAV P+KDQ  CG CWAFS VA
Sbjct: 58  ------RNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVA 111

Query: 173 AVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           A EGI +I TG+LISLSEQELVDCD    N GC+GGLMD AF+FI +  G+ SE  YPY 
Sbjct: 112 ATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYE 170

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G +  C+  +       I GYEDV   +E +L+KAVA QPV+VAI+AGG  FQ Y SGVF
Sbjct: 171 GDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVF 230

Query: 292 TGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           TG+CG+ LDHGV AVGYG  ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIA
Sbjct: 231 TGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIA 289

Query: 351 MEASYPV 357
           M+ASYP 
Sbjct: 290 MQASYPT 296


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 156/316 (49%), Positives = 212/316 (67%), Gaps = 7/316 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  ++  ++ W+A HG+        + RFQIFK+N+ +ID HN+  +++Y + +NKFADL
Sbjct: 48  DPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFADL 107

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN+E+RA   G +   K+    S V S  +       +P+ VDWR++GAV PVKDQG CG
Sbjct: 108 TNDEFRASRNGYK---KQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDCG 164

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA+EGINK+  G+L+SLSEQELVDCD   I+ GC GGLM+ AFQFI +  G+
Sbjct: 165 CCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKGL 224

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY G +  C+  +       I G+E V   +E +L +AVA+QPVS+AI+A G  
Sbjct: 225 AAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGYE 284

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVFTG CG+ LDH + AVGYG T +G  YWL++NSWG+ WGENGY++++R+ L 
Sbjct: 285 FQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSL- 343

Query: 342 TNTGKCGIAMEASYPV 357
              G CGIAM+ SYPV
Sbjct: 344 AKEGLCGIAMDPSYPV 359


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
           ++ W+AKHG+         +R ++F+DN+ FI+  N+    +K  L  N+FADLTN E+R
Sbjct: 40  HERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 99

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A   G R  + R        S RYA  +  +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 100 ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 157

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAA+EG  K+ TG+L+SLSEQ+LV CD K  + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 158 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 217

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  +++KC  +   A   +I GYEDV   DE +L KAVA+QPVSVAI+ G R FQ Y+ 
Sbjct: 218 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 277

Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV +G   C + LDH + AVGYG   +G  YWL++NSWG+ WGE+GYV+++R + D   G
Sbjct: 278 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 336

Query: 346 KCGIAMEASYPV 357
            CG+AM ASYP 
Sbjct: 337 VCGLAMMASYPT 348


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 150/321 (46%), Positives = 213/321 (66%), Gaps = 17/321 (5%)

Query: 38  HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVG 96
           H +S R + E       W+A++G+    +   ++ FQIFK+N+ FI+  N+  N+ YK+G
Sbjct: 30  HETSLREEHE------NWIARYGQVYK-VAAEKETFQIFKENVEFIESFNAAANKPYKLG 82

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N FADLT EE++    G +   +  +   K  +         ++PE++DWREKGAV P+
Sbjct: 83  VNLFADLTLEEFKDFRFGLKKTHEFSITPFKYENVT-------DIPEALDWREKGAVTPI 135

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQF 215
           KDQG CGSCWAFSTVAA EGI++I TG L+SL EQELV CD K ++ GC GG M+  F+F
Sbjct: 136 KDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEF 195

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           II+NGG+ ++ +YPY G    C+ +   + V  I GYE V  + E +L+KAVA+QPVSV+
Sbjct: 196 IIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEALQKAVANQPVSVS 255

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           I+A    F  Y  G++TGECG+ LDHGV AVGYGT N  DYW+V+NSWG+ W E G++++
Sbjct: 256 IDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETDYWIVKNSWGTGWDEKGFIRM 315

Query: 336 QRNLLDTNTGKCGIAMEASYP 356
           QR  +    G CG+A+++SYP
Sbjct: 316 QRG-ITVKHGLCGVALDSSYP 335


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 158/319 (49%), Positives = 211/319 (66%), Gaps = 8/319 (2%)

Query: 45  DDEVMT--IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
           DDE++    +  W+A+HG+T   M     R+ +FK N+  I+  N++   RT+K+ +N+F
Sbjct: 29  DDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQF 88

Query: 101 ADLTNEEYRAMYLGTRSD-AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           ADLTN+E+R MY G + D       ++K  S RY       LP +VDWR+KGAV P+K+Q
Sbjct: 89  ADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKNQ 148

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           GSCG CWAFS VAA+EG  +I  G+LISLSEQ+LVDCD   + GC+GGLMD AF+ I+  
Sbjct: 149 GSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLMDTAFEHIMAT 207

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E +YPY G +  C          SI GYEDV   DE +L KAVA QPVSV IE G
Sbjct: 208 GGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEGG 267

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRN 338
           G  FQ Y SGVFTGEC + LDH V AVGY   + G  YW+++NSWG+ WGE GY++++++
Sbjct: 268 GFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKKD 327

Query: 339 LLDTNTGKCGIAMEASYPV 357
           + D   G CG+AM+ASYP 
Sbjct: 328 IKDKE-GLCGLAMKASYPT 345


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 204/319 (63%), Gaps = 14/319 (4%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---------TYKVGLNKFA 101
           +++ W A+HGK     G    R   F DN  F+  HN+            +Y + LN FA
Sbjct: 41  LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQG 160
           DLT+ E+RA  LG  +    R   S+     +A   G   +PE++DWR+ GAV  VKDQG
Sbjct: 101 DLTHAEFRAARLGRLAVGGARAPPSEGG---FAGSVGVGAVPEALDWRQSGAVTKVKDQG 157

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
           SCG+CW+FS   A+EGINKI TG LISLSEQEL+DCDR  NAGC GGLMDYA++F+I+NG
Sbjct: 158 SCGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNG 217

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+D+E DYPY  A+  C+ ++    VV+IDGY DV    E SL +AVA QP+SV I    
Sbjct: 218 GIDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSA 277

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           RAFQ Y  G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG  WG  GY+ + RN  
Sbjct: 278 RAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-T 336

Query: 341 DTNTGKCGIAMEASYPVKN 359
            +++G CGI M AS+P K 
Sbjct: 337 GSSSGICGINMMASFPTKT 355


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 152/308 (49%), Positives = 210/308 (68%), Gaps = 5/308 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ + GK+       EKRFQIFK+N+ FI+  N++ N+ + + +N FADLTNEE++A
Sbjct: 37  HEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G +    +  + ++  S RY       +P S+DWR++GAV P+K+QGSCGSCWAFST
Sbjct: 97  SLNGNKKLHDKFDILNETTSFRYHNVT--SVPASMDWRKRGAVTPIKNQGSCGSCWAFST 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VA++EGI++I TGEL+SLSEQEL+DC R  ++GC+GG ++ AF+FI + GGM SE +YPY
Sbjct: 155 VASIEGIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPY 214

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
              + KC   + +  V  I GYE V    E  L KAVA+QPVSV ++AG   FQ Y  G+
Sbjct: 215 KETDEKCKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGI 274

Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+  DH V  VGYG   +  +YWLV+NSWG+ WGE GY+KL+RN +D+  G CGI
Sbjct: 275 FTGKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRN-VDSKKGLCGI 333

Query: 350 AMEASYPV 357
           A   SYPV
Sbjct: 334 ATNPSYPV 341


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
           ++ W+AKHG+         +R ++F+DN+ FI+  N+    +K  L  N+FADLTN E+R
Sbjct: 5   HERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 64

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A   G R  + R        S RYA  +  +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 65  ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 122

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAA+EG  K+ TG+L+SLSEQ+LV CD K  + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 123 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 182

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  +++KC  +   A   +I GYEDV   DE +L KAVA+QPVSVAI+ G R FQ Y+ 
Sbjct: 183 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 242

Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV +G   C + LDH + AVGYG   +G  YWL++NSWG+ WGE+GYV+++R + D   G
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 301

Query: 346 KCGIAMEASYPV 357
            CG+AM ASYP 
Sbjct: 302 VCGLAMMASYPT 313


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 177/360 (49%), Positives = 219/360 (60%), Gaps = 32/360 (8%)

Query: 25  ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG-MGHNEKRFQIFKDNLRFI 83
            D SI+ Y +  D SS     + +  +++ WL++H K +   +    +RF++FKDNL  I
Sbjct: 26  GDFSIVGY-SEEDLSS----HESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHI 80

Query: 84  DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR------------------LMK 125
           DE N    +Y +GLN+FADLT++E++A YLG                             
Sbjct: 81  DETNRKVSSYWLGLNEFADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSS 140

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           S     RY       LP+SVDWR KGAV  VK+QG CGSCWAFSTVAAVEGIN+IVTG L
Sbjct: 141 SSSFRFRYEGVDAARLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 200

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
            +LSEQELVDCD   N GCNGGLMDYAF +I  NGG+ +E+ YPYL  E  C     +A 
Sbjct: 201 TALSEQELVDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRG-SSAA 259

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
           VV+I GYEDV   +E +L KA+A QPVSVAIEA GR  Q Y  GVF G CG+ LDHGV A
Sbjct: 260 VVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAA 319

Query: 306 VGYGT---ENG---VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           VGYGT   +NG    DY +V+NSWG  WGE GY++++R       G CGI    SYP KN
Sbjct: 320 VGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRG-TGKRQGLCGINKMPSYPTKN 378


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 208/312 (66%), Gaps = 9/312 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
           ++ W+AKHG+         +R ++F+DN+ FI+  N+    +K  L  N+FADLTN E+R
Sbjct: 5   HERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFR 64

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           A   G R  + R        S RYA  +  +LP SVDWR KGAVNPVKDQG CG CWAFS
Sbjct: 65  ATRTGLRPSSSRG--NRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWAFS 122

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAA+EG  K+ TG+L+SLSEQ+LV CD K  + GC GGLMD AF FII+NGG+ +E DY
Sbjct: 123 AVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAESDY 182

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  +++KC  +   A   +I GYEDV   DE +L KAVA+QPVSVAI+ G R FQ Y+ 
Sbjct: 183 PYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFYKG 242

Query: 289 GVFTGE--CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV +G   C + LDH + AVGYG   +G  YWL++NSWG+ WGE+GYV+++R + D   G
Sbjct: 243 GVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE-G 301

Query: 346 KCGIAMEASYPV 357
            CG+AM ASYP 
Sbjct: 302 VCGLAMMASYPT 313


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 162/331 (48%), Positives = 209/331 (63%), Gaps = 13/331 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ +  +Y+ W + H +         +RF  FK N  FI  HN   +  Y++ LN+F D
Sbjct: 38  SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96

Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           +   E+RA ++G  R D   +     V    YA     +LP SVDWR+KGAV  VKDQG 
Sbjct: 97  MDQAEFRATFVGDLRRDTPAK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD   N GC GGLMD AF++I  NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214

Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           + +E  YPY  A   C+ +R    +  VV IDG++DV    E  L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
            G+AF  Y  GVFTG+CG+ LDHGV  VGYG  E+G  YW V+NSWG  WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           +    + G CGIAMEASYPVK      KP P
Sbjct: 335 D-SGASGGLCGIAMEASYPVKTYN---KPMP 361


>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 165/316 (52%), Positives = 222/316 (70%), Gaps = 5/316 (1%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           T++ +  +Y+ W  KH   S  +    KRF +FK+N+  +   N +++ YK+ LNKFAD+
Sbjct: 33  TEESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADM 91

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           +N E+   Y  +     R+L + +  +  +  +   +LP SVD RE+GAVN VK+QG CG
Sbjct: 92  SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCG 151

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAFS+VAAVEGINKI T +L+SLSEQEL+DC+ + N GCNGG M+ AF FI +NGG+ 
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKRNGGIA 210

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E  YPY G+   C  SR ++ +V IDGYE V P +E +L +AVA+QPVSVAI+A GR F
Sbjct: 211 TENSYPYHGSRGLCRSSRISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDF 269

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GVF G CG+ L+HGVVA+GYG TE+G DYWLVRNSWG  WGE+GYV+++R  ++ 
Sbjct: 270 QFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRG-VEQ 328

Query: 343 NTGKCGIAMEASYPVK 358
             G CGIAMEASYP+K
Sbjct: 329 AEGLCGIAMEASYPIK 344


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 151/306 (49%), Positives = 204/306 (66%), Gaps = 6/306 (1%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMY 112
           W+ +HG+          R+ +FK N+  I+  N +    T+K+ +N+FADLTNEE+R+MY
Sbjct: 41  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100

Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
            G + ++     ++K  S RY   + D LP SVDWR+KGAV P+KDQG CGSCWAFS VA
Sbjct: 101 TGFKGNSVLS-SRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 159

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
           A+EG+ +I  G+LISLSEQELVDCD   + GC GGLMD AF + I  GG+ SE +YPY  
Sbjct: 160 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 218

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
               C+ ++      SI G+EDV   DE +L KAVA  PVS+ I  G   FQ Y SGVF+
Sbjct: 219 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 278

Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           GEC + LDHGV AVGYG ++NG+ YW+++NSWG  WGE GY++++++ +    G+CG+AM
Sbjct: 279 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKD-IKPKHGQCGLAM 337

Query: 352 EASYPV 357
            ASYP 
Sbjct: 338 NASYPT 343


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 160/337 (47%), Positives = 216/337 (64%), Gaps = 18/337 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRT--------YK 94
           +++ +  +Y  W + H           +RF  FK N+ FI  HN+ LN T        Y+
Sbjct: 34  SEEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYR 93

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + LN+F D+   E+R+ + G       R  +   +   +      ++P++VDWR+KGAV 
Sbjct: 94  LRLNRFGDMDQAEFRSTFAGPL----HRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVT 149

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAF 213
            VKDQG CGSCWAFS VA+VEG+N I TG L+SLSEQEL+DCD    + GC GGLM+ AF
Sbjct: 150 GVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAF 209

Query: 214 QFIIQN-GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPV 272
           +FI  + GG+ +E  YPY  +   C+ +R ++  V IDG++ V   +E +L KAVA QPV
Sbjct: 210 EFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPV 269

Query: 273 SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGEN 330
           SVAI+AGG+AFQ Y  GVFTG+CGS LDHGV  VGYG   E+G +YW+V+NSWG  WGE+
Sbjct: 270 SVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEH 329

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
           GYV++QR+    + G CGIAMEASYPVKN Q   KP+
Sbjct: 330 GYVRMQRD-SGVDGGLCGIAMEASYPVKNEQTKKKPR 365


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  310 bits (795), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 155/322 (48%), Positives = 204/322 (63%), Gaps = 17/322 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--------------TYKVGL 97
           +  W A+HGK          R  +F DN  F+  HN+                 +Y + L
Sbjct: 36  FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           N FADLT+EE+RA  LG    A    ++S+ A   +    G  +P+++DWR+ GAV  VK
Sbjct: 96  NAFADLTHEEFRAARLGRI--APGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVK 153

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQGSCG+CW+FS   A+EGINKI TG L+SLSEQEL+DCDR  N+GC GGLMDYA++F+I
Sbjct: 154 DQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVI 213

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +NGG+D+E+DYPY  A+  C+ ++   +VV+IDGY DV    E  L +AVA QPVSV I 
Sbjct: 214 KNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGIC 273

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
              RAFQ Y  G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG  WG  GY+ + R
Sbjct: 274 GSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHR 333

Query: 338 NLLDTNTGKCGIAMEASYPVKN 359
           N  D+  G CGI M AS+P K 
Sbjct: 334 NTGDSK-GVCGINMMASFPTKT 354


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 158/274 (57%), Positives = 191/274 (69%), Gaps = 13/274 (4%)

Query: 103 LTNEEYRAMYLGTRSDAKRRLMK-----SKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           +T +E+R  Y G+R  A  R+ +     S  ++  +      ++P SVDWR+KGAV  VK
Sbjct: 1   MTADEFRRHYAGSRV-AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVK 59

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K NAGCNGGLMDYAFQ+I 
Sbjct: 60  DQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIA 119

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           ++GG+ +E  YPY   +  C  S   A VV+IDGYEDV   DE +LKKAVA QPVSVAIE
Sbjct: 120 KHGGVAAEDAYPYRARQASCKKS--PAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIE 177

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
           A G  FQ Y  GVF+G CG+ LDHGV AVGYG T +G  YWLV+NSWG +WGE GY+++ 
Sbjct: 178 ASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMA 237

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
           R++     G CGIAMEASYPVK S N   PK H+
Sbjct: 238 RDVA-AKEGHCGIAMEASYPVKTSPN---PKVHA 267


>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
          Length = 229

 Score =  310 bits (794), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 148/234 (63%), Positives = 174/234 (74%), Gaps = 12/234 (5%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           L  STL+FL F  S +    +I +Y           TD+EVMT+Y+ WL KH K  NG+ 
Sbjct: 7   LVTSTLLFLSFTLSCAIDTSTITNY-----------TDNEVMTMYEEWLVKHQKVYNGLR 55

Query: 68  HNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
             +KRFQ+FKDNL FI EHN+  N TYK+GLN+FAD+TNEEYR MY GT+SDAKRRLMK+
Sbjct: 56  EKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKT 115

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           K    RYA  AGD LP  VDWR KGAV P+KDQGSCGSCWAFSTVA VE  NKIVTG+ +
Sbjct: 116 KSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEATNKIVTGKFV 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           SLSEQELVDCDR  N  CNGGLMDYAF+FIIQNGG+D+++DYPY G +  CDP+
Sbjct: 176 SLSEQELVDCDRAYNERCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPT 229


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score =  310 bits (794), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 208/316 (65%), Gaps = 5/316 (1%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D+ +   ++ W+A+ G+          R ++FK N+ FI+  N+ N  + +G N+FADLT
Sbjct: 34  DNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANVAFIESFNAENHEFWLGANQFADLT 93

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           N+E+RA    T    K+  ++      +Y+  + D LP SVDWR KGAV P+K+QG CGS
Sbjct: 94  NDEFRASK--TNKGIKQGGVRDAPTGFKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGS 151

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMD 223
           CWAFS VAA EG+ K+ TG+L+SLSEQELVDCD   ++ GC GG MD AF+FII+NGG+ 
Sbjct: 152 CWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLT 211

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E +YPY G ++KC  +       +I GYEDV   DE +L KAVA QPVSV ++ G   F
Sbjct: 212 TEANYPYTGEDDKCKSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVVVDGGDMTF 271

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GV TG CG  +DHG+ A+GYG T NG  YWL++NSWG+ WGE G++++ +++ D 
Sbjct: 272 QLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDK 331

Query: 343 NTGKCGIAMEASYPVK 358
             G CG+AM+ SYP +
Sbjct: 332 R-GMCGLAMKPSYPTE 346


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 212/318 (66%), Gaps = 14/318 (4%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  +++W++++G++       +++F++FK N  FID  N+ N  + +G+N+FAD+T
Sbjct: 30  DLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAKNHKFWLGINQFADIT 89

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ---RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           NEE++       +   +  + +KV +     Y   + D LP ++DWR KGAV PVKDQG 
Sbjct: 90  NEEFKV------TKTNKGFISNKVRASTGFSYENVSIDALPATIDWRTKGAVTPVKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA EGI K+ TG+L+SLSEQELVDCD    + GC GGLMD AF+FII NG
Sbjct: 144 CGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+  E  YPY   + KC    ++A   +I  YEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 204 GLTQESSYPYDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 322 AD-KKGMCGLAMEPSYPT 338


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 150/293 (51%), Positives = 203/293 (69%), Gaps = 7/293 (2%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           D SI+ Y           + D+++ +++ W++   K    +     RF++FKDNL+ IDE
Sbjct: 30  DYSIVGYS-----PEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
            N   ++Y +GLN+FADL++EE++ MYLG ++D  RR  +   A   +A +  + +P+SV
Sbjct: 85  TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA--EFAYRDVEAVPKSV 142

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWR+KGAV  VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD   N GCN
Sbjct: 143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMDYAF++I++NGG+  E+DYPY   E  C+  +  ++ V+I+G++DV   DE SL K
Sbjct: 203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           A+A QP+SVAI+A GR FQ Y  GVF G CG  LDHGV AVGYG+  G DY +
Sbjct: 263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 207/320 (64%), Gaps = 8/320 (2%)

Query: 43  RTDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLN 98
           R  DEV     +  W+ +HG+          R+ +FK N+  I+  N +    T+K+ +N
Sbjct: 26  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 85

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEE+R+MY G + ++     ++K  S RY   + D LP SVDWR+KGAV P+KD
Sbjct: 86  QFADLTNEEFRSMYTGYKGNSVLS-SRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKD 144

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCGSCWAFS VAA+EG+ +I  G+LISLSEQELVDCD   + GC GG M+ AF + + 
Sbjct: 145 QGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMT 203

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
            GG+ SE +YPY   +  C+ ++      SI G+EDV   DE +L KAVA  PVS+ I  
Sbjct: 204 TGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 263

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
           GG  FQ Y SGVF+GEC + LDHGV  VGYG + NG  YW+++NSWG  WGE GY+++++
Sbjct: 264 GGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKK 323

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           +      G+CG+AM ASYP 
Sbjct: 324 D-TKAKHGQCGLAMNASYPT 342


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/317 (48%), Positives = 203/317 (64%), Gaps = 13/317 (4%)

Query: 50  TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---------LNRTYKVGLNKF 100
            ++  W A+HGK          R  +F DN  F+  HN+            +Y + LN F
Sbjct: 39  ALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAF 98

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQ 159
           ADLT+EE+RA  LG R  A    ++S  A        G   +P+++DWRE GAV  VKDQ
Sbjct: 99  ADLTHEEFRAARLG-RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQ 157

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           GSCG+CW+FS   A+EGINKI TG L+SLSEQEL+DCDR  N+GC GGLMDYA++F+++N
Sbjct: 158 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKN 217

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+D+E+DYPY  A+  C+ ++   ++V+IDGY DV    E  L +AVA QPVSV I   
Sbjct: 218 GGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGS 277

Query: 280 GRAFQHY-ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            RAFQ Y + G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG  WG  GY+ + RN
Sbjct: 278 ARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRN 337

Query: 339 LLDTNTGKCGIAMEASY 355
             D+  G CGI M AS+
Sbjct: 338 TGDSK-GVCGINMMASF 353


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 147/315 (46%), Positives = 200/315 (63%), Gaps = 9/315 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-------NRTYKVGLNKFADLT 104
           ++ W A+HGK     G    R   F +N  F+  HN           +Y + LN FADLT
Sbjct: 39  FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           ++E+RA  LG  +     L     +   +  + G  +P+++DWR+ GAV  VKDQGSCG+
Sbjct: 99  HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVG-AVPDALDWRQSGAVTKVKDQGSCGA 157

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGINKI TG L+SLSEQEL+DCDR  N GC GGLM YA++F+I+NGG+D+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYP+  A+  C+ ++    VV+IDGY++V    E  L +AVA QP+SV I    RAFQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG  WG  GY+ + RN   +++
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-TGSSS 336

Query: 345 GKCGIAMEASYPVKN 359
           G CGI M AS+P K 
Sbjct: 337 GICGINMMASFPTKT 351


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 152/309 (49%), Positives = 209/309 (67%), Gaps = 7/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ K+GK        +KRF IF++N+ FI+  N+  N+ YK+ +N  AD TNEE+ A
Sbjct: 38  HEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            + G +    + L  +     +Y  +   ++P +VDWR+KG V  +KDQ  CG+CWAFS 
Sbjct: 98  SHKGYKGSHWQGLRITTQTPFKY--ENVTDIPWAVDWRQKGDVTSIKDQAQCGNCWAFSA 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA EGI +I TG L+SLSE+ELVDCD  ++ GC+GGLM++ F+FII+NGG+ SE +YPY
Sbjct: 156 VAATEGIYQITTGNLVSLSEKELVDCD-SVDHGCDGGLMEHGFEFIIKNGGISSEANYPY 214

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESG 289
                 CD ++  + V  I GYE V    E  L+KAVA+Q  +SV+I+AGG AFQ Y SG
Sbjct: 215 TAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAGGSAFQFYPSG 274

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYG T+ G  YW+V+NSWG+ WGE GY+++ R  +D   G CG
Sbjct: 275 VFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRG-IDAQEGLCG 333

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 334 IAMDASYPT 342


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  306 bits (785), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 166/361 (45%), Positives = 216/361 (59%), Gaps = 17/361 (4%)

Query: 8   LAISTLVFLFFISSSSA---ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           LA++  V     ++ SA    D S++ Y        S        +++++W  KHGK   
Sbjct: 5   LAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPS--------SLFRSWSVKHGKLYA 56

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR-- 122
                 +R++IFK NL  I E N  N +Y +GLN+FAD+ +EE++A YLG +    R   
Sbjct: 57  SPTEKLERYEIFKQNLMHIAETNRKNGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGA 116

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
                  + RYA  A   LP SVDWR KGAV PVK+QG CGSCWAFS+VAAVEGIN+IVT
Sbjct: 117 PQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVT 176

Query: 183 GELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC---DP 239
           G+L+SLSEQELVDCD  ++ GC GG MD AF +++ + G+ +E DYPYL  E  C    P
Sbjct: 177 GKLVSLSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQP 236

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
                    + G+EDV    E+SL KA+A QPVSV I AG R FQ Y  GVF G C   L
Sbjct: 237 CVLGITEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVEL 296

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DH + AVGYG+  G +Y  ++NSWG +WGE GYV+++        G CGI   ASYPVKN
Sbjct: 297 DHALTAVGYGSSYGQNYITMKNSWGKNWGEQGYVRIKMG-TGKPEGVCGIYTMASYPVKN 355

Query: 360 S 360
           +
Sbjct: 356 A 356


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 169/365 (46%), Positives = 232/365 (63%), Gaps = 29/365 (7%)

Query: 1   MATASMFLAISTLVFL--FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLA 57
           M T    +AI  L+ L   +I++S+            H+ +SS   D EVM + Y++WL 
Sbjct: 1   MKTTITLVAIINLLVLCNLWITASACPA--------KHNDNSS---DSEVMRMRYESWLK 49

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYL--GT 115
           K+G+        E RF+I++ N++FI+ +NS N +YK+  NKF DLTNEE+R MYL    
Sbjct: 50  KYGQKYRNKDEWEFRFEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTNEEFRRMYLVYQP 109

Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
           RS  + R M  K          GD LP+ +DWR +GAV  +KDQG CGSCW+FS VA VE
Sbjct: 110 RSHLQTRFMYQK---------HGD-LPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVE 159

Query: 176 GINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
            INKI TG+L+SLSEQ+L+DCD R  N GCNGG M+  F FI + GG+ ++++YPY G++
Sbjct: 160 DINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSD 218

Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
              + ++     V+I GYE++   +E  LK AVA QP SVA +AGG AFQ Y  G F+G 
Sbjct: 219 GDXNKAKVRNHAVAICGYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGS 278

Query: 295 CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
           CG  L+H +  VGYG ENG  YWLV+NSW +D G +GY++++R+  D + G CG AMEAS
Sbjct: 279 CGKDLNHRMTIVGYGEENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKD-GTCGTAMEAS 337

Query: 355 YPVKN 359
           YP K+
Sbjct: 338 YPDKH 342


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 148/316 (46%), Positives = 213/316 (67%), Gaps = 10/316 (3%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  ++ W+ ++G+         ++F++FK N  FI+  N+ N  + +G+N+FAD+T
Sbjct: 30  DLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNHKFWLGINQFADIT 89

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           NEE++A    T+++      K +V +   Y   + D LP ++DWR KGAV P+KDQG CG
Sbjct: 90  NEEFKA----TKTNKGFISNKVRVPTGFMYENMSFDALPATIDWRTKGAVTPIKDQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA+EGI K+ TG+L+SLSEQELVDCD    + GC GGLMD AF+FII+NGG+
Sbjct: 146 CCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
             E +YPY  A+ KC     +A   +I  YEDV   +E +L KAVA+QPVSVA++ G   
Sbjct: 206 TQESNYPYDAADGKCKSGSSSA--ATIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMT 263

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GV TG CG+ LDHG+ A+GYG T +G  +W+++NSWG+ WGENG++++++++ D
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKFWIMKNSWGTSWGENGFLRMEKDIAD 323

Query: 342 TNTGKCGIAMEASYPV 357
              G CG+AME SYP 
Sbjct: 324 -KKGMCGLAMEPSYPT 338


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  306 bits (785), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 150/318 (47%), Positives = 209/318 (65%), Gaps = 14/318 (4%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  +++W+ ++G+          +F++FK N  FID  N+ N  + +G+N+FAD+T
Sbjct: 30  DLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNHKFWLGINQFADIT 89

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQR---YACKAGDELPESVDWREKGAVNPVKDQGS 161
           N+E++A      +   +  + +KV +     Y   + D LP S+DWR KGAV PVKDQG 
Sbjct: 90  NKEFKA------TKTNKGFISNKVRAPTGFSYENVSFDALPASIDWRTKGAVTPVKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA EGI K+ TG+L+SLSEQELVDCD    + GC GGLMD AF+FII NG
Sbjct: 144 CGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+  E  YPY   + KC    ++A   +I  YEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 204 GLTQESSYPYDAEDGKCKSGSKSAG--TIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENG++++++++
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 322 AD-KKGMCGLAMEPSYPT 338


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 151/284 (53%), Positives = 201/284 (70%), Gaps = 8/284 (2%)

Query: 77  KDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
           K+N+ +I+  +N+ N+ YK+G+N+FADLT+EE+  +    R +   R   ++  + +Y  
Sbjct: 5   KENVNYIEAFNNAANKPYKLGINQFADLTSEEF--IVPRNRFNGHMRFSNTRTTTFKY-- 60

Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
           +    LP+S+DWR+KGAV P+K+QGSCG CWAFS +AA EGI+KI TG+L+SLSEQE+VD
Sbjct: 61  ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120

Query: 196 CDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
           CD K  + GC GG MD AF+FIIQN G+++E  YPY G + KC+         +I GYED
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYED 180

Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-G 313
           V   +E +L+KAVA+QPVSVAI+A G  FQ Y+SG+FTG CG+ LDHGV AVGYG  N G
Sbjct: 181 VPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG 240

Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             YWLV+NSWG++WGE GY  +QR +     G CGIAM ASYP 
Sbjct: 241 TKYWLVKNSWGTEWGEEGYTMMQRGVKAVE-GICGIAMLASYPT 283


>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 290

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 155/256 (60%), Positives = 195/256 (76%), Gaps = 14/256 (5%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
           R + EV  +Y+ WL ++ K  NG+G  E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35  RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLTNEE+RA+YL  R   +R   K  V ++RY  K GD LP+ VDWR  GAV  VKDQG+
Sbjct: 95  DLTNEEFRAIYL--RKKMER--TKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR  +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210

Query: 221 GMDSEQDYPY----LGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           G++++QDYPY    LG    C+  +  N +VV+IDGYEDV   DE SLKKAVA QPVSVA
Sbjct: 211 GIETDQDYPYNANDLGL---CNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVA 267

Query: 276 IEAGGRAFQHYESGVF 291
           IEA  +AFQ Y+S  F
Sbjct: 268 IEASSQAFQLYKSVNF 283


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/308 (49%), Positives = 207/308 (67%), Gaps = 17/308 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++G+        E R+ IFK+N+  ID  NS   ++Y +G+N+FADL+NEE++A
Sbjct: 5   HEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNEEFKA 64

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +   +    RY   +   +P ++DWR+KGAV PVKDQG C        
Sbjct: 65  ----SRNRFKGHMCSPQAGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC-------- 110

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGIN++ TG+LISLSEQE+VDCD K  + GCNGGLMD AF+FI QN G+ +E +YP
Sbjct: 111 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +  +    I G++DV    E +L KAVA QPVSVAI+AGG  FQ Y SG
Sbjct: 171 YTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSG 230

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +FTG CG+ LDHGV AVGYG  +G  YWLV+NSWG+ WGE GY+++Q++ +    G CGI
Sbjct: 231 IFTGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGI 289

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 290 AMQASYPT 297


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 158/358 (44%), Positives = 229/358 (63%), Gaps = 19/358 (5%)

Query: 5   SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           +   +IS L+F    L      S+AD SI+ Y  + D +S+ R    ++ ++++W+ KH 
Sbjct: 2   TTICSISKLIFVATCLIVHVGLSSADFSIVGYSQD-DLTSTER----LIRLFESWMLKHD 56

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           +  N +     RF+IFKDNL +IDE N  N +Y +GLN+F DLT++E++  Y+G+  +  
Sbjct: 57  RVYNNIEEKIHRFEIFKDNLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKYVGSIGEDF 116

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
             + +S    + +  K   + PES+DWR+KGAV PVK    CGSCWAFSTVA VEGINKI
Sbjct: 117 VTIEQSN--DEEFPYKHVVDYPESIDWRDKGAVTPVKPN-PCGSCWAFSTVATVEGINKI 173

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           VTG+LISLSEQEL+DCDR+ + GC GG    + Q+++ NG + +E++YPY   + KC   
Sbjct: 174 VTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDNG-VHTEKEYPYEKKQGKCRAK 231

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
            +    V I GY+ V   DE+SL +A+A+QPVSV +E+ GRAFQ Y+ G+F G CG+ LD
Sbjct: 232 EKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLD 291

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           H V A+GYG      Y L++NSWG +WGE GY+K++R     + G CG+   + +P K
Sbjct: 292 HAVTAIGYGK----TYILIKNSWGPNWGEKGYLKIKR-ASGKSEGTCGVYKSSYFPTK 344


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 156/360 (43%), Positives = 225/360 (62%), Gaps = 17/360 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
            A  S +L ++    LFFI     +    +S   N++ +   R D         W+  H 
Sbjct: 3   FANLSQYLCLA----LFFICLGLWSSQVALSRPINYEATMRARHDQ--------WIVHHE 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K    +   E RFQIFK+N+  I+  N+  ++ YK+G NKF+DLTNEE+R ++ G +   
Sbjct: 51  KVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSH 110

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            + +  SK  +        D +P ++DWR+KGAV P+KDQ  CG CWAFS VAA+EG+++
Sbjct: 111 PKVMTSSKGKTHFRYTNVTD-IPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQ 169

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TGELI LSEQELVDCD +  + GC+GGL+D AF FI++N G+ +E +YPY G +  C+
Sbjct: 170 LKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCN 229

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I GYEDV    E +L +AVA+QPVSVAI+     FQ Y SGVF+G C + 
Sbjct: 230 KKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTW 289

Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           L+H V AVGYG T +G  YW+++NSWGS WG++GY++++R++ +   G CG+AM+ASYP 
Sbjct: 290 LNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYPT 348


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  303 bits (777), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 143/300 (47%), Positives = 205/300 (68%), Gaps = 7/300 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  ++ W+AK  +         +RF+ FK N+ FI+  N+ N  + +G+N+F DLT
Sbjct: 30  DAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNHKFWLGVNQFTDLT 89

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           N+E+RA    T+++   +   ++  ++ +Y   + D LP +VDWR KG V P+KDQG CG
Sbjct: 90  NDEFRA----TKTNKGLKRNGARAPTRFKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCG 145

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGM 222
            CWAFS VAA EGI K+ TG+L+SLSEQELVDCD   ++ GC GG MD AF+FII+NGG+
Sbjct: 146 CCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGL 205

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPY   + +C  S  +  V +I GYEDV   DE SL KAVA+QPVSVA++ G   
Sbjct: 206 TTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVI 265

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQHY  GV TG CG+ LDHG+VA+GYG T +G  +WL++NSWG+ WGE+GY+++++++ D
Sbjct: 266 FQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISD 325


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  303 bits (776), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 154/310 (49%), Positives = 203/310 (65%), Gaps = 6/310 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W+ KHG+     G  ++RF+++K+NL  I+E NS    Y +  NKFADLTNEE+RA 
Sbjct: 119 FEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEEFRAK 178

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD---ELPESVDWREKGAVNPVKDQGSCGSCWAF 168
            LG       R  +++ AS        D   +LP+ VDWR+KGAV  VK+QGSCGSCWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           S VAA+EG+N+I  G+L+SLSEQELVDCD +   GC GG M +AF+F++ N G+ +E  Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEA-VGCAGGFMSWAFEFVMANHGLTTEASY 297

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY G    C  ++ N   VSI GY +V+   E  L K  A QPVSVA++AGG  FQ Y  
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357

Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GVF+G C + ++HGV  VGYG T+    YW+V+NSWG +WGE GY+ +QR+     TG C
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRD-AGVPTGLC 416

Query: 348 GIAMEASYPV 357
           GIAM ASYPV
Sbjct: 417 GIAMLASYPV 426


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 209/326 (64%), Gaps = 16/326 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++  ++ W+ +HG+     G  ++RF++++ N+  ++  NS++  YK+  NKFADLTN
Sbjct: 25  DLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 84

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVKDQGSC 162
           EE+RA  LG R       + S   S   A     + D LP+SVDWR+KGAV  VK+QG C
Sbjct: 85  EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 143

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFS VAA+EGIN+I  GEL+SLSEQELVDCD +   GC GG M +AF+F++ N G+
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGL 202

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY  A   C  ++ N   V+I GY +V+P  E  L +A A QPVSVA++ G   
Sbjct: 203 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 262

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGENG 331
           FQ Y SGV+TG C + ++HGV  VGYG +E   D          YW+V+NSWG++WG+ G
Sbjct: 263 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 322

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
           Y+ +QR++    +G CGIA+  SYPV
Sbjct: 323 YILMQRDVAGLASGLCGIALLPSYPV 348


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 148/302 (49%), Positives = 201/302 (66%), Gaps = 6/302 (1%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMY 112
           W+ +HG+          R+ +FK N+  I+  N +    T+K+ +N+FADLTNEE+R+MY
Sbjct: 35  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94

Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
            G + ++     ++K  S RY   + D LP SVDWR+KGAV P+KDQG CGSCWAFS VA
Sbjct: 95  TGFKGNSVLS-SRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVA 153

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
           A+EG+ +I  G+LISLSEQELVDCD   + GC GGLMD AF + I  GG+ SE +YPY  
Sbjct: 154 AIEGVAQIKKGKLISLSEQELVDCDTN-DGGCMGGLMDTAFNYTITIGGLTSESNYPYKS 212

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
               C+ ++      SI G+EDV   DE +L KAVA  PVS+ I  G   FQ Y SGVF+
Sbjct: 213 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFS 272

Query: 293 GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           GEC + LDHGV AVGYG ++NG+ YW+++NSWG  WGE GY++++++ +    G+CG+AM
Sbjct: 273 GECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKD-IKPKHGQCGLAM 331

Query: 352 EA 353
            A
Sbjct: 332 NA 333


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 209/326 (64%), Gaps = 16/326 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++  ++ W+ +HG+     G  ++RF++++ N+  ++  NS++  YK+  NKFADLTN
Sbjct: 26  DLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 85

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVKDQGSC 162
           EE+RA  LG R       + S   S   A     + D LP+SVDWR+KGAV  VK+QG C
Sbjct: 86  EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 144

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFS VAA+EGIN+I  GEL+SLSEQELVDCD +   GC GG M +AF+F++ N G+
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGL 203

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E  YPY  A   C  ++ N   V+I GY +V+P  E  L +A A QPVSVA++ G   
Sbjct: 204 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 263

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGENG 331
           FQ Y SGV+TG C + ++HGV  VGYG +E   D          YW+V+NSWG++WG+ G
Sbjct: 264 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 323

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
           Y+ +QR++    +G CGIA+  SYPV
Sbjct: 324 YILMQRDVAGLASGLCGIALLPSYPV 349


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 214/313 (68%), Gaps = 5/313 (1%)

Query: 49  MTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEE 107
           M  +Q W+ ++ K  +N +   E RF ++ +NL +I  +N+   ++ + LN FADLT +E
Sbjct: 42  MAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHWLHLNAFADLTTDE 101

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R   LG    A++   + + +   Y     ++LP  +DWR+KGAV  VK+QG CGSCWA
Sbjct: 102 FRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWA 160

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           F+T  +VEGIN IVTGEL SLSEQELVDCD   + GC+GGLMDYA+Q+II+NGG+D+E D
Sbjct: 161 FATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDD 220

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           YPY   +  C  +++N +VV+IDGY D+   DE++LKKA A QP++VAIEA  ++FQ Y 
Sbjct: 221 YPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYG 280

Query: 288 SGVFTGE-CGSALDHGVVAVGYGTENGV-DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
            GV+    CG++L+HGV+ VGYG +    +YW+V+NSWG +WG+NGY++L+    D   G
Sbjct: 281 GGVYDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQ-G 339

Query: 346 KCGIAMEASYPVK 358
            CGIAM  S+P K
Sbjct: 340 MCGIAMAPSFPTK 352


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 152/308 (49%), Positives = 198/308 (64%), Gaps = 11/308 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W  K+GK        +KR  IFKDN+ FI+  N+  N+ YK+ +N   D TNEE+ A
Sbjct: 40  HEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEEFVA 99

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            + G +          K     Y    G  +P +VDWRE GAV  +KDQG CG+CWAFST
Sbjct: 100 SHNGYKHKGSHSQTPFK-----YENITG--VPNAVDWRENGAVXAMKDQGQCGNCWAFST 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VA  EGI +I T  L+SLSEQELVDCD  ++ GC+GG M+  F+FI +NGG+ SE +YPY
Sbjct: 153 VATTEGIYQITTSMLMSLSEQELVDCD-SVDHGCDGGYMEGGFEFIXKNGGISSEANYPY 211

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
              +   D ++  +    I GYE V    E +L+KAVA+QPVSV I+ GG AFQ   SGV
Sbjct: 212 TAVDGTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGV 271

Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+ LDHGV AVGYG T++G  YW+V+NSWG+ WGE GY+++QR   D   G CGI
Sbjct: 272 FTGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRG-TDAQEGLCGI 330

Query: 350 AMEASYPV 357
           AM+ASYP 
Sbjct: 331 AMDASYPT 338


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 201/308 (65%), Gaps = 3/308 (0%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        +KRFQIFK+N+ FI+  N+  ++ + + +N+FADL +EE++A
Sbjct: 38  HENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEFKA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +        +  +  +      +      +L  ++DWR++GAV P+KDQ  CGSCWAFS 
Sbjct: 98  LLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPIKDQRRCGSCWAFSA 157

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA+EGI++I T +L+SLSEQELVDC +  + GCNGG M+ AF+F+ + GG+ SE  YPY
Sbjct: 158 VAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFVAKKGGIASESYYPY 217

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G +  C   +    V  I GYE V    E +L+KAVA QPVSV +EAGG AFQ Y SG+
Sbjct: 218 KGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNAFQFYSSGI 277

Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+  DH +  VGYG +  G  YWLV+NSWG+ WGE GY++++R+ +    G CGI
Sbjct: 278 FTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRD-IRAKEGLCGI 336

Query: 350 AMEASYPV 357
           AM A YP 
Sbjct: 337 AMNAFYPT 344


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 145/311 (46%), Positives = 198/311 (63%), Gaps = 9/311 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-------NRTYKVGLNKFADLT 104
           ++ W A+HGK     G    R   F +N  F+  HN           +Y + LN FADLT
Sbjct: 39  FEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLT 98

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           ++E+RA  LG  +     L     +   +  + G  +P+++DWR+ GAV  VKDQGSCG+
Sbjct: 99  HDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVG-AVPDALDWRQSGAVTKVKDQGSCGA 157

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FS   A+EGINKI TG L+SLSEQEL+DCDR  N GC GGLM YA++F+I+NGG+D+
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDT 217

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYP+  A+  C+ ++    VV+IDGY++V    E  L +AVA QP+SV I    RAFQ
Sbjct: 218 EDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQ 277

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  G+F G C ++LDH V+ VGYG+E G DYW+V+NSWG  WG  GY+ + RN   +++
Sbjct: 278 LYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRN-TGSSS 336

Query: 345 GKCGIAMEASY 355
           G CGI M AS+
Sbjct: 337 GICGINMMASF 347


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 219/360 (60%), Gaps = 14/360 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M   S  L ++ L  +  + SSS   +   + +   D + + R        ++ W+A+HG
Sbjct: 1   MGAISKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAAR--------HERWMAQHG 52

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         +R ++FK N+ FI+  N+  +  Y +G+N+FADLT+EE++A    ++  +
Sbjct: 53  RVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFS 112

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                       +Y   + D LP SVDWR KGAV  +KDQG CG CWAFS VAA+EGI K
Sbjct: 113 TPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVK 172

Query: 180 IVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD   N  GC GG +D AFQFI+ NGG+ +E +YPY   + +C 
Sbjct: 173 LSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCK 232

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
            +       SI GYEDV   DE SL KAVA QPVSVA++A    FQ Y  GV  GECG++
Sbjct: 233 TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTS 290

Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV  +GYG   +G  YWLV+NSWG+ WGE GY++++++ +D   G CG+AM+ SYP 
Sbjct: 291 LDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKD-IDDKRGMCGLAMQPSYPT 349


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score =  301 bits (771), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 211/313 (67%), Gaps = 11/313 (3%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N  FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNHKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R     T+++       ++V +  RY     D LP ++DWR KG V P+KDQG 
Sbjct: 88  LTNDEFRL----TKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ TG+LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 203

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY  A++KC     +  V SI GYEDV   +E +L KAVA+QPVSVA++   
Sbjct: 204 GLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGDD 261

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV  G CG+ LDHG+VA+GYG   +G  YWL++NSWG  WGENG++++++++
Sbjct: 262 MTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 321

Query: 340 LDTNTGKCGIAME 352
            D   G CG+AME
Sbjct: 322 SDKR-GMCGLAME 333


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 150/316 (47%), Positives = 204/316 (64%), Gaps = 8/316 (2%)

Query: 43  RTDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLN 98
           R  DEV     +  W+ +HG+          R+ +FK N+  I+  N +    T+K+ +N
Sbjct: 20  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 79

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +FADLTNEE+R+MY G + ++     ++K  S RY   + D LP SVDWR+KGAV P+KD
Sbjct: 80  QFADLTNEEFRSMYTGYKGNSVLS-SRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKD 138

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCGSCWAFS VAA+EG+ +I  G+LISLSEQELVDCD   + GC GG M+ AF + + 
Sbjct: 139 QGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-DDGCMGGYMNSAFNYTMT 197

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
            GG+ SE +YPY   +  C+ ++      SI G+EDV   DE +L KAVA  PVS+ I  
Sbjct: 198 TGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 257

Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
           GG  FQ Y SGVF+GEC + LDHGV  VGYG + NG  YW+++NSWG  WGE GY+++++
Sbjct: 258 GGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKK 317

Query: 338 NLLDTNTGKCGIAMEA 353
           +      G+CG+AM A
Sbjct: 318 D-TKAKHGQCGLAMNA 332


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 152/361 (42%), Positives = 226/361 (62%), Gaps = 19/361 (5%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
            A  S +L ++       +  S  A    I+Y+ +            +   +  W+A H 
Sbjct: 3   FANLSQYLCLALFFIFLGVWRSQVASSRPINYEAS------------MRARHDQWIAHHD 50

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K    +   E RF+IFK+N+  I+  N+  ++ YK+G+NKF+DLTNE++R ++ G +   
Sbjct: 51  KVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSH 110

Query: 120 KRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
            + +  SK  +  RYA     ++P ++DWR+KGAV P+KDQ  CG CWAFS VAA EG++
Sbjct: 111 PKVMSSSKPKTHFRYANVT--DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLH 168

Query: 179 KIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           ++ TG+LI LSEQELVDCD +  + GC+GGL+D AF FI++N G+ +E +YPY G +  C
Sbjct: 169 QLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVC 228

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
           +  +       I GYEDV    E +L +AVA+QPVSVAI+     FQ Y SGVF+G C +
Sbjct: 229 NKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCST 288

Query: 298 ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            L+H V AVGYG T +G  YW+++NSWGS WG++GY++++R++ +   G CG+AM+ASYP
Sbjct: 289 WLNHAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYP 347

Query: 357 V 357
            
Sbjct: 348 T 348


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 151/315 (47%), Positives = 198/315 (62%), Gaps = 13/315 (4%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR------TYKVGLNKFADLT 104
           +++ W  +H KT +       R ++F+DN  F+ +HN          +Y + LN FADLT
Sbjct: 32  LFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADLT 91

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           + E++   LG        L++ K   Q    +    +P  +DWR+ GAV PVKDQ SCG+
Sbjct: 92  HHEFKTTRLG----LPLTLLRFK-RPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CWAFS   A+EGINKIVTG L+SLSEQEL+DCD   N+GC GGLMD+A+QF+I N G+D+
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYPY   +  C   +   + V+I+ Y DV P +E  L KAVA QPVSV I    R FQ
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEIL-KAVASQPVSVGICGSEREFQ 265

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y  G+FTG C + LDH V+ VGYG+ENGVDYW+V+NSWG  WG NGY+ + RN  ++  
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK- 324

Query: 345 GKCGIAMEASYPVKN 359
           G CGI   ASYPVK 
Sbjct: 325 GICGINTLASYPVKT 339


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 148/313 (47%), Positives = 217/313 (69%), Gaps = 9/313 (2%)

Query: 52  YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W   H ++  N +   E RF+++ +NL ++  +N+   ++ + LN  ADL+  EY++
Sbjct: 13  FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHLADLSTPEYKS 72

Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
             LG   D + R+ ++K+ +  RY     + LP ++DWR+K AV  VK+QG CGSCWAF+
Sbjct: 73  KLLGF--DNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFA 130

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T  +VEGIN IVTG L+SLSEQELVDCD + + GC+GGLMDYA+ +II+N G+++E+DYP
Sbjct: 131 TTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + +CD ++   +VV+ID YEDV   DE++LKKA A QPV+VAIEA  ++FQ Y  G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250

Query: 290 VFTGE-CGSALDHGVVAVGYG---TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           V+    CG++L+HGV+ VGYG   T +G +YW+V+NSWG++WG+ GY++L+    D   G
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAE-G 309

Query: 346 KCGIAMEASYPVK 358
            CGIAM  SYPVK
Sbjct: 310 LCGIAMAPSYPVK 322


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 146/307 (47%), Positives = 208/307 (67%), Gaps = 8/307 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFK+N++FI+  N+  ++ + + +N+FADL NEE++A
Sbjct: 37  HEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  +   +  +  +   S RY  ++  ++P ++DWR++GAV P+KDQG+CGSCWAFST
Sbjct: 97  SLINVQKK-ESGVETATETSFRY--ESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFST 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA+EGI++I TG+L+SLSEQELVDC +  + GCN G  + AF+F+ +NGG+ SE  YPY
Sbjct: 154 VAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPY 213

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
                 C   +    V  I GYE+V    E +L KAVA+QPVSV I+AG  A Q Y SG+
Sbjct: 214 KANNKTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAG--ALQFYSSGI 271

Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+A +H V  +GYG    G  YWLV+NSWG+ WGE GY+K++R+ +    G CGI
Sbjct: 272 FTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNSWGTKWGEKGYIKMKRD-IRAKEGLCGI 330

Query: 350 AMEASYP 356
           A  ASYP
Sbjct: 331 ATNASYP 337


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 149/331 (45%), Positives = 201/331 (60%), Gaps = 21/331 (6%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++  ++ W+ +HG+     G  ++R ++++ N+  ++  NS+   Y++  NKFADLTN
Sbjct: 48  DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 107

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYAC--------KAGDELPESVDWREKGAVNPVK 157
           EE+RA  LG              A    AC        +   +LP+SVDWREKGAV PVK
Sbjct: 108 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 167

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
            QG CGSCWAFS VAA+EGIN+I  G+L+SLSEQELVDCD K   GC GG M +AF+F++
Sbjct: 168 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFVM 226

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +N G+ +E++YPY G    C   +     VSI GY +V+P  E  L +A A QPVSVA++
Sbjct: 227 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 286

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-----------GVDYWLVRNSWGSD 326
           AG   +Q Y  GVFTG C + L+HGV  VGYG              G  YW+V+NSWG +
Sbjct: 287 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 346

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG+ GY+ +QR      +G CGIAM  SYPV
Sbjct: 347 WGDAGYILMQRE-ASVASGLCGIAMLPSYPV 376


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 157/361 (43%), Positives = 219/361 (60%), Gaps = 14/361 (3%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           M   S  L ++ L  +  + SSS   +   + +   D + + R        ++ W+A+HG
Sbjct: 1   MGAISKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAAR--------HERWMAQHG 52

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         +R ++FK N+ FI+  N+  +  Y +G+N+FADLT+EE++A    ++  +
Sbjct: 53  RVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFS 112

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                       +Y   + D LP SVDWR KGAV  +KDQG CG CWAFS VAA+EG  K
Sbjct: 113 TPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVK 172

Query: 180 IVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD   N  GC GG +D AFQFI+ NGG+ +E +YPY   + +C 
Sbjct: 173 LSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCK 232

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
            +       SI GYEDV   DE SL KAVA QPVSVA++A    FQ Y  GV  GECG++
Sbjct: 233 TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTS 290

Query: 299 LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV  +GYG   +G  YWLV+NSWG+ WGE GY++++++ +D   G CG+AM+ SYP 
Sbjct: 291 LDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKD-IDDKRGMCGLAMQPSYPT 349

Query: 358 K 358
           +
Sbjct: 350 E 350


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 149/331 (45%), Positives = 201/331 (60%), Gaps = 21/331 (6%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++  ++ W+ +HG+     G  ++R ++++ N+  ++  NS+   Y++  NKFADLTN
Sbjct: 27  DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTN 86

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYAC--------KAGDELPESVDWREKGAVNPVK 157
           EE+RA  LG              A    AC        +   +LP+SVDWREKGAV PVK
Sbjct: 87  EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 146

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
            QG CGSCWAFS VAA+EGIN+I  G+L+SLSEQELVDCD K   GC GG M +AF+F++
Sbjct: 147 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFVM 205

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +N G+ +E++YPY G    C   +     VSI GY +V+P  E  L +A A QPVSVA++
Sbjct: 206 KNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVD 265

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-----------GVDYWLVRNSWGSD 326
           AG   +Q Y  GVFTG C + L+HGV  VGYG              G  YW+V+NSWG +
Sbjct: 266 AGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPE 325

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG+ GY+ +QR      +G CGIAM  SYPV
Sbjct: 326 WGDAGYILMQRE-ASVASGLCGIAMLPSYPV 355


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 139/312 (44%), Positives = 214/312 (68%), Gaps = 9/312 (2%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
           E+  +++ W AKHGK+ +      +R  IF D L +I++HN+  N T+ +GLNKF+DLTN
Sbjct: 32  EIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTN 91

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E+RAM++G     KR   + ++ ++         LP S+DWR+KGAV P+KDQG CGSC
Sbjct: 92  AEFRAMHVGKF---KRPRYQDRLPAEDEDVDV-SSLPTSLDWRQKGAVTPIKDQGDCGSC 147

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++AGC+GGLM+ AF+F+++NGG+ +E
Sbjct: 148 WAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCDGGLMETAFKFVVKNGGVTTE 206

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
             YPY G+   C+ ++   KV  I G++ V+     +L KAV+  PV+V+I      FQ+
Sbjct: 207 AAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQN 266

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y+SG+ +G+C  +LDHGV+ +GYGTE G+ YW+++NSWG+ WGE+G++K++R   D   G
Sbjct: 267 YKSGILSGKCDDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGD---G 323

Query: 346 KCGIAMEASYPV 357
            CG+  ++SYP 
Sbjct: 324 MCGMNGDSSYPT 335


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 158/323 (48%), Positives = 206/323 (63%), Gaps = 14/323 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK----VGLNK 99
           +++ V+ I+Q W  KH K        EKRF+ FK NL++I E N+  +  K    VGLNK
Sbjct: 41  SEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNK 100

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-ELPESVDWREKGAVNPVKD 158
           FAD++NEE+R  YL   S  K+ + K    S+    K    + P S+DWR  G V  VKD
Sbjct: 101 FADMSNEEFRKAYL---SKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKD 157

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QGSCGSCWAFS+  A+EGIN +VTG+LISLSEQELV+CD   N GC GG MDYAF+++I 
Sbjct: 158 QGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVIN 216

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+DSE DYPY G +  C+ ++   KVVSIDGY+DV   D  +L  AVA QPVSV I+ 
Sbjct: 217 NGGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDS-ALLCAVAQQPVSVGIDG 275

Query: 279 GGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
               FQ Y  G++ G C      +DH V+ VGYG+E+  +YW+V+NSWG+ WG +GY  L
Sbjct: 276 SAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGIDGYFYL 335

Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
           +R+  D   G C +   ASYP K
Sbjct: 336 KRD-TDLPYGVCAVNAMASYPTK 357


>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
          Length = 300

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 142/195 (72%), Positives = 164/195 (84%), Gaps = 1/195 (0%)

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFST+ AVEGINKIVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E 
Sbjct: 1   AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEA 60

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           DYPY  A+ +CD +R+NAKVV+ID YEDV    E SLKKA+A QP+SVAIEAGGRAFQ Y
Sbjct: 61  DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 120

Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
            SGVF G CG+ LDHGVVAVGYGTENG  YW+VRNSWG+ WGE+GY+K+ RN ++  TGK
Sbjct: 121 SSGVFDGLCGTELDHGVVAVGYGTENGKGYWIVRNSWGNRWGESGYIKMARN-IEAPTGK 179

Query: 347 CGIAMEASYPVKNSQ 361
           CGIAMEASYP+K  Q
Sbjct: 180 CGIAMEASYPIKKGQ 194


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 143/224 (63%), Positives = 171/224 (76%), Gaps = 2/224 (0%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           +P SVDWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD   
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GCNGGLMDYAF+FI Q GG+ +E +YPY   +  CD S+ NA  VSIDG+E+V   DE
Sbjct: 62  NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLV 319
            +L KAVA+QPVSVAI+AGG  FQ Y  GVFTG CG+ LDHGV  VGYGT  +G  YW V
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
           +NSWG +WGE GY++++R + D   G CGIAMEASYP+K S N+
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKE-GLCGIAMEASYPIKKSSNN 224


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 158/362 (43%), Positives = 226/362 (62%), Gaps = 17/362 (4%)

Query: 1   MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT S   +IS ++FL        S S+AD   + Y  + D +S  R    ++ ++ +W+
Sbjct: 1   MATMS---SISKIIFLATCLIIHMSLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            KH K    +     RF+IF+DNL +IDE N  N +Y +GLN FADL+N+E++  Y+G+ 
Sbjct: 53  LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGSV 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           ++    L      ++ +  K     P+S+DWR KGAV PVK+QGSCGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           +NKIVTG L+ LSEQELVDCD+  + GC GG    + Q++  NG + + + YPY     +
Sbjct: 171 VNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADNG-VHTSKVYPYQAKAMQ 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  + +    V I GY+ V    E S   A+A+QP+SV +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R   ++  G CG+   + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347

Query: 357 VK 358
            K
Sbjct: 348 FK 349


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 205/324 (63%), Gaps = 16/324 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH---NSLNRTYKVGLNKF 100
           +++ ++ I+Q W  +H K       +EKR++ FK NL++I E     +    + VGLNKF
Sbjct: 42  SEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKF 101

Query: 101 ADLTNEEYRAMYLGTRS---DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           ADL+NEE++ +YL       + KR   +         C A    P S+DWR+KG V  VK
Sbjct: 102 ADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDA----PSSLDWRKKGVVTAVK 157

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCW+FST  A+EGIN IVTG+LISLSEQELVDCD   N GC GG MDYAF+++I
Sbjct: 158 DQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCD-TTNYGCEGGYMDYAFEWVI 216

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+D+E +YPY G +  C+ ++   KVVSIDGY DV   D  +L  A   QP+SV ++
Sbjct: 217 NNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETDS-ALLCATVQQPISVGMD 275

Query: 278 AGGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
                FQ Y  G++ G+C    + +DH V+ VGYG+ENG DYW+V+NSWG++WG  GY  
Sbjct: 276 GSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFY 335

Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
           ++RN  D   G C I  EASYP K
Sbjct: 336 IKRN-TDLPYGVCAINAEASYPTK 358


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 212/320 (66%), Gaps = 14/320 (4%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFAD 102
           +D  ++  ++ W+ ++G+         +RF++FKDN+ F++  N+  N  + +G+N+FAD
Sbjct: 28  SDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQG 160
           LT EE++A      +   + +   KV +   +Y   +   LP +VDWR KGAV P+K+QG
Sbjct: 88  LTIEEFKA------NKGFKPISAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQG 141

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
            CG CWAFS VAA+EGI K+ TG LISLSEQELVDCD   ++ GC GG MD AF+F+I+N
Sbjct: 142 QCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKN 201

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +   YPY   + KC    ++A   +I G+EDV   DE +L KAVA+QPVSVA++A 
Sbjct: 202 GGLATVSSYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNDEAALMKAVANQPVSVAVDAS 259

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRN 338
            R F  Y  GV TG CG+ LDHG+ A+GYG E +G  YW+++NSWG+ WGE G+++++++
Sbjct: 260 DRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKD 319

Query: 339 LLDTNTGKCGIAMEASYPVK 358
           + D   G CG+AM+ SYP +
Sbjct: 320 ISDKQ-GMCGLAMKPSYPTE 338


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 163/359 (45%), Positives = 221/359 (61%), Gaps = 42/359 (11%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA  S  + I+ L+   + S + +  +        H+ S S R +D        W+  +G
Sbjct: 1   MALESKIICITLLIMGVWASQALSRTL--------HEVSMSERHED--------WMGLYG 44

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           +T   +   E+RF+IFK+N+ +I+   S+N+ +K   N +                 +  
Sbjct: 45  RTYKDIAEKERRFKIFKENVEYIE---SVNK-FKASRNGY-----------------NMS 83

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            R   S++ S RY   A   +P S+DWR+KGAV P+KDQG CG CWAFS VAA+EG+ ++
Sbjct: 84  SRPRSSEITSFRYENVAA--VPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQL 141

Query: 181 VTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
            TGELISLSEQELVDCD    + GC GGLMD AF+FII NGG+ +E +YPY G +  C+ 
Sbjct: 142 KTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNK 201

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
            +  +    I  YEDV    E +L KAVA  PVSVAI+AGG  FQ Y SGVFTG+CG+ L
Sbjct: 202 KKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTEL 261

Query: 300 DHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV AVGYG T++G  YWLV+NSWG+ WGE+GY+ ++R+ +  + G CGIAMEASYP 
Sbjct: 262 DHGVTAVGYGKTDDGTKYWLVKNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 319


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 158/362 (43%), Positives = 225/362 (62%), Gaps = 17/362 (4%)

Query: 1   MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT S   +IS ++FL          S+AD   + Y  + D +S  R    ++ ++ +W+
Sbjct: 1   MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            KH K    +     RF+IF+DNL +IDE N  N +Y +GLN FADL+N+E++  Y+G  
Sbjct: 53  LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           ++    L      ++ +  K     P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG L+ LSEQELVDCD+  + GC GG    + Q++  NG + + + YPY   + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYK 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  + +    V I GY+ V    E S   A+A+QP+SV +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R   ++  G CG+   + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347

Query: 357 VK 358
            K
Sbjct: 348 FK 349


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 145/308 (47%), Positives = 205/308 (66%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFK+N+ FI+  H + ++ + + +N+FADL   +++A
Sbjct: 38  HEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKA 95

Query: 111 MYL-GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           + + G + +   R   +  AS +Y   +   +P S+DWR++GAV P+KDQG+C SCWAFS
Sbjct: 96  LLINGQKKEHNVRTATATEASFKY--DSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFS 153

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           TVA +EG+++I  GEL+SLSEQELVDC +  + GC GG ++ AF+FI + GG+ SE  YP
Sbjct: 154 TVATIEGLHQITKGELVSLSEQELVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G    C   +    VV I GYE V    E +L KAVA QPVS  +EAGG AFQ Y SG
Sbjct: 214 YKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSG 273

Query: 290 VFTGECGSALDHGVVAVGYGTENGVD-YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           +FTG+CG+ +DH V  VGYG   G + YWLV+NSWG++WGE GY++++R+ +    G CG
Sbjct: 274 IFTGKCGTDIDHSVTVVGYGKARGGNKYWLVKNSWGTEWGEKGYIRMKRD-IRAKEGLCG 332

Query: 349 IAMEASYP 356
           IA  A YP
Sbjct: 333 IATGALYP 340


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 140/314 (44%), Positives = 215/314 (68%), Gaps = 11/314 (3%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTN 105
           E+  +++ W AKHGK+ +      +R  IF D L +I++HN+  N T+ +GLNKF+DLTN
Sbjct: 36  EIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTN 95

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E+RAM++G     KR   + ++ ++         LP S+DWR+KGAV P+KDQG CGSC
Sbjct: 96  AEFRAMHVGKF---KRPRYQDRLPAEDEDVDV-SSLPTSLDWRQKGAVTPIKDQGDCGSC 151

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++AGC+GGLM+ AF+F+++NGG+ +E
Sbjct: 152 WAFSAIASIESAHFLATKELVSLSEQQLMDCD-TVDAGCDGGLMETAFKFVVKNGGVTTE 210

Query: 226 QDYPYLGAENKCDPSRRNA--KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
             YPY G+   C+ ++     KV  I G++ V+     +L KAV+  PV+V+I      F
Sbjct: 211 ASYPYTGSVGSCNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENF 270

Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Q+Y+SG+ +G+CG +LDHGV+ +GYGTE G+ YW+++NSWG+ WGE+G++K++R   D  
Sbjct: 271 QNYKSGILSGQCGDSLDHGVLLIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIERKDGD-- 328

Query: 344 TGKCGIAMEASYPV 357
            G CG+  ++SYP 
Sbjct: 329 -GICGMNGDSSYPT 341


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 151/312 (48%), Positives = 200/312 (64%), Gaps = 18/312 (5%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNE 106
           ++  +  K  K         +RF +F  N+ FI+ HN+       T+ V +N+FADLTNE
Sbjct: 29  LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           EYR +YL  R      L + +         AG     SVDWR+KGAV P+K+QG CGSCW
Sbjct: 89  EYRQLYL--RPYPTELLGRERQEVWLDGPNAG-----SVDWRQKGAVTPIKNQGQCGSCW 141

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           +FST  +VEG + I TG L+SLSEQ+LVDC     N GCNGGLMD AF++II NGG+D+E
Sbjct: 142 SFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTE 201

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           QDYPY   +  CD S+ +   VSI GY+DV   +E  L  AV   PVSVAIEA  ++FQ 
Sbjct: 202 QDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQM 261

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y SGVF+G CG+ LDHGV+ VGY +    DYW+V+NSWG+ WG+ GY+ ++R +  ++ G
Sbjct: 262 YSSGVFSGPCGTNLDHGVLVVGYTS----DYWIVKNSWGASWGDQGYIMMKRGV--SSAG 315

Query: 346 KCGIAMEASYPV 357
            CGIAM+ SYP+
Sbjct: 316 ICGIAMQPSYPI 327


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 157/362 (43%), Positives = 224/362 (61%), Gaps = 17/362 (4%)

Query: 1   MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT S   +IS ++FL          S+AD   + Y  + D +S  R    ++ ++ +W+
Sbjct: 1   MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            KH K    +     RF+IF+DNL +IDE N  N +Y +GLN FADL+N+E++  Y+G  
Sbjct: 53  LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           ++    L      ++ +  K     P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG L+ LSEQELVDCD+  + GC GG    + Q++  NG + + + YPY   + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYK 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  + +    V I GY+ V    E S   A+A+QP+S  +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R   ++  G CG+   + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347

Query: 357 VK 358
            K
Sbjct: 348 FK 349


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 145/306 (47%), Positives = 195/306 (63%), Gaps = 6/306 (1%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W A+HG++    G    R   F DN  F+  HN    +Y + LN FADLT++E+RA 
Sbjct: 38  FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRAA 97

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
                          +     Y    G    +P++VDWR+ GAV  VKDQGSCG+CW+FS
Sbjct: 98  ---RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 154

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
              A+EGINKI TG LISLSEQEL+DCDR  N+GC GGLMDYA++F+++NGG+D+E DYP
Sbjct: 155 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   +  C+ ++   +VV+IDGY+DV   +E  L +AVA QPVSV I    RAFQ Y  G
Sbjct: 215 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 274

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +F G C ++LDH ++ VGYG+E G DYW+V+NSWG  WG  GY+ + RN  ++N G CGI
Sbjct: 275 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCGI 333

Query: 350 AMEASY 355
               S+
Sbjct: 334 NQMPSF 339


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 196/306 (64%), Gaps = 7/306 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W A+HG++    G    R   F DN  F+  HN    +Y + LN FADLT++E+RA 
Sbjct: 38  FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFRA- 96

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
               R          +     Y    G    +P++VDWR+ GAV  VKDQGSCG+CW+FS
Sbjct: 97  ---ARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 153

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
              A+EGINKI TG LISLSEQEL+DCDR  N+GC GGLMDYA++F+++NGG+D+E DYP
Sbjct: 154 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 213

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   +  C+ ++   +VV+IDGY+DV   +E  L +AVA QPVSV I    RAFQ Y  G
Sbjct: 214 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 273

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +F G C ++LDH ++ VGYG+E G DYW+V+NSWG  WG  GY+ + RN  ++N G CGI
Sbjct: 274 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCGI 332

Query: 350 AMEASY 355
               S+
Sbjct: 333 NQMPSF 338


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 144/319 (45%), Positives = 210/319 (65%), Gaps = 11/319 (3%)

Query: 44  TDDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFA 101
           +DD  M   ++ W+A +G+         +RF++FKDNL F++  N+  +  + +G+N+FA
Sbjct: 32  SDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFA 91

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLT EE++A   G +  +   +  +    +  +  A   LP +VDWR KGAV P+K+QG 
Sbjct: 92  DLTTEEFKANK-GFKPISAEEVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQ 147

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ T  L+SLSEQELVDCD   ++ GC GG MD AF+F+I+NG
Sbjct: 148 CGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNG 207

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY   + KC    ++A   +I G+EDV P +E +L KAVA QPVSVA++A  
Sbjct: 208 GLATESSYPYKAVDGKCKGGSKSA--ATIKGHEDVPPNNEAALMKAVASQPVSVAVDASD 265

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           R F  Y  GV TG CG+ LDHG+ A+GYG E +G  YW+++NSWG+ WGE  ++++++++
Sbjct: 266 RTFMLYSGGVMTGSCGTQLDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKRFLRMEKDI 325

Query: 340 LDTNTGKCGIAMEASYPVK 358
            D   G CG+AM+ SYP +
Sbjct: 326 SDKQ-GMCGLAMKPSYPTE 343


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 142/318 (44%), Positives = 208/318 (65%), Gaps = 11/318 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
           +D  ++  ++ W+ ++G+         +RF+ FK N+ F++  N+  +  + +G+N+FAD
Sbjct: 28  SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LT EE++A         K    K      +Y   +   LP +VDWR KGAV P+K+QG C
Sbjct: 88  LTTEEFKA-----NKGFKPTAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQC 142

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS VAA+EGI K+ TG LISLSEQELVDCD   ++ GC GG MD AF+F+I+NGG
Sbjct: 143 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 202

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY   + KC    ++A   +I G+EDV   +E +L KAVA+QPVSVA++A  R
Sbjct: 203 LATESNYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDR 260

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            F  Y  GV TG CG+ LDHG+ A+GYG E +G  YW+++NSWG+ WGE G++++++++ 
Sbjct: 261 TFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDIT 320

Query: 341 DTNTGKCGIAMEASYPVK 358
           D   G CG+AM+ SYP +
Sbjct: 321 DKR-GMCGLAMKPSYPTE 337


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 203/323 (62%), Gaps = 17/323 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKF 100
           T++ +  +++ W  KH K        E+R   FK NL++I E N   ++   +KVGLNKF
Sbjct: 42  TEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKF 101

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY--ACKAGDELPESVDWREKGAVNPVKD 158
           ADL+NEE+R MYL   S  K+ +   +    R+   C A    P S+DWR KG V  VKD
Sbjct: 102 ADLSNEEFREMYL---SKVKKPITIEEKRKHRHLQTCDA----PSSLDWRNKGVVTAVKD 154

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CGSCW+FST  A+E IN IVTG+LISLSEQELVDCD   N GC GG MD AFQ++I 
Sbjct: 155 QGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIG 214

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+E DYPY G +  C+ ++   KVVSI+GY DV P D  +L  A   QP+SV ++ 
Sbjct: 215 NGGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDS-ALLCATVQQPISVGMDG 273

Query: 279 GGRAFQHYESGVFTGECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
               FQ Y  G++ G+C    + +DH ++ VGYG+EN  DYW+V+NSWG++WG  GY  +
Sbjct: 274 SALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYI 333

Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
           +RN      G C I  +ASYP K
Sbjct: 334 RRN-TSKPYGVCAINADASYPTK 355


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 147/325 (45%), Positives = 211/325 (64%), Gaps = 11/325 (3%)

Query: 40  SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLN 98
           +S      ++  ++ W+ +HGK        E+RFQIFK+NL FI+  N+  +  + + +N
Sbjct: 23  TSLVISSRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSIN 82

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVN 154
           +F D TN+E++A YL  +   K+ L+   +A+      +  +   E+P ++DWRE+GAV 
Sbjct: 83  QFGDQTNDEFKANYLNGK---KKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVT 139

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
           P+K Q  CGSCWAF+TVAA+EGI++I TG L+SLSEQELVDC +     GCNGG ++ A 
Sbjct: 140 PIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDAC 199

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
            FI++ GG+ SE +YPY   + KC+  +    V  I GYE V   +E +L KAVA+QP++
Sbjct: 200 DFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIA 259

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGY 332
           V I A  RAFQ Y SG+  G+CG  LDH V  VGYGT ++GV YWLV+NSWG+ WGE GY
Sbjct: 260 VYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGY 319

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
           +K++R+ +    G CGIAM  +YP+
Sbjct: 320 IKIKRD-VHAKEGSCGIAMVPTYPI 343


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 139/303 (45%), Positives = 206/303 (67%), Gaps = 6/303 (1%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
           D  ++  ++ W+AK+ +  +      +RF++FK N+  I+  N+ N  + +  N+FADLT
Sbjct: 34  DQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANRFADLT 93

Query: 105 NEEYRAMYLGTR--SDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQG 160
           ++E+RA + G R  + A     +S+ A+   +YA  + D++P SVDWR KGAV P+K+QG
Sbjct: 94  DDEFRATWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQG 153

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
            CG CWAFS VA++EG+ K+ TG+L+SLSEQELVDCD   ++ GC GG MD AF FI+ N
Sbjct: 154 ECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGN 213

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E  YPY  ++  C+ +  +    SI GYEDV   DE SL+KAVA+QPVSVA++ G
Sbjct: 214 GGLTTESRYPYTASDGTCNSNEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGG 273

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRN 338
              F+ Y+ GV +G CG+ LDHG+ AVGYG   +G  YW+++NSWG+ WGE GY++++R+
Sbjct: 274 DSHFRFYKGGVLSGACGTELDHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERD 333

Query: 339 LLD 341
           + D
Sbjct: 334 IAD 336


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 139/254 (54%), Positives = 186/254 (73%), Gaps = 5/254 (1%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D+++ ++++W+++HGK    +     RF+IFKDNL+ IDE N +   Y +GLN+FADL++
Sbjct: 2   DKLIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSNYWLGLNEFADLSH 61

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E++  YLG + D   R    + +S+ +  +  D LP+SVDWR+KGAV  +K+QGSCGSC
Sbjct: 62  HEFKKQYLGLKVDFSTR----RESSEEFTYRDVD-LPKSVDWRKKGAVTNIKNQGSCGSC 116

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFSTVAAVEGIN+IVTG L SLSEQEL+DCDR  N+GCNGGLMDYAF FI++NGG+  E
Sbjct: 117 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKE 176

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
            DYPY+  E  C+ S+  ++VV+I GY DV   +E SL KA+A+QP+SVAIEA GR FQ 
Sbjct: 177 DDYPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 236

Query: 286 YESGVFTGECGSAL 299
           Y  GVF G CG+ L
Sbjct: 237 YSGGVFDGHCGTQL 250


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 156/356 (43%), Positives = 216/356 (60%), Gaps = 20/356 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           L  + FI +S A   S +  +  +     + +++ V  ++  W  +H +         KR
Sbjct: 8   LALVLFIWASLACLSSSLPTEF-YITGEEFASEERVRELFHLWKERHKRVYKHAEETAKR 66

Query: 73  FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-------RRLMK 125
           F+IFK+NL+++ E NS    + +G+NKFAD++NEE++  YL              RR M+
Sbjct: 67  FEIFKENLKYVIERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
            K  +      A  E P S+DWR+KG V  +KDQG CGSCWAFS+  A+EGIN IVTG+L
Sbjct: 127 QKKGT------ASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDL 180

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           ISLSEQELVDCD   N GC GG MDYAF+++I NGG+DSE DYPY G +  C+ ++ + K
Sbjct: 181 ISLSEQELVDCD-TTNYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTK 239

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG---ECGSALDHG 302
           VVSIDGY+DV   D   L  AV +QP+SV ++     FQ Y SG++ G   +    +DH 
Sbjct: 240 VVSIDGYKDVDESDSALLCAAV-NQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHA 298

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V+ VGYG+E+  DYW+ +NSWG+ WG  GY  ++RN  D   G+C I   ASYP K
Sbjct: 299 VLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRN-TDLPYGECAINAMASYPTK 353


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 143/308 (46%), Positives = 206/308 (66%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQIFK+N++FI+  N+  ++ + + +N+FADL NEE++A
Sbjct: 37  HEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             +  +   +  +  +   S RY  ++  ++P ++DWR++GAV P+KDQG+CGSCWAFS 
Sbjct: 97  SLINVQKK-ESGVETATETSFRY--ESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSI 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA+EGI++I TG+L+SLSEQELVDC +  + GCN G  + AF+F+ +NGG+ SE  YPY
Sbjct: 154 VAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPY 213

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
                 C   +    V  I GYE+V    E +L KAVA+QPVSV I+AG  A Q Y SG+
Sbjct: 214 KANNKTCMVKKETQGVAQIKGYENVPSNSEKALLKAVANQPVSVYIDAG--ALQFYSSGI 271

Query: 291 FTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           FTG+CG+A +H    +GYG    G  YWLV+NSWG+ WGE GY++++R+ +    G CGI
Sbjct: 272 FTGKCGTAPNHAATVIGYGKARGGAKYWLVKNSWGTKWGEKGYIRMKRD-IRAKEGLCGI 330

Query: 350 AMEASYPV 357
           A  ASYP 
Sbjct: 331 ATNASYPT 338


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 142/318 (44%), Positives = 210/318 (66%), Gaps = 10/318 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
           +D  ++  ++ W+ ++G+         +RF+ FK N+ F++  N+  +  + +G+N+FAD
Sbjct: 28  SDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LT EE++A   G +  +   +  +    +  +  A   LP +VDWR KGAV P+K+QG C
Sbjct: 88  LTTEEFKANK-GFKPISAEMVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQC 143

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS VAA+EGI K+ TG LISLSEQELVDCD   ++ GC GG MD AF+F+I+NGG
Sbjct: 144 GCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 203

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY   + KC    ++A   +I G+EDV   DE +L KAVA+QPVSVA++A  R
Sbjct: 204 LATESSYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNDEAALMKAVANQPVSVAVDASDR 261

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            F  Y  GV TG CG+ LDHG+ A+GYG E +G  YW+++NSWG+ WGE G++++++++ 
Sbjct: 262 TFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLRMEKDIS 321

Query: 341 DTNTGKCGIAMEASYPVK 358
           D   G CG+AM+ SYP +
Sbjct: 322 DKQ-GMCGLAMKPSYPTE 338


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 149/333 (44%), Positives = 218/333 (65%), Gaps = 14/333 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           D +I+ Y  + D +S  R    ++ ++++W  ++ K    +     RF+IFKDNL +IDE
Sbjct: 1   DFAIVGYSQD-DLTSIER----LVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDE 55

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
            N  N +Y +GLN+FADLT++E++A Y+G+  +    + +S    + +  K   + PES+
Sbjct: 56  TNKKNSSYWLGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSD--DEEFPYKHVVDYPESI 113

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWR+KGAV PVK+Q  CGSCWAFSTVA VEGINKIVTG+LISLSEQEL+DCDR+ + GC 
Sbjct: 114 DWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCK 172

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GG    + Q++  N G+ +E++YPY   + KC    +    V I GY+ V   +E+SL +
Sbjct: 173 GGYQTTSLQYVADN-GVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQ 231

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           A+A+QPVSV +E+ GRAFQ Y+ G+F G CG+ +DH V AVGYG     +Y L++NSWG 
Sbjct: 232 AIANQPVSVVVESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGYGK----NYILIKNSWGP 287

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
            WGE GY++++R     + G CG+   + +P K
Sbjct: 288 KWGEKGYIRIKR-ASGKSKGTCGVYSSSYFPTK 319


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 165/380 (43%), Positives = 227/380 (59%), Gaps = 26/380 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA     L IS  + +  +S++ A   S    D N        + + ++ ++  WL +HG
Sbjct: 1   MANPLHLLLISATI-ICLVSAAKAVQHSYEVGDIN--------SGNGLVRLFDRWLGRHG 51

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K         +R QIF+ NL++I  HN + N ++++GLNKFADLTNEE++  Y G  S  
Sbjct: 52  KLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQ 111

Query: 120 --KRRLMKSKVASQRYACK-------AGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              RR  + + A  R   K       +   +  S+DWR+KGAV  VKDQ  CGSCWAFST
Sbjct: 112 WRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFST 171

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
             A+EG+N I TG+L+SLSEQELV CD   N GC GG MDYAF ++IQNGG+D+E+DY Y
Sbjct: 172 TGAIEGVNFISTGKLVSLSEQELVACD-ATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSY 230

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G ++ C+ ++   K+VSIDGY DVSP D+ +L  A   QPVSV I+     FQ Y  G+
Sbjct: 231 TGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGI 289

Query: 291 FTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           + G+C      +DH V+ VGY  +NG DYW+V+NSWG+DWG  GY  + RN  +   G C
Sbjct: 290 YDGDCSGNPDDIDHAVLVVGYSAKNGKDYWIVKNSWGTDWGLEGYFYILRN-TELPYGVC 348

Query: 348 GIAMEASYPVKNSQNSAKPK 367
            I   ASYP K +++S + K
Sbjct: 349 AINAMASYPTK-TESSVQSK 367


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 162/350 (46%), Positives = 219/350 (62%), Gaps = 20/350 (5%)

Query: 19  ISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKD 78
           +SSS  ++ SI+  D      S    D+ ++ I+Q W  +H K        EKRF  FK 
Sbjct: 15  VSSSLPSEYSIVGND-----FSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKR 69

Query: 79  NLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ---RY 133
           NL++I E      T  ++VGLNKFADL+NEE++ +YL   S  K+ + K+++ ++   R 
Sbjct: 70  NLKYIIEKTGKETTLRHRVGLNKFADLSNEEFKQLYL---SKVKKPINKTRIDAEDRSRR 126

Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQEL 193
             ++ D  P S+DWR+KG V  VKDQG CGSCW+FST  A+EGIN IVT +LISLSEQEL
Sbjct: 127 NLQSCDA-PSSLDWRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQEL 185

Query: 194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE 253
           VDCD   N GC GG MDYAF+++I NGG+D+E +YPY G +  C+ ++   KVVSIDGY+
Sbjct: 186 VDCD-TTNYGCEGGYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYK 244

Query: 254 DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF---TGECGSALDHGVVAVGYGT 310
           DV   D  +L  A A QP+SV I+     FQ Y  G++     +    +DH V+ VGYG+
Sbjct: 245 DVDETDS-ALLCAAAQQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGS 303

Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           ENG DYW+V+NSWG+ WG  GY  ++RN  D   G C I   ASYP K +
Sbjct: 304 ENGEDYWIVKNSWGTSWGIEGYFYIKRN-TDLPYGVCAINAMASYPTKEA 352


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 150/319 (47%), Positives = 209/319 (65%), Gaps = 24/319 (7%)

Query: 45  DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           DD  M   ++ W+ ++ +         +RF++FK N++FI+  N+  NR + +G+N+FAD
Sbjct: 29  DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+RA    T+++   +    KV +  RY   + D LP ++DWR KGAV P+KDQG 
Sbjct: 89  LTNDEFRA----TKTNKGFKPSPVKVPTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQ 144

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           C            EGI KI TG+LISLSEQELVDCD    + GC GGLMD AFQFII+NG
Sbjct: 145 C------------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFQFIIKNG 192

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY  A+ KC     +A   ++ G+EDV   DE +L KAVA+QPVSVA++ G 
Sbjct: 193 GLTTESSYPYTAADGKCKSGSNSA--ATVKGFEDVPANDEAALMKAVANQPVSVAVDGGD 250

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENGY+++++++
Sbjct: 251 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 310

Query: 340 LDTNTGKCGIAMEASYPVK 358
            D   G CG+AME SYP++
Sbjct: 311 SDKR-GMCGLAMEPSYPIE 328


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 159/360 (44%), Positives = 224/360 (62%), Gaps = 42/360 (11%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LF +    AA  S  +  N H+ S   R +D        W+A++G
Sbjct: 1   MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMAQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+      +R+  
Sbjct: 48  RVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFGT----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y  +    +P ++DWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GCNG                    +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGA-------------------NYPYAGTDGTCN 202

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAV  QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 203 RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTE 262

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 263 LDHGVAAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 321


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 160/360 (44%), Positives = 224/360 (62%), Gaps = 44/360 (12%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ + +  I  L  LF +    AA  S  +  N H+ S   R +D        W+ ++G
Sbjct: 1   MASVNQYQYI-CLALLFVL----AAWASQATARNLHEASMYERHED--------WMVQYG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA    +R+  
Sbjct: 48  REYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA----SRNRF 103

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
           K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS VAA+EGI +
Sbjct: 104 KAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           + TG+LISLSEQELVDCD    + GC                      +YPY G +  C+
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCT---------------------NYPYAGTDGTCN 200

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I+GYEDV   +E +L+KAVA QP++VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 201 RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTE 260

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYGT ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 261 LDHGVSAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 319


>gi|125592011|gb|EAZ32361.1| hypothetical protein OsJ_16571 [Oryza sativa Japonica Group]
          Length = 416

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 164/343 (47%), Positives = 215/343 (62%), Gaps = 46/343 (13%)

Query: 27  MSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG------MGHNEKRFQIFKDNL 80
           MSII  +  H      RT+ +    Y  WLA+H +   G      +G +E+RF++F DNL
Sbjct: 1   MSIIRNNAEHGVRGLERTEAQARAAYDLWLARHRRGGGGGSRNGFIGEHERRFRVFWDNL 60

Query: 81  RFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
           +F+D HN+       +++G+N+FADLTN E+RA YLGT    + R +      + Y    
Sbjct: 61  KFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV-----GEAYRHDG 115

Query: 138 GDELPESVDWREKGAV-NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC 196
            + LP+SVDWR+KGAV  PVK+QG CG+                  G     +EQ L   
Sbjct: 116 VEALPDSVDWRDKGAVVAPVKNQGQCGA-----------------GGVREERAEQRL--- 155

Query: 197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVS 256
                      +MD AF FI +NGG+D+E+DYPY   + KC+ ++R+ KVVSIDG+EDV 
Sbjct: 156 --------QRWIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKVVSIDGFEDVP 207

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGV 314
             DE+SL+KAVA QPVSVAI+AGGR FQ Y+SGVFTG CG+ LDHGVVAVGYGT+   G 
Sbjct: 208 ENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGA 267

Query: 315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            YW VRNSWG DWGENGY++++RN+    TGKCGIAM ASYP+
Sbjct: 268 AYWTVRNSWGPDWGENGYIRMERNVT-ARTGKCGIAMMASYPI 309


>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
          Length = 340

 Score =  293 bits (751), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 159/365 (43%), Positives = 228/365 (62%), Gaps = 46/365 (12%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +  L FL F+S+ +               S++WR+DDEV+ +Y+ WL KH K  + +G  
Sbjct: 5   VLILSFLLFVSAITCI-------------STNWRSDDEVIALYEEWLVKHQKLYSSLGEK 51

Query: 70  EKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLGT---------- 115
            KRF+IFKDNLR+ID+ N  N+     + +GLN+FADLT +E+ ++YLGT          
Sbjct: 52  IKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISS 111

Query: 116 ---RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
                D +  ++K  V           ELP+SVDWREKG V P+++QG CGSCW FS VA
Sbjct: 112 NPNHDDVEEDILKEDVV----------ELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVA 161

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
           ++E +N I  G +I+LSEQEL+DC+  I+ GC GG  + AF ++ +NG + SE+ YPY+ 
Sbjct: 162 SIETLNGIKKGHMIALSEQELLDCE-TISQGCKGGHYNNAFAYVAKNG-ITSEEKYPYIF 219

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
            + +C    +  KVV I GY+ V   +   L+ AVA Q VSVA++   + FQ Y+ G+F+
Sbjct: 220 RQGQC---YQKEKVVKISGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFS 276

Query: 293 GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
           G CG  LDH V  VGYG++ G +YW++RNSWG++WGENGY+++Q+N      G CGIAM+
Sbjct: 277 GACGPILDHAVNIVGYGSKGGANYWIMRNSWGTNWGENGYMRIQKN-SKHYEGHCGIAMQ 335

Query: 353 ASYPV 357
            SYPV
Sbjct: 336 PSYPV 340


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 149/362 (41%), Positives = 226/362 (62%), Gaps = 25/362 (6%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+ ++F   S  + L F   + +A+   +   + H+              ++ W+A+HG
Sbjct: 1   MASENLFHCTSLALLLLFGFWAFSANTRTLEDASMHER-------------HEQWMAQHG 47

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           K        E R++IF+ N++ I+  N+  N+++K+G+N+FADLT EE++A+      + 
Sbjct: 48  KVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKLGVNQFADLTEEEFKAI------NK 101

Query: 120 KRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGI 177
            +  M SK++ +  +  +   ++P ++DWR+KGAV P+K QG  CGSCWAF+ VAA EGI
Sbjct: 102 LKGYMWSKISRTSTFKYEHVTKVPATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGI 161

Query: 178 NKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
            K+ TGELISLSEQEL+DCD    N GC  G++  AF+FI+QN G+ +E  YPY   +  
Sbjct: 162 TKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGT 221

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C+    +  V SI GYEDV   +E +L  AVA+QPVSV +++    F+ Y SGV +G CG
Sbjct: 222 CNAKVESKHVASIKGYEDVPANNETALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCG 281

Query: 297 SALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
           +  DH V  VGYG +++G  YWL++NSWG  WGE GY++++R++     G CGIAM+ASY
Sbjct: 282 TTFDHAVTVVGYGVSDDGTKYWLIKNSWGVYWGEQGYIRIKRDVA-AKEGMCGIAMQASY 340

Query: 356 PV 357
           P+
Sbjct: 341 PI 342


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 154/341 (45%), Positives = 217/341 (63%), Gaps = 20/341 (5%)

Query: 1   MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT     +IS L+F+    S     S+AD SI+ Y  + D  +S  +    + ++++W+
Sbjct: 1   MAT---IFSISKLIFVVTCLSLHLGLSSADFSIVGY--SQDDLTSIESS---IRLFESWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            KH K    +     RF+ FKDNL +IDE N  N +Y +GLN+FADLT++E++  Y+G+ 
Sbjct: 53  LKHDKVYKTIDEKIYRFETFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKEKYVGSI 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
            +    + +S      +  K   + PES+DWR+KGAV PVK+Q  CGSCWAFSTVA VEG
Sbjct: 113 PEDSMIIEQSD--DVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG LISLSEQEL+DCDR+ + GC GG    + ++++ N G+ +E++YPY   +  
Sbjct: 171 INKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGN 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C    +    V I+GY+ V   DE+SL K ++ QPVSV +E+ GR FQ Y+ GVF G CG
Sbjct: 229 CRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
           + LDH V AVGYG     DY L++NSWG  WG+ GY+K++R
Sbjct: 289 TKLDHAVTAVGYGK----DYILIKNSWGPKWGDKGYIKIKR 325


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 156/362 (43%), Positives = 223/362 (61%), Gaps = 17/362 (4%)

Query: 1   MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT S   +IS ++FL          S+AD   + Y  + D +S  R    ++ ++ +W+
Sbjct: 1   MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            KH K    +     RF+IF+DNL +IDE N  N +Y +GLN FADL+N+E++  Y+G  
Sbjct: 53  LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           ++    L      ++ +  K     P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG L+ LSEQELVDCD+  + GC GG    + Q++  NG + + + YP    + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPCQAKQYK 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  + +    V I GY+ V    E S   A+A+QP+S  +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R   ++  G CG+   + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347

Query: 357 VK 358
            K
Sbjct: 348 FK 349


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 149/319 (46%), Positives = 209/319 (65%), Gaps = 24/319 (7%)

Query: 45  DDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           DD  M   ++ W+ ++ +         +RF++FK N++FI+  N+  NR + +G+N+FAD
Sbjct: 29  DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+RA    T+++   +    KV++  RY   + D LP ++DWR KGAV P+KDQG 
Sbjct: 89  LTNDEFRA----TKTNKGFKPSPVKVSTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQ 144

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           C            EGI KI TG+LISLSEQELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 145 C------------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 192

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E  YPY  A+ KC     +A   ++ G+EDV   DE +L KAVA+QPVSVA++ G 
Sbjct: 193 GLTTESSYPYTAADGKCKSGSNSA--ATVKGFEDVPANDEAALMKAVANQPVSVAVDGGD 250

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y  GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENGY+++++++
Sbjct: 251 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI 310

Query: 340 LDTNTGKCGIAMEASYPVK 358
            D   G CG+AME SYP +
Sbjct: 311 SDKR-GMCGLAMEPSYPTE 328


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 155/355 (43%), Positives = 217/355 (61%), Gaps = 18/355 (5%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           ST++F+  I          +SY  +   S     +   +  ++ W+A+  +  +      
Sbjct: 3   STIIFILTI---------FLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKR 53

Query: 71  KRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKV 128
            RF IFK NL F+   N  N+ TYKV +N+F+DLT+EE+RA + G    +A  R+     
Sbjct: 54  NRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSS 113

Query: 129 ASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                  + G+  +  ES+DWR++GAV PVK QG CG CWAFS VAAVEGI KI  GEL+
Sbjct: 114 GKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELV 173

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA-- 244
           SLSEQ+L+DCDR  N GC GG+M  AF++II+N G+ +E +YPY  ++  C  S   +  
Sbjct: 174 SLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSS 233

Query: 245 -KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
            +  +I GYE V   +E +L +AV+ QPVSV IE  G AF+HY  GVF GECG+ L H V
Sbjct: 234 FRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAV 293

Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             VGYG +E G  YW+V+NSWG  WGENGY++++R+ +D   G CG+A+ A YP+
Sbjct: 294 TIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRD-VDAPQGMCGLAILAFYPL 347


>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
          Length = 367

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 160/366 (43%), Positives = 223/366 (60%), Gaps = 24/366 (6%)

Query: 5   SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           +M  +IS L+F    LF   S S  D SI+ Y  + D +S+ R    ++ ++ +W+  H 
Sbjct: 2   AMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQD-DLTSTER----LIQLFNSWMLNHN 56

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF+IFKDNL +IDE N  N +Y++GLN+FADL+N+E+   Y+G+  DA 
Sbjct: 57  KFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVGSLIDA- 115

Query: 121 RRLMKSKVASQRYACKAGDE----LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
                     Q Y  +  +E    LPE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEG
Sbjct: 116 -------TIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEG 168

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKI TG+L+ LSEQELVDC+R+ + GC GG   YA +++ +NG +     YPY   +  
Sbjct: 169 INKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGT 226

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C   +    +V   G   V P +E +L  A+A QPVSV +E+ GR FQ Y+ G+F G CG
Sbjct: 227 CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCG 286

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + +DH V AVGYG   G  Y L++NSWG+ WGE GY++++R     + G CG+   + YP
Sbjct: 287 TKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYP 345

Query: 357 VKNSQN 362
           +KN  N
Sbjct: 346 IKNRDN 351


>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
           AltName: Allergen=Car p 1; Flags: Precursor
 gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
 gi|387885|gb|AAA72774.1| papain [synthetic construct]
 gi|225437|prf||1303270A papain
          Length = 345

 Score =  290 bits (743), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 217/359 (60%), Gaps = 19/359 (5%)

Query: 5   SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           +M  +IS L+F    LF     S  D SI+ Y  N D +S+ R    ++ ++++W+ KH 
Sbjct: 2   AMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQN-DLTSTER----LIQLFESWMLKHN 56

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF+IFKDNL++IDE N  N +Y +GLN FAD++N+E++  Y G+ +   
Sbjct: 57  KIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAG-- 114

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
                ++++ +         +PE VDWR+KGAV PVK+QGSCGSCWAFS V  +EGI KI
Sbjct: 115 -NYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKI 173

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG L   SEQEL+DCDR+ + GCNGG    A Q + Q  G+     YPY G +  C   
Sbjct: 174 RTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSR 231

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
            +       DG   V P++E +L  ++A+QPVSV +EA G+ FQ Y  G+F G CG+ +D
Sbjct: 232 EKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD 291

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           H V AVGYG     +Y L++NSWG+ WGENGY++++R   ++  G CG+   + YPVKN
Sbjct: 292 HAVAAVGYGP----NYILIKNSWGTGWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVKN 345


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  290 bits (743), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 151/314 (48%), Positives = 201/314 (64%), Gaps = 10/314 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ + AK G++ NG     +R  +F  N++ I+E NS   TY +G+N+FADLT EE+   
Sbjct: 19  WEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEEFSKT 78

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y+G +  A++    + +    Y    G+ LP SVDW  +GAV PVK+QG CGSCW+FST 
Sbjct: 79  YMGFKKPAQKYGDAAYLGRHVYN---GEALPTSVDWSSQGAVTPVKNQGQCGSCWSFSTT 135

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG N+I TG+L+SLSEQ+ VDC     N GCNGGLMD AF++   N  + +EQ YPY
Sbjct: 136 GSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LCTEQSYPY 194

Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
            G +  C  S  +  +   S+ GY+DVS   E  +  AVA QPVS+AIEA    FQ Y  
Sbjct: 195 KGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQLYSG 254

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GV TG CG++LDHGV+AVGYGT +G DYW V+NSWGS WG +GYV LQR      +G+CG
Sbjct: 255 GVLTGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRG--KGGSGECG 312

Query: 349 IAMEASYP-VKNSQ 361
           +  E SYP V  SQ
Sbjct: 313 LLSEPSYPQVTGSQ 326


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 144/310 (46%), Positives = 205/310 (66%), Gaps = 8/310 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++GK        EKRFQ+FK+N++FI+  N+  ++ + + +N+FADL +EE++A
Sbjct: 35  HEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEFKA 94

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
           +    +  A R +  +   S RY  +   ++P ++DWR++GAV P+KDQG +CGSCWAF+
Sbjct: 95  LLNNVQKKASR-VETATETSFRY--ENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFA 151

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           TVA VE +++I TGEL+SLSEQELVDC R  + GC GG ++ AF+FI   GG+ SE  YP
Sbjct: 152 TVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYYP 211

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C   +    V  I GYE V    E +L KAVA+QPVSV I+AG  AF+ Y SG
Sbjct: 212 YKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSSG 271

Query: 290 VFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           +F    CG+ LDH V  VGYG   +G  YWLV+NSW + WGE GY++++R+ +    G C
Sbjct: 272 IFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRD-IRAKKGLC 330

Query: 348 GIAMEASYPV 357
           GIA  ASYP+
Sbjct: 331 GIASNASYPI 340


>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 352

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 203/312 (65%), Gaps = 16/312 (5%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
           W+A+HG+T        +RF++FK N+  ID  N+  N+ Y++  N+F DLT+ E+ AMY 
Sbjct: 45  WMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYT 104

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
           G   +    +  +  A+ R + +  D+ P  VDWR++GAV  VK+Q SCG CWAFSTVAA
Sbjct: 105 GY--NPANTMYAAANATTRLSSE-DDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 161

Query: 174 VEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           VEGI++I TGEL+SLSEQ+L+DC    N GC GG +D AFQ++  +GG+ +E  Y Y GA
Sbjct: 162 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 219

Query: 234 ENKCD---PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +  C     S  +    +I GY+ V+P DE SL  AVA QPVSVAIE  G  F+HY SGV
Sbjct: 220 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 279

Query: 291 FTGE-CGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           FT + CG+ LDH V  VGYG E     G  YW+++NSWG+ WG+ GY+KL++++   + G
Sbjct: 280 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV--GSQG 337

Query: 346 KCGIAMEASYPV 357
            CG+AM  SYPV
Sbjct: 338 ACGVAMAPSYPV 349


>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
          Length = 342

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 203/312 (65%), Gaps = 16/312 (5%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYL 113
           W+A+HG+T        +RF++FK N+  ID  N+  N+ Y++  N+F DLT+ E+ AMY 
Sbjct: 35  WMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYT 94

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
           G   +    +  +  A+ R + +  D+ P  VDWR++GAV  VK+Q SCG CWAFSTVAA
Sbjct: 95  GY--NPANTMYAAANATTRLSSE-DDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAA 151

Query: 174 VEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           VEGI++I TGEL+SLSEQ+L+DC    N GC GG +D AFQ++  +GG+ +E  Y Y GA
Sbjct: 152 VEGIHQITTGELVSLSEQQLLDC--ADNGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGA 209

Query: 234 ENKCD---PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +  C     S  +    +I GY+ V+P DE SL  AVA QPVSVAIE  G  F+HY SGV
Sbjct: 210 QGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGV 269

Query: 291 FTGE-CGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           FT + CG+ LDH V  VGYG E     G  YW+++NSWG+ WG+ GY+KL++++   + G
Sbjct: 270 FTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV--GSQG 327

Query: 346 KCGIAMEASYPV 357
            CG+AM  SYPV
Sbjct: 328 ACGVAMAPSYPV 339


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 203/309 (65%), Gaps = 31/309 (10%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++G+         KR++IFKDN+  I+  N +++++YK+ +N+FADLTNEE+RA
Sbjct: 39  HEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
               +R+  K  +  ++  S +Y  +    +P +VDWR+KGAV P+KDQG CGSCWAFS 
Sbjct: 99  ----SRNRFKAHICSTEATSFKY--ENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI ++ TG+LISLSEQELVDCD    + GC                      +YP
Sbjct: 153 VAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCT---------------------NYP 191

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C+  +       I+GYEDV   +E +L+KAVA QP++VAI+A G  FQ Y SG
Sbjct: 192 YAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSG 251

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           VFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSW + WGE GY+++QR++     G CG
Sbjct: 252 VFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKNSWSTGWGEEGYIRMQRDVT-AKEGLCG 310

Query: 349 IAMEASYPV 357
           IAM+ASYP 
Sbjct: 311 IAMQASYPT 319


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 203/308 (65%), Gaps = 7/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++G+        EKRFQ+FK+N+ FI+  N+  ++ + + +N+FADL +EE++A
Sbjct: 37  HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           + +  +  A   +  S   S RY  ++  ++P ++DWR++GAV P+KDQG CGSCWAFS 
Sbjct: 97  LLINVQKKASW-VETSTETSFRY--ESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA EGI++I TG+L+ LSEQELVDC +  + GC GG +D AF+FI + GG+ SE  YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G    C   +    V  I GYE V   +E +L KAVA+QPVSV I+AG  AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273

Query: 291 FTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           F    CG+  +H V  VGYG   +G  YWLV+NSWG++WGE GY++++R+ +    G CG
Sbjct: 274 FNARNCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332

Query: 349 IAMEASYP 356
           IA    YP
Sbjct: 333 IAKYPYYP 340


>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 137/219 (62%), Positives = 167/219 (76%), Gaps = 3/219 (1%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP+ VDWR  GAV  +KDQG CGSCWAFST+AAVEGINKI TG+LISLSEQELVDC R  
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
           N  GC+GG M   FQFII NGG+++E +YPY   E +C+   +  K VSID YE+V   +
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E +L+ AVA QPVSVA+EA G  FQHY SG+FTG CG+A+DH V  VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +NSWG+ WGE GY+++QRN+     G+CGIA +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRNV--GGVGQCGIAKKASYPVK 217


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 203/308 (65%), Gaps = 7/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++G+        EKRFQ+FK+N+ FI+  N+  ++ + + +N+FADL +EE++A
Sbjct: 37  HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           + +  +  A   +  S   S RY  ++  ++P ++DWR++GAV P+KDQG CGSCWAFS 
Sbjct: 97  LLINVQKKASW-VETSTQTSFRY--ESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA EGI++I TG+L+ LSEQELVDC +  + GC GG +D AF+FI + GG+ SE  YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G    C   +    V  I GYE V   +E +L KAVA+QPVSV I+AG  AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273

Query: 291 F-TGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           F    CG+  +H V  VGYG   +G  YWLV+NSWG++WGE GY++++R+ +    G CG
Sbjct: 274 FNVRNCGTDPNHAVAVVGYGKALDGSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332

Query: 349 IAMEASYP 356
           IA    YP
Sbjct: 333 IAKYPYYP 340


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 146/311 (46%), Positives = 204/311 (65%), Gaps = 23/311 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+ ++ +         +RF++FK N++FI+  N+  NR + +G+N+FADLTN+E+RA
Sbjct: 5   HEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 64

Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
               T+++   +    KV +  RY   + D LP ++DWR KGAV P+KDQG C       
Sbjct: 65  ----TKTNKGFKPSPVKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC------- 113

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
                EGI KI TG+LISLSEQELVDCD    + GC GGLMD AF+FII+ GG+ +E  Y
Sbjct: 114 -----EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESSY 168

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  A+ KC     +  V ++ G+EDV   DE SL KAVA+QPVSVA++ G   FQ Y  
Sbjct: 169 PYTAADGKCKSGSNS--VATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYSG 226

Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GV TG CG+ LDHG+ A+GYG T +G  YWL++NSWG+ WGENGY+++++++ D   G C
Sbjct: 227 GVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKR-GMC 285

Query: 348 GIAMEASYPVK 358
           G+AME SYP +
Sbjct: 286 GLAMEPSYPTE 296


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 137/312 (43%), Positives = 204/312 (65%), Gaps = 19/312 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++ W AKHGK+ +      +R  IF D L +I++HN+  N T+ +GLNKF+DLTN E+R
Sbjct: 1   MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
           A Y+G          KS     R   K  D     LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61  ANYVGK--------FKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++ GC GG  + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           + YPY G    C+ ++   KVV I GY+DV+     +L KAV+  PV+V I    + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y SG+ +G+C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGENG++K+++       G
Sbjct: 230 YRSGILSGQCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKK---DGEG 286

Query: 346 KCGIAMEASYPV 357
            CG+  ++SYP 
Sbjct: 287 MCGMNGQSSYPT 298


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  287 bits (734), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 153/326 (46%), Positives = 203/326 (62%), Gaps = 27/326 (8%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYK-----VGLNKFADLTN 105
           +++ W+ KH K     G   +R+  F  NL F+ + N+  R        VG+N FADL+N
Sbjct: 50  LFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLSN 109

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK--------AGDELPESVDWREKGAVNPVK 157
           EE+R +Y         R+++ K A  R A +        AG + P S+DWR++GAV  VK
Sbjct: 110 EEFREVY-------SSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVK 162

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QG CGSCWAFS+  A+EGIN I TGELISLSEQELVDCD   N GC+GG MDYAF+++I
Sbjct: 163 NQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD-TTNEGCDGGYMDYAFEWVI 221

Query: 218 QNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            NGG+DSE +YPY G A++ C+ ++   KVVSIDGYEDV+   E +L  A   QPVSV I
Sbjct: 222 NNGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGI 280

Query: 277 EAGGRAFQHYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYV 333
           +     FQ Y  G++ G+C      +DH V+ VGYG + G DYW+V+NSWG+DWG  GY+
Sbjct: 281 DGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGYI 340

Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN 359
            ++RN      G C I   ASYP K 
Sbjct: 341 YIRRN-TGLPYGVCAIDAMASYPTKQ 365


>gi|357437721|ref|XP_003589136.1| Cysteine proteinase [Medicago truncatula]
 gi|355478184|gb|AES59387.1| Cysteine proteinase [Medicago truncatula]
          Length = 295

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 129/183 (70%), Positives = 153/183 (83%)

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           IVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+DSE DYPY   + +CD 
Sbjct: 5   IVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQ 64

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           +R+NAKVV+ID YEDV  +DE++L+KAVA+QP++VA+E GGR FQ YE GVFTG CG+AL
Sbjct: 65  NRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTAL 124

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           DHGV AVGYGTENG DYW+VRNSWG  WGE GY++L+RNL  +  GKCGIA+E SYP+KN
Sbjct: 125 DHGVAAVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIKN 184

Query: 360 SQN 362
            QN
Sbjct: 185 GQN 187


>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
 gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
          Length = 221

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 3/224 (1%)

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
           D+LP+S+DWRE GAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC  
Sbjct: 1   DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-T 59

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             N GC GG M+ AFQFI+ NGG++SE+ YPY G +  C+ S  NA VVSID YE+V   
Sbjct: 60  TANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICN-STVNAPVVSIDSYENVPSH 118

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           +E SL+KAVA+QPVSV ++A GR FQ Y SG+FTG C  + +H +  VGYGTEN  D+W+
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWI 178

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           V+NSWG +WGE+GY++ +RN+ + + GKCGI   ASYPVK   N
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPD-GKCGITRFASYPVKKGTN 221


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 203/312 (65%), Gaps = 12/312 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+++  +  +       RF+IFK NL+F++  N + N+TY + +N+F+DLT+EE++A
Sbjct: 35  HEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKA 94

Query: 111 MYLGTRSDAKRRLMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
            Y G         M +    +  S RY  +   E  ES+DWRE+GAV  VK Q  CG CW
Sbjct: 95  RYTGLVVPEGMTRMSTTDSHETVSFRY--ENVGETGESMDWREEGAVTSVKHQQQCGCCW 152

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFS VAAVEG+ KI  GEL+SLSEQ+L+DC  + N GC+GG+M  AF +I++N G+ +E 
Sbjct: 153 AFSAVAAVEGMTKIAKGELVSLSEQQLLDCSTE-NDGCDGGIMWKAFDYIVENQGITAED 211

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY GA+  C+ +   A  +S  GYE V   DE +L KAV+ QPVSVAIE  G  F HY
Sbjct: 212 NYPYQGAQQTCESNHVAAATIS--GYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHY 269

Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
             G+F GECG+ L+H V  VGYG +E G+ YWL++NSWG  WGE+GY+++ R+ +D   G
Sbjct: 270 SGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMRIMRD-VDAPQG 328

Query: 346 KCGIAMEASYPV 357
            CG+A  A YPV
Sbjct: 329 MCGLASLAYYPV 340


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 147/327 (44%), Positives = 204/327 (62%), Gaps = 17/327 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           D ++  ++ W+ +HG+     G  ++RF++++ N+  ++  NS++  YK+  NKFADLTN
Sbjct: 25  DLMLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTN 84

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAV-NPVKDQGS 161
           EE+RA  LG R       + S   S   A     + D LP+SVDWR KGAV N  K    
Sbjct: 85  EEFRAKMLGFRPHVTIPQI-SNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVD 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
            GSCWAFS VAA+EGIN+I  GEL+SLSEQELVDCD +   GC GG M +AF+F++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHG 202

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E  YPY  A   C  ++ N   V+I GY +V+P  E  L +A A QPVSVA++ G  
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYG-TENGVD----------YWLVRNSWGSDWGEN 330
            FQ Y SGV+TG C + ++HGV  VGYG +E   D          YW+V+NSWG++WG+ 
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY+ +QR++    +G CGIA+  SYPV
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 210/318 (66%), Gaps = 14/318 (4%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N+ FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R+    T+++       ++V +  R      D LP ++DWR KG V P+KDQG 
Sbjct: 88  LTNDEFRS----TKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLS-EQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+EGI K+ TG+LIS S  + L+     ++ GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSLLTV---MSMGCEGGLMDDAFKFIIKNG 200

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY   ++K      +  V SI GYEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 201 GLTTESNYPYAAVDDKFKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 258

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV TG CG+ LDHG+VA+GYG   +G  YWL++NSWG  WGENG++++++++
Sbjct: 259 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 318

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 319 SDKR-GMCGLAMEPSYPT 335


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  286 bits (731), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 151/310 (48%), Positives = 197/310 (63%), Gaps = 12/310 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           + +W A HG +   +G    R  I++ NL FI++HNS   +YK+ +NKFADLT  E+ A 
Sbjct: 22  FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           YLG R DA     KS  AS          LP+SVDWR  G V P+KDQG CGSCW+FST 
Sbjct: 82  YLGLRFDATNA-TKSFAASTYLPRMV--SLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138

Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            +VEG +   TG+L+SLSEQ LVDC   + NAGCNGGLMD AFQ+II N G+D+E  YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
              +  C  +  N    ++  Y+D++   E  L+ AVA   P+SVAI+A   +FQ Y SG
Sbjct: 199 TAQDGTCQFNSANVG-ATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSG 257

Query: 290 VFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+    C S+ LDHGV+AVGYGT    DYWLV+NSWG+ WG++GY+ + RN    +  +C
Sbjct: 258 VYNEPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRN----SNNQC 313

Query: 348 GIAMEASYPV 357
           GIA  ASYP+
Sbjct: 314 GIATAASYPL 323


>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
           Full=Papaya proteinase III; Short=PPIII; AltName:
           Full=Papaya proteinase omega; Flags: Precursor
 gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
          Length = 348

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 156/359 (43%), Positives = 219/359 (61%), Gaps = 16/359 (4%)

Query: 5   SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           +M  +IS L+F    LF   S S  D SI+ Y  + D +S+ R    ++ ++ +W+  H 
Sbjct: 2   AMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQD-DLTSTER----LIQLFNSWMLNHN 56

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF+IFKDNL +IDE N  N +Y +GLN+FADL+N+E+   Y+G+  DA 
Sbjct: 57  KFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDA- 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
                 +   + +  +    LPE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEGINKI
Sbjct: 116 ---TIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKI 172

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG+L+ LSEQELVDC+R+ + GC GG   YA +++ +NG +     YPY   +  C   
Sbjct: 173 RTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAK 230

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +    +V   G   V P +E +L  A+A QPVSV +E+ GR FQ Y+ G+F G CG+ +D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           H V AVGYG   G  Y L++NSWG+ WGE GY++++R     + G CG+   + YP KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPTKN 348


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 149/336 (44%), Positives = 214/336 (63%), Gaps = 28/336 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           +D+ +  +Y+ W + +  ++   G  + RF +FK+N+++I+E N +++ YK+ LN+F DL
Sbjct: 36  SDETLWDLYERWRSVY-TSARSFGEKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDL 94

Query: 104 TNEEYRAMYL------GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           T  E+   Y       GTR+++   + ++             E+P S+DWR KGAV PVK
Sbjct: 95  TPSEFARTYANSKIIEGTRNESGGFMYENV------------EVPRSIDWRVKGAVTPVK 142

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QG CG CWAFS  AAVEGIN+I TG+LISLSEQ+L+DCD + N+GC GG M  AF++I 
Sbjct: 143 NQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIK 201

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           Q GG+ SE +YPY      C  +      VSIDGY ++   ++  L K +A QPVSVA++
Sbjct: 202 QRGGITSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRRSEDAVL-KILAHQPVSVAVD 260

Query: 278 AGGRA---FQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYV 333
           A   +   +  Y  GVFTG CG+ L+HGV AVGYGT N G DYW+++NSWG  WGE GY+
Sbjct: 261 ATTWSSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYM 320

Query: 334 KLQRNLLDTNTGKCGIAMEASYPVKN-SQNSAKPKP 368
           ++ R +  +  G CGIAM+AS+P+K  S   AK +P
Sbjct: 321 RMLRGV--SPYGLCGIAMQASFPIKRVSAGKAKFEP 354


>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 136/219 (62%), Positives = 166/219 (75%), Gaps = 3/219 (1%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP+ VDWR  GAV  +KDQG CGS WAFST+AAVEGINKI TG+LISLSEQELVDC R  
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
           N  GC+GG M   FQFII NGG+++E +YPY   E +C+   +  K VSID YE+V   +
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E +L+ AVA QPVSVA+EA G  FQHY SG+FTG CG+A+DH V  VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +NSWG+ WGE GY+++QRN+     G+CGIA +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRNV--GGVGQCGIAKKASYPVK 217


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 151/311 (48%), Positives = 201/311 (64%), Gaps = 12/311 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ WL ++ +        E RF I++ NL +I+  NS   +Y +  NKFADLTNEE+ + 
Sbjct: 5   FERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEEFVSP 64

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           YLG      R L  +      +     ++LPES DWR++GAV+ +KDQG+CGSCWAFS V
Sbjct: 65  YLGF---GTRFLPHTGFMYHEH-----EDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAV 116

Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           AAVEGINKI +G+L+SLSEQE  DCD    N GC GGLMD AF FI +NGG+ + +DYPY
Sbjct: 117 AAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPY 176

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA--DQPVSVAIEAGGRAFQHYES 288
            G +  C+  +      +I G+  V   DE  LK   A  +Q  SVAI+AGG AFQ Y  
Sbjct: 177 EGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLK 236

Query: 289 GVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           GVF+G CG  L+HGV  VGYG      YW+V+NSWG+DWGE+GY++++R+  D   G CG
Sbjct: 237 GVFSGICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDAFD-KAGTCG 295

Query: 349 IAMEASYPVKN 359
           IAM+ASYP+K+
Sbjct: 296 IAMQASYPLKD 306


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 136/310 (43%), Positives = 203/310 (65%), Gaps = 10/310 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFADLTNEEYRA 110
           ++ W+A++ +         +RF++FKDN  F++  N+  +  + +G+N+FADLT EE++A
Sbjct: 5   HERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEEFKA 64

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G +  +   +  +    +  +  A   LP +VDWR KGAV P+K+QG CG CWAFS 
Sbjct: 65  NK-GFKPISAEEVPTTGFKYENLSVSA---LPTAVDWRTKGAVTPIKNQGQCGCCWAFSA 120

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           +AA+EGI K+ TG L+SLSEQE VDCD   ++ GC GG MD AF+F+I+NGG+ +E  YP
Sbjct: 121 IAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATESSYP 180

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   + KC    ++A   +I G+EDV P +E +L K VA QPVSVA++A  R F  Y  G
Sbjct: 181 YKVVDGKCKGGSKSA--ATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYSGG 238

Query: 290 VFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           V TG CG+ LDHG+ A+GYG E +   YW+++NSWG+ WGE G++++++++ D   G C 
Sbjct: 239 VMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKR-GMCD 297

Query: 349 IAMEASYPVK 358
           +AM+ SYP +
Sbjct: 298 LAMKPSYPTE 307


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 144/334 (43%), Positives = 206/334 (61%), Gaps = 24/334 (7%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFAD 102
           D +   ++ W A+H +T         R +++  N+R+I+  N       TY++G   + D
Sbjct: 36  DPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTD 95

Query: 103 LTNEEYRAMYLG---TRSDAKRRLMKSKVAS--------------QRYACKAGDELPESV 145
           LT++E+ AMY       SD    L  + + +              Q Y  ++    P SV
Sbjct: 96  LTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGA-PASV 154

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWRE+GAV  VK+QG CGSCWAFSTVA +EGI++I TG+L SLSEQELVDCD K++ GCN
Sbjct: 155 DWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCN 213

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GG+   A Q+I  NGG+ S+ DYPY   ++ CD  + +    SI G++ V+   E+SL  
Sbjct: 214 GGVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTN 273

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSW 323
           AVA QPV+V+IEAGG  FQHY +GV+ G CG+ L+HGV  VGYG +   G  YW+V+NSW
Sbjct: 274 AVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSW 333

Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G  WG+NGY+++++ ++D   G CGIA+  S+P+
Sbjct: 334 GEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 204/312 (65%), Gaps = 19/312 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++ W AKHGK+ +      +R  IF D L +I++HN+L N T+ +GLNKF+DLTN E+R
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
           A Y+G          K      R   K  D     LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61  ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++ GC GG  + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           + YPY G    C+ ++   KVV I GY+DV+     +L KAV+  PV+V I    + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y SG+ +G C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++++++   +   G
Sbjct: 230 YRSGILSGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKK---EDGEG 286

Query: 346 KCGIAMEASYPV 357
            CG+  ++SYP 
Sbjct: 287 MCGMNGQSSYPT 298


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 151/360 (41%), Positives = 203/360 (56%), Gaps = 50/360 (13%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLT 104
           D ++  ++ W+ +HG+     G  ++R ++++ N+  ++  NS+ N  Y++  NKFADLT
Sbjct: 26  DPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLT 85

Query: 105 NEEYRAMYLG-TRSDAKRRLMKSKVASQRYAC-------KAGDELPESVDWREKGAVNPV 156
           NEE+RA  LG  R     R           AC       +  DELP+SVDWREKGAV PV
Sbjct: 86  NEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPV 145

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
           K+QG CGSCWAFS VAA+EGIN+I  G+L+SLSEQELVDCD K   GC GG M +AF+F+
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFEFV 204

Query: 217 IQNGGMDSEQDYPYLGA----------------------------ENKCDPSRRNAKVVS 248
           + N G+ +E++YPY G                                C   +     VS
Sbjct: 205 MNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVS 264

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I GY +V+   E  L +A A QPVSVA++AG   +Q Y  GVFTG C + L+HGV  VGY
Sbjct: 265 ISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGY 324

Query: 309 GTEN-----------GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G              G  YW+V+NSWG +WG+ GY+ +QR      +G CGIA+  SYPV
Sbjct: 325 GETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQRE-ASVASGLCGIALLPSYPV 383


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  283 bits (725), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 220/358 (61%), Gaps = 25/358 (6%)

Query: 5   SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           S+ + ++    LF   S S A    +++   H+ SS        +  ++ W+A+  +   
Sbjct: 3   SIMVLVTIFTILFTTFSISQATSRTVTF---HEPSS--------LEKHEQWMARFSRVYR 51

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRL 123
                + R  +FK NL+FI+  N   N++YK+G+N+FAD TNEE+ A++ G +       
Sbjct: 52  DELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKG------ 105

Query: 124 MKSKVASQRYACKA---GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
           + SKV  +  + ++    D +  S DWR +GAV PVK QG CG CWAFS VAAVEG+ KI
Sbjct: 106 LSSKVVDETISSRSWNISDMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKI 165

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
             G L+SLSEQ+L+DCDR+ + GC+GG+M  AF +IIQN G+ SE DY Y G++ +C  S
Sbjct: 166 AGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSS 225

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
            R A  +S  G++ V   +E +L +AV+ QPVSV+++A G  F HY  GV+ G CG++ +
Sbjct: 226 ARPAARIS--GFQTVPSNNEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSN 283

Query: 301 HGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           H V  VGYGT ++G  YWL +NSWG  WGE GY++++R++     G CG+A  A YPV
Sbjct: 284 HAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ-GMCGVAQYAFYPV 340


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 203/312 (65%), Gaps = 19/312 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++ W AKHGK+ +      +R  IF D L +I++HN+L N T+ +GLNKF+DLTN E+R
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
           A Y+G          K      R   K  D     LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61  ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++ GC GG  + AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPEDAFKFVVENGGVTTE 171

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           + YPY G    C+ ++   KVV I GY+DV+     +L KAV+  PV+V I    + FQ+
Sbjct: 172 EAYPYTGFAGSCNANKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y SG+ +G C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++++++       G
Sbjct: 230 YRSGILSGHCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKK---DGEG 286

Query: 346 KCGIAMEASYPV 357
            CG+  ++SYP 
Sbjct: 287 MCGMNGQSSYPT 298


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 148/354 (41%), Positives = 217/354 (61%), Gaps = 17/354 (4%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           ST++F+  I          +SY  +   S     +   +  ++ W+A+  +  +      
Sbjct: 3   STIIFILTI---------FLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKR 53

Query: 71  KRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
            RF IFK NL F+   N + N TYK+ +N+F+DLT+EE+RA + G     +   + +  +
Sbjct: 54  NRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSS 113

Query: 130 SQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
            +    + G+  +  ES+DWR++GAV PVK QG CG CWAFS VAAVEGI KI  GEL+S
Sbjct: 114 DKTVPFRYGNVSDTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVS 173

Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA--- 244
           LSEQ+L+DCD   N GC+GG+M  AF++II+N G+ +E +YPY  ++  C  S   +   
Sbjct: 174 LSEQQLLDCDTDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSF 233

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           +  +I GYE V   +E +L +AV+ QPVSV IE  G  F+HY  G+F GECG+ L H V 
Sbjct: 234 RAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVT 293

Query: 305 AVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            VGYG +E G  YW+V+NSWG  WGE+G+++++R+ +D   G CG+AM A YP+
Sbjct: 294 IVGYGMSEEGTKYWVVKNSWGETWGEDGFMRIKRD-VDAPQGMCGLAMLAFYPL 346


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 198/317 (62%), Gaps = 14/317 (4%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNE 106
           ++  W  KHGKT +     E R +IF DN  F+ +HN+       T+ VGLN  ADLT +
Sbjct: 67  LFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADLTKD 126

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E++ M LG  +  +        ++  YA       PE +DW   GAV PVK+Q  CGSCW
Sbjct: 127 EFKKM-LGYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQCGSCW 182

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFST  AVEG+N I TG+LISLSE+EL+ C    N GCNGGLMD  F++I+ N G+D+E 
Sbjct: 183 AFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGIDTED 242

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
            + Y+  E KC   RR+ + V+IDG++DV   DE SL KAV+ QPVSVAIEA  ++FQ Y
Sbjct: 243 GWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSFQLY 302

Query: 287 ESGVFTG-ECGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             GV++  +CG+ LDHGV+ VGYG +        +W ++NSWG  WGE+GY+++ +    
Sbjct: 303 AGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGSG 362

Query: 342 TNTGKCGIAMEASYPVK 358
              G+CG+AM+ SYP K
Sbjct: 363 VE-GQCGVAMQPSYPTK 378


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 138/309 (44%), Positives = 202/309 (65%), Gaps = 7/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++G+        EKRFQ+FK+N+ FI+  N+  ++ + + +N+FADL +EE++A
Sbjct: 37  HEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKA 96

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           + +  +  A   +  S   S RY  ++  ++P ++D R++GAV P+KDQG CGSCWAFS 
Sbjct: 97  LLINVQKKASW-VETSTETSFRY--ESVTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSA 153

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           VAA EGI++I TG+L+ LSEQELVDC +  + GC GG +D AF+FI + GG+ SE  YPY
Sbjct: 154 VAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYPY 213

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
            G    C   +    V  I GYE V   +E +L KAVA+QPVSV I+AG  AF++Y SG+
Sbjct: 214 KGVNKTCKVKKETHGVAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGI 273

Query: 291 FTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           F    CG+  +H V  VGYG   +   YWLV+NSWG++WGE GY++++R+ +    G CG
Sbjct: 274 FNARNCGTDPNHAVAVVGYGKALDDSKYWLVKNSWGTEWGERGYIRIKRD-IRAKEGLCG 332

Query: 349 IAMEASYPV 357
           IA    YP+
Sbjct: 333 IAKYPYYPI 341


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 160/362 (44%), Positives = 225/362 (62%), Gaps = 34/362 (9%)

Query: 2   ATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHG 60
           A A M LA+ T+V         A D+S          +S+    +E M + +Q W+A+HG
Sbjct: 15  AAALMILAVMTMVV-------EARDLS----------TSTGGYGEEAMKVRHQQWMAEHG 57

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           +T        +RFQ+FK N  F+D  N+   ++Y++ +N+FAD+TN+E+ AMY G +   
Sbjct: 58  RTYKDEAEKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLKPVP 117

Query: 120 KRRLMKSKVASQRYA-CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
                  K+A  +Y      D   ++VDWR+KGAV  +K+QG CG CWAF+ VAAVE I+
Sbjct: 118 AG---PKKMAGFKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIH 174

Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           +I TG L+SLSEQ+++DCD   N GCNGG +D AFQ+II NGG+ +E  YPY  A+  C 
Sbjct: 175 QITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQ 234

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGS 297
            S + A  V+I  Y+DV   DE +L  AVA+QPV+VAI+A    FQ Y SGV T + CG+
Sbjct: 235 SSVQPA--VTISSYQDVPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGT 291

Query: 298 -ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
            +L+H V AVGY T E+G  YWL++N WG +WGE GY++++R      T  CG+A +ASY
Sbjct: 292 PSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLRVER-----GTNACGVAQQASY 346

Query: 356 PV 357
           PV
Sbjct: 347 PV 348


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 154/353 (43%), Positives = 208/353 (58%), Gaps = 51/353 (14%)

Query: 10  ISTLVFLFFISSSSAA-DMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMG 67
           ++ L F FF  ++ AA D+S                DD  M   ++ W+A++ +      
Sbjct: 9   LAILGFAFFCGAALAARDLS----------------DDSAMVARHEQWMAQYSRVYKDAS 52

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
              +RF                         KFADLTN E+R+  + T    K   MK  
Sbjct: 53  EKARRF-------------------------KFADLTNHEFRS--VKTNKGFKSSNMKI- 84

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           +   RY   + D LP ++DWR KG V P+KDQG CG C AFS VAA EGI KI TG+L+S
Sbjct: 85  LTGFRYENVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVS 144

Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           L++QELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC+    +A  
Sbjct: 145 LADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNSA-- 202

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
            +I GYEDV   DE +L KA+A+QPVSVA++ G   F+ Y  GV TG CG+ LDHG+ A+
Sbjct: 203 ATIKGYEDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAI 262

Query: 307 GYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GYG T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+AME SYP K
Sbjct: 263 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTK 314


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 142/291 (48%), Positives = 192/291 (65%), Gaps = 12/291 (4%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           ++  S+ +AIS    L     + A D SI+ Y   H         D+++ ++++W+++H 
Sbjct: 8   LSKFSLLVAISASALL---CCAFARDFSIVGYTPEH-----LTNTDKLLELFESWMSEHS 59

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF++F++NL  ID+ N+   +Y +GLN+FADLT+EE++  YLG    AK
Sbjct: 60  KAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
            +  + +  S  +  +   +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG L SLSEQEL+DCD   N+GCNGGLMDYAFQ+II  GG+  E DYPYL  E  C   
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           + + + V+I GYEDV   D+ SL KA+A QPVSVAIEA GR FQ Y+ GV+
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK-GVY 286


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 156/320 (48%), Positives = 206/320 (64%), Gaps = 11/320 (3%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKF 100
            T   V   +Q W+ ++G++       EKRF+IF +NL +I++ N+   N++YK+ LN+F
Sbjct: 29  ETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAPGNKSYKLDLNQF 88

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           +DLTNEE+ A + G   D  +    SK AS      +  + P S+DWRE+GAV  VK+QG
Sbjct: 89  SDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLS--DTPTSLDWREQGAVTDVKNQG 146

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQN 219
           +CGSCWAFS VAAVEGI KI  G LISLSEQ+LVDC   + N GC GG MD AF +I +N
Sbjct: 147 NCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNAFSYITEN 206

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
            G+ SE DY Y G    C  +        I GYEDV P  E  L  AV+ QPVSVAI A 
Sbjct: 207 -GIASENDYQYRGGAGTCQNNEMITPAARISGYEDV-PAGEDQLLLAVSQQPVSVAI-AV 263

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQR 337
           G++F  Y+ G+++G CGS+L+HGV  VGYGT  E+G  YWL++NSWG  WGENGY++L R
Sbjct: 264 GQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKYWLIKNSWGESWGENGYMRLLR 323

Query: 338 NLLDTNTGKCGIAMEASYPV 357
                + G CGIA++AS+P 
Sbjct: 324 E-SGQSEGHCGIAVKASHPT 342


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 134/312 (42%), Positives = 202/312 (64%), Gaps = 19/312 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYR 109
           +++ W AKH K+ +      +R  +F D L +I++HN+  N T+ +GLNKF+DLTN E+R
Sbjct: 1   MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD----ELPESVDWREKGAVNPVKDQGSCGSC 165
           A Y+G          K      R   K  D     LP S+DWR++GAV P+KDQG CGSC
Sbjct: 61  ANYVGK--------FKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSC 112

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +A++E  + + T EL+SLSEQ+L+DCD  ++ GC GG  D AF+F+++NGG+ +E
Sbjct: 113 WAFSAIASIESAHFLATKELVSLSEQQLIDCD-TVDQGCQGGFPDDAFKFVVENGGVTTE 171

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
           + YPY G    C+ ++   KVV I GY+DV+     +L KAV+  PV+V I    + FQ+
Sbjct: 172 EAYPYTGFAGSCNTNKN--KVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQN 229

Query: 286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           Y SG+ +G+C ++ DH V+ +GYGTE G+ YW+++NSWG+ WGE+G++K+++       G
Sbjct: 230 YRSGILSGQCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKK---DGEG 286

Query: 346 KCGIAMEASYPV 357
            CG+  ++SYP 
Sbjct: 287 MCGMNGQSSYPT 298


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  280 bits (717), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 156/315 (49%), Positives = 204/315 (64%), Gaps = 22/315 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+A+HG+T +     E+RFQIFK+NL +I+  N + N+TYK+GLNKF+DL+ EE+  
Sbjct: 40  HEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVT 99

Query: 111 MYLG----TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
            Y G    T        +K    S  Y     DE+PES+DWRE G V  VK+QG CG CW
Sbjct: 100 TYNGYEMPTTLPTANTTVKPTFFSNYYN---QDEVPESIDWRENGVVTSVKNQGECGCCW 156

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFS VAAVEGI     G   SLS Q+L+DC    N+GC GG M  AF++I+QN G+ S+ 
Sbjct: 157 AFSAVAAVEGI----AGNGASLSAQQLLDCVGD-NSGCGGGTMIKAFEYIVQNQGIVSDT 211

Query: 227 DYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEA-GGRAFQ 284
           DYPY   +  C   R  + V + I GYE V   +E +LK+AVA QP+SVAI+A  G  F+
Sbjct: 212 DYPYEQTQEMC---RSGSNVAARITGYESVIQSEE-ALKRAVAKQPISVAIDASSGPNFK 267

Query: 285 HYESGVFTGE-CGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            Y SGVF+ E CG+ L H V  VGYG TE+G  YWLV+NSWG +WGE+GY++LQR+ +  
Sbjct: 268 SYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRD-VGA 326

Query: 343 NTGKCGIAMEASYPV 357
             G CGIAM+ASYP 
Sbjct: 327 MEGPCGIAMQASYPT 341


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 207/324 (63%), Gaps = 13/324 (4%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFA 101
           ++ V+ +++ W  KHGK        EK+FQ F+DNLR++ E N     +  + VGLNKFA
Sbjct: 44  EERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFA 103

Query: 102 DLTNEEYRAMYLGT--RSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVK 157
           D++NEE+R +Y+    +  +KR  ++ +   +  A KA      P S+DWR+ G V  VK
Sbjct: 104 DMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVK 163

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQG CGSCWAFS+  A+EGIN +  G+LISLSEQELVDCD   N GC GG MDYAF++++
Sbjct: 164 DQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCD-STNDGCEGGYMDYAFEWVM 222

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+D+E DYPY G +  C+ ++   K VSIDGYEDV+  +E +L  AV  QP+SV I+
Sbjct: 223 SNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPISVGID 281

Query: 278 AGGRAFQHYESGVF---TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
            G   FQ Y  G++     +    +DH V+ VGYG E+G +YW+++NSWG+DWG  GY  
Sbjct: 282 GGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAY 341

Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
           ++RN    + G C I   ASYP K
Sbjct: 342 IKRN-TSKDYGVCAINAMASYPTK 364


>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
 gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
          Length = 430

 Score =  280 bits (716), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 150/335 (44%), Positives = 205/335 (61%), Gaps = 28/335 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
           ++ W ++HG         E  KR   F +N  ++ EHN+L      ++ VGLN  A  T 
Sbjct: 98  FERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAATTR 157

Query: 106 EEYRAMY-----LGTRSDAKRRLMKSKVASQRYACK---AGDELPESVDWREKGAVNPVK 157
           EEYRA+      L +  DA+     S    ++Y      A  + PE++DW E GAV P K
Sbjct: 158 EEYRALLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELGAVTPPK 217

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QG CGSCWAFST  AVEGI KI TG L+SLSEQE+V C ++ N GCNGGLMDYAF++I+
Sbjct: 218 NQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQ-NMGCNGGLMDYAFRWIV 276

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +NGG+DSE  YPY      C+  +    V +IDG++DV P DE  L+KAV+ QPVS+AIE
Sbjct: 277 KNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQPVSIAIE 336

Query: 278 AGGRAFQHYESGVF-TGECGSALDHGVVAVGYGTENG-----------VDYWLVRNSWGS 325
           A  ++FQ Y+ GV+ + ECGS +DHGV+ VGYG ++              +W V+NSWG 
Sbjct: 337 ADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKVKNSWGG 396

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            WGE G++++ R + D  TG+CGI    SYP K++
Sbjct: 397 TWGEGGFIRMARRISD-ETGQCGITTAPSYPTKSA 430


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 207/329 (62%), Gaps = 17/329 (5%)

Query: 46  DEVMTI---YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLN 98
           D V TI   +  WLA HGK         KR  IF DN  F+  HN  +    +++ + LN
Sbjct: 61  DPVATIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLN 120

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
             ADLT EE++ M LG  +  KR    S          A    PE++DW  +GAV PVK+
Sbjct: 121 HLADLTREEFKHM-LGYDASKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKN 179

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI--NAGCNGGLMDYAFQFI 216
           QG CGSCWAFSTV AVEG+  + TG+LISLSEQELV C  KI  N GC GGLMD  F++I
Sbjct: 180 QGQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSC-AKIGGNNGCKGGLMDNGFEWI 238

Query: 217 IQNGGMDSEQDYPYLGAENKCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           ++N G+D E+D+ YL  + +C+   +R AK  SIDG++DV   DE +LKKAV+ QPV+VA
Sbjct: 239 VENRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVA 298

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG----TENGVDYWLVRNSWGSDWGENG 331
           IEA  R FQ Y  GVF GECG+ LDHGV+ VGYG    +     YW V+NSWG+ WGE G
Sbjct: 299 IEADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEG 358

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
           Y+++ R  +    G+CG+AM+ASYP K+S
Sbjct: 359 YIRIARGGMGP-AGQCGVAMQASYPTKSS 386


>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 340

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 148/321 (46%), Positives = 203/321 (63%), Gaps = 10/321 (3%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGL 97
           SS   ++  + T ++ W+A H +        ++R QIFK+NL FI++HN+  +  Y + L
Sbjct: 25  SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKHNNEGKKRYNLSL 84

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR--YACKAGDELPESVDWREKGAVNP 155
           N FADLTNEE+ A + G       +L   K+      +    GD +  S+DWR++GAVN 
Sbjct: 85  NSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGD-IEASLDWRKRGAVND 143

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           +K+QG CGSCWAFS VAAVEGIN+I  G+L+SLSEQ LVDC    N GC+G  ++ AF +
Sbjct: 144 IKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--NDGCHGQYVEKAFDY 201

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVA 275
           I ++ G+ +E++YPY+     C  +   A  + I GY+ V+P +E  L  AVA QPVSV 
Sbjct: 202 I-RDYGLANEEEYPYVETVGTCSGNSNPA--IQIRGYQSVTPQNEEQLLTAVASQPVSVL 258

Query: 276 IEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           +EA G+ FQ Y  GVF+GECG+ L+H V  VGYG E    YWL+RNSWG  WGE GY+KL
Sbjct: 259 LEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEGKYWLIRNSWGKSWGEGGYMKL 318

Query: 336 QRNLLDTNTGKCGIAMEASYP 356
            R+  +   G CGI M+ASYP
Sbjct: 319 MRDTGNPQ-GLCGINMQASYP 338


>gi|413956349|gb|AFW88998.1| hypothetical protein ZEAMMB73_678859 [Zea mays]
          Length = 1140

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 128/177 (72%), Positives = 149/177 (84%)

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFST+AAVEGIN+IVTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+
Sbjct: 780 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGI 839

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           D+E+DYPY G + +CD +R+NAKVV+ID YEDV   DE SL+KAVA+QPVSVAIEA G  
Sbjct: 840 DTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTT 899

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           FQ Y SG+FTG CG+ALDHGV AVGYGTENG DYW+++NSWGS WGE+G    +R L
Sbjct: 900 FQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIMKNSWGSSWGESGRAPTRRTL 956


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 209/339 (61%), Gaps = 26/339 (7%)

Query: 44  TDDEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGL 97
           TDD    I  +Q W A + K+   +  + +RF ++  N+ +I+  N+       TY++G 
Sbjct: 42  TDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGE 101

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSK-------VASQRYACKAGDELP-------- 142
             + DLTN+E+ AMY    S A+    + +       + ++     A  +LP        
Sbjct: 102 TAYTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTA 161

Query: 143 --ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
              SVDWR  GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD  +
Sbjct: 162 APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TL 220

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           +AGC+GG+   A ++I  NGG+ +E+DYPY G  + C+ ++      SI G   V+   E
Sbjct: 221 DAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSE 280

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWL 318
            SL  AVA QPV+V+IEAGG  FQHY+ GV+ G CG++L+HGV  VGYG   E+G  YW+
Sbjct: 281 ASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDKYWI 340

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           ++NSWG+ WG+ GY+K+++++     G CGIA+  S+P+
Sbjct: 341 IKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 202/312 (64%), Gaps = 12/312 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+++  +  +       RF+IF +NL+F++  N + N+TY + +N+F+DLT+EE++A
Sbjct: 35  HEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKA 94

Query: 111 MYLG-TRSDAKRRLMKS---KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
            Y G    +   R+  +   +  S RY  +   E  ES+DW ++GAV  VK Q  CG CW
Sbjct: 95  RYTGLVVPEGMTRISTTDSHETVSFRY--ENVGETGESMDWIQEGAVTSVKHQQQCGCCW 152

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           AFS VAAVEG+ KI  GEL+SLSEQ+L+DC  + N GC GG+M  AF +I +N G+ +E 
Sbjct: 153 AFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE-NNGCGGGIMWKAFDYIKENQGITTED 211

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           +YPY GA+  C+ +   A  +S  GYE V   DE +L KAV+ QPVSVAIE  G  F HY
Sbjct: 212 NYPYQGAQQTCESNHLAAATIS--GYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHY 269

Query: 287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
             G+F GECG+ L H V  VGYG +E G+ YWL++NSWG  WGENGY+++ R+ +D+  G
Sbjct: 270 SGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRD-VDSPQG 328

Query: 346 KCGIAMEASYPV 357
            CG+A  A YPV
Sbjct: 329 MCGLASLAYYPV 340


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 138/318 (43%), Positives = 203/318 (63%), Gaps = 20/318 (6%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFAD 102
           +D  ++  ++ W+ ++G+         +RFQ+FKDN+ F++  N+  N  + +G+N+FAD
Sbjct: 28  SDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LT EE++A         K    K      +Y   +   LP +VDWR KGAV P+K+QG C
Sbjct: 88  LTTEEFKA-----NKGFKPTAEKVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQC 142

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
                    AA+EGI K+ TG LISLSEQELVDCD   ++ GC GG MD AF+F+I+NGG
Sbjct: 143 ---------AAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGG 193

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY   + KC    ++A   +I G+EDV   +E +L KAVA+QPVSVA++A  R
Sbjct: 194 LATESNYPYKAVDGKCKGGSKSA--ATIKGHEDVPVNNEAALMKAVANQPVSVAVDASDR 251

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            F  Y  GV TG CG+ LDHG+ A+GYG E +G  YW+++NSWG+ WGE G++++++++ 
Sbjct: 252 TFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLRMEKDIT 311

Query: 341 DTNTGKCGIAMEASYPVK 358
           D   G CG+AM+ SYP +
Sbjct: 312 DKR-GMCGLAMKPSYPTE 328


>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
 gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
           Crystal Structure Of A Plant Cysteine Protease Ervatamin
           B: Insight Into The Structural Basis Of Its Stability
           And Substrate Specificity
          Length = 215

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 131/218 (60%), Positives = 164/218 (75%), Gaps = 4/218 (1%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP  VDWR KGAVN +K+Q  CGSCWAFS VAAVE INKI TG+LISLSEQELVDCD   
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           + GCNGG M+ AFQ+II NGG+D++Q+YPY   +  C P R   +VVSI+G++ V+  +E
Sbjct: 60  SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNE 117

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
            +L+ AVA QPVSV +EA G  FQHY SG+FTG CG+A +HGVV VGYGT++G +YW+VR
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG +WG  GY+ ++RN+  +  G CGIA   SYP K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASS-AGLCGIAQLPSYPTK 214


>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
          Length = 396

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 160/384 (41%), Positives = 223/384 (58%), Gaps = 32/384 (8%)

Query: 6   MFLAISTLVFL---FFISSSSAADMSIISYDNNHDHSSSW--RTDDEVMTI--------Y 52
            FL ++TL  L   F  ++++A  +   +  N          R DD+   +        +
Sbjct: 13  FFLLLTTLAILSLSFLPTATTAIRLEPENTINEKTDEVELVLRNDDDKRVLRESKIEDAF 72

Query: 53  QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEY 108
             WL K+ K         KR +IF +N  F+ EHN+       ++ V +NKFA  T EEY
Sbjct: 73  DAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFAAHTREEY 132

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACK-AGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           R M LG +   +R+    + A      +  G E PES+DW ++G +   K+QGSCGSCWA
Sbjct: 133 RKM-LGFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITTPKNQGSCGSCWA 191

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS + AVEGIN I TG+L+SLSEQELV C R+  N GCNGGLMD AF++I++NGG+DSE+
Sbjct: 192 FSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVENGGVDSEK 251

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
            Y Y  + + C   +    + SIDG+ DV   DE +LKKAV+ QPVSVAIEA  R+FQ Y
Sbjct: 252 QYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVAIEADQRSFQLY 311

Query: 287 ESGVFTGE-CGSALDHGVVAVGYGTENGVD----------YWLVRNSWGSDWGENGYVKL 335
             GV+  E CG+ LDHGV+ VGYG ++             YW ++NSW   WGE GY+++
Sbjct: 312 GGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWSEQWGEGGYIRI 371

Query: 336 QRNLLDTNTGKCGIAMEASYPVKN 359
            R+ +++ +G CG+A  ASYP K 
Sbjct: 372 ARD-VESPSGMCGVAEMASYPEKT 394


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 18/312 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
           +  W+  H  + +      KR + +  N  +I EHN  N    V L  N+F+ ++ EE++
Sbjct: 29  FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFK 88

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKDQGSCGS 164
               G        +M      QR A +  +     ++P+SVDW++KG V PVK+QG CGS
Sbjct: 89  FKMTG-------YVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGS 141

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CWAFST  AVEG   + +G+L+SLSEQELVDCD   + GCNGGLMD+AF +I  NGG+ S
Sbjct: 142 CWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICS 201

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DY Y      C   R   KVV I G++DV+P DE +LK AVA QPVSVAIEA  +AFQ
Sbjct: 202 EDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y+SGVF   CG+ LDHGV+AVGYG+ENG  +W V+NSWGS WGE GY++L R   +   
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLARE-ENGPA 317

Query: 345 GKCGIAMEASYP 356
           G+CGIA   SYP
Sbjct: 318 GQCGIASVPSYP 329


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 157/366 (42%), Positives = 218/366 (59%), Gaps = 33/366 (9%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGK 61
           T   F A++  +       + A D+S          +S+    +E M + +Q W+A+HG+
Sbjct: 10  TVITFTAVALTILAVTTMMAEARDLS---------STSTGGYGEEAMKVRHQQWMAEHGR 60

Query: 62  TSNGMGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           T         RFQ+FK N  F+D  N+     ++Y++ LN+FAD+TN+E+ AMY G R  
Sbjct: 61  TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLRPV 120

Query: 119 AKRRLMKSKVASQRYA---CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
                   K+A  +Y        D+  ++VDWR+KGAV  +K+QG CG CWAF+ VAAVE
Sbjct: 121 PAG---AKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVE 177

Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           GI++I TG L+SLSEQ+++DCD   N GCNGG +D AFQ+I+ NGG+ +E  YPY  A+ 
Sbjct: 178 GIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQA 237

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
            C   +    V +I GY+DV   DE +L  AVA+QPVSVAI+A    FQ Y  GV T   
Sbjct: 238 MCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAAS 292

Query: 296 GSA---LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
            S    L+H V AVGYGT E+G  YWL++N WG +WGE GY++L+R         CG+A 
Sbjct: 293 CSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLER-----GANACGVAQ 347

Query: 352 EASYPV 357
           +ASYPV
Sbjct: 348 QASYPV 353


>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 291

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 144/231 (62%), Positives = 173/231 (74%), Gaps = 3/231 (1%)

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
           ++P SVDWR+KGAV  VKDQG CGSCWAFST+AAVEGIN I T  L SLSEQ+LVDCD K
Sbjct: 60  DVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTK 119

Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
            NAGCNGGLMDYAFQ+I ++GG+ +E  YPY  A      +++ + VV+IDGYEDV   D
Sbjct: 120 SNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYK-ARQASSCNKKPSAVVTIDGYEDVPAND 178

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWL 318
           E +LKKAVA QPV+VAIEA G  FQ Y  GVF G+CG+ LDHGV AVGYGT  +G  YW+
Sbjct: 179 ETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWI 238

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
           V+NSWG +WGE GY++++R++ D   G CGIAMEASYPVK S N      H
Sbjct: 239 VKNSWGPEWGEKGYIRMKRDVEDKE-GLCGIAMEASYPVKTSTNPKHAGAH 288


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 18/312 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL--NKFADLTNEEYR 109
           +  W+  H  + +      KR + +  N  +I EHN  N    V L  N+F+ ++ EE++
Sbjct: 29  FSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFK 88

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKDQGSCGS 164
               G        +M      QR A +  +     ++P+SVDW++KG V PVK+QG CGS
Sbjct: 89  FKMTG-------YVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGS 141

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CWAFST  AVEG   + +G+L+SLSEQELVDCD   + GCNGGLMD+AF +I  NGG+ S
Sbjct: 142 CWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICS 201

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DY Y      C   R   KVV I G++DV+P DE +LK AVA QPVSVAIEA  +AFQ
Sbjct: 202 EDDYEYKAKAQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            Y+SGVF   CG+ LDHGV+AVGYG+ENG  +W V+NSWGS WGE GY++L R   +   
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLARE-ENGPA 317

Query: 345 GKCGIAMEASYP 356
           G+CGIA   SYP
Sbjct: 318 GQCGIASVPSYP 329


>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 159/376 (42%), Positives = 229/376 (60%), Gaps = 33/376 (8%)

Query: 1   MATASMFLAISTLVFLFFISSSSA-----ADMSIISYDNNHDHSSSWRTDDEVMTIYQTW 55
           MAT++  + I  L+FL ++S S +     ++ SI+    N   SS+     +V  ++  W
Sbjct: 1   MATSNSMITI--LIFLTYVSYSISTKTLPSEFSILEGQENDILSSA-----KVSDLFGKW 53

Query: 56  LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMY 112
              HGKT         R + FK +++F+ E NS  ++   + VGLNKFADL+NEE++ MY
Sbjct: 54  KELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNKFADLSNEEFKEMY 113

Query: 113 L----GTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +    G+RS + K   +K  ++     C A    P S+DWR+KG V P+KDQG CGSCWA
Sbjct: 114 MSKVKGSRSNELKMGGVKRNMSVSSRTCDA----PTSLDWRDKGVVTPMKDQGQCGSCWA 169

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FS   ++E  N I TG+LI LSEQELVDCD   + GC+GG MD A+++II+NGG+DSE D
Sbjct: 170 FSVSGSIESANAIATGDLIRLSEQELVDCDT-YDYGCDGGNMDTAYRWIIKNGGLDSEDD 228

Query: 228 YPYL---GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           YPY    G + KCD ++    VVS+D Y +V   +E ++  AVA  PV++ I      FQ
Sbjct: 229 YPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVES-NEDAVLCAVATTPVTIGIVGSAYDFQ 287

Query: 285 HYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y  GV+ G+C S    +DH V+ VGYG+++G DYW+V+NSWG+ WG  GY+ ++RN  D
Sbjct: 288 LYTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIVKNSWGTYWGLEGYILMERN-TD 346

Query: 342 TNTGKCGIAMEASYPV 357
              G CG+ +E  YP+
Sbjct: 347 IKNGVCGMYLEPVYPI 362


>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
 gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
          Length = 363

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 143/334 (42%), Positives = 205/334 (61%), Gaps = 15/334 (4%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           D SI+ Y  N D +S+ R    ++ ++++W+ KH K    +     RF+IFKDNL++IDE
Sbjct: 45  DFSIVGYSQN-DLTSTER----LIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDE 99

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
            N  N +Y +GLN FAD++N+E++  Y G+ +        ++++ +         +PE V
Sbjct: 100 TNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAG---NYTTTELSYEEVLNDGDVNIPEYV 156

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DWR+KGAV PVK+QGSCGS WAFS V+ +E I KI TG L   SEQEL+DCDR+ + GCN
Sbjct: 157 DWRQKGAVTPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCN 215

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GG    A Q + Q  G+     YPY G +  C    +       DG   V P++E +L  
Sbjct: 216 GGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLY 274

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           ++A+QPVSV +EA G+ FQ Y  G+F G CG+ +DH V AVGYG     +Y L+RNSWG+
Sbjct: 275 SIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGP----NYILIRNSWGT 330

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
            WGENGY++++R   ++  G CG+   + YPVKN
Sbjct: 331 GWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVKN 363


>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
           endopeptidase; AltName: Full=Papaya peptidase B;
           AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
           Precursor
 gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
          Length = 348

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 153/359 (42%), Positives = 220/359 (61%), Gaps = 16/359 (4%)

Query: 5   SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           ++  + S L+F    LF   S S  D SI+ Y  + D +S+ R    ++ ++ +W+ KH 
Sbjct: 2   AIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQD-DLTSTER----LIQLFNSWMLKHN 56

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K    +     RF+IFKDNL++IDE N +   Y +GLN+F+DL+N+E++  Y+G+  +  
Sbjct: 57  KNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPED- 115

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
                ++   + +  +   +LPESVDWR KGAV PVK QG C SCWAFSTVA VEGINKI
Sbjct: 116 ---YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKI 172

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
            TG L+ LSEQELVDCD++ + GCN G    + Q++ QNG +     YPY+  +  C  +
Sbjct: 173 KTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQNG-IHLRAKYPYIAKQQTCRAN 230

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
           +     V  +G   V   +E SL  A+A QPVSV +E+ GR FQ+Y+ G+F G CG+ +D
Sbjct: 231 QVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVD 290

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           H V AVGYG   G  Y L++NSWG  WGENGY++++R     + G CG+   + YP+KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGPGWGENGYIRIRR-ASGNSPGVCGVYRSSYYPIKN 348


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 206/341 (60%), Gaps = 21/341 (6%)

Query: 37  DHSSSWRTDDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
           D   S  TDD  M   +Q W A + K+   +    +RF++   N+ +I+  N+       
Sbjct: 34  DMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGL 93

Query: 92  TYKVGLNKFADLTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELP------ 142
           TY++G   + DLTN+E+ AMY      +  A   ++ ++         A  +LP      
Sbjct: 94  TYELGETAYTDLTNQEFMAMYTAPAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLS 153

Query: 143 ----ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
                SVDWR  GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD 
Sbjct: 154 TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD- 212

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
            ++ GC+GG+   A ++I  NGG+ +E DYPY G  + C+ ++ +   VSI G   V+  
Sbjct: 213 TLDDGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATR 272

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDY 316
            E SL  AVA QPV+V+IEAGG  FQHY+ GV+ G CG+ L+HGV  VGYG E   G  Y
Sbjct: 273 SEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRY 332

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           W+V+NSWG  WG++GY+++++++     G CGIA+  SYP+
Sbjct: 333 WIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
          Length = 262

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 138/233 (59%), Positives = 167/233 (71%), Gaps = 8/233 (3%)

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
           +LP SVDWR+KGAV  VKDQG CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD  
Sbjct: 3   DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTA 62

Query: 200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVS 256
            N GC GGLMD AF++I  NGG+ +E  YPY  A   C+ +R    +  VV IDG++DV 
Sbjct: 63  DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122

Query: 257 PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVD 315
              E  L +AVA+QPVSVA+EA G+AF  Y  GVFTGECG+ LDHGV  VGYG  E+G  
Sbjct: 123 ANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKA 182

Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
           YW V+NSWG  WGE GY++++++    + G CGIAMEASYPVK     +KPKP
Sbjct: 183 YWTVKNSWGPSWGEQGYIRVEKD-SGASGGLCGIAMEASYPVK---TYSKPKP 231


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 148/350 (42%), Positives = 217/350 (62%), Gaps = 34/350 (9%)

Query: 16  LFFISSSS-AADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
           L F+S     ++ SI+++D N      + ++++V+ ++Q W  +H K          R +
Sbjct: 19  LTFLSCYGIPSEYSILAFDLN-----KFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLE 73

Query: 75  IFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
            FK NL++I E N++  +   + +GLN+FAD++NEE++  ++            SKV S 
Sbjct: 74  NFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFI------------SKVES- 120

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
                  D+ P S+DWR+KG V  VKDQG+CGSCW+FS+  A+EG+N IVTG+LISLSEQ
Sbjct: 121 ------CDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQ 174

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           ELVDCD   N GC GG MDYAF+++I NGG+D+E DYPY+G    C+ ++   KVV+IDG
Sbjct: 175 ELVDCD-TTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDG 233

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA---LDHGVVAVGY 308
           Y DV+  D  +L  A   QP+SV I+     FQ Y  G++ G+C S    +DH V+ VGY
Sbjct: 234 YTDVTQSDS-ALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGY 292

Query: 309 GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           G++   DYW+V+NSWG+ WG  G++ ++RN  +   G C I   AS+P K
Sbjct: 293 GSDGNQDYWIVKNSWGTSWGIEGFIYIRRN-TNLKYGVCAINYMASFPTK 341


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 150/350 (42%), Positives = 208/350 (59%), Gaps = 19/350 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+A++G+         +R
Sbjct: 7   LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  N+ N  +Y +G+NKF D+TN E+ A Y G  S   R L   K    
Sbjct: 58  FQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGIS---RPLNIEKEPVV 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       + +S+DWR+ GAV  VKDQ  CGSCWAFS +A VEGI KIVTG L+SLSEQ
Sbjct: 115 SFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQ 174

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           E++DC   ++ GC+GG +D A+ FII N G+ SE DYPY   +  C  +        I G
Sbjct: 175 EVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC-AANSWPNSAYITG 231

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   DE S+K AV +QP++ AI+A G  FQ+Y  GVF+G CG++L+H +  +GYG +
Sbjct: 232 YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQD 291

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            +G  YW+V+NSWGS WGE GY+++ R +  +++G CGIAM+  YP   S
Sbjct: 292 SSGTQYWIVKNSWGSSWGERGYIRMARGV--SSSGLCGIAMDPLYPTLQS 339


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 157/366 (42%), Positives = 218/366 (59%), Gaps = 33/366 (9%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTI-YQTWLAKHGK 61
           T   F A++  +       + A D+S          +S+    +E M + +Q W+A+HG+
Sbjct: 10  TVIAFTAVALTILAVKTMMAEARDLS---------STSTGGYGEEAMKVRHQQWMAEHGR 60

Query: 62  TSNGMGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSD 118
           T         RFQ+FK N  F+D  N+     ++Y++ LN+FAD+TN+E+ AMY G R  
Sbjct: 61  TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLRPV 120

Query: 119 AKRRLMKSKVASQRYA---CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
                   K+A  +Y        D+  ++VDWR+KGAV  +K+QG CG CWAF+ VAAVE
Sbjct: 121 PAG---AKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVE 177

Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           GI++I TG L+SLSEQ+++DCD + N GCNGG +D AFQ+I  NGG+ +E  YPY  A+ 
Sbjct: 178 GIHQITTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQA 237

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGEC 295
            C   +    V +I GY+DV   DE +L  AVA+QPVSVAI+A    FQ Y  GV T   
Sbjct: 238 MCQSVQ---PVAAISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAAS 292

Query: 296 GSA---LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
            S    L+H V AVGYGT E+G  YWL++N WG +WGE GY++L+R         CG+A 
Sbjct: 293 CSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLER-----GANACGVAQ 347

Query: 352 EASYPV 357
           +ASYPV
Sbjct: 348 QASYPV 353


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 142/332 (42%), Positives = 204/332 (61%), Gaps = 20/332 (6%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKF 100
           D  ++  +Q W A + K+   +    +RF+++  N+ +I+  N+       TY++G   +
Sbjct: 43  DSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETAY 102

Query: 101 ADLTNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELP----------ESVDW 147
            DLTN+E+ AMY      +  A   ++ ++         A  +LP           SVDW
Sbjct: 103 TDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVDW 162

Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
           R  GAV PVK+QG CGSCWAFSTVA VEGI +I TG+L+SLSEQELVDCD  ++ GC+GG
Sbjct: 163 RASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD-TLDDGCDGG 221

Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
           +   A ++I  NGG+ +E DYPY G  + C+ ++ +   VSI G   V+   E SL  AV
Sbjct: 222 ISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLANAV 281

Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGS 325
           A QPV+V+IEAGG  FQHY+ GV+ G CG+ L+HGV  VGYG E   G  YW+V+NSWG 
Sbjct: 282 AGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSWGQ 341

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            WG++GY+++++++     G CGIA+  SYP+
Sbjct: 342 GWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 148/315 (46%), Positives = 205/315 (65%), Gaps = 21/315 (6%)

Query: 52  YQT----WLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           YQT    W+ KH +  +   H E   R+Q FK+N+ FI + NS      +GL KFADLTN
Sbjct: 29  YQTSFIGWMRKHDRAYS---HEEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTN 85

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           EEY+  YLG + + K+ L  ++   + +        P+S+DWREKGAV+ VKDQG CGSC
Sbjct: 86  EEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG----PDSIDWREKGAVSQVKDQGQCGSC 141

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           W+FST  AVEG ++I +G ++SLSEQ LVDC  +  N GC GGLM  AF++II NGG+ +
Sbjct: 142 WSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIAT 201

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E  YPY  A+ +C  + ++    +I GY+++   +E SL  A+A QPVSVAI+A   +FQ
Sbjct: 202 ESSYPYTAAQGRCKFT-KSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQ 260

Query: 285 HYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            Y SGV+    C S ALDHGV+AVGYGT  G DY++++NSWG  WG++GY+ + RN  + 
Sbjct: 261 LYSSGVYDEPACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNAQN- 319

Query: 343 NTGKCGIAMEASYPV 357
              +CG+A  ASYP+
Sbjct: 320 ---QCGVATMASYPI 331


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 144/307 (46%), Positives = 197/307 (64%), Gaps = 8/307 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFADLTNEEYR 109
           +  W++ HG T +      +R + +  N  +I EHN+ N     K+G N F+ ++ +E++
Sbjct: 28  FSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFK 87

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
               G         ++ ++AS+     +  E+P +VDW +KG V PVK+QG CGSCWAFS
Sbjct: 88  FKMTGLV--LPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFS 145

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T  AVEG   + +G+L+SLSEQELVDCD   + GCNGGLMD+AFQ+I  +GG+ SE DY 
Sbjct: 146 TTGAVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYE 205

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y      C   R+   VV + G++DV+P DE +LK AVA QPVSVAIEA  +AFQ Y+SG
Sbjct: 206 YKAKAQVC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSG 262

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           VF   CG+ LDHGV+AVGYG +NG  +W V+NSWG+ WGE GY++L R   +   G+CGI
Sbjct: 263 VFNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLARE-ENGPAGQCGI 321

Query: 350 AMEASYP 356
           A   SYP
Sbjct: 322 ASVPSYP 328


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 152/313 (48%), Positives = 199/313 (63%), Gaps = 5/313 (1%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNE 106
           +++ ++ W+A+HG+T        +R +IF+ N  FID  N   + ++++  N+FADLT+E
Sbjct: 43  MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+RA   G R               RY   +  +  +SVDWR  GAV  VKDQG CG CW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFS VAAVEG+NKI TG L+SLSEQELVDCD    + GC GGLMD AFQFI + GG+ SE
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
             YPY G +  C  S   A+  SI G+EDV   +E +L  AVA+QPVSVAI     AF+ 
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282

Query: 286 YESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
           Y+SGV  GECG+ L+H + AVGYGT  +G  YWL++NSWG+ WGE GYV+++R +     
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV--RGE 340

Query: 345 GKCGIAMEASYPV 357
           G CG+A   SYPV
Sbjct: 341 GVCGLAKLPSYPV 353


>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
 gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
          Length = 260

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 139/275 (50%), Positives = 183/275 (66%), Gaps = 23/275 (8%)

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           LNKFAD+TN E+R++Y  ++ +  R        +  +  +  + +P S+DWR+ GAV  V
Sbjct: 2   LNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGV 61

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
           KDQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD ++N GCNGGLM+YAF+FI
Sbjct: 62  KDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFEFI 121

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            QN G+ +E +YPY   +  C+  + N   VSIDG+E+V   +E +L KA A+QP+SVAI
Sbjct: 122 KQN-GITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISVAI 180

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
           +AGG  FQ Y  GVFTG CG+ L+HGV                 NSWGS+WGE GY+++Q
Sbjct: 181 DAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYIRMQ 223

Query: 337 RNLLDTNTGKCGIAMEASYPV----KNSQNSAKPK 367
           R  +    G CGIAMEASYP+    KN   S+ PK
Sbjct: 224 R-AISHKQGLCGIAMEASYPIKKSSKNPTKSSLPK 257


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 149/309 (48%), Positives = 197/309 (63%), Gaps = 8/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A+HG+         +R ++F+ N   ID  N+    ++++  N+FADLT EE+RA
Sbjct: 38  HEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G R    R    +     RY   +  +  +SVDWR  GAV  VKDQG+CG CWAFS 
Sbjct: 98  ARTGLR---PRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAAVEG+NKI TG L+SLSEQELVDCD   ++ GC+GGLMD AFQF+ + GG+ SE  YP
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G +  C  S   A+  SI G+EDV   +E +L  AVA+QPVSVAI     AF+ Y+SG
Sbjct: 215 YQGRDGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSG 274

Query: 290 VFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           V  G CG+ L+H + AVGYGT N G  YWL++NSWG+ WGE GYV+++R +     G CG
Sbjct: 275 VLGGACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGV--RGEGVCG 332

Query: 349 IAMEASYPV 357
           +A   SYPV
Sbjct: 333 LAKLPSYPV 341


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 144/359 (40%), Positives = 219/359 (61%), Gaps = 21/359 (5%)

Query: 4   ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
           AS+ + ++ L+ LF     S A    + +            +  ++  ++ W+A+  +  
Sbjct: 2   ASIMVLVTVLIILFTGFRISQATSRTVIF-----------REQSMVDKHEQWMARFSREY 50

Query: 64  NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-- 120
                   R  +FK NL+FI+  N   N++YK+G+N+FAD TNEE+ A++ G +   +  
Sbjct: 51  RDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVS 110

Query: 121 -RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
             +++   ++SQ +     D + ES DWR +GAV PVK QG CG CWAFS VAAVEG+ K
Sbjct: 111 PSKVVAKTISSQTW--NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAK 168

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           I  G L+SLSEQ+L+DCDR+ + GC+GG+M  AF +++QN G+ SE DY Y G++  C  
Sbjct: 169 IAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRS 228

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           + R A  +S  G++ V   +E +L +AV+ QPVSV+++A G  F HY  GV+ G CG++ 
Sbjct: 229 NARPAARIS--GFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSS 286

Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +H V  VGYGT ++G  YWL +NSWG  WGE GY++++R++     G CG+A  A YPV
Sbjct: 287 NHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ-GMCGVAQYAFYPV 344


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 131/229 (57%), Positives = 165/229 (72%), Gaps = 5/229 (2%)

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY   + D LP ++DWR KGAV P+KDQG CG CWAFS VAA EGI KI TG+L+SL+EQ
Sbjct: 8   RYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQ 67

Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMD AF+FII+NGG+ +E  YPY  A+ KC     +A   +I 
Sbjct: 68  ELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNSA--ATIK 125

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
           GYEDV   DE +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+ A+GYG 
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+AME SYP K
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKR-GMCGLAMEPSYPTK 233


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 144/310 (46%), Positives = 201/310 (64%), Gaps = 9/310 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A++ +        E+RF +FKDN+ FI   ++  N   K+G+N  AD+T+EE+RA
Sbjct: 35  HEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNKLGVNALADMTHEEFRA 94

Query: 111 MYLGTRSDAKRRL-MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
              G        L ++S+  S R+  +    +P ++DWR+K  V  +K+Q  CG CWAFS
Sbjct: 95  S--GNTFKIPPNLGLRSETTSFRH--QNVTRIPSTMDWRKKRTVTHIKNQLQCGGCWAFS 150

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VAA+EGI K+ T + ISLSEQELVDCD    N GC GG MD AF+FIIQN G++SE  Y
Sbjct: 151 AVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKFIIQNRGLNSEARY 210

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
            Y G E  C+  + +++   I+ YE++  F E +L K VA QP+SVAI+AGG AFQ YE 
Sbjct: 211 LYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQPISVAIDAGGSAFQFYEI 270

Query: 289 GVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           G+ T E G+ LD+GV   GYG + +G  +WLV+NSWG+DWGENGY +++R +  T TG C
Sbjct: 271 GIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYTRMERGVKAT-TGLC 329

Query: 348 GIAMEASYPV 357
           G  M+ASYP 
Sbjct: 330 GFTMQASYPT 339


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 147/291 (50%), Positives = 186/291 (63%), Gaps = 19/291 (6%)

Query: 72  RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           RF  FK N+  I  HN+L N +Y +GLN+FADL+ EE++  Y G +   +R   +S    
Sbjct: 61  RFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGYK-HVEREFARSNNLH 119

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
           Q       +  P S+DWR   AV P+KDQG CGSCWAFS   ++EG   ++ G+  L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQ+LVDC     NAGCNGGLMDYAF++II N G+ +E  YPY G    C  S    KVV
Sbjct: 174 SEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           +I GY+DV+  DE SL  AV    PVSVAIEA    FQ Y SGVF+G CG  LDHGV+AV
Sbjct: 232 TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GYGT    DYW+V+NSWG+ WGE+GY+++ R     N  +CGIA++ SYP 
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIRMIR-----NKNQCGIAIQPSYPT 337


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 141/297 (47%), Positives = 187/297 (62%), Gaps = 16/297 (5%)

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
           S+ +G  E  F+    NLR I+ HN+ N ++ +G+ +FADLT  E+ A         KR 
Sbjct: 38  SSQLGLCEPAFRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEFSAY-------VKRF 90

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
            M   V   R      +   + VDWR+K AV  +K+QG CGSCW+FST  +VEG + I T
Sbjct: 91  PMN--VTRPRNEVWITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIAT 148

Query: 183 GELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           G+L+SLSEQ+L+DC  R  N GCNGGLMDYAF+++I NGG+D+E+DYPY   + KC+  +
Sbjct: 149 GKLVSLSEQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEK 208

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
                  I G+ +V    E  L  AV+  PVSVAIEA    FQHY SGVF G+CG++LDH
Sbjct: 209 EKKHAAEIHGFRNVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDH 268

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GV+ VGY      DYW+V+NSWG  WGE GY++L+R +     G CGI M+ASYP K
Sbjct: 269 GVLVVGYSD----DYWIVKNSWGKSWGEEGYIRLKRGV--DKKGMCGITMQASYPEK 319


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 157/359 (43%), Positives = 214/359 (59%), Gaps = 18/359 (5%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           +A+ST V    + S +AA    ++ D     +++   D  + + ++ W+AKHGKT     
Sbjct: 1   MALSTFVLAVLVMSGAAALGRELAGDGAAAAAAA---DVAMASRHEKWMAKHGKTYKDEE 57

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRT-----YKVGLNKFADLTNEEYRAMYLG-TRSDAKR 121
              +R ++F+ N + ID  N+         +++  N+FADLT++E+RA   G  R  A  
Sbjct: 58  EKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRPPAAV 117

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
                    + ++  A    P+S+DWR  GAV  VKDQGSCG CWAFS VAAVEG+ KI 
Sbjct: 118 AGAGGGFLYENFSLAAA---PQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIR 174

Query: 182 TGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           TG+L+SLSEQELVDCD R  + GC GGLMD AFQ+I + GG+ +E  YPY G +     +
Sbjct: 175 TGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVDGA-CRA 233

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSAL 299
                  SI G++DV   DE +L  AVA QPVSVAI   G  F+ Y+ GV  G  CG+ L
Sbjct: 234 AAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTEL 293

Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +H V AVGYGT  +G  YWL++NSWG+ WGE GYV+++R +     G CGIA  ASYPV
Sbjct: 294 NHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGV--GREGACGIAQMASYPV 350


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 156/360 (43%), Positives = 206/360 (57%), Gaps = 66/360 (18%)

Query: 1   MATASMFLAIS-TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKH 59
           MA+ + +  +S  L+F+    +S A   S+      H+ S   R +D        W+A++
Sbjct: 1   MASTNQYQYVSMALLFILAAWASQATSRSL------HEASMYERHED--------WMARY 46

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           G+        EKRF+IFKDN+                    A  T  +Y  +        
Sbjct: 47  GRMYKDANEKEKRFKIFKDNV--------------------AQATTFKYENV-------- 78

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                                +P ++DWR+KGAV P+KDQ  CGSCWAFS VAA EGI +
Sbjct: 79  -------------------TAVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQ 119

Query: 180 IVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           I TG+LISLSEQELVDCD    N GC+GGL D AF+FI  +G + SE  YPY G +  C+
Sbjct: 120 ITTGKLISLSEQELVDCDTGGENQGCSGGLXDDAFRFIXIHG-LASEATYPYEGDDGTCN 178

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             +       I GYEDV   +E +L+KAVA QPV+VAI+AGG  FQ Y SGVFTG+CG+ 
Sbjct: 179 SKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTE 238

Query: 299 LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           LDHGV AVGYG  ++G+ YWLV+NSWG+ WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 239 LDHGVAAVGYGIGDDGMXYWLVKNSWGTGWGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 297


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 161/356 (45%), Positives = 211/356 (59%), Gaps = 47/356 (13%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           LAI+ LV +F   +S A    +I+             +D ++  ++ W+A+HG+T     
Sbjct: 9   LAIALLV-VFSTWASQAMARQLIN-------------EDALVEKHEQWMARHGRTYQDSE 54

Query: 68  HNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
             E+RFQIFK NL +ID  N + N+TY++GLN FADL++EEY A Y              
Sbjct: 55  EKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYT------------- 101

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                  A K   E+PES+DWR+ GAV P+K+Q  CG CWAFS  AAVEGI  +  G  +
Sbjct: 102 -------ARKMPVEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI--VANG--V 150

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           SLS Q+L+DC    N GC GG M+ AF +IIQN G+  E DYPY   +  C  SR  A  
Sbjct: 151 SLSAQQLLDCVSD-NQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCS-SRMAA-- 206

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA-FQHYESGVFTGE-CGSALDHGVV 304
             I G+EDV+P DE +L +AVA QPVSV I+A     F+ Y+ GVFT   CG+   H V 
Sbjct: 207 AQISGFEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVT 266

Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
            VGYGT E+G  YWL +NSWG  WGE+GY++LQR+ +    G CGIA+ ASYP  N
Sbjct: 267 LVGYGTSEDGTKYWLAKNSWGETWGESGYMRLQRD-IGLEGGPCGIALYASYPTIN 321


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 149/350 (42%), Positives = 206/350 (58%), Gaps = 20/350 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+A++G+         +R
Sbjct: 7   LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  N+ N  +Y +G+NKF D+TN E+   Y G          +  V S 
Sbjct: 58  FQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGV--SLPLNFKREPVVS- 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       + +S+DWR+ GAV  VKDQ  CGSCWAFS +A VEGI KIVTG L+SLSEQ
Sbjct: 115 -FDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQ 173

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           E++DC   ++ GC+GG +D A+ FII N G+ SE DYPY   E  C  +        I G
Sbjct: 174 EVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSW-PNSAYITG 230

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   DE S+K AV +QP++ AI+A G  FQ+Y  GVF+G CG++L+H +  +GYG +
Sbjct: 231 YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQD 290

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            +G  YW+V+NSWGS WGE GYV++ R +  +++G CGIAM+  YP   S
Sbjct: 291 SSGTQYWIVKNSWGSSWGERGYVRMARGV--SSSGLCGIAMDPLYPTLQS 338


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 154/310 (49%), Positives = 202/310 (65%), Gaps = 24/310 (7%)

Query: 59  HGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMY 112
           HGK+    GH+E+ F  Q+F  ++  I+ HN  +     TY++GLNKF D+T+EE+R  +
Sbjct: 26  HGKS---YGHDEEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-F 81

Query: 113 LGTRSDAKRRLMKSKVASQRYACKA-GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            G + DA     K+K    R+  +  G+ LP  VDWREKG V PVK+QG CGSCWAFST 
Sbjct: 82  KGLKFDA----TKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTT 137

Query: 172 AAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD  F +I QNGG+D+E+ YPY
Sbjct: 138 GSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPY 197

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
            G +  C     N+    + G+ DV   DE +L+ AVA   PVSVAI+A   +FQ+Y+ G
Sbjct: 198 TGKDGDC-AFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEG 256

Query: 290 VF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+    C  S LDHGV+ VGYGTENGVDYWLV+NSWG  WG++GY+K+ RN       +C
Sbjct: 257 VYDEPSCSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRN----KENQC 312

Query: 348 GIAMEASYPV 357
           GIA  ASYP 
Sbjct: 313 GIASMASYPT 322


>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
          Length = 352

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 143/326 (43%), Positives = 202/326 (61%), Gaps = 13/326 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNK 99
            DDE+   + +W  K  K  +G  H   RF +FK N+  I  HN+L      T+ +  N+
Sbjct: 27  VDDEIHLAFISWKNKFEKVYDGAEH-LARFAVFKANMEIIRAHNALYELGEETFSMAANQ 85

Query: 100 FADLTNEEYRAMYLGTRSDAK-RRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVK 157
           FAD+T EE++   LG + + K +RL++   + +    ++ +   P+++DWR K AV PVK
Sbjct: 86  FADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCTHRSNNSTRPKAIDWRTKSAVTPVK 145

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QG CGSCW+FST  AVEG   +    LISLSE+ELV CD K + GCNGGLMD A+ +II
Sbjct: 146 NQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEELVQCDTKSDQGCNGGLMDNAYAWII 205

Query: 218 QNGGMDSEQDYPYL---GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
           QNGG+ +E  YPY+   G    C  +  + KV SI  + D+ P DE  L+ A+  QPV+V
Sbjct: 206 QNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASISDWCDLKPEDESDLELALVQQPVAV 265

Query: 275 AIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYG--TENGVDYWLVRNSWGSDWGENG 331
           AIEA   +FQ Y  GV    +CG+ LDHGV+AVGYG   ++ + YW+V+NSWG++WG+ G
Sbjct: 266 AIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGYGYDKKHKMHYWIVKNSWGAEWGDEG 325

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
           Y++L++    T    CGIA  ASYP 
Sbjct: 326 YIRLEKMPKKTKHSACGIAKAASYPT 351


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 199/332 (59%), Gaps = 23/332 (6%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ +  +Y+ W A H   +   G   +RF +FK+N R I EHN   N TY +GLN+F+D
Sbjct: 40  SEESLWALYERWCA-HYNMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSD 98

Query: 103 LTNEEY-RAMYLGTRS------DAKRRLMKSKVASQRYAC------KAGDEL--PESVDW 147
           +T+EE+ R+ Y G  +      D    L       +            G +L  P +VDW
Sbjct: 99  MTDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDW 158

Query: 148 REKGAVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
           R + AV  VKDQG +CGSCWAFS +AAVEGIN I T  L+ LSEQ+LVDCD K+N GCNG
Sbjct: 159 RGR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCD-KLNHGCNG 216

Query: 207 GLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKA 266
           GLM  AF F+++N G+  E  YPY+G E +C      A  V+I GY+ V  FD  +L  A
Sbjct: 217 GLMTTAFSFVVRNRGVVPEGAYPYMGREGRC--KHVMAPPVTIYGYQRVPRFDANALMNA 274

Query: 267 VADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSD 326
           VA QPVSVAIEA    F+HY+ GVF G CG  L H   AVGYG + G  +W+V+NSWG  
Sbjct: 275 VAAQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGADAGGPFWIVKNSWGPG 334

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           WGE GYV++ RN      G CGI  E SYPVK
Sbjct: 335 WGEGGYVRISRN-TPVRQGVCGILTENSYPVK 365


>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
 gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
          Length = 330

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 205/317 (64%), Gaps = 24/317 (7%)

Query: 52  YQT----WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNE 106
           YQT    W+ KH ++ +    N K +Q FKDN+ FI   N+  N    +GL +FADLTNE
Sbjct: 29  YQTSFLGWMKKHDRSYHHHEFNNK-YQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNE 87

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVKDQGSCGSC 165
           EYR +YLGT         K  VA +++         P+S+DWR KGAV+ VKDQG CGSC
Sbjct: 88  EYRKIYLGT---------KVNVAPEKHNFNMIHFTGPDSIDWRTKGAVSHVKDQGQCGSC 138

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           W+FST  +VEG ++I TG +++LSEQ LVDC  K  N GC+GGLM  AF+FI+  GG+ +
Sbjct: 139 WSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVAT 198

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E  YPY   + KC  ++      +I GY++++   E+ L+ A+  QPVS+AI+A  ++FQ
Sbjct: 199 EDSYPYNAVQGKCKFTKSMVG-ANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQ 257

Query: 285 HYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            Y+SGV+   EC S  LDHGV+AVGYGTENG DY++V+NSW   WG++GY+ + RN  + 
Sbjct: 258 LYKSGVYDEPECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRNAKN- 316

Query: 343 NTGKCGIAMEASYPVKN 359
              +CG+A  ASYP+ N
Sbjct: 317 ---QCGVATMASYPISN 330


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 189/296 (63%), Gaps = 28/296 (9%)

Query: 71  KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           KR   F+ NL FI++HN+ +     +Y VG+N+FADLT +E+ A+Y+ ++ +  R +  +
Sbjct: 17  KRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALYVPSKFN--RTMPYN 74

Query: 127 KVASQRYACKAGDELP----ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
            V            LP    +SVDWR KGAV P+K+QG CGSCW+FST  + EG + I T
Sbjct: 75  TVY-----------LPATSEDSVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEGAHAIAT 123

Query: 183 GELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           G L+SLSEQ+LVDC     N GCNGGLMD AF++II N G+D+E+DYPY   +  C+  +
Sbjct: 124 GNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDGTCNKEK 183

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
                 +I  Y DV   +E  L  AVA  PVSVAIEA    FQ Y+SGVF G CG+ LDH
Sbjct: 184 EAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFDGNCGTNLDH 243

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+ VGY      DYW+V+NSWG+ WG  GY+ ++R +  + +G CGIAM+ SYP+
Sbjct: 244 GVLVVGYTD----DYWIVKNSWGTTWGVEGYINMKRGV--SASGICGIAMQPSYPI 293


>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 335

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 157/361 (43%), Positives = 211/361 (58%), Gaps = 37/361 (10%)

Query: 5   SMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSN 64
           S+ L + TL+ L  +++S+        Y+N  D       D   M +++ W+AK GKT  
Sbjct: 3   SIVLLVCTLMALQAMAASA-------YYNNGSD-------DGVTMQMFEEWMAKFGKTYK 48

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
             G  E RF IF+DN+ FI  +     TY   VG+N+FADLTN+E+ A Y G        
Sbjct: 49  CHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG-------- 99

Query: 123 LMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
              +K    + A +  D +  P  +DWR +GAV  VKDQG+CGSCWAF+ VAA+EG+ KI
Sbjct: 100 ---AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKI 156

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--D 238
            TG+L  LSEQELVDCD   N GC GG  D AF+ +   GG+ +E DY Y G + KC  D
Sbjct: 157 RTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVD 215

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
               N    SI GY  V P DE  L  AVA QPV+V I+A G AFQ Y+SGVF G CG++
Sbjct: 216 DMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGAS 274

Query: 299 LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            +H V  VGY  +  +G  YWL +NSWG  WG+ GY+ L+++++  + G CG+A+   YP
Sbjct: 275 SNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPH-GTCGLAVSPFYP 333

Query: 357 V 357
            
Sbjct: 334 T 334


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 146/291 (50%), Positives = 186/291 (63%), Gaps = 19/291 (6%)

Query: 72  RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           RF  FK N+  I  HN+L N +Y +GLN+FADL+ EE++  Y G +   +R   +S    
Sbjct: 61  RFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGYK-HVEREFARSNNLH 119

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
           Q       +  P S+DWR   AV P+KDQG CGSCWAFS   ++EG   ++ G+  L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQ+LVDC     +AGCNGGLMDYAF++II N G+ +E  YPY G    C  S    KVV
Sbjct: 174 SEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           +I GY+DV+  DE SL  AV    PVSVAIEA    FQ Y SGVF+G CG  LDHGV+AV
Sbjct: 232 TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GYGT    DYW+V+NSWG+ WGE+GY+++ R     N  +CGIA++ SYP 
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIRMIR-----NKNQCGIAIQPSYPT 337


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  273 bits (699), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 147/365 (40%), Positives = 221/365 (60%), Gaps = 23/365 (6%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKH 59
           M   + T +FL F+   S    + + Y    ++S        + +++ V+ ++Q W  ++
Sbjct: 1   MGCQLKTQLFLLFLVWGS---WTFLCYGLPSEYSILALEIDKFPSEEGVIELFQRWKEEN 57

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTR 116
            K        + RF+ FK NL++I E NS   +     +GLN+FAD++NEE+++ +  T 
Sbjct: 58  KKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKF--TS 115

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
              K    ++ ++ + ++C   ++ P S+DWR+KG V  VKDQG CG CWAFS+  A+EG
Sbjct: 116 KVKKPFSKRNGLSGKDHSC---EDAPYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEG 172

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           IN IV+G+LISLSE ELVDCDR  N GC+GG MDYAF++++ NGG+D+E +YPY GA+  
Sbjct: 173 INAIVSGDLISLSEPELVDCDR-TNDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGT 231

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C+ ++   KV+ IDGY +V   D  SL  A   QP+S  I+     FQ Y  G++ G+C 
Sbjct: 232 CNVAKEETKVIGIDGYYNVEQSDR-SLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCS 290

Query: 297 S---ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
           S    +DH ++ VGYG+E   DYW+V+NSWG+ WG  GY+ ++RN  +   G C I   A
Sbjct: 291 SDPDDIDHAILVVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRN-TNLKYGVCAINYMA 349

Query: 354 SYPVK 358
           SYP K
Sbjct: 350 SYPTK 354


>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
          Length = 336

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 159/365 (43%), Positives = 213/365 (58%), Gaps = 38/365 (10%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MA+A   L + TL+ L  +++S+        Y+N  D       D   M +++ W+AK G
Sbjct: 1   MASA-FLLVVCTLMALQAMAASA-------YYNNGSD-------DGVTMQMFEEWMAKFG 45

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSD 118
           KT    G  E RF IF+DN+ FI  +     TY   VG+N+FADLTN+E+ A Y G    
Sbjct: 46  KTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG---- 100

Query: 119 AKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
                  +K    + A +  D +  P  +DWR +GAV  VKDQG+CGSCWAF+ VAA+EG
Sbjct: 101 -------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEG 153

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           + KI TG+L  LSEQELVDCD   N GC GG  D AF+ +   GG+ +E DY Y G + K
Sbjct: 154 LTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGK 212

Query: 237 C--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
           C  D    N    SI GY  V P DE  L  AVA QPV+V I+A G AFQ Y+SGVF G 
Sbjct: 213 CRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGP 271

Query: 295 CGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
           CG++ +H V  VGY  +  +G  YW+ +NSWG  WG+ GY+ L++++L  + G CG+A+ 
Sbjct: 272 CGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVS 330

Query: 353 ASYPV 357
             YP 
Sbjct: 331 PFYPT 335


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 134/260 (51%), Positives = 178/260 (68%), Gaps = 4/260 (1%)

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVK 157
           +FA++TN+E+R+MY G + D+         ++  RY   +   LP +VDWR+KGAV P+K
Sbjct: 1   QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           +QGSCG CWAFS VAA+EG  +I  G+LISLSEQ+LVDCD   + GC+GGL+D AF+ I+
Sbjct: 61  NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
             GG+ +E +YPY G +  C          SI GYEDV   DE +L KAVA QPVSV IE
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQ 336
            GG  FQ Y SGVFTGEC + LDH V AVGY  +  G  YW+++NSWG+ WGE GY++++
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239

Query: 337 RNLLDTNTGKCGIAMEASYP 356
           +++ D   G CG+AM+ASYP
Sbjct: 240 KDIKDKE-GLCGLAMKASYP 258


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/262 (53%), Positives = 174/262 (66%), Gaps = 16/262 (6%)

Query: 71  KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           +RF IF DNL FI  HN+       T+ VG+N+FADLTNEEYR +YL  R      L + 
Sbjct: 39  RRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--RPYPTELLGRE 96

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +         AG     SVDWR+KGAV P+K+QG CGSCW+FST  +VEG + I TG L+
Sbjct: 97  RQEVWLDGPNAG-----SVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLV 151

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ+LVDC     N GCNGGLMD AF++II NGG+D+EQDYPY   +  CD S+ +  
Sbjct: 152 SLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKH 211

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
            VSI GY+DV   +E  L  AV   PVSVAIEA  ++FQ Y SGVF+G CG+ LDHGV+ 
Sbjct: 212 AVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLV 271

Query: 306 VGYGTENGVDYWLVRNSWGSDW 327
           VGY +    DYW+V+NSWG+ W
Sbjct: 272 VGYTS----DYWIVKNSWGASW 289


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/318 (44%), Positives = 201/318 (63%), Gaps = 27/318 (8%)

Query: 44  TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +DD  M   ++ W+A++G+         +RF++FK N+ FI+  N+ N  + +G+N+FAD
Sbjct: 28  SDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFAD 87

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGS 161
           LTN+E+R+    T+++       ++V +  R      D LP ++DWR KG V P+KDQG 
Sbjct: 88  LTNDEFRS----TKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQ 143

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG CWAFS VAA+E                ELVDCD    + GC GGLMD AF+FII+NG
Sbjct: 144 CGCCWAFSAVAAME----------------ELVDCDVHGEDQGCEGGLMDDAFKFIIKNG 187

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+ +E +YPY   ++K      +  V SI GYEDV   +E +L KAVA+QPVSVA++ G 
Sbjct: 188 GLTTESNYPYAAVDDKFKSVSNS--VASIKGYEDVPANNEAALMKAVANQPVSVAVDGGD 245

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             FQ Y+ GV TG CG+ LDHG+VA+GYG   +G  YWL++NSWG  WGENG++++++++
Sbjct: 246 MTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKDI 305

Query: 340 LDTNTGKCGIAMEASYPV 357
            D   G CG+AME SYP 
Sbjct: 306 SDKR-GMCGLAMEPSYPT 322


>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
          Length = 382

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 155/385 (40%), Positives = 214/385 (55%), Gaps = 32/385 (8%)

Query: 1   MATASMFLAISTLVFL--FFISSSSAADMSIIS-YDNNHDHSSSWRTDDEVMTIYQTWLA 57
           MA AS F     L+ L  FFI  SS     + S    N D   +  T   +M ++Q W A
Sbjct: 1   MAAASFFSMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATT---MMEMFQRWKA 57

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGT- 115
           ++ ++        +R +++  N+R+I+  N+     Y++G   + DLTN+E+ AMY    
Sbjct: 58  EYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTAPP 117

Query: 116 -RSDAKRRLMKSKVASQRYACKAGDE-------------LPESVDWREKGAVNPVKDQGS 161
            RS A      +            DE              P SVDWR  GAV  VKDQG 
Sbjct: 118 LRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRASGAVTEVKDQGR 177

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           CGSCWAFSTVA VEGI KI  G+L+SLSEQELVDCD  +++GC+GG+   A ++I  NGG
Sbjct: 178 CGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCD-TLDSGCDGGVSYRALEWITANGG 236

Query: 222 MDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           + +  DYPY G A   CD ++      +I G   V+   E SL+ A A QPV+V+IEAGG
Sbjct: 237 ITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGG 296

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTEN--------GVDYWLVRNSWGSDWGENGY 332
             FQHY  GV+ G CG+ L+HGV  VGYG E         G  YW+++NSWG +WG+ GY
Sbjct: 297 DNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGY 356

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
           +K+++++     G CGIA+  S+P+
Sbjct: 357 IKMKKDVAGKPEGLCGIAIRPSFPL 381


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 199/320 (62%), Gaps = 16/320 (5%)

Query: 46  DEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT-YKVGLNKFAD 102
           DE M +  Y+ W+A++ +          RFQ+FK N  FID  N+  +  Y +G N+FAD
Sbjct: 51  DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKV--ASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           LT++E+ AMY G R  A       ++  A  +Y      +    VDWR++GAV PVK+QG
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CG CWAFS V A+EG+  I TG L+SLSEQ+++DCD    N GCNGG MD AFQ++I N
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E  YPY   +  C   +  A   +I G++D+   DE +L  AVA+QPVSV ++ G
Sbjct: 231 GGVTTEDAYPYSAVQGTCQNVQPAA---TISGFQDLPSGDENALANAVANQPVSVGVDGG 287

Query: 280 GRAFQHYESGVFTGE-CGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQR 337
              FQ Y+ G++ G+ CG+ ++H V A+GYG ++ G  YW+++NSWG+ WGENG+++LQ 
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQM 347

Query: 338 NLLDTNTGKCGIAMEASYPV 357
            +     G CGI+  ASYP 
Sbjct: 348 GV-----GACGISTMASYPT 362


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 140/310 (45%), Positives = 198/310 (63%), Gaps = 12/310 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+  HG+        E RF+ FK+N+ FI+  N +  + YK+ +NK+ADLT EE+  
Sbjct: 41  HENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTT 100

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            ++G  +    +  +S   +  +   +  E+P S+DWR++G+V  VKDQG CG CWAFS 
Sbjct: 101 SFMGLDTSLLSQ-QESTATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSA 159

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN--GGMDSEQDY 228
            AA+EG  +I   ELISLSEQ+L+DC  + N GC GGLM  A+ F++QN  GG+ +E +Y
Sbjct: 160 AAAIEGAYQIANNELISLSEQQLLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNY 218

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY  A+N C   +  A  V+I+GYE V P DE SL KAV +QP+SV I A    F  Y S
Sbjct: 219 PYEEAQNVCKTEQPAA--VTINGYE-VVPSDESSLLKAVVNQPISVGI-AANDEFHMYGS 274

Query: 289 GVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           G++ G C S L+H V  +GYGT  E+G  YW+V+NSWGSDWGE GY+++ R+ +  + G 
Sbjct: 275 GIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWGEEGYMRIARD-VGVDGGH 333

Query: 347 CGIAMEASYP 356
           CGIA  AS+P
Sbjct: 334 CGIAKVASFP 343


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 203/318 (63%), Gaps = 22/318 (6%)

Query: 44  TDDEVMTIYQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           +  +  T +Q W+ KH K  T++  G    R+ +F+DN+  + + N       +GLN  A
Sbjct: 24  SQKQYQTAFQNWMVKHQKSYTNDEFG---SRYSVFQDNMDIVAKWNQKGSNTILGLNVMA 80

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLTNEE++ +YLGT         K+ V  ++        LP SVDWR  GAV  VK+QG 
Sbjct: 81  DLTNEEFKKLYLGT---------KANVTYKKKTLVGVSGLPASVDWRANGAVTAVKNQGQ 131

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNG 220
           CG C+AFST  +VEGI++I + +L+ LSEQ+++DC   + N GC+GGLM  +F++II  G
Sbjct: 132 CGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVG 191

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+D+E  YPY G   KC  +++N    +I GY++V    E  L+ AVA QPVSVAI+A  
Sbjct: 192 GLDTEASYPYTGEVGKCKFNKKNIG-ATITGYKNVESGSESDLQTAVAAQPVSVAIDASQ 250

Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            +FQ Y SGV +  EC S  LDHGV+AVGYG+++G DYW+V+NSWG+DWGENG++ + RN
Sbjct: 251 SSFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSGQDYWIVKNSWGADWGENGFILMARN 310

Query: 339 LLDTNTGKCGIAMEASYP 356
             D N   CGIA  AS+P
Sbjct: 311 -KDNN---CGIATMASFP 324


>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
 gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
          Length = 214

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 127/216 (58%), Positives = 160/216 (74%), Gaps = 3/216 (1%)

Query: 144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG 203
           SVDWR+KG V  +KDQG CG+CWAFS +AAVEG+  + TG L+SLSEQELVDCD  +N G
Sbjct: 1   SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60

Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSL 263
           C+GG+MDYAFQ++I+NGG+ S+ +YPY      CD  +      +I+G++ + P  E  L
Sbjct: 61  CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120

Query: 264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNS 322
            +AVA+QPVSVAIEAGG+ FQ Y SGVFTGECGS LDHGV  VGYGT+  G  YWLV+NS
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNS 180

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           WGS WGE+GYV+++R       G CGI ++ASYP K
Sbjct: 181 WGSGWGESGYVRMERQ--GPGAGVCGINLDASYPTK 214


>gi|2098464|pdb|1PCI|A Chain A, Procaricain
 gi|2098465|pdb|1PCI|B Chain B, Procaricain
 gi|2098466|pdb|1PCI|C Chain C, Procaricain
          Length = 322

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/338 (44%), Positives = 207/338 (61%), Gaps = 20/338 (5%)

Query: 26  DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
           D SI+ Y  + D +S+ R    ++ ++ +W+  H K    +     RF+IFKDNL +IDE
Sbjct: 1   DFSIVGYSQD-DLTSTER----LIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE 55

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE----L 141
            N  N +Y +GLN+FADL+N+E+   Y+G+  DA           Q Y  +  +E    L
Sbjct: 56  TNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDA--------TIEQSYDEEFINEDIVNL 107

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEGINKI TG+L+ LSEQELVDC+R+ +
Sbjct: 108 PENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-S 166

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC GG   YA +++ +NG +     YPY   +  C   +    +V   G   V P +E 
Sbjct: 167 HGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEG 225

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           +L  A+A QPVSV +E+ GR FQ Y+ G+F G CG+ +D  V AVGYG   G  Y L++N
Sbjct: 226 NLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKN 285

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           SWG+ WGE GY++++R     + G CG+   + YP KN
Sbjct: 286 SWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPTKN 322


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 191/312 (61%), Gaps = 11/312 (3%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M ++  +  K+GK  NG+  +  RF IFK N+  I   N+ N T+ +G+N+F DLT EE 
Sbjct: 24  MMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEL 83

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
            A Y G +  +    +  ++++  Y    G  L  SVDW  +G V PVK+QG CGSCW+F
Sbjct: 84  AASYTGLKPASLWSGLP-RLSTHEYN---GAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST  A+EG   + TG L+SLSEQ+ VDCD   ++GCNGG MD AF F  +N  + +E  Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFVDCDTT-DSGCNGGWMDNAFSFAKKNS-ICTEGSY 197

Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           PY   +  C+ S     +    + GY DVS   E ++  AVA QPVS+AIEA   +FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257

Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
            SGV T  CG+ LDHGV+AVGYG+E G DYW V+NSWGS WGE GYV+LQR       G+
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGAGE 315

Query: 347 CG-IAMEASYPV 357
           CG +A   SYPV
Sbjct: 316 CGLLAGPPSYPV 327


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 193/307 (62%), Gaps = 8/307 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFADLTNEEYR 109
           +  W+  HG T +      +R + +  N  +I EHN+ N      +G N F+ ++ +E++
Sbjct: 28  FSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFK 87

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
               G         ++ ++AS+     +  E+P +VDW +KG V PVK+QG CGSCWAFS
Sbjct: 88  FKMTGLV--LPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFS 145

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           T  AVEG   + +G+L SLSEQELVDCD   + GCNGGLMD+AFQ+I  +GG+ SE DY 
Sbjct: 146 TTGAVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYE 205

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y      C   R    VV + G++DV+P DE +LK AVA QPVSVAIEA  +AFQ Y+SG
Sbjct: 206 YKAKAQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSG 262

Query: 290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           VF   CG+ LDHGV+AVGYG +NG  +W V+NSWG+ WGE GY++L R   +   G+CGI
Sbjct: 263 VFNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLARE-ENGPAGQCGI 321

Query: 350 AMEASYP 356
           A   SYP
Sbjct: 322 ASVPSYP 328


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 138/322 (42%), Positives = 202/322 (62%), Gaps = 21/322 (6%)

Query: 46  DEVMTI--YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           DE M +  Y+ W+A++ +          RFQ+FK N  FID  N+   + Y +G N+FAD
Sbjct: 51  DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110

Query: 103 LTNEEYRAMYLGTRSDAK----RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           LT++E+ AMY G R  A      + + +    Q +  +  D++   VDWR++GAV PVK+
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFT-RLDDDV--QVDWRQQGAVTPVKN 167

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
           QG CG CWAFS V A+EG+  I TG L+SLSEQ+++DCD    N GCNGG MD AFQ+++
Sbjct: 168 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVV 227

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+ +E  YPY   +  C   +  A   +I G++D+   DE +L  AVA+QPVSV ++
Sbjct: 228 NNGGVTTEDAYPYSAVQGTCQNVQPAA---TISGFQDLPSGDENALANAVANQPVSVGVD 284

Query: 278 AGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKL 335
            G   FQ Y+ G++ G+ CG+ ++H V A+GYG ++ G  YW+++NSWG+ WGENG+++L
Sbjct: 285 GGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL 344

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
           Q  +     G CGI+  ASYP 
Sbjct: 345 QMGV-----GACGISTMASYPT 361


>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
          Length = 221

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 130/220 (59%), Positives = 166/220 (75%), Gaps = 3/220 (1%)

Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
           D LP+S+DWREKGAV PVK+QG CGSCWAF  +AAVEGIN+IVTG+LISLSEQ+LVDC  
Sbjct: 1   DVLPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCST 60

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
           + N GC GG    AFQ+II NGG++SE+ YPY G    CD ++ NA VVSID Y +V   
Sbjct: 61  R-NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSN 118

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           DE SL+KAVA+QPVSV ++A GR FQ Y +G+FTG C  + +H     G  TEN  DYW 
Sbjct: 119 DEKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWT 178

Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           V+NSWG +WGE+GY++++RN+ ++ +GKCGIA+  SYP+K
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAES-SGKCGIAISPSYPIK 217


>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
          Length = 1105

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 122/218 (55%), Positives = 160/218 (73%), Gaps = 1/218 (0%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           +P++VDWR+ GAV  VKDQGSCG+CW+FS   A+EGINKI TG LISLSEQEL+DCDR  
Sbjct: 129 VPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRSY 188

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N+GC GGLMDYA++F+++NGG+D+E DYPY   +  C+ ++   +VV+IDGY+DV   +E
Sbjct: 189 NSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNE 248

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
             L +AVA QPVSV I    RAFQ Y  G+F G C ++LDH ++ VGYG+E G DYW+V+
Sbjct: 249 DMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIVK 308

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG  WG  GY+ + RN  ++N G CGI    S+P K
Sbjct: 309 NSWGESWGMKGYMYMHRNTGNSN-GVCGINQMPSFPTK 345


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 151/352 (42%), Positives = 210/352 (59%), Gaps = 34/352 (9%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGK--TSNGMG 67
           I  LVF F I +  +A                  +  +  T +Q W+ KH K  T++  G
Sbjct: 4   ILALVFCFLIVNCISAARVF--------------SQKQYQTAFQNWMVKHQKSYTNDEFG 49

Query: 68  HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
               R+ IF+DN+ F+ + N       +GLN  ADLTN+EY+ +YLGT++  K+  +   
Sbjct: 50  ---SRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIG 106

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           V     A       P SVDWR  GAV  VK+QG CG C++FST  +VEGI++I + +L+S
Sbjct: 107 VTDVSKA-------PASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVS 159

Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ+++DC   + N GC+GGLM  +F++II  GG+D+E  YPY G   KC  ++ N   
Sbjct: 160 LSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIG- 218

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVV 304
            +I GY++V    E  L+ AVA QPVSVAI+A   +FQ Y SGV +   C S  LDHGV+
Sbjct: 219 ATITGYKNVKSGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVL 278

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           AVGYG+++G DYW+V+NSWG+DWGE G++ + RN        CGIA  ASYP
Sbjct: 279 AVGYGSQSGQDYWIVKNSWGADWGEKGFILMARN----KHNNCGIATMASYP 326


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 142/359 (39%), Positives = 217/359 (60%), Gaps = 21/359 (5%)

Query: 4   ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
           AS+ + ++ L+ LF     S A    + +            +  ++  ++ W+A+  +  
Sbjct: 2   ASIMVLVTVLIILFTGFRISQATSRTVIF-----------REQSMVDKHEQWMARFSREY 50

Query: 64  NGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAK-- 120
                   R  +FK NL+FI+  N   N++YK+G+N+FAD TNEE+ A++ G +   +  
Sbjct: 51  RDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVS 110

Query: 121 -RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
             +++   ++SQ +     D + ES DWR +GAV PVK QG CG CWAFS VAAVEG+ K
Sbjct: 111 PSKVVAKTISSQTW--NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAK 168

Query: 180 IVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
           I  G L+SLSEQ+L+DCDR+ +  C+GG+M  AF +++QN G+ SE DY Y G++  C  
Sbjct: 169 IAGGNLVSLSEQQLLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRS 228

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL 299
           + R A  +S  G++ V   +E +L +AV+ QPVSV+++A G  F HY  GV+ G CG++ 
Sbjct: 229 NARPAARIS--GFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSS 286

Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +H V  VGYGT ++G  YWL +NSWG  W E GY++++R++     G CG+A  A YPV
Sbjct: 287 NHAVTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIRRDVA-WPQGMCGVAQYAFYPV 344


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 146/320 (45%), Positives = 198/320 (61%), Gaps = 19/320 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
           T D +  ++  W+  + K+ +    NE+   R+ ++++N + I+EHN  N+T  + +NKF
Sbjct: 22  THDPLTGVFAEWMRDNSKSYS----NEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKF 77

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            DLTN E+  ++ G   D       +K A+++     G  L    DWR+KGAV  VK+QG
Sbjct: 78  GDLTNAEFNKLFKGLAFD--YSFHANKAAAEKAVPAPG--LSADFDWRQKGAVTHVKNQG 133

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CGSCW+FST  + EG N + TG L SLSEQ L+DC     N GCNGGLMDYAF++II N
Sbjct: 134 QCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINN 193

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
            G+D+E  YPY  A+  C  +  N+   S+  Y DVS  DE +L  AVA +P SVAI+A 
Sbjct: 194 KGIDTEASYPYQTAQYTCQYNPANSG-GSLTSYTDVSSGDENALLNAVATEPTSVAIDAS 252

Query: 280 GRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             +FQ Y  GV +   C S  LDHGV+AVG+GTE+G DYWLV+NSWG+DWG  GY+K+ R
Sbjct: 253 HNSFQFYSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAGYIKMAR 312

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N     +  CGIA  ASYP 
Sbjct: 313 N----RSNNCGIATSASYPT 328


>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
 gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 148/290 (51%), Positives = 196/290 (67%), Gaps = 8/290 (2%)

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
           EKR +IFK+NL +I+  N+  N++YK+GLN+++DLT++E+ A + G +    ++L  SK+
Sbjct: 80  EKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLK--VSKQLSSSKM 137

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            S        D++P + DWR++GAV  VKDQGSCG CWAFS VAAVEG  KI TGELISL
Sbjct: 138 RSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELISL 197

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQ+LVDCD + N+GC+GG MD AF++IIQ  G+ SE DYPY      C  + +      
Sbjct: 198 SEQQLVDCDER-NSGCHGGNMDSAFKYIIQK-GIVSEADYPYQEGSQTCQLNDQMKFEAQ 255

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I  + DV   DE  L +AVA QPVSV IE G   FQHY   V++G CG +++H V AVGY
Sbjct: 256 ITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDE-FQHYMGDVYSGTCGQSMNHAVTAVGY 314

Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G +E+G  YWL++NSWG  WGE GY+KL R   +   G+CGIA  ASYP+
Sbjct: 315 GVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPG-GQCGIAAHASYPI 363


>gi|308082013|ref|NP_001183396.1| uncharacterized protein LOC100501813 [Zea mays]
 gi|238011208|gb|ACR36639.1| unknown [Zea mays]
          Length = 291

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 124/178 (69%), Positives = 150/178 (84%), Gaps = 1/178 (0%)

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           +ISLSEQELVDCD   N GCNGGLMDYAF+FII NGG+D+E+DYPY G + +CD +R+NA
Sbjct: 1   MISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNA 60

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVV 304
           KVV+ID YEDV    E SL+KAVA+QP+SVAIEAGGRAFQ Y SG+FTG CG+ALDHGV 
Sbjct: 61  KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVT 120

Query: 305 AVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
           AVGYGTENG DYW+V+NSWGS WGE+GYV+++RN +  ++GKCGIA+E SYP+K   N
Sbjct: 121 AVGYGTENGKDYWIVKNSWGSSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGAN 177


>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
          Length = 319

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 197/323 (60%), Gaps = 24/323 (7%)

Query: 44  TDDEV-MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKF 100
           +DD V M +++ W+AK GKT    G  E RF IF+DN+ FI  +     TY   VG+N+F
Sbjct: 11  SDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQF 69

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKD 158
           ADLTN+E+ A Y G           +K    + A +  D +  P  +DWR +GAV  VKD
Sbjct: 70  ADLTNDEFVATYTG-----------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKD 118

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG+CGSCWAF+ VAA+EG+ KI TG+L  LSEQELVDCD   N GC GG  D AF+ +  
Sbjct: 119 QGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVAS 177

Query: 219 NGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            GG+ +E DY Y G + KC  D    N    SI GY  V P DE  L  AVA QPV+V I
Sbjct: 178 KGGITAESDYRYEGFQGKCRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYI 236

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
           +A G AFQ Y+SGVF G CG++ +H V  VGY  +  +G  YWL +NSWG  WG+ GY+ 
Sbjct: 237 DASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYIL 296

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           L+++++  + G CG+A+   YP 
Sbjct: 297 LEKDIVQPH-GTCGLAVSPFYPT 318


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 144/312 (46%), Positives = 191/312 (61%), Gaps = 11/312 (3%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY 108
           M ++  +  K+GK  NG+  +  RF IFK N+  I   N+ N T+ +G+N+F DLT EE+
Sbjct: 24  MMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLTQEEF 83

Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
            A Y G +  +    +  ++++  Y    G  L  SVDW  +G V PVK+QG CGSCW+F
Sbjct: 84  AASYTGLKPASLWSGLP-RLSTHEYN---GAPLASSVDWTTQGVVTPVKNQGQCGSCWSF 139

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           ST  A+EG   + TG L+SLSEQ+  DCD   ++GCNGG MD AF F  +N  + +E  Y
Sbjct: 140 STTGALEGAWALSTGNLVSLSEQQFEDCD-TTDSGCNGGWMDNAFSFAKKNS-ICTEGSY 197

Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           PY   +  C+ S     +    + GY DVS   E ++  AVA QPVS+AIEA   +FQ Y
Sbjct: 198 PYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLY 257

Query: 287 ESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
            SGV T  CG+ LDHGV+AVGYG+E G DYW V+NSWGS WGE GYV+LQR       G+
Sbjct: 258 SSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGAGE 315

Query: 347 CG-IAMEASYPV 357
           CG +A   SYPV
Sbjct: 316 CGLLAGPPSYPV 327


>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
          Length = 342

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 155/357 (43%), Positives = 208/357 (58%), Gaps = 31/357 (8%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEV-MTIYQTWLAKHGKTSNGMGH 68
           +++ V L   +  +   M   +Y NN        +DD V M +++ W+AK GKT    G 
Sbjct: 7   MASAVLLVVCTLMALQAMGADAYYNNG-------SDDGVTMQMFEEWMAKFGKTYKCHGE 59

Query: 69  NEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
            E RF IF+DN+ FI  +     TY   VG+N+FADLTN+E+ A Y G           +
Sbjct: 60  KEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQFADLTNDEFVATYTG-----------A 107

Query: 127 KVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
           K    + A +  D +  P  +DWR +GAV  VKDQG+CGSCWAF+ VAA+EG+ KI TG+
Sbjct: 108 KPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQ 167

Query: 185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRR 242
           L  LSEQELVDCD   N GC GG  D AF+ +   GG+ +E DY Y G + KC  D    
Sbjct: 168 LTPLSEQELVDCDTNSN-GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLF 226

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHG 302
           N     I GY  V P DE  L  AVA QPV+V I+A G AFQ Y+SGVF G CG++ +H 
Sbjct: 227 N-HAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHA 285

Query: 303 VVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V  VGY  +  +G  YW+ +NSWG  WG+ GY+ L++++L  + G CG+A+   YP 
Sbjct: 286 VTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVSPFYPT 341


>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
          Length = 319

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 197/323 (60%), Gaps = 24/323 (7%)

Query: 44  TDDEV-MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKF 100
           +DD V M +++ W+AK GKT    G  E RF IF+DN+ FI  +     TY   VG+N+F
Sbjct: 11  SDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKP-QVTYDSAVGINQF 69

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKD 158
           ADLTN+E+ A Y G           +K    + A +  D +  P  +DWR +GAV  VKD
Sbjct: 70  ADLTNDEFVATYTG-----------AKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKD 118

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG+CGSCWAF+ VAA+EG+ KI TG+L  LSEQELVDCD   N GC GG  D AF+ +  
Sbjct: 119 QGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSN-GCGGGHTDRAFELVAS 177

Query: 219 NGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            GG+ +E DY Y G + KC  D    N    SI GY  V P DE  L  AVA QPV+V I
Sbjct: 178 KGGITAESDYRYEGFQGKCRVDDMLFN-HAASIGGYRAVPPNDERQLATAVARQPVTVYI 236

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
           +A G AFQ Y+SGVF G CG++ +H V  VGY  +  +G  YW+ +NSWG  WG+ GY+ 
Sbjct: 237 DASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYIL 296

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           L++++L  + G CG+A+   YP 
Sbjct: 297 LEKDVLQPH-GTCGLAVSPFYPT 318


>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 351

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 141/362 (38%), Positives = 215/362 (59%), Gaps = 19/362 (5%)

Query: 3   TASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKT 62
           T   FL +  ++  F  +   + ++              + ++  +M +Y+ W + H + 
Sbjct: 2   TVMKFLIVPLVLIAFLCNICESFELE----------RKDFESEKSLMQLYKRW-SSHHRI 50

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
           S        RF++FK+N + + + N + ++ K+ LN+FAD++++E+R MY    +  K  
Sbjct: 51  SRNANEMHNRFKVFKNNAKHVFKVNLMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDL 110

Query: 123 LMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN 178
             K   A+      +  +  + +P S+DWR+KGAVN +K+QG CGSCWAF+ VAAVE I+
Sbjct: 111 HAKKIEATGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIH 170

Query: 179 KIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
           +I T EL+SLSE+E++DCD + + GC GG  + AF+F++ N G+  E +YPY      C 
Sbjct: 171 QIKTNELVSLSEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCR 229

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE--CG 296
                 K V IDGYE+V   +E +L KAVA QPV+VAI +GG  F+ Y  G+FT    CG
Sbjct: 230 RRGGRNKRVRIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCG 289

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             +DH VV VGYGT+   DYW++RN +G  WG NGY+K+QR    +  G CG+AM+ +YP
Sbjct: 290 FNIDHTVVVVGYGTDEDGDYWIIRNQYGHRWGMNGYMKMQRG-AHSPQGVCGMAMQPAYP 348

Query: 357 VK 358
           VK
Sbjct: 349 VK 350


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 217/359 (60%), Gaps = 25/359 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE--VMTIYQTWLAKHGKTSNGMG 67
           +++++F+F         ++I+S       ++S  T  E  V   +Q W+ +  +  +   
Sbjct: 1   MTSILFMF-------VSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDEL 53

Query: 68  HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRS---DAKRRL 123
             + RF +FK NL+FI++ N   +RTYK+G+N+FAD T EE+ A + G +          
Sbjct: 54  EKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEF 113

Query: 124 MKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
           +   + S  +     AG   PE  DWR +GAV PVK QG CG CWAFS+VAAVEG+ KIV
Sbjct: 114 VDEMIPSWNWNVSDVAG---PEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIV 170

Query: 182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
            G L+SLSEQ+L+DCDR+ + GCNGG+M  AF +II+N G+ SE  YPY   E  C   R
Sbjct: 171 GGNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTC---R 227

Query: 242 RNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSAL 299
            NAK  + I G++ V   +E +L +AV+ QPVSV+I+A G  F HY  GV+    CG+ +
Sbjct: 228 YNAKPSAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDV 287

Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +H V  VGYGT   G+ YWL +NSWG  WGENGY++++R++     G CG+A  A YPV
Sbjct: 288 NHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQ-GMCGVAQYAFYPV 345


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 129/229 (56%), Positives = 165/229 (72%), Gaps = 5/229 (2%)

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
           RY   + D +P ++DWR  GAV P+KDQG CG CWAFS VAA EGI KI TG+LISLSEQ
Sbjct: 7   RYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQ 66

Query: 192 ELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           ELVDCD    + GC GGLMD AF+FII+NGG+ +E +YPY  A+ KC     +A   +I 
Sbjct: 67  ELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNSA--ANIK 124

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
           GYEDV   DE +L KAVA+QPVSVA++ G   FQ Y  GV TG CG+ LDHG+ A+GYG 
Sbjct: 125 GYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 184

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           T +G  YWL++NSWG+ WGENGY+++++++ D   G CG+A+E SYP +
Sbjct: 185 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISD-KKGMCGLAIEPSYPTE 232


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 197/312 (63%), Gaps = 20/312 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  W   HGKT  G   + +R  I+ DNL  + +HN+ N +YK+ +N FADLT  E++  
Sbjct: 27  WHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIVKKHNAENHSYKLDMNHFADLTVTEFKQR 85

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           ++G R+ +      +      +   +  +LP  VDWR+KG V  VK+QG CGSCWAFS+ 
Sbjct: 86  FMGYRAAS------NSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSST 139

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG +   TG+L+SLSEQ LVDC +K  N GC GGLMDYAF++I  N G+D+EQ YPY
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPY 199

Query: 231 LGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
              + +C   P    A V    GY DV    E  L+ AVA   P+SVAI+AG  +FQ Y+
Sbjct: 200 TARDGQCHFKPGSVGATVT---GYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYK 256

Query: 288 SGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           +GV++  +C S  LDHGV+AVGYG E+G DYWLV+NSWG  WG NGY+K+ RN       
Sbjct: 257 TGVYSEPDCSSTQLDHGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRN----KDN 312

Query: 346 KCGIAMEASYPV 357
           +CGIA +ASYP+
Sbjct: 313 QCGIATQASYPL 324


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 143/311 (45%), Positives = 192/311 (61%), Gaps = 12/311 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
           ++ W    GK+ +       R  +++ N   +D HN     +Y +G+N FADLT+EE++ 
Sbjct: 30  FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKR 89

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            YLGT+ D  R   +S  +S          LP+SVDWR  G V PVKDQG CGSCW+FST
Sbjct: 90  FYLGTKVDLNRP--RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFST 147

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             +VEG +   TG+L+SLSEQ LVDC + + N GCNGGLMD AFQ+II N G+D+E  YP
Sbjct: 148 TGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYP 207

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
           Y   +  C  +  N    ++  ++D++   E  L+ AVA   PVSVAI+A   +FQ Y S
Sbjct: 208 YTAKDGTCKFNAANVG-ATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTS 266

Query: 289 GVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           GV+   +C S +LDHGV+A GYGT NG  YWLV+NSWGS WG+ GY+ + RN       +
Sbjct: 267 GVYNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNA----NNQ 322

Query: 347 CGIAMEASYPV 357
           CGIA  ASYP+
Sbjct: 323 CGIATSASYPI 333


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 143/324 (44%), Positives = 199/324 (61%), Gaps = 13/324 (4%)

Query: 45  DDEVMT-IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-----RTYKVGLN 98
           DD+ M   Y+ W+A+ G+T        +RF++FK N  FID HN+          K+  N
Sbjct: 12  DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71

Query: 99  KFADLTNEEYRAMYL-GTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           KFADLT +E+R +Y+ G R + +   + +    +  A    D +P S+DWR +GAV  VK
Sbjct: 72  KFADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSD-VPPSIDWRARGAVTSVK 130

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           DQ  C  CWAFS+ AAVEGI++I TG  +SLS Q+LVDC    N  C  G +D A+++I 
Sbjct: 131 DQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIA 190

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           ++GG+ ++QDYPY G    C    + A V  I G++ V   +E +L  AVA QPVSVA++
Sbjct: 191 RSGGLVADQDYPYEGHSGTCRVYGKQA-VARISGFQYVPARNETALLLAVAHQPVSVALD 249

Query: 278 AGGRAFQHYESGVF--TGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
              RA QH  +G+F   GE C + L+H +  VGYGT E+G  YWL++NSWGSDWG+ GYV
Sbjct: 250 GLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYV 309

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K  R++     G CG+A+EASYPV
Sbjct: 310 KFARDVASEINGVCGLALEASYPV 333


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 197/318 (61%), Gaps = 18/318 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           D +  ++  W+  H K+ +    NE+   R+ ++++N  FI E N  N +Y + +NKF D
Sbjct: 24  DPLTGVFADWMRTHTKSYS----NEEFVFRWNVWRENYNFIQEENRKNNSYYLTMNKFGD 79

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LTN E+  +Y G   D    ++K+K A+          LP + DWR+KGAV  VK+QG C
Sbjct: 80  LTNAEFNKVYKGLAFDYSAHILKAKAATPA---APAPGLPANFDWRQKGAVTHVKNQGQC 136

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCW+FST  + EG N +  G L+SLSEQ L+DC     N GCNGGLMDYAF++II N G
Sbjct: 137 GSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKG 196

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +D+E  YPY  A+  C  +  N+   S+  Y DVS  DE +L  AVA +P SVAI+A   
Sbjct: 197 IDTEASYPYETAQYNCRYNPANSG-GSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHN 255

Query: 282 AFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           +FQ Y  GV +   C S  LDHGV+AVG+GTENG DYWLV+NSWG+DWG  GY+K+ RN 
Sbjct: 256 SFQFYSGGVYYESSCSSTQLDHGVLAVGWGTENGQDYWLVKNSWGADWGLQGYIKMARNR 315

Query: 340 LDTNTGKCGIAMEASYPV 357
            +     CGIA  ASYP 
Sbjct: 316 HN----NCGIATAASYPT 329


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 140/316 (44%), Positives = 192/316 (60%), Gaps = 12/316 (3%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
           DE    +Q W   H K    +     R  I++DNL+ I +HN+   ++ + +N   DLT 
Sbjct: 22  DEDEQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQ 81

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           +E+R  Y G RS       K   A   +   +  ++P++VDWR++G V PVK+QG CGSC
Sbjct: 82  DEFRYFYTGMRSHYSNYTKKQGSA---FLAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSC 138

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           WAFST  ++EG N   TG+L+SLSEQ LVDC     N GC GGLMDYAF++I +NGG+D+
Sbjct: 139 WAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENGGIDT 198

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
           E+ YPY    ++C   + N   V   G+ DV+  DE +LK A     P+SVAI+AG  +F
Sbjct: 199 EESYPYEARNDRCRFQKSNIGAVDT-GFVDVTHGDEEALKTAAGTVGPISVAIDAGHMSF 257

Query: 284 QHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           Q Y SGV+   G   ++LDHGV+ VGYGT  G DYWLV+NSWG  WG  GY+ + RN   
Sbjct: 258 QFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNSWGERWGMEGYIMMSRN--- 314

Query: 342 TNTGKCGIAMEASYPV 357
               +CG+A +ASYP+
Sbjct: 315 -KNNQCGVATQASYPL 329


>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
           C-169]
          Length = 387

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 153/325 (47%), Positives = 192/325 (59%), Gaps = 41/325 (12%)

Query: 72  RFQIFKDNLRFIDEHNSLNRTYK------------------------------VGLNKFA 101
           R  IFK N+ +I   NS  ++Y+                              +GLN+FA
Sbjct: 20  RLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAHTDLLPQLGLNEFA 79

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP-ESVDWREKGAVNPVKDQG 160
           D T EE+ + +LG  +        S     R+A    D  P  S++W E GAV PVK+Q 
Sbjct: 80  DQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHA----DVTPANSINWVEAGAVTPVKNQA 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFST  +VEG N + TG+L+SLSEQ+LVDCD K + GC GGLMDYAF +II+NG
Sbjct: 136 FCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQGCGGGLMDYAFDYIIKNG 195

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           G+D+E+DY Y      C+  R    VVSIDGYEDV   DE++L KAV+ QPVSVAI A  
Sbjct: 196 GLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSKQPVSVAICA-S 254

Query: 281 RAFQHYESGVFT--GECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
            A Q Y SGV    G C   L+HGV+A GY   E+G  YWLV+NSWG  WG  GY+KL++
Sbjct: 255 EAMQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTWGMQGYMKLEK 313

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQN 362
           +      G CGIAM ASYPVK+S N
Sbjct: 314 D-SSVKEGACGIAMAASYPVKSSPN 337


>gi|110743577|dbj|BAE98346.1| RD21A-like cysteine protease [Triticum aestivum]
          Length = 184

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 134/184 (72%), Positives = 156/184 (84%), Gaps = 1/184 (0%)

Query: 140 ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-R 198
           ELPES+DWREKGAV PVK+QG CGSCWAFS V+ VE IN+IVTGE+++LSEQELV+CD  
Sbjct: 1   ELPESIDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDIN 60

Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
             ++GCNGGLMD AF+FII+NGG+D+E DYPY   + +CD  R+NAKVVSIDG+EDV   
Sbjct: 61  GGSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPEN 120

Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
           DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG+ LDHGVVAVGYGTENG DYW+
Sbjct: 121 DEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWI 180

Query: 319 VRNS 322
           VRNS
Sbjct: 181 VRNS 184


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 199/313 (63%), Gaps = 18/313 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W  K+G     +   +K FQIFK N+ +ID  N+  N+ YK+ +N+F D   E+   
Sbjct: 42  FEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPIED--- 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
                 SD       +   +  +  +   ++P +VDWR++GAV P+K+QG CGSCWAFS 
Sbjct: 99  ------SDDGFERTTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCWAFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAA+EGI KI +G L+SLSEQ+LVDCDR     GC+ G M  AF+FI++NGG+ +E +YP
Sbjct: 153 VAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEANYP 212

Query: 230 YLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           Y    +  C   ++ +  V I  YE+V    E SL KAVA+QPVSV I+  G  F+ Y S
Sbjct: 213 YKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRG-MFKFYSS 268

Query: 289 GVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           G+FTGECG+  +H +  VGYGT ++G+ YWLV+NSW   WGE GY++++R+ +D   G C
Sbjct: 269 GIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRD-IDAKEGLC 327

Query: 348 GIAMEASYPVKNS 360
           GIAM+ SYP+ N+
Sbjct: 328 GIAMKPSYPIINN 340


>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
           [Glycine max]
          Length = 400

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 154/367 (41%), Positives = 223/367 (60%), Gaps = 26/367 (7%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS------SSWRTDDEVMTIYQTWLAKH 59
           M   + T +FL FI   S    S + YD   ++S        + +++ V+ ++Q W  ++
Sbjct: 1   MGCQLKTHLFLLFIVWGS---WSFLCYDLPSEYSILALEIDKFPSEEGVVELFQRWKEEN 57

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTR 116
            K        + RF+ FK NL++I E NS   +     +GLN+FAD++NEE+++ ++   
Sbjct: 58  KKIYRNPEEEKLRFENFKRNLKYIVEKNSKRISPYGQSLGLNQFADMSNEEFKSKFM--- 114

Query: 117 SDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVN-PVKDQGSCGSCWAFSTVAAV 174
           S  K+   K   V+S+ ++C+  DE P S+DWR+KG V   VKDQG CGS WAFS+  A+
Sbjct: 115 SKVKKPFSKRNGVSSKDHSCE--DE-PYSLDWRKKGVVTLAVKDQGYCGSYWAFSSTDAI 171

Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
           EGIN IVT +LISLSEQELVDCD   N GC+GG MDYAF++++ NGG+D+E +YPY+GA+
Sbjct: 172 EGINAIVTADLISLSEQELVDCD-STNDGCDGGXMDYAFEWVMYNGGIDTETNYPYIGAD 230

Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
             C+ ++   KV+ IDGY DV   D  SL  A   QP+S  I+     FQ Y  G++ G+
Sbjct: 231 GTCNVTKEKTKVIGIDGYYDVGQSDS-SLLCATVKQPISAGIDGTSWDFQLYIGGIYDGD 289

Query: 295 CGS---ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
           C S    +DH ++ VGYG+E   DYW+V+NSW + WG  G + L++N  +   G C I  
Sbjct: 290 CSSDPDDIDHAILVVGYGSEGDDDYWIVKNSWRTSWGMEGCIYLRKN-TNLKYGXCAINY 348

Query: 352 EASYPVK 358
            ASYP K
Sbjct: 349 MASYPTK 355


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 148/339 (43%), Positives = 203/339 (59%), Gaps = 19/339 (5%)

Query: 9   AISTLVFL-FFISSSSAADMSIISYDNNHDHSS-----SWRTDDEVMTIYQTWLAKHGKT 62
           A+S LVF  F I      D  +       DH +      W+ ++     + ++ A +GK+
Sbjct: 71  AVSLLVFASFLIQWQGDDDRGVFPPSPVEDHKTPVNIWEWK-EEHFQNAFGSFRATYGKS 129

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
                  +KR+ IFK+NL +I  HN    +Y + +N F DL+ EE+R  YLG     K R
Sbjct: 130 YATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLGYN---KSR 186

Query: 123 LMKSK---VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
            +KS    VA++       D +P +VDWREKG V PVKDQ  CGSCWAFS   A+EG + 
Sbjct: 187 NLKSNNLGVATELLKVSPSD-VPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEGAHC 245

Query: 180 IVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
             TGEL+SLSEQELVDC   + N GC+GG M+ AFQ+++ +GG+ SE+ YPYL  + +C 
Sbjct: 246 AKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDGEC- 304

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             R   KVV+I G++DV    E ++K A+A  PVS+AIEA    FQ Y  GVF   CG+ 
Sbjct: 305 -KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDASCGTD 363

Query: 299 LDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKL 335
           LDHGV+ VGYGT  E   D+W+++NSWGS WG +GY+ +
Sbjct: 364 LDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 139/314 (44%), Positives = 198/314 (63%), Gaps = 15/314 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           ++ +   HGK          R  IF+DN + I EHN       R+Y +G+N+F DL + E
Sbjct: 20  WEAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSE 79

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           Y  + +G        L  S  +   +    G ++ ++VDWR+KGAV P+KDQG CGSCWA
Sbjct: 80  YLELVVGP---GLLPLNLSTPSENVFESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWA 136

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FST  ++EG + + TG+L+SLSEQ L+DC R+  N GC GGLMD AF++I  NGG+D+E+
Sbjct: 137 FSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEE 196

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY+  + K    + +    ++  Y D+   DEM+L +AV    PVSVAI+A  ++ + 
Sbjct: 197 CYPYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRF 256

Query: 286 YESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y+SG++   EC  + LDHGV+AVGYG+ +G+DYWLV+NSWGS WG+ GYVK+ RN     
Sbjct: 257 YKSGIYDEPECSRTKLDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMGYVKMTRN----K 312

Query: 344 TGKCGIAMEASYPV 357
             +CGIA +ASYPV
Sbjct: 313 NNQCGIATKASYPV 326


>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 141/287 (49%), Positives = 191/287 (66%), Gaps = 20/287 (6%)

Query: 76  FKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
           FK+N+ +I+  +N+ N+ YK G+N+FA              R+  K  +  S +    + 
Sbjct: 58  FKENVNYIEACNNAANKPYKRGINQFA-------------PRNRFKGHMCSSIIRITTFK 104

Query: 135 CKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELV 194
            +     P +VD R+KGAV P+KDQG CG CWAFS VAA EGI+ +  G+LISLSEQELV
Sbjct: 105 FENVTATPSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELV 164

Query: 195 DCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYP-YLGAENKCDPSRRNAKVVS-IDG 251
           DCD K ++ GC GGLMD AF+FIIQN G+      P Y+G + KC+ +       + I G
Sbjct: 165 DCDTKGVDXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITG 224

Query: 252 YEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG- 309
           YEDV   +E + L+KAVA+ PVS AI+A G  FQ Y+SGVFTG CG+ LDHGV AVGYG 
Sbjct: 225 YEDVPANNEKAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGV 284

Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           +++G +YWLV+NSWG++WGE GY+++QR  +D+    CGIA++ASYP
Sbjct: 285 SDDGTEYWLVKNSWGTEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 330


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 144/358 (40%), Positives = 210/358 (58%), Gaps = 14/358 (3%)

Query: 9   AISTLVFL-FFISSSSAADMSIISYDNNHDHSSS-----WRTDDEVMTIYQTWLAKHGKT 62
           A+S LVF  F I      D ++       DH        W+ +      + ++ A + K+
Sbjct: 69  AVSLLVFASFLIQWQGEDDRAVFPPSPVEDHQPPANIWEWK-EAHFQDAFSSFQAMYAKS 127

Query: 63  SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
                  ++R+ IFK+NL +I  HN    +Y + +N F DL+ +E+R  YLG +     +
Sbjct: 128 YATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKSRNLK 187

Query: 123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
                VA++        ELP  VDWR +G V PVKDQ  CGSCWAFST  A+EG +   T
Sbjct: 188 SHHLGVATELLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKT 246

Query: 183 GELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
           G+L+SLSEQEL+DC R + N  C+GG M+ AFQ+++ +GG+ SE  YPYL  + +C  ++
Sbjct: 247 GKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECR-AQ 305

Query: 242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDH 301
              KVV I G++DV    E ++K A+A  PVS+AIEA    FQ Y  GVF   CG+ LDH
Sbjct: 306 SCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDH 365

Query: 302 GVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+ VGYGT  E+  D+W+++NSWG+ WG +GY+ +   +     G+CG+ ++AS+PV
Sbjct: 366 GVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMA--MHKGEEGQCGLLLDASFPV 421


>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
          Length = 492

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 145/326 (44%), Positives = 199/326 (61%), Gaps = 37/326 (11%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           +T  E  + + +WL  H  T +      KR + +  N  +I  HN    ++K+G N F+ 
Sbjct: 24  KTFKEYESDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQESSFKLGHNAFSH 83

Query: 103 LTNEEYRAMYLGTRSD---AKRRLMKSKVASQ---RYACKAGDELPESVDWREKGAVNPV 156
           LTNEE+R  + G ++      +RL +S VAS    +Y      +LPESVDW EKGAV  V
Sbjct: 84  LTNEEFRQRFNGFKASDDYLTKRLAQSNVASSTNFQYI-----DLPESVDWVEKGAVTGV 138

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
           K+QG CGSCWAFST  A+EG   I +G+L+SLSEQELVDCD   + GCNGGLMD+AF +I
Sbjct: 139 KNQGMCGSCWAFSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWI 198

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
            ++ G+ SE+DY Y+ +++ C   R    VVS                      PV+VAI
Sbjct: 199 SEHDGICSEEDYAYIHSQSLC---RSCKPVVS----------------------PVAVAI 233

Query: 277 EAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
           +AG R+FQ Y+SGV+   CG+ LDHGV+ VGYG E+G  YW V+NSWG+ WGE GY++L 
Sbjct: 234 DAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYGVEDGQKYWKVKNSWGNSWGEKGYIRLS 293

Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQN 362
           R+  +  +G+CGIAM  SYP  + +N
Sbjct: 294 RD-QNGRSGQCGIAMVPSYPTASLRN 318


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 157/355 (44%), Positives = 214/355 (60%), Gaps = 11/355 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           +++STL+ L  ++ SSA                +   D  +++ ++ W+A+HG+T     
Sbjct: 1   MSLSTLI-LALLAMSSAVAAPRALAARQLAGDEAITVDSAMVSRHEKWMAEHGRTYANEE 59

Query: 68  HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
              +R ++F+ N + ID  NS  + T+++  N+FADLT+EE+RA   G R          
Sbjct: 60  EKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAAAAGAG 119

Query: 127 KVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
             A   RY   +  +   S+DWR  GAV  VKDQGSCG CWAFS VAAVEG+ KI TG L
Sbjct: 120 SGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRL 179

Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           +SLSEQ+LVDCD    + GC GGLMD AF+++I  GG+ +E  YPY G +  C   RR+A
Sbjct: 180 VSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTTESSYPYRGTDGSC---RRSA 236

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGV 303
              SI GYEDV   +E +L  AVA QPVSVAI  G   F+ Y+SGV  G  CG+ L+H +
Sbjct: 237 SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVFRFYDSGVLGGSGCGTELNHAI 296

Query: 304 VAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            AVGYGT  +G  YW+++NSWG  WGE GYV+++R +     G CG+A  ASYPV
Sbjct: 297 TAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV--RGEGVCGLAQLASYPV 349


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 133/286 (46%), Positives = 181/286 (63%), Gaps = 4/286 (1%)

Query: 72  RFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           RF++F  N + I+ HN   + ++ +G N+++ LT +E++ +  G R        ++K A 
Sbjct: 47  RFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYAL 106

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
              A    D +P  +DW E+G V PVK+QG CGSCWAFST  A+EG   + + +L+S+SE
Sbjct: 107 MAPAVNMTD-VPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSE 165

Query: 191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           QELVDCD   + GCNGGLMD AF+++  + G+  E+DYPY   E  C   ++   V  + 
Sbjct: 166 QELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAKEGTC-ALKKCKPVTKVT 224

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
            + DV   DE +LK AVA QPVSVAIEA    FQ Y+SGVF   CG+ LDHGV+ VGYG 
Sbjct: 225 AFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVFDKSCGTKLDHGVLVVGYGE 284

Query: 311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           E G  YW V+NSWG+DWG+ GY+KL R      TG+CG+AM  SYP
Sbjct: 285 EGGKKYWKVKNSWGADWGDKGYIKLARE-FGPETGQCGVAMVPSYP 329


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 138/307 (44%), Positives = 193/307 (62%), Gaps = 10/307 (3%)

Query: 56  LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLG 114
           +A++G+         +RFQIFK+N+  I+  N+ N  +Y +G+NKF D+TN E+ A Y G
Sbjct: 1   MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
             S   R L   K     +       + +S+DWR+ GAV  VKDQ  CGSCWAFS +A V
Sbjct: 61  GIS---RPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117

Query: 175 EGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
           EGI KIVTG L+SLSEQE++DC   ++ GC+GG +D A+ FII N G+ SE DYPY   +
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDC--AVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQ 175

Query: 235 NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
             C  +        I GY  V   DE S+K AV +QP++ AI+A G  FQ+Y  GVF+G 
Sbjct: 176 GDC-AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGP 234

Query: 295 CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
           CG++L+H +  +GYG + +G  YW+V+NSWGS WGE GY+++ R +  +++G CGIAM+ 
Sbjct: 235 CGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGV--SSSGLCGIAMDP 292

Query: 354 SYPVKNS 360
            YP   S
Sbjct: 293 LYPTLQS 299


>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
          Length = 218

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 126/219 (57%), Positives = 158/219 (72%), Gaps = 3/219 (1%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP  VDWR  GAV  +K QG CG CWAFS +A VEGINKIVTG LISLSEQEL+DC R  
Sbjct: 1   LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60

Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
           N  GCNGG +   FQFII NGG+++E++YPY   + +C+   +N K V+ID YE+V   +
Sbjct: 61  NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A+DH V  VGYGTE G+DYW+V
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 181 KNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 217


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 203/323 (62%), Gaps = 22/323 (6%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADL 103
           +D V   ++ W+A+HG+T       E+RF IFK NL+ I+  +N+ NRTYK+GLN FADL
Sbjct: 31  EDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADL 90

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-----PESVDWREKGAVNPVKD 158
           T+EE+ A Y G +    + L  + + ++    ++ D L     PES+DWR +G V PVK+
Sbjct: 91  TDEEFLATYTGYK--MPKVLPTANITTK--TTQSSDVLYEANVPESIDWRTRGVVTPVKN 146

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
           QG CG CWAFS  AAVEGI     G  +SLS Q+L+DC    N GCNGG MD AF++IIQ
Sbjct: 147 QGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLLDCVPDSN-GCNGGFMDNAFRYIIQ 201

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           N G+ S   YPY      C PS   A+   I GY DV+P DE +LK AVA QPVS A++A
Sbjct: 202 NQGLASATYYPYQLMREMCRPSNNAAR---ISGYVDVTPADEETLKSAVARQPVSAAVDA 258

Query: 279 GGRA-FQHYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
                F++Y  G+F  + CGS L H +  VGYGT   G  YWL++NSWG  WGE GY++L
Sbjct: 259 TSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAEGTKYWLIKNSWGEGWGEGGYMRL 318

Query: 336 QRNLLDTNTGKCGIAMEASYPVK 358
           QR+ + +  G CGIA+ ASYP +
Sbjct: 319 QRD-VGSYGGACGIALRASYPTR 340


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/355 (40%), Positives = 213/355 (60%), Gaps = 20/355 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF    +  A  S  S D           +D +M  ++ W+A++G+         +R
Sbjct: 7   LVFLFLFLCAMWASPSAASRD---------EPNDPMMKRFEEWMAEYGRVYKDDDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N++ I+  NS N  +Y +G+N+F D+T  E+ A Y G        + +  V S 
Sbjct: 58  FQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGV--SLPLNIEREPVVS- 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       +P+S+DWR+ GAVN VK+Q  CGSCW+F+ +A VEGI KI TG L+SLSEQ
Sbjct: 115 -FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQ 173

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           E++DC   ++ GC GG ++ A+ FII N G+ +E++YPYL  +  C+ +        I G
Sbjct: 174 EVLDC--AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCN-ANSFPNSAYITG 230

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   DE S+  AV++QP++  I+A    FQ+Y  GVF+G CG++L+H +  +GYG +
Sbjct: 231 YSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQD 289

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
            +G  YW+VRNSWGS WGE GYV++ R  + +++G CGIAM   +P   S  +A+
Sbjct: 290 SSGTKYWIVRNSWGSSWGEGGYVRMARG-VSSSSGVCGIAMAPLFPTLQSGANAE 343


>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
 gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 346

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 134/314 (42%), Positives = 193/314 (61%), Gaps = 7/314 (2%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
           ++  +Q W+ +  +  +     + R Q+  +NL+FI+  N++ N++YK+G+N+F D T E
Sbjct: 35  IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94

Query: 107 EYRAMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           E+ A Y G R  +              +     D L  + DWR +GAV PVK QG CG C
Sbjct: 95  EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGC 154

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS +AAVEG+ KI  G LISLSEQ+L+DC R+ N GC GG    AF +II++ G+ SE
Sbjct: 155 WAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISSE 214

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
            +YPY   E  C  + R A  + I G+E+V   +E +L +AV+ QPV+VAI+A    F H
Sbjct: 215 NEYPYQVKEGPCRSNARPA--ILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272

Query: 286 YESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y  GV+    CG++++H V  VGYGT   G+ YWL +NSWG  WGENGY++++R+ ++  
Sbjct: 273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRD-VEWP 331

Query: 344 TGKCGIAMEASYPV 357
            G CG+A  ASYPV
Sbjct: 332 QGMCGVAQYASYPV 345


>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
          Length = 246

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 133/271 (49%), Positives = 181/271 (66%), Gaps = 28/271 (10%)

Query: 89  LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWR 148
           ++++YK+ +N+FADLTNEE+      +R+  K  +  ++  S +Y  +    +P + DWR
Sbjct: 1   MDKSYKLSINEFADLTNEEFGT----SRNRFKAHICSTEATSFKY--ENVTAVPSTXDWR 54

Query: 149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGG 207
           +KGAV P+KDQG CGSCWAFS VAA+EGI ++ TG+LISLSEQELVDCD    + GC G 
Sbjct: 55  KKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA 114

Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
                              +YPY G +  C+  +       I+GYEDV   +E +L+KAV
Sbjct: 115 -------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAV 155

Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSD 326
           A QP++VAI+AGG  FQ Y SGVFTG+CG+ LDHGV AVGYGT ++G+ YWLV+NSWG+ 
Sbjct: 156 AHQPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTG 215

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WGE GY+++QR++     G CGIAM+ASYP 
Sbjct: 216 WGEEGYIRMQRDVT-AKEGLCGIAMQASYPT 245


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/322 (42%), Positives = 206/322 (63%), Gaps = 11/322 (3%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLT 104
           D +M  ++ W+A++G+  N      +RFQIFK+N+  I+  N+ +  +Y +G+N+F D+T
Sbjct: 4   DPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMT 63

Query: 105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
           N E+ A Y G  +     + +  V S  +       +P+S+DWR+ GAV  VK+QGSCGS
Sbjct: 64  NNEFLARYTG--ASLPLNIERDPVVS--FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGS 119

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CWAFS +A VEGI KI  G LISLSEQE++DC   ++ GC+GG ++ A+ FII N G+ S
Sbjct: 120 CWAFSAIATVEGIYKIKAGNLISLSEQEVLDC--ALSYGCDGGWVNKAYDFIISNNGVTS 177

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
             + PY G +  C+ +    K   I GY  V   +E S+  AVA+QP++  I+AGG  FQ
Sbjct: 178 FANLPYKGYKGPCNHNDLPNKAY-ITGYTYVQSNNERSMMIAVANQPIAALIDAGGD-FQ 235

Query: 285 HYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           +Y+SGVFTG CG++L+H +  +GYG T +G  YW+V+NSWG+ WGE GY+++ R+ + + 
Sbjct: 236 YYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARD-VSSP 294

Query: 344 TGKCGIAMEASYPVKNSQNSAK 365
            G CGIAM   +P   S  +A+
Sbjct: 295 YGLCGIAMAPLFPTLQSGANAE 316


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 198/318 (62%), Gaps = 18/318 (5%)

Query: 45  DDEVMTI---YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKF 100
           +D+ +T+   Y+ W  K+          EK  QIFK N+ +ID  N+  N++YK+ +N+F
Sbjct: 29  NDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRF 88

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           ADL  E     +       KR+L      S  +  K   ++P +VDWR++GAV PVK+Q 
Sbjct: 89  ADLPTEPSDDGF------KKRKL--EPTTSSLFKYKNITDIPAAVDWRKRGAVTPVKNQR 140

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFS V A+EGI +I +G L+SLSEQELVD  R     GCNGG +  AF+F+++N
Sbjct: 141 ECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLEN 200

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E  YPY G   K + S++ ++ V I  YE V    E SL K VA+QPVSV I+  
Sbjct: 201 GGIATEASYPYRGV--KGNNSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDIS 258

Query: 280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRN 338
           G   + Y SG+FTGECG+  +H V+ VGYGT N G  YWLV+NSWG  WGE  Y++++R+
Sbjct: 259 G-MIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSWGIRWGEKRYIRMKRD 317

Query: 339 LLDTNTGKCGIAMEASYP 356
            +D   G CGI M+ASYP
Sbjct: 318 -IDAKEGLCGIPMDASYP 334


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 147/314 (46%), Positives = 191/314 (60%), Gaps = 17/314 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  W  +HGK          R  I++ NL  + +HN      + TY +G+N+F DL NEE
Sbjct: 28  WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + AM  G R     +  K    S         ELP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88  FVAMMTGFRVSGTSKAAK---GSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWA 144

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FST  +VEG +   TG+L+SLSEQ LVDC  + +AGC+GG MD AFQ+II  GG+D+E  
Sbjct: 145 FSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR-DAGCDGGFMDRAFQYIIDAGGIDTEAS 203

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
           YPY   + KC   + N    ++ GY DV+   E +L+KAVA   P+SVAI+A   +FQHY
Sbjct: 204 YPYKAVDGKCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHY 262

Query: 287 ESGVFT--GECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           +SGV+   G   + LDHGV+AVGYGT  +G DYW+V+NSW   WG NGYV + RN     
Sbjct: 263 KSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRN----K 318

Query: 344 TGKCGIAMEASYPV 357
             +CGIA  ASYP+
Sbjct: 319 DNQCGIATNASYPL 332


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 156/355 (43%), Positives = 213/355 (60%), Gaps = 11/355 (3%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           +++STL+ L  ++ SSA                +   D  +++ ++ W+A+HG+T     
Sbjct: 1   MSLSTLI-LALLAMSSAVAAPRALAARQLAGDEAITVDAAMVSRHEKWMAEHGRTYANEE 59

Query: 68  HNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
              +R ++F+ N + ID  NS  + T+++  N+FADLT+EE+RA   G R          
Sbjct: 60  EKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDEEFRAARTGLRRPPAAAAGAG 119

Query: 127 KVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
             A   RY   +  +   S+DWR  GAV  VKDQGSCG CWAFS VAAVEG+ KI TG L
Sbjct: 120 SGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRL 179

Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           +SLSEQ+LVDCD    + GC GGLMD AF+++I  GG+ +E  YPY G +  C   RR+A
Sbjct: 180 VSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTTESSYPYRGTDGSC---RRSA 236

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGV 303
              SI GYEDV   +E +L  AVA QPVSVAI  G   F+ Y+SGV  G  CG+ L+H +
Sbjct: 237 SAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVFRFYDSGVLGGSGCGTELNHAI 296

Query: 304 VAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            A GYGT  +G  YW+++NSWG  WGE GYV+++R +     G CG+A  ASYPV
Sbjct: 297 TAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGV--RGEGVCGLAQLASYPV 349


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 142/350 (40%), Positives = 206/350 (58%), Gaps = 19/350 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+A++G+         +R
Sbjct: 7   LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  NS N  +Y +G+N+F D+T  E+ A Y G  S       +  V+  
Sbjct: 58  FQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVSFD 117

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
                A   +P+S+DWR+ GAVN VK+Q  CGSCWAF+ +A VEGI KI TG L+SLSEQ
Sbjct: 118 DVNISA---VPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQ 174

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           E++DC   ++ GC GG ++ A+ FII N G+ +E++YPY   +  C+ +        I G
Sbjct: 175 EVLDC--AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCN-ANSFPNSAYITG 231

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   DE S+  AV++QP++  I+A    FQ+Y  GVF+G CG++L+H +  +GYG +
Sbjct: 232 YSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQD 290

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            +G  YW+VRNSWGS WGE GYV++ R  + +++G CGIAM   +P   S
Sbjct: 291 SSGTKYWIVRNSWGSSWGEGGYVRMARG-VSSSSGACGIAMSPLFPTLQS 339


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 197/315 (62%), Gaps = 16/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           +Q W+ +  +  +     + RF +FK NL+FI++ N   +RTYK+G+N+FAD T EE+ A
Sbjct: 47  HQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIA 106

Query: 111 MYLGTRS---DAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSC 165
            + G +          +   + S  +     AG E   + DWR +GAV PVK QG CG C
Sbjct: 107 THTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRE---TKDWRYEGAVTPVKYQGQCGCC 163

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS+VAAVEG+ KIV   L+SLSEQ+L+DCDR+ + GCNGG+M  AF +II+N G+ SE
Sbjct: 164 WAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASE 223

Query: 226 QDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
             YPY  AE  C   R N K  + I G++ V   +E +L +AV+ QPVSV+I+A G  F 
Sbjct: 224 ASYPYQAAEGTC---RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 280

Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           HY  GV+    CG+ ++H V  VGYGT   G+ YWL +NSWG  WGENGY++++R++   
Sbjct: 281 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 340

Query: 343 NTGKCGIAMEASYPV 357
             G CG+A  A YPV
Sbjct: 341 Q-GMCGVAQYAFYPV 354


>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
          Length = 318

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 203/329 (61%), Gaps = 20/329 (6%)

Query: 1   MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT S F   S L+F+    S     S    SI+ Y  +   S+     ++++ ++ +W+
Sbjct: 1   MATISSF---SKLLFVAICLSVHMGLSYGAFSIVGYSPDDLTST-----EKLINLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            ++ K    +     RF+IFKDNL++IDE N  N TY +GL  F DLTN+E++  Y+G  
Sbjct: 53  VEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVG-- 110

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           S  +      +   + +       +P S+DWR+KGAV PV++QGSCGSCW FS+VAAVEG
Sbjct: 111 SIPENWSTTEEPNDKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG+L+SLSEQEL+DC+R+ + GC GG   YA Q+ + N G+   Q YPY G + +
Sbjct: 171 INKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQ 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  ++     V  DG   V   +E +L + +A QPVS+ +EA GRAFQ+Y  G+F G CG
Sbjct: 229 CRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           +++DH V AVGYG  NG  Y L++NSWG+
Sbjct: 289 TSIDHAVAAVGYG--NG--YILIKNSWGT 313


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/315 (46%), Positives = 197/315 (62%), Gaps = 17/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  W  +HGK          R  I++ NL  + +HN      + TY +G+N+FADL NEE
Sbjct: 28  WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + AM  G R +   +  K    S         ELP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88  FVAMMTGFRVNGTSKAAK---GSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWA 144

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FST  ++EG +   TG+L+SLSEQ LVDC  ++ N GC+GGLMD AFQ+II+ GG+D+E+
Sbjct: 145 FSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEE 204

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY   + +C   + N    ++ GY DV+   E +L+KAVA   P+SVAI+A   +FQ 
Sbjct: 205 SYPYKAVDGECHFKKANIG-ATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQL 263

Query: 286 YESGVFT-GECGSA-LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Y+SGV+   +C S  LDHGV+AVGYG T +G DYW+V+NSW   WG NGY+ + RN    
Sbjct: 264 YKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRN---- 319

Query: 343 NTGKCGIAMEASYPV 357
              +CGIA +ASYP+
Sbjct: 320 KDNQCGIATQASYPL 334


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 192/310 (61%), Gaps = 14/310 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W + HGK  +  G ++ R  +F  N++ I  HN+   T+K+ +N+F+DLT +E+   
Sbjct: 25  WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNA-KSTFKMAINEFSDLTRKEFVKT 83

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y G R   K+   K       +       +P  VDWR++G V P+K+QG CGSCWAFST 
Sbjct: 84  YNGYRLSMKKSTNKPST----FMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTT 139

Query: 172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG +   TG+L+SLSEQ L+DC   + N GC GG MD AF++I  N G+D+E  YPY
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPY 199

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
            G ++ C   + N   +   GY D+  + E  LK AVA   P+SVAI+A  ++F  Y +G
Sbjct: 200 EGRDDICRYKKTNKGAIDT-GYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTG 258

Query: 290 VF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+   EC  + LDHGV+ VGYGTENG DYWLV+NSWG+DWG NGY+K+ RN     +  C
Sbjct: 259 VYHEPECSQTVLDHGVLVVGYGTENGEDYWLVKNSWGTDWGMNGYIKMSRN----RSNNC 314

Query: 348 GIAMEASYPV 357
           GIA  ASYP+
Sbjct: 315 GIATNASYPL 324


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  264 bits (674), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 207/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-SGLCDIAKMSSYP 341


>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
          Length = 318

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 203/329 (61%), Gaps = 20/329 (6%)

Query: 1   MATASMFLAISTLVFLFFISSS----SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
           MAT   F   S L+F+    S     S    SI+ Y  +   S+     ++++ ++ +W+
Sbjct: 1   MATIXSF---SKLLFVAICLSVHMGLSYGAFSIVGYSPDDLTST-----EKLINLFDSWM 52

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
            ++ K    +     RF+IFKDNL++IDE N  N TY +GL  F DLTN+E++  Y+G+ 
Sbjct: 53  VEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKYVGSI 112

Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
            +      +S    + +       +P S+DWR+KGAV PV++QGSCGSCW FS+VAAVEG
Sbjct: 113 PENWSTTEESN--DKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 170

Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
           INKIVTG+L+SLSEQEL+DC+R+ + GC GG   YA Q+ + N G+   Q YPY G + +
Sbjct: 171 INKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VANSGIHLRQYYPYEGVQRQ 228

Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
           C  ++     V  DG   V   +E +L + +A QPVS+ +EA GRAFQ+Y  G+F G CG
Sbjct: 229 CRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGPCG 288

Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGS 325
           +++DH V AVGYG  NG  Y L++NSWG+
Sbjct: 289 TSIDHAVAAVGYG--NG--YILIKNSWGT 313


>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 149/338 (44%), Positives = 193/338 (57%), Gaps = 39/338 (11%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  WL  +G         E RF I++ N+ +I    S   +Y +  NKFADLTNEE+ + 
Sbjct: 5   FDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEEFVST 64

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG-------- 163
           YLG  +    RL    +   R+       LP S DWR++GAV  +KDQG+CG        
Sbjct: 65  YLGFAT----RL----IPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFSP 116

Query: 164 ---------------------SCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKIN 201
                                S WAFS VAAVE INKI +G+L+SLSEQELVD D    N
Sbjct: 117 EISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKN 176

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GC GGLMD  F FI +NGG+ + +DYPY G +  C+  +     V+I GYE     DE 
Sbjct: 177 QGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEA 236

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
            LK A A+QP+SVAI+AGG AFQ Y  GVF+G CG  L+HGV  VGY       Y  V+N
Sbjct: 237 MLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKN 296

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           S G+DWGE+GY++++R+  D   G CGIAM+ASYP+K+
Sbjct: 297 SXGADWGESGYIRMKRDAFD-KAGTCGIAMKASYPLKD 333


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 208/351 (59%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T+EE+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C + ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT ENG  YWL++NSWG+ WGE G++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNP-SGLCDIAKLSSYP 341


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 148/351 (42%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D +P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGGLM  AF FII+NGG+  E DY YLG +  C  SR     
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCR-SREKTAA 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGNCADQINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 197/315 (62%), Gaps = 16/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           +Q W+ +  +  +     + RF +FK NL+FI++ N   +RTYK+G+N+FAD T EE+ A
Sbjct: 23  HQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIA 82

Query: 111 MYLGTRS---DAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSC 165
            + G +          +   + S  +     AG E   + DWR +GAV PVK QG CG C
Sbjct: 83  THTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRE---TKDWRYEGAVTPVKYQGQCGCC 139

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS+VAAVEG+ KIV   L+SLSEQ+L+DCDR+ + GCNGG+M  AF +II+N G+ SE
Sbjct: 140 WAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASE 199

Query: 226 QDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
             YPY  AE  C   R N K  + I G++ V   +E +L +AV+ QPVSV+I+A G  F 
Sbjct: 200 ASYPYQAAEGTC---RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 256

Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           HY  GV+    CG+ ++H V  VGYGT   G+ YWL +NSWG  WGENGY++++R++   
Sbjct: 257 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 316

Query: 343 NTGKCGIAMEASYPV 357
             G CG+A  A YPV
Sbjct: 317 Q-GMCGVAQYAFYPV 330


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 147/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T+EE+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
            S  +     + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 PSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A     Q Y  G + G C + ++H V A+
Sbjct: 234 VQISNYQ-VVPEGETSLLQAVTKQPVSIGI-AASHDLQFYAGGTYDGSCANRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT ENG  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISIFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYSGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N    + S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S V
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPV 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|167345242|gb|ABZ69061.1| cysteine protease [Pinus sylvestris]
          Length = 214

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 125/206 (60%), Positives = 156/206 (75%), Gaps = 9/206 (4%)

Query: 21  SSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNL 80
           S+S AD SIIS       +   R DD +M +Y+ WLA+H K  NG+   +KRF +FKDN 
Sbjct: 18  SASRADFSIIS-------NKDLREDDAIMELYELWLAEHKKAYNGLDEKQKRFTVFKDNF 70

Query: 81  RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE 140
            +I EHN  NR+YK+GLN+FADL++EE++A YLG + D K+RL++S   S RY    G++
Sbjct: 71  LYIHEHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLLRS--PSPRYQYSDGED 128

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP+S+DWREKGAV PVKDQG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQELVDCD   
Sbjct: 129 LPKSIDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSY 188

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           N GCNGGL DYAF+FII NGG+ + +
Sbjct: 189 NQGCNGGLRDYAFEFIINNGGLTARR 214


>gi|357439999|ref|XP_003590277.1| Cysteine protease [Medicago truncatula]
 gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula]
          Length = 514

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/384 (37%), Positives = 219/384 (57%), Gaps = 62/384 (16%)

Query: 25  ADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID 84
           ++ SI+++D N      + ++++V+ ++Q W  +H K          R + FK NL++I 
Sbjct: 30  SEYSILAFDLN-----KFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIV 84

Query: 85  EHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
           E N++  +   + +GLN+FAD++NEE++  ++   S  K+ + K + ++     ++ D+ 
Sbjct: 85  ERNAMRNSPVGHHLGLNRFADMSNEEFKNKFI---SKVKKPISK-RASNLHVKVESCDDA 140

Query: 142 PESVDWREKGAVNPVKDQGSCG-------------------------------------- 163
           P S+DWR+KG V  VKDQG+CG                                      
Sbjct: 141 PYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLVIYILELTTNFPLYSFESQFCILEKK 200

Query: 164 ------SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
                 SCW+FS+  A+EG+N IVTG+LISLSEQELVDCD   N GC GG MDYAF+++I
Sbjct: 201 KLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTT-NDGCEGGYMDYAFEWVI 259

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+D+E DYPY+G    C+ ++   KVV+IDGY DV+  D  +L  A   QP+SV I+
Sbjct: 260 NNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDS-ALFCATVKQPISVGID 318

Query: 278 AGGRAFQHYESGVFTGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
                FQ Y  G++ G+C S    +DH V+ VGYG++   DYW+V+NSWG+ WG  G++ 
Sbjct: 319 GSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIY 378

Query: 335 LQRNLLDTNTGKCGIAMEASYPVK 358
           ++RN  +   G C I   AS+P K
Sbjct: 379 IRRN-TNLKYGVCAINYMASFPTK 401


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 200/322 (62%), Gaps = 18/322 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNK 99
           T+ E+   ++ + +  G+          R  IF+ NL+FI  HN    + + T+ V +N 
Sbjct: 25  TEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNN 84

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           F DL+NEE+RA + G      RRL    +A   +A    + LP +VDW  KG V P+K+Q
Sbjct: 85  FTDLSNEEFRATFNG-----YRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQ 218
             CGSCWAFS VA++EG + + TG+L+SLSEQ LVDC   + + GC+GG MDYAF+++IQ
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
           N G+D+E  YPY   +  C+  +RN+   +I  + DV   DE +L+ AVA   P+SVAI+
Sbjct: 200 NRGIDTEASYPYKAIDESCE-FKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAID 258

Query: 278 AGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           A   +FQ Y SGV+   +C +  LDHGV AVGYGT NGV YW V+NSWG+ WG+ GY+ +
Sbjct: 259 ASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGYIFM 318

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
            RN       +CGIA +ASYPV
Sbjct: 319 SRN----KQNQCGIATKASYPV 336


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 194/315 (61%), Gaps = 7/315 (2%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           T+  V+  +Q W+ K+ +T       EKR +IFK+NL +I+  N++ N++YK+GLN+++D
Sbjct: 25  TESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSD 84

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           LT+EE+ A + G +     +L  SK+ S        D++P + DWREKG V  VK+Q  C
Sbjct: 85  LTSEEFIASHTGFK--VSDQLSDSKMRSVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQC 142

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           G CWAF+ VAAVEGI KI  G LISLSEQ+LVDCDR+ ++GC GG    AF  II++ G+
Sbjct: 143 GCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGGGDFVLAFDSIIKSRGI 201

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
             E DYPY   + +     +      I+GY  V   DE  L +AV  QPVSVAI      
Sbjct: 202 VKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQPVSVAIST-SYD 260

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           F HY  GV+ G CG  L+H V  +GYG +E G  YWL++NSWG  WGE GY+K+ R    
Sbjct: 261 FHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGETWGEKGYMKVLRESSA 320

Query: 342 TNTGKCGIAMEASYP 356
           T  G+C IA+ A+YP
Sbjct: 321 TG-GQCSIAVHAAYP 334


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/309 (46%), Positives = 193/309 (62%), Gaps = 9/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A+HG+         +R ++F+ N   ID  N+    ++++  N+FADLT +E+RA
Sbjct: 38  HEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRA 97

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
              G R    R    +     RY   +  +  +SVDWR  GAV  VKDQG+ G CWAFS 
Sbjct: 98  ARTGLR---PRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSA 154

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAAVEG+NKI TG L+SLSEQELVDCD   ++ GC+GGLMD AFQF+ + GG+ SE  YP
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYP 214

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   +  C  S   A   SI G+EDV   +E +L  AVA QPVSVAI     AF+ Y+SG
Sbjct: 215 YQCRDGPCR-SSAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSG 273

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           V  G CG+ L+H + AVGYGT  +G  YWL++NSWG+ WGE GYV+++R +     G CG
Sbjct: 274 VLGGACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGV--RGEGVCG 331

Query: 349 IAMEASYPV 357
           +A   SYPV
Sbjct: 332 LAKLPSYPV 340


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
          Length = 229

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 126/208 (60%), Positives = 156/208 (75%), Gaps = 8/208 (3%)

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           GSCWAFS +AAVEG+NKI+TG+L+SLSEQELVDCD   N GC+GGLMDYAFQ+I +NGG+
Sbjct: 13  GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGV 72

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +E +YPYL  +  C+ ++  +  V+IDGYEDV   +E +L+KAVA QPV+VAIEA G+ 
Sbjct: 73  TTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQD 132

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           FQ Y  GVFTG CG+ LDHGV AVGYGT  +G  YW V+NSWG DWGE GY+++QR + D
Sbjct: 133 FQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVPD 192

Query: 342 TNTGKCGIAMEASYPVKNSQNSAKPKPH 369
           +  G CGIAME SYP K      KP  H
Sbjct: 193 SR-GLCGIAMEPSYPTK------KPAGH 213


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 131/317 (41%), Positives = 194/317 (61%), Gaps = 7/317 (2%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           +  +   +Q W+    +  +     + R ++F +NL+FI+  N++ +++YK+G+NKF D 
Sbjct: 31  EPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDW 90

Query: 104 TNEEYRAMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           T EE+ A + G    +           +  +     D L  + DWR +GAV PVK QG C
Sbjct: 91  TKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTPVKYQGEC 150

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
           G CWAFS +AAVEG+ KI  G LISLSEQ+L+DC R+ N GC GG M  AF +I++NGG+
Sbjct: 151 GGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEAFNYIVKNGGV 210

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            SE  YPY   E  C  +  +   + I G+E+V   +E +L +AV+ QPV+V I+A    
Sbjct: 211 SSENAYPYQVKEGPCRSN--DIPAIVIRGFENVPSNNERALLEAVSRQPVAVDIDASETG 268

Query: 283 FQHYESGVFTG-ECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           F HY  GV+   +CG++++H V  VGYGT + G+ YWL +NSWG  WGENGY++++R+ +
Sbjct: 269 FIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRD-V 327

Query: 341 DTNTGKCGIAMEASYPV 357
           +   G CG+A  ASYPV
Sbjct: 328 EWPQGMCGVAQYASYPV 344


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 137/321 (42%), Positives = 202/321 (62%), Gaps = 26/321 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEK---RFQIFKDNLRFIDEHNSLNRTYKVGLNKF 100
           + D +  ++  W+ +H K+      NE+   R+ ++++N  +I+ HN  N+++ + +NKF
Sbjct: 22  SHDPLTGVFADWMQEHQKSYA----NEEFVYRWNVWRENYLYIEAHNHQNKSFHLAMNKF 77

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            DLTN E+  ++ G    A +   +S +A           LP   DWR+KGAV  VK+QG
Sbjct: 78  GDLTNAEFNKLFKGLSITADQAKQESDIA-------PAPGLPADFDWRQKGAVTHVKNQG 130

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CGSCW+FST  + EG N +  G L SLSEQ LVDC     N GCNGGLMDYAF++II+N
Sbjct: 131 QCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRN 190

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNA--KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            G+D+E+ YPY  ++  C  +++++  ++VS   Y +V   +E +L  AVA QP SVAI+
Sbjct: 191 KGIDTEESYPYHASQGTCRYNKQHSGGELVS---YTNVPSGNEGALLNAVATQPTSVAID 247

Query: 278 AGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           A   +FQ Y+ GV+    C S+ LDHGV+AVG+G  +G DYWLV+NSWG+DWG +GY+++
Sbjct: 248 ASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGVRDGKDYWLVKNSWGADWGLSGYIEM 307

Query: 336 QRNLLDTNTGKCGIAMEASYP 356
            RN       +CGIA  AS+P
Sbjct: 308 SRN----KHNQCGIATAASHP 324


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N    + S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 139/349 (39%), Positives = 204/349 (58%), Gaps = 13/349 (3%)

Query: 17  FFISSSSAADMSIISYDNNHDHSSS-----WRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
           F I      D ++       DH        W+ +      + ++ A + K+       ++
Sbjct: 77  FLIQWQGEDDRAVFPPSPVEDHQPPANIWEWK-EAHFQDAFSSFQAMYAKSYATEEEKQR 135

Query: 72  RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           R+ IFK+NL +I  HN    +Y + +N F DL+ +E+R  YLG +     +     VA++
Sbjct: 136 RYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKSRNLKSHHLGVATE 195

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
                   ELP  VDWR +G V PVKDQ  CGSCWAFST  A+EG +   TG+L+SLSEQ
Sbjct: 196 LLNVLPS-ELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQ 254

Query: 192 ELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
           EL+DC R + N  C+GG M+ AFQ+++ +GG+ SE  YPYL  + +C  ++   KVV I 
Sbjct: 255 ELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECR-AQSCEKVVKIL 313

Query: 251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT 310
           G++DV    E ++K A+A  PVS+AIEA    FQ Y  GVF   CG+ LDHGV+ VGYGT
Sbjct: 314 GFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGT 373

Query: 311 --ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             E+  D+W+++NSWG+ WG +GY+ +   +     G+CG+ ++AS+PV
Sbjct: 374 DKESKKDFWIMKNSWGTGWGRDGYMYMA--MHKGEEGQCGLLLDASFPV 420


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT ENG  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 148/321 (46%), Positives = 199/321 (61%), Gaps = 28/321 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K G++        +R QI+ +N + +  HN L     ++Y++G+ +FAD+ NEE
Sbjct: 27  FHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEE 86

Query: 108 YRAMY-LGT----RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           Y+++  LG      + A RR       S  +    G  LP +VDWR+KG V  VKDQ  C
Sbjct: 87  YKSLISLGCLRAFNTSAPRR------GSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQC 140

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFS   ++EG N   TG+L+SLSEQ+LVDC     N GCNGGLMDYAF++I +NGG
Sbjct: 141 GSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGG 200

Query: 222 MDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           +D+E+ YPY   + +C   P    AK     GY DV+  DE +LK+AVA   PVSV I+A
Sbjct: 201 IDTEKSYPYEAEDGQCRFKPENVGAKCT---GYVDVTVGDEDALKEAVATIGPVSVGIDA 257

Query: 279 GGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
              +FQ Y+SGV+   +C S  LDHGV+AVGYGT+NG DYWLV+NSWG  WG+ GY+ + 
Sbjct: 258 SHSSFQLYDSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMS 317

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           RN       +CGIA  ASYP+
Sbjct: 318 RN----KDNQCGIATAASYPL 334


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG+L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAEGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
 gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
          Length = 514

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 161/355 (45%), Positives = 212/355 (59%), Gaps = 46/355 (12%)

Query: 52  YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           +  W  ++G+T         +R  IF DN+R I E +  +    + LN++ADLT EE+ +
Sbjct: 38  FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDPGVTLALNEYADLTWEEFSS 97

Query: 111 MYLGTRSDAKR-----RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             LG R D  +     R   S+  + RYA  A  + P+++DWREKGAV  VK+QG CGSC
Sbjct: 98  TRLGLRIDQDQLDRRSRRSASRRNAWRYA--AAVDNPKAIDWREKGAVAEVKNQGQCGSC 155

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCD---RKI---------------------- 200
           WAFST  A+EGIN IVTG+L SLSEQ+LVDCD   R +                      
Sbjct: 156 WAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRNE 215

Query: 201 -NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY---LGAENKCDPSRRNAK-VVSIDGYEDV 255
            N GC+GGLMD AF+++IQNGG+D+EQDY Y    G    C+  ++  +  VSIDGYEDV
Sbjct: 216 SNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYEDV 275

Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGV 314
            P  E +L KAVA QPV+VAI AG  + Q Y  GV +  C   L+HGV+ VGY  +++G 
Sbjct: 276 -PQGEDNLLKAVAHQPVAVAICAGA-SMQFYSRGVIS-TCCEGLNHGVLTVGYNVSQDGE 332

Query: 315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPH 369
            YW+V+NSWG+ WGE GY +L+  + +  TG CGIA  ASYP K S N  KP P 
Sbjct: 333 KYWIVKNSWGAGWGEQGYFRLKMGVGE--TGLCGIASAASYPTKTSPN--KPVPE 383


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 204/310 (65%), Gaps = 21/310 (6%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEEYRAMYL 113
           +HG+        E+RF+IFK NL++I+EHN   SL  ++Y +G+N+FAD+ NEE+R MY 
Sbjct: 48  QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106

Query: 114 GTRSDAK--RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           G R D    R +  S   +  Y        P+ VDWR+KG V  VK+QG CGSCW+FST 
Sbjct: 107 GLRRDYNYSREVQCSNHLTPEYLVA-----PDEVDWRKKGYVTAVKNQGQCGSCWSFSTT 161

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG +   +G+L+SLSEQ+LVDC  K  N GCNGGLMD AF++II NGG+++E++YPY
Sbjct: 162 GSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPY 221

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
              + +C   +++    +  G  DV   DE  LK +VA+  PVS+AI+A  ++FQ Y  G
Sbjct: 222 DARQERCH-FKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGG 280

Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+   +C S  LDHGV+ VGYGT++G DYWLV+NSWG+ WG  GYVK+ RN       +C
Sbjct: 281 VYDEPKCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTTWGLEGYVKMSRN----QDNQC 336

Query: 348 GIAMEASYPV 357
           G+A +ASYP+
Sbjct: 337 GVATQASYPL 346


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 22/351 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTTARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T+EE+   + G   +    L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGI--NIPSYLSPSPM 114

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+
Sbjct: 115 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 174

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+ SE DY Y G +  C    + A  
Sbjct: 175 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKTA-A 232

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 233 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 290

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 291 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPG-GHCDIAKMSSYP 340


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S + +  V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G        L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FI +NGG+ SE DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 371

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/369 (39%), Positives = 218/369 (59%), Gaps = 26/369 (7%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKH 59
           +AT ++ + +   +F+F  +   AA M+  +      H      DD +M   +  W A H
Sbjct: 15  LATTAVLM-LRGCLFVFLTALPPAAIMTPAA-----GHVV--ELDDMLMLDRFVRWQAAH 66

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYL----- 113
            +T        +RFQ+++ N+ +I+  N     TY++G N+FADLT+EE+ +MY      
Sbjct: 67  NRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYASSYDA 126

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQG-SCGSCWAFS 169
           G R+D +  L+ + VA    A   GD     P S DWR KGAV P K+QG +C SCWAF 
Sbjct: 127 GDRADDEAALITTDVAGDG-AWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFV 185

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           TVA +EG+  I TG+LISLSEQ+LVDCD   + GCN G     F+++++NGG+ +E +YP
Sbjct: 186 TVATIEGLTFIKTGKLISLSEQQLVDCD-MYDGGCNTGSYSRGFRWVLENGGLTTEAEYP 244

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y  A   C+ ++       I G   + P +E+ ++KAVA QPV VAIE  G   Q Y++G
Sbjct: 245 YTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTG 303

Query: 290 VFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V++G CG+ L H V  VGYG +  +G  YW+V+NSWG  WGE G+++++R++     G C
Sbjct: 304 VYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDV--GGPGLC 361

Query: 348 GIAMEASYP 356
           GIA++ +YP
Sbjct: 362 GIALDVAYP 370


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 147/352 (41%), Positives = 204/352 (57%), Gaps = 22/352 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYAC---KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           +S  +      + D +P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L
Sbjct: 116 SSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNL 175

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           +  SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A 
Sbjct: 176 MEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA- 233

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
            V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A
Sbjct: 234 AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGNCADRINHAVTA 291

Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           +GYGT E G  YWL++NSWG+ WGENGY+K+ R+  D  +G C IA  +SYP
Sbjct: 292 IGYGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDP-SGLCDIAKMSSYP 342


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N    + S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 142/322 (44%), Positives = 198/322 (61%), Gaps = 18/322 (5%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNK 99
           T+ E+   ++ + +  G+          R  IF+ NL+FI  HN    + + T+ V +N 
Sbjct: 25  TEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNN 84

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           F DL+NEE+RA + G      RRL    +A   +A    + LP +VDW  KG V P+K+Q
Sbjct: 85  FTDLSNEEFRATFNG-----YRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQ 139

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQ 218
             CGSCWAFS VA++EG + + TG+L+SLSEQ LVDC   + + GC+GG MDYAF+++IQ
Sbjct: 140 QQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQ 199

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
           N G+D+E  YPY   +  C+  +RN+   +I  + DV   DE +L+ AVA   P+SVAI+
Sbjct: 200 NRGIDTEASYPYKAIDESCE-FKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAID 258

Query: 278 AGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
           A   +FQ Y SGV+   +C +  LDHGV AVGYGT NG  YW V+NSWG+ WG  GY+ +
Sbjct: 259 AAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGYIFM 318

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
            RN       +CGIA +ASYPV
Sbjct: 319 SRN----KQNQCGIATKASYPV 336


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C I   +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  261 bits (666), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 149/353 (42%), Positives = 204/353 (57%), Gaps = 25/353 (7%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M LA+  +V L  +S +  A  ++ S       + +++T       +  W+ KH K  + 
Sbjct: 1   MRLAVFLIVSLVILSINVCAATNLFS-------AQTYQTS------FLGWMKKHNKAYHH 47

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
              N+K +Q FKDN+ FI   NS      +GLN+FADLTNEEY+  YLG   +   R  +
Sbjct: 48  HEFNDK-YQTFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKTYLGMSINVNLRANQ 106

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
             +    +    G   P S+DWR+ GAV  VKDQG CGSCWAF+T  AVEG ++I TG +
Sbjct: 107 VPMNGLNFERFTG---PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNM 163

Query: 186 ISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           ++ SEQ LVDC  R  N GC+GGLM  AF++II N G+ +E+ YPY   +N+C       
Sbjct: 164 VTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRC-VYNTTM 222

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT-GECGS-ALDHG 302
              +I GY+DV    E +L  A++ QPV+VAI+A    FQ Y+SGV+    C S  L+HG
Sbjct: 223 LGTAISGYKDVPRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHG 282

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
           V+AVGYGT  G DY++V+NSW   WG  GY+ + RN        CGIA  ASY
Sbjct: 283 VLAVGYGTLEGKDYYIVKNSWAETWGNQGYILMARNA----NNHCGIATMASY 331


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 132/318 (41%), Positives = 199/318 (62%), Gaps = 11/318 (3%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
           +D +M  ++ W+A++G+         +RFQIFK+N++ I+  NS N  +Y +G+N+F D+
Sbjct: 3   NDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDM 62

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T  E+ A Y G        + +  V S  +       +P+S+DWR+ GAVN VK+Q  CG
Sbjct: 63  TKSEFVAQYTGV--SLPLNIEREPVVS--FDDVNISAVPQSIDWRDYGAVNEVKNQNPCG 118

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
           SCWAF+ +A VEGI KI TG L+SLSEQE++DC   ++ GC GG ++ A+ FII N G+ 
Sbjct: 119 SCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDC--AVSYGCKGGWVNKAYDFIISNNGVT 176

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           +E++YPY   +  C+ +        I GY  V   DE S+  AV++QP++  I+A    F
Sbjct: 177 TEENYPYQAYQGTCN-ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDA-SENF 234

Query: 284 QHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q+Y  GVF+G CG++L+H +  +GYG + +G  YW+VRNSWGS WGE GYV++ R  + +
Sbjct: 235 QYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARG-VSS 293

Query: 343 NTGKCGIAMEASYPVKNS 360
           ++G CGIAM   +P   S
Sbjct: 294 SSGACGIAMSPLFPTLQS 311


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 153/333 (45%), Positives = 204/333 (61%), Gaps = 21/333 (6%)

Query: 36  HDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
           H   S  +  DE    +  +    GK+      N+   + F  N+  I+EHN  +R    
Sbjct: 31  HRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEEND-YMEAFVKNVIHIEEHNKEHRLGRK 89

Query: 92  TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWRE 149
           T+++GLN+ ADL   +YR +  G R    RR     + S   ++      ++PESVDWRE
Sbjct: 90  TFEMGLNEIADLPFSQYRKLN-GYRM---RRQFGDSLQSNGTKFLVPFNVQIPESVDWRE 145

Query: 150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGL 208
           +G V PVK+QG CGSCWAFS+  A+EG +   TG+L+SLSEQ LVDC  K  N GCNGGL
Sbjct: 146 EGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGL 205

Query: 209 MDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
           MD AF++I +N G+D+E  YPY+G E KC   +RNA      G+ D+   DE +LKKAVA
Sbjct: 206 MDLAFEYIKENHGVDTEDSYPYVGRETKCH-FKRNAVGADDKGFVDLPEGDEEALKKAVA 264

Query: 269 DQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
            Q P+S+AI+AG R+FQ Y+ GV F  EC S  LDHGV+ VGYGT+    DYWLV+NSWG
Sbjct: 265 TQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWG 324

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             WGE GY+++ RN        CG+A +ASYP+
Sbjct: 325 PTWGEKGYIRIARN----RNNHCGVATKASYPL 353


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S + +  V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G        L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FI +NGG+ SE DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 377

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 139/327 (42%), Positives = 195/327 (59%), Gaps = 24/327 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNRTYKVGLNKFADLTNEEY 108
           +Q W A+HG+         +R +++  N+R+I+  N   +   TY++G   + DLT +E+
Sbjct: 53  FQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDLTADEF 112

Query: 109 RAMY------LGTRSDAKRRLMKSKVASQRYACKAGDE----------LPESVDWREKGA 152
            AMY      L    D     M   + ++  A  AG +           P SVDWR KGA
Sbjct: 113 TAMYTSPSPVLSAHDDEAAGAMM--ITTRAGAVDAGGQQVYFNVSTAGAPASVDWRAKGA 170

Query: 153 VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYA 212
           V  VK+QG CGSCWAFSTVA VEGI++I TG LISLSEQELVDCD  ++ GC+GG+  +A
Sbjct: 171 VTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCD-TLDYGCDGGVSYHA 229

Query: 213 FQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPV 272
            ++I  NGG+ +E DYPY G +  C  ++      +I G+  V+   E SL  AVA QPV
Sbjct: 230 LEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLANAVAAQPV 289

Query: 273 SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV--GYGTENGVDYWLVRNSWGSDWGEN 330
           +V+IEAGG  FQHY  GV+ G CG+ L+HGV  V  G    +G  YW+V+NSWG  WG+ 
Sbjct: 290 AVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSWGKKWGDG 349

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY ++++++     G CGIA+  S+P+
Sbjct: 350 GYFRMKKDVAGKPEGLCGIAIRPSFPL 376


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 146/315 (46%), Positives = 195/315 (61%), Gaps = 16/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K  ++ +       R QI+ +N +F+  HN L     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEE 85

Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           Y R +  G        L   +  S  +    G +LP++VDWR+KG V  VKDQ  CGSCW
Sbjct: 86  YKRVISQGCLHSFNASL--PRRGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSCW 143

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFS   ++EG +   TG L+SLSEQ+LVDC     N GC GGLMDYAFQ+I  NGG+D+E
Sbjct: 144 AFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTE 203

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
           + YPY     KC  +  N    S  GY +VS  DE +LK+AVA   P+SV I+A   +FQ
Sbjct: 204 ESYPYEAENGKCRYNPDNIGATST-GYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQ 262

Query: 285 HYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            YESGV+   +C S  LDHGV+AVGYGTE+G DYWLV+NSWG +WG+ GY+K+ RN    
Sbjct: 263 FYESGVYNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRN---- 318

Query: 343 NTGKCGIAMEASYPV 357
            + +CGIA  ASYP+
Sbjct: 319 KSNQCGIATAASYPL 333


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 145/352 (41%), Positives = 206/352 (58%), Gaps = 22/352 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYAC---KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           +S  +      + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG+L
Sbjct: 116 SSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKL 175

Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           +  SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A 
Sbjct: 176 MEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA- 233

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
            V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A
Sbjct: 234 AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTA 291

Query: 306 VGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           +GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 IGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 342


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 144/314 (45%), Positives = 192/314 (61%), Gaps = 17/314 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           ++ W  +HGK          R  I++ NL  +  HN      + TY +G+N+FADL N+E
Sbjct: 28  WKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + AM  G R +   +  K    S         +LP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88  FVAMMTGFRVNGTSKAAK---GSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCWA 144

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FS   ++EG +   TG+L+SLSEQ LVDC  K N GCNGGLMD AFQ+II  GG+D+E+ 
Sbjct: 145 FSATGSLEGQHFKKTGKLVSLSEQNLVDCSDK-NYGCNGGLMDRAFQYIIDAGGIDTEES 203

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
           YPY+  +  C     N    ++ GY DV+   E +L+KAVA   P+SVAI+A   +FQ Y
Sbjct: 204 YPYIAMDGNCHFKTANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQLY 262

Query: 287 ESGVFT--GECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           +SGV+   G   + LDHGV+AVGYGT  +G DYW+V+NSW   WG NGY+ + RN     
Sbjct: 263 QSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRN----K 318

Query: 344 TGKCGIAMEASYPV 357
             +CGIA +ASYP+
Sbjct: 319 DNQCGIATQASYPL 332


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FI +NGG+ SE DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDIAKMSSYP 341


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 206/320 (64%), Gaps = 25/320 (7%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFA 101
           DE+ T+++T    H KT      + +RF I++ +L  I++HN    L + T+ +G+N++ 
Sbjct: 21  DEMWTLFKT---THSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNEYG 76

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DLT  EY AM       +  ++ KS V S  +      ++P++VDWREKG V PVK+QG 
Sbjct: 77  DLTQHEYAAM-------SGYKMAKSSVGSS-FLEPENLQVPKTVDWREKGYVTPVKNQGQ 128

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFS+  ++EG     TG L S+SEQ LVDC R + N GC+GGLMD AF +I +N 
Sbjct: 129 CGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNM 188

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           G+DSE+ YPY   + +C   +++  V +  G+ D+   DE +L+ AVA   PVSVAI+A 
Sbjct: 189 GIDSEKSYPYEAVDGECR-YKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDAS 247

Query: 280 GRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             +FQ Y++GV+T   C S  LDHGV+ VGYG ENG DYWLV+NSWG+ WGE GY+KL R
Sbjct: 248 HTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLAR 307

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N    +  +CGIA +ASYP+
Sbjct: 308 N----HGNQCGIASQASYPL 323


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 152/333 (45%), Positives = 203/333 (60%), Gaps = 21/333 (6%)

Query: 36  HDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---- 91
           H   S  +  DE    +  +    GK+      N+   + F  N+  I+EHN  +R    
Sbjct: 32  HRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEEND-YMEAFVKNVIHIEEHNKEHRLGRK 90

Query: 92  TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWRE 149
           T+++GLN+ ADL   +YR +  G R    RR     + S   ++      ++PESVDWRE
Sbjct: 91  TFEMGLNEIADLPFSQYRKLN-GYRM---RRQFGDSMQSNGTKFLVPFNVQIPESVDWRE 146

Query: 150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGL 208
           +G V PVK+QG CGSCWAFS+  A+EG +   TG+L+SLSEQ LVDC  K  N GCNGGL
Sbjct: 147 EGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGL 206

Query: 209 MDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
           MD AF++I +N G+D+E  YPY+G E KC   +RN       G+ D+   DE +LKKAVA
Sbjct: 207 MDLAFEYIKENHGVDTEDSYPYVGRETKCH-FKRNTVGADDKGFVDLPEGDEEALKKAVA 265

Query: 269 DQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWG 324
            Q P+S+AI+AG R+FQ Y+ GV F  EC S  LDHGV+ VGYGT+    DYWLV+NSWG
Sbjct: 266 TQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWG 325

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             WGE GY+++ RN        CG+A +ASYP+
Sbjct: 326 PTWGEKGYIRIARN----RNNHCGVATKASYPL 354


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/317 (43%), Positives = 189/317 (59%), Gaps = 12/317 (3%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
           T D +  ++  W+ ++ K++    ++ + F I++ N+   +EHN  N++Y + +N+F DL
Sbjct: 22  THDPLTGVFAKWMRENTKSNYRFVYSNEEF-IYRWNVWRDEEHNRQNKSYFLAMNQFGDL 80

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TN E+  ++ G   D  +       A +  A      +P   DWR+KGAV  VK+QG CG
Sbjct: 81  TNAEFNRLFKGLAFDYSKHAKIHTAAPEAPATG----IPSEFDWRQKGAVTHVKNQGQCG 136

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCW+FST  + EG N + TG L+SLSEQ L+DC     N GCNGGLMDYAF++II N G+
Sbjct: 137 SCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIINNRGI 196

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
           D+E  YPY  A           K  S+ GY DV+  DE +L  A   +PVSVAI+A   +
Sbjct: 197 DTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSVAIDASHNS 256

Query: 283 FQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y  GV +   C S  LDHGV+ VG+G+ENG D+W V+NSWG+ WG NGY+K+ RN  
Sbjct: 257 FQFYSGGVYYESACSSTQLDHGVLVVGWGSENGQDFWWVKNSWGASWGLNGYIKMSRN-- 314

Query: 341 DTNTGKCGIAMEASYPV 357
                 CGIA  ASYP 
Sbjct: 315 --QNNNCGIATAASYPT 329


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S        + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FII+NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N    + S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S + +  V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPELSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY Y G +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 143/351 (40%), Positives = 206/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N    + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNTQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     +  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYVSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C + ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGE+G++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 341


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C I   +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/307 (45%), Positives = 183/307 (59%), Gaps = 17/307 (5%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
           W   H K  +  G    R+ I+KDN R I EHN     + + +N+F D+TN E++     
Sbjct: 30  WKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFK----- 84

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
              D    L    V+   +        P+SVDWR +G V PVKDQG CGSCWAFST  ++
Sbjct: 85  ---DFNGYLSHKHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141

Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EG N   TG+L+SLSEQ LVDC     N GCNGGLMD AF +I +N G+DSE  YPY   
Sbjct: 142 EGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAK 201

Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
           + KC  ++ N       G+ D+   DE  LK+AVA   P+SVAI+A   +FQ Y  GV+ 
Sbjct: 202 DGKCAFTKPNVAATDT-GFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKGVYN 260

Query: 293 G-ECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
             +C S  LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K+ RN  +    +CGIA
Sbjct: 261 ERKCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMSRNAKN----QCGIA 316

Query: 351 MEASYPV 357
             ASYP+
Sbjct: 317 TNASYPL 323


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  D  +G C I   +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDP-SGLCDITKMSSYP 341


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCDIAKMSSYP 341


>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 498

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/329 (45%), Positives = 198/329 (60%), Gaps = 21/329 (6%)

Query: 52  YQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           +  W  +H +T S G     +R  +F DN+R I E N  N    + LN++AD T EE+ A
Sbjct: 40  FGLWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRRNTGITLALNEYADETWEEFAA 99

Query: 111 MYLGTRSDAKR------RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
             LG +   ++      R   S  +S RYA     + P +VDWR K AV  VK+QG CGS
Sbjct: 100 KRLGLKISQEQLKAREARSSSSSSSSWRYA---QVQTPAAVDWRAKNAVTQVKNQGQCGS 156

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CWAFS V ++EG N + TG+L++LSEQ+LVDCD   N GC+GGLMD AF++++ NGG+D+
Sbjct: 157 CWAFSAVGSIEGANALATGQLVALSEQQLVDCDTASNMGCSGGLMDDAFKYVLDNGGIDT 216

Query: 225 EQDYPY---LGAENKCDPSRRNAK-VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
           E+DY Y    G    C+  ++  +  VSIDGYEDV P  E +L KAVA QPV+VAI A  
Sbjct: 217 EEDYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDV-PTSEPALLKAVAGQPVAVAICASA 275

Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD-YWLVRNSWGSDWGENGYVKLQRNL 339
              Q Y SGV    C   L+HGV+AVGY T +    YW+V+NSWG  WGE GY +L+   
Sbjct: 276 N-MQFYSSGVIN-SCCEGLNHGVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMG- 332

Query: 340 LDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
            +   G CGIA  ASY VK S  + KP P
Sbjct: 333 -EGPKGLCGIASAASYAVKTSAVN-KPVP 359


>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
           At 1.7 Angstroms Resolution By Fast Fourier
           Least-Squares Methods
          Length = 220

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 122/219 (55%), Positives = 157/219 (71%), Gaps = 3/219 (1%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LP  VDWR  GAV  +K QG CG  WAFS +A VEGINKI +G LISLSEQEL+DC R  
Sbjct: 1   LPSYVDWRSAGAVVDIKSQGECGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ 60

Query: 201 NA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
           N  GC+GG +   FQFII +GG+++E++YPY   +  CD + ++ K V+ID YE+V   +
Sbjct: 61  NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN 120

Query: 260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
           E +L+ AV  QPVSVA++A G AF+ Y SG+FTG CG+A+DH +V VGYGTE GVDYW+V
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV 180

Query: 320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           +NSW + WGE GY+++ RN+     G CGIA   SYPVK
Sbjct: 181 KNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 217


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 203/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 191/316 (60%), Gaps = 28/316 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA- 110
           +  W   H K  +       R+ I+KDN+  I E+NS ++   + +N F D+TN E+RA 
Sbjct: 27  WYVWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAK 86

Query: 111 ---MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
              + L    +    L+ S  A+           P++VDWR +G V PVK+QG CGSCWA
Sbjct: 87  MNGLLLHKHQNGSTFLVPSHTAA-----------PDAVDWRSEGYVTPVKNQGQCGSCWA 135

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG +   TG L+SLSEQ LVDC     N GCNGGLMD AF +I  NGG+D+E 
Sbjct: 136 FSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTET 195

Query: 227 DYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
            YPY G +  C   R +   +  D  G+ D+   DE +LK+AVA   PVSVAI+A   +F
Sbjct: 196 GYPYEGQDGTC---RYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSF 252

Query: 284 QHYESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           Q Y SGV+   +C  SALDHGV+ VGYGT+NG DYWLV+NSWG+ WG  GY+ + RN   
Sbjct: 253 QFYHSGVYDEPQCSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRN--- 309

Query: 342 TNTGKCGIAMEASYPV 357
            N  +CGIA +ASYP+
Sbjct: 310 -NQNQCGIASKASYPL 324


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 199/324 (61%), Gaps = 21/324 (6%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
           VM  ++++  +H K          R +IF +N + I  HN L    ++TYK+G+NK+ D+
Sbjct: 25  VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQG 160
            + E+  M  G R++      K+    Q        E   +P+SVDWREKGAV  VKDQG
Sbjct: 85  LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
           SCGSCWAFS   A+EG +   TG+L+SLSEQ LVDC  K  N GCNGGLMD AFQ+I  N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
           GG+D+E+ YPY   E + +P R N      D  G+ DV   +E +LKKA+A   PVSVAI
Sbjct: 205 GGIDTEKSYPY---EAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAI 261

Query: 277 EAGGRAFQHYESGVFTGECGSA--LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYV 333
           +A   +FQ Y+ GV++    SA  LDHGV+AVGYG TE+G DYWLV+NSW   WG+ GY+
Sbjct: 262 DASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYI 321

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN        CGIA  ASYP+
Sbjct: 322 KIARN----QNNMCGIASAASYPL 341


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N+   + S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNSQTRARS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q    G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFCAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
 gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
          Length = 351

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 20/329 (6%)

Query: 36  HDHSSSWRTDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRT 92
            D SSS    +E MT  ++ W+ +HG+T        +RFQ+FK N  F+D  N+    + 
Sbjct: 35  RDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFKANAAFVDTSNAAAGGKK 94

Query: 93  YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA-CKAGDELPESVDWREKG 151
           Y + +N+FAD+T++E+ A Y G +          K+   +YA      E  ++VDWR+KG
Sbjct: 95  YHLAINRFADMTHDEFMARYTGFKP---LPATGKKMPGFKYANVTLSSEDQQAVDWRKKG 151

Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMD 210
           AV  VK+Q  CG CWAFS VAA+EG+++I TGEL+SLSEQ+LVDC     N GC GG M+
Sbjct: 152 AVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQLVDCSTNGNNNGCGGGTME 211

Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ 270
            AFQ++I N G+ +E  YPY   +  C   +     V++  Y+ V   DE +L  AVA Q
Sbjct: 212 DAFQYVIGNNGIATEAAYPYTAMQGMCQNVQ---PAVAVRSYQQVPRDDEDALAAAVAGQ 268

Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWG 328
           PVSVA++A    FQ Y+ GV T + CG+ L+H V AVGYGT E+G  YWL++N WGS WG
Sbjct: 269 PVSVAVDANN--FQFYKGGVMTADSCGTNLNHAVTAVGYGTAEDGTPYWLLKNQWGSTWG 326

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           E GY++LQR +     G CG+A +ASYPV
Sbjct: 327 EEGYLRLQRGV-----GACGVAKDASYPV 350


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG         
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGHVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G        L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNI-PNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GC+GG M  AF FI +NGG+ SE DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S        + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 137/313 (43%), Positives = 192/313 (61%), Gaps = 17/313 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTY--KVGLNKFADLTNEEYR 109
           +++W  +HGK  N       R  I++ N +++DEHN+    +   VG+N+FADL + E+ 
Sbjct: 22  WESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFG 81

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            +Y G  +    +  +SKV    ++ K GD LP SVDWR KG V  +K+QG CGSCWAFS
Sbjct: 82  RLYNGYNNKPSMKKAQSKV----FSTKVGD-LPTSVDWRTKGFVTAIKNQGQCGSCWAFS 136

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
            VA +EG +   TG L+SLSEQ LVDC   + N GCNGGLMD AFQ++I+NGG+D+E  Y
Sbjct: 137 AVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASY 196

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPF--DEMSLKKAVADQPVSVAIEAGGRAFQHY 286
           PY   + KC  +  N    +  G+ D+ P   +           P+SVAI+A   +FQ Y
Sbjct: 197 PYKAVDQKCKFNAANVG-STCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLY 255

Query: 287 ESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
           +SGV++    S  +LDHGV AVGY + +GV YW+V+NSWG+ WG+ GY+ + RN      
Sbjct: 256 KSGVYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRN----KN 311

Query: 345 GKCGIAMEASYPV 357
            +CGIA  ASYP+
Sbjct: 312 NQCGIATAASYPI 324


>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
          Length = 340

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 141/322 (43%), Positives = 194/322 (60%), Gaps = 16/322 (4%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVG 96
           S S  T D+    +  ++++  K        E R Q +K N+ FI+ HNS N   ++ +G
Sbjct: 29  SQSLYTADQDHIDFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLG 88

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
            N  AD T++EY+ M LG +        ++K   + Y+     ++PES+DWREKGAVN V
Sbjct: 89  PNHLADYTHDEYKKM-LGYKP-------RNKTGKEVYSTPNLKDIPESIDWREKGAVNAV 140

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
           KDQG CGSCWAFST+A++E    I TG+L SLSEQ+LVDC +  N GCNGG M  A  +I
Sbjct: 141 KDQGQCGSCWAFSTIASLESRYFIETGKLQSLSEQQLVDCSKNGNEGCNGGDMGLAMDYI 200

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQPVSVA 275
              GG+++E+DYPY+G +  C  +   +K V+ D G+ ++ P    +L+ A+A+ PVSVA
Sbjct: 201 ASAGGVETEKDYPYVGKDQTC--AFEASKEVATDKGHINIVPGKFATLQAAIAEGPVSVA 258

Query: 276 IEAGGRAFQHYESGVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
           IEA    FQ Y SG+F    CG+ LDHGV AVGYG +NG  Y++VRNSW   WG  GY+ 
Sbjct: 259 IEADSLFFQFYRSGIFDSSWCGTNLDHGVAAVGYGVDNGKQYYIVRNSWSDSWGLKGYIN 318

Query: 335 LQRNLLDTNTGKCGIAMEASYP 356
           +  N      G CGI ME   P
Sbjct: 319 IIAN--GDGNGMCGIQMEPVVP 338


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 202/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S        + D++P ++DWRE GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 199/324 (61%), Gaps = 19/324 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D +   + T+  +H K          R +IF +N   I +HN L      +YK+GLNK+A
Sbjct: 22  DLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYA 81

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSK--VASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           D+ + E++    G  +   R+LM+ +  +    Y   A   +P+SVDWRE GAV  VKDQ
Sbjct: 82  DMLHHEFKETMNG-YNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+  A+EG +    G L+SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 200

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSVAI 276
           NGG+D+E+ YPY G ++ C  ++  A + + D G+ D+   DE  +KKAVA   PVSVAI
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNK--ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAI 258

Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
           +A   +FQ Y  GV+   EC    LDHGV+ VGYGT E+G+DYWLV+NSWG+ WGE GY+
Sbjct: 259 DASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYI 318

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN       +CGIA  +SYP 
Sbjct: 319 KMARN----QNNQCGIATASSYPT 338


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 191/317 (60%), Gaps = 25/317 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           +  + A HGKT         R +IF DN + I+ HN+       +YK+ +N F DL   E
Sbjct: 27  WHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHE 86

Query: 108 YRAMYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           ++A+  G +   D KR        +      +   LP++VDWR+KGAV PVKDQG CGSC
Sbjct: 87  FKALMNGFKMSPDTKR--------NGELYFPSNSNLPKTVDWRQKGAVTPVKDQGQCGSC 138

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           W+FS   ++EG   + TG+L+SLSEQ LVDC     N GC GGLMD AFQ++  N G+D+
Sbjct: 139 WSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDT 198

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
           E  YPY   EN C    +  KV   D G+ D+   DE +L+ A+A   P+SVAI+A   +
Sbjct: 199 EASYPYEARENTC--RFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGS 256

Query: 283 FQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y  GV+    C S  LDHGV+AVGYGTENG DYWLV+NSWG  WGENGY+K+ RN  
Sbjct: 257 FQFYSKGVYNEPNCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARN-- 314

Query: 341 DTNTGKCGIAMEASYPV 357
             ++  CGIA  ASYP+
Sbjct: 315 --HSNHCGIASMASYPL 329


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 133/308 (43%), Positives = 188/308 (61%), Gaps = 8/308 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+++HG+         +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A
Sbjct: 39  HELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            + G              +++ +    + D++P ++DWRE GAV  VK QG CG CWAFS
Sbjct: 99  KFTGLNIPNSYLSPSPLSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFS 158

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
            V ++EG  KI TG L+  SEQEL+DC    N GCNGG M  AF FII+NGG+  E DY 
Sbjct: 159 AVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYE 217

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           YLG +  C    + A  V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G
Sbjct: 218 YLGQQYTCRSQEKTA-AVQISSYK-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGG 274

Query: 290 VFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
            + G C   ++H V A+GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C 
Sbjct: 275 TYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLCD 333

Query: 349 IAMEASYP 356
           IA  +SYP
Sbjct: 334 IAKMSSYP 341


>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 374

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 150/336 (44%), Positives = 198/336 (58%), Gaps = 26/336 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNG---MGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLN 98
            +++ + ++YQ W   +G  S+    +     RF++FK N R+I + N     +YK+GLN
Sbjct: 34  ESEESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLN 93

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           KFADLT EE+ A Y G        L K+   S   A  AGD  P + DWRE GAV  VKD
Sbjct: 94  KFADLTLEEFTAKYTGANPGPITGL-KNGTGSPPLAAVAGDA-PPAWDWREHGAVTRVKD 151

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
           QG CGSCWAFS V AVEGIN I+TG L++LSEQ+++DC     AG C+GG   YAF + +
Sbjct: 152 QGPCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCS---GAGDCSGGYTSYAFDYAV 208

Query: 218 QNG---------GMDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKK 265
            NG             E  + Y   E   +P R     A +V ID Y  V P DE +LK+
Sbjct: 209 SNGITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQ 268

Query: 266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSW 323
           AV  Q PVSV IEA    F  Y+ GVF+G CG+ L+H V+ VGY  TE+G  YW+V+NSW
Sbjct: 269 AVYSQGPVSVLIEA-SYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSW 327

Query: 324 GSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           G+ WGE+GY+++ RN +    G CGIAM   YP+K+
Sbjct: 328 GAGWGESGYIRMIRN-IPAPEGICGIAMYPIYPIKS 362


>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
           Cysteine Protease Ervatamin-C Refinement With Cdna
           Derived Amino Acid Sequence
 gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
           Cysteine Protease Ervatamin-C Refinement With Cdna
           Derived Amino Acid Sequence
 gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
           Complexed With Irreversible Inhibitor E-64 At 2.7 A
           Resolution
 gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
           Complexed With Irreversible Inhibitor E-64 At 2.7 A
           Resolution
          Length = 208

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/218 (57%), Positives = 155/218 (71%), Gaps = 11/218 (5%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPE +DWR+KGAV PVK+QG CGSCWAFSTV+ VE IN+I TG LISLSEQ+LVDC++K 
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK- 59

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GC GG   YA+Q+II NGG+D+E +YPY   +  C   R   KVV IDGY+ V   +E
Sbjct: 60  NHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQGPC---RAAKKVVRIDGYKGVPHCNE 116

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
            +LKKAVA QP  VAI+A  + FQHY+SG+F+G CG+ L+HGVV VGY      DYW+VR
Sbjct: 117 NALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK----DYWIVR 172

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG  WGE GY++++R       G CGIA    YP K
Sbjct: 173 NSWGRYWGEQGYIRMKR---VGGCGLCGIARLPYYPTK 207


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 24/349 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G        +  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN------IPNSYL 110

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           +       + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+  
Sbjct: 111 SPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 170

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  V 
Sbjct: 171 SEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-AVQ 228

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C + ++H V A+GY
Sbjct: 229 ISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAIGY 286

Query: 309 GT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GT E G  YWL++NSWG+ WGE+G++K+ R+  +   G C IA  +SYP
Sbjct: 287 GTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334


>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
          Length = 334

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 197/320 (61%), Gaps = 26/320 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  W  K G++ N     +KR QI+  N   +  HN++    + TY++G+  +ADL +EE
Sbjct: 26  FHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEE 85

Query: 108 YR----AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           ++     + LG+ + +K R   S +   R+       LP+++DWR+ G V PVK+QGSCG
Sbjct: 86  FKQTVFGVCLGSFNASKPRGGSSFLKMHRFY-----NLPQTIDWRQWGFVTPVKNQGSCG 140

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCW+FS+  A+EG N   TG L+SLSEQELVDC     N GCNGG MD AF++I+  GG+
Sbjct: 141 SCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGI 200

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
            +E  YPY G   +C   R N   +  +  GY D+   +E +LK+AVA   PVSVAI A 
Sbjct: 201 HTEDSYPYEGQVGQC---RANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHAS 257

Query: 280 GRAFQHYESGVFTGE--CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
            ++FQ Y SGV+      G+ALDH V+ VGYGTE G DYWLV+NSWG  WG+ GY+K+ R
Sbjct: 258 DQSFQLYHSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSR 317

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N  +    +CGIA  AS+P+
Sbjct: 318 NRYN----QCGIASAASFPL 333


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 24/349 (6%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MSILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G        +  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN------IPNSYL 110

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           +       + D++P ++DWRE GAV  VK+QG CG CWAFS V ++EG  KI TG L+  
Sbjct: 111 SPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 170

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  V 
Sbjct: 171 SEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTA-AVQ 228

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
           I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C + ++H V A+GY
Sbjct: 229 ISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCANRINHAVTAIGY 286

Query: 309 GT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GT E G  YWL++NSWG+ WGE+G++K+ R+  +   G C IA  +SYP
Sbjct: 287 GTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 14/312 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
           +  W A H +          R +I+  NL  I+EHN+  R +Y +G+N+F DL + E+ A
Sbjct: 21  FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            YLG R +          AS  Y  +    LP+SVDWR  G V PVK+QG CGSCW+FST
Sbjct: 81  KYLGVRFNGVN--ATKSFASSTYLPRM-VSLPDSVDWRTAGIVTPVKNQGQCGSCWSFST 137

Query: 171 VAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             +VEG +   TG L+SLSEQ LVDC  ++ N GCNGGLMD AF++II+NGG+D+E  YP
Sbjct: 138 TGSVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYP 197

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
           Y      C  +  N    ++  Y+D+    E  L+ AVA   PVSVAI+A    FQ Y +
Sbjct: 198 YTATTGTCKFNAANIG-ATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFT 256

Query: 289 GVFT-GECGSA-LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV+   +C +  LDHGV+AVGYGT   G DYWLV+NSWG+ WG+ GY+ + RN  +    
Sbjct: 257 GVYNEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADN---- 312

Query: 346 KCGIAMEASYPV 357
           +CGIA  ASYP+
Sbjct: 313 QCGIATSASYPL 324


>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           KHGK+ +      KR  IF DNL +I+E N+ N +YK+G+N++ DLT EE+ A+ L + +
Sbjct: 33  KHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAALKL-SST 91

Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
           D    +    VA    A      LP SVDWR+KG +NPVKDQG CGSCWAFS + A+E  
Sbjct: 92  DMSEGMGDGFVAG---AGPTTTTLPTSVDWRKKGVLNPVKDQGYCGSCWAFSAIGALEPR 148

Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
             I TG+L+SLSEQ+LVDC     N GCNGGLMD AF++ I+  G+D E  YPY+G++  
Sbjct: 149 YAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEY-IKATGVDKESTYPYVGSDET 207

Query: 237 CDPSRRNA----KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
           C  +  N      V  + G + +    E +L + VA  PVS+A+ A  ++FQHY+SGV++
Sbjct: 208 CQATVENKTDGLPVGEVTGNQMLHQ-TEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYS 266

Query: 293 -GEC---GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
              C   G ++DHGVVAVGYGTENG DY+++RNSWG  WG++GYV L+R +   + G+C 
Sbjct: 267 DPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV--GSFGQCN 324

Query: 349 IAMEASYPVKNSQ 361
           I      P   S+
Sbjct: 325 IYKYMCVPTLKSR 337


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/309 (43%), Positives = 190/309 (61%), Gaps = 10/309 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+++HG+         +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A
Sbjct: 39  HELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLA 98

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
            + G  +     L  S ++S  +     + D++P ++DWRE GAV  VK QG CG CWAF
Sbjct: 99  KFTGL-NIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAF 157

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           S V ++EG  KI TG L+  SEQEL+DC    N GC+GG M  AF FI +NGG+  E DY
Sbjct: 158 SAVGSLEGAYKIATGNLMEFSEQELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDY 216

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
            YLG +  C    + A  V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  
Sbjct: 217 EYLGEQYTCRSQEKTA-AVQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAG 273

Query: 289 GVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           G + G C   ++H V A+GYGT E G  YWL++NSWG+ WGENG++K+ R+  +  +G C
Sbjct: 274 GTYDGSCADRINHAVTAIGYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-SGLC 332

Query: 348 GIAMEASYP 356
            IA  +SYP
Sbjct: 333 DIAKMSSYP 341


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/371 (39%), Positives = 216/371 (58%), Gaps = 31/371 (8%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT-IYQTWLAKH 59
           +A++S  LA   L+ + +   +   + +          +++   DD  M   Y+ W A H
Sbjct: 3   IASSSFSLAAILLIIIMYCCPTGLVEAA------RKGPAAAGGGDDSAMRERYEKWAADH 56

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY---LG 114
           G+T        +RF++F+ N  FID  N+    ++ ++  NKFADLTNEE+   Y     
Sbjct: 57  GRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEFAEYYGRPFS 116

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
           T        M   V +         ++P +++WR++GAV  VK+Q  C SCWAFS VAAV
Sbjct: 117 TPVIGGSGFMYGNVRTS--------DVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAV 168

Query: 175 EGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EGI++I +  L++LS Q+L+DC   + N GCN G MD AF++I  NGG+ +E DYPY   
Sbjct: 169 EGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPY--E 226

Query: 234 ENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           +      R + K V  SI G++ V P +E +L  AVA QPVSVA++  G+  Q + SGVF
Sbjct: 227 DRALGTCRASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVF 286

Query: 292 TG----ECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
                  C + L+H + AVGYGT E+G  YWL++NSWG+DWGE GY+K+ R++  +NTG 
Sbjct: 287 GAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVA-SNTGL 345

Query: 347 CGIAMEASYPV 357
           CG+AM+ SYPV
Sbjct: 346 CGLAMQPSYPV 356


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/356 (41%), Positives = 209/356 (58%), Gaps = 29/356 (8%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           +S L FL  I   + A  + +    N D  S W            W   H K+     H 
Sbjct: 1   MSNLTFLVAIGLVACATAAFVK-PTNPDLDSRWLE----------WKIAHTKSYTNDMHE 49

Query: 70  EKRFQIFKDNLRFIDEHN---SLNRT-YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
            +R  ++++N++ I+ HN   SL++  +++G+N++ D+   E R+   G +S        
Sbjct: 50  LERRLVWEENVKMINMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYKSSNV----- 104

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
           +KV    +   +  ++P++VDWR KG V PVK+QG CGSCWAFST  ++EG     T +L
Sbjct: 105 TKVQGSTFLTPSNIQVPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKL 164

Query: 186 ISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           +SLSEQ LVDC R + N GC GGLMD  FQ++I N G+DSE  YPY   +  C   + + 
Sbjct: 165 VSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCH-YKASC 223

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDH 301
               + G+ DV+  DE +L +AVA   PVSVAI+A  ++FQ YESGV+   EC S+ LDH
Sbjct: 224 DSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDH 283

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+ VGYGT+ G DYWLV+NSWG  WG +GY+K+ RN     + +CGIA  ASYP+
Sbjct: 284 GVLVVGYGTDGGKDYWLVKNSWGETWGLSGYIKMSRN----KSNQCGIATSASYPL 335


>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 341

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 198/317 (62%), Gaps = 20/317 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K  K+ +      +R QI+ +N + +  HN L     ++Y++G+ +FAD+ NEE
Sbjct: 33  FHAWKLKFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEE 92

Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           Y R +  G        L   +  S  +    G  LP++VDWR+KG V  V++Q  CGSCW
Sbjct: 93  YKRLVSQGCLHSFNSSL--PRRGSTFFRLPKGTVLPDTVDWRDKGYVTNVQNQMDCGSCW 150

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFS   ++EG +   TG+L+SLS+Q+LVDC  +  N GCNGGLMD AFQ+I  NGG+D+E
Sbjct: 151 AFSATGSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDTE 210

Query: 226 QDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
           + YPY   + KC   R N K    +  GY DV P +E +LK+AVA   P+SVAI+A   +
Sbjct: 211 ESYPYEAEDGKC---RYNPKSTGATCTGYVDVQPANEETLKEAVATIGPISVAIDAFHPS 267

Query: 283 FQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ YESGV+   +C S  LDH V+AVGYGTENG+DYWLV+NS G  WGE GY+K+ RN  
Sbjct: 268 FQFYESGVYDEPDCSSTMLDHAVLAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRN-- 325

Query: 341 DTNTGKCGIAMEASYPV 357
              + +CGIA  ASYP+
Sbjct: 326 --KSNQCGIATAASYPL 340


>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 340

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 133/319 (41%), Positives = 197/319 (61%), Gaps = 12/319 (3%)

Query: 42  WRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
           + ++  +M +Y+ W + H + S       KRF+IF+DN + + + N + ++ K+ LN+FA
Sbjct: 31  FESEKSLMQLYKRW-SSHHRISRNAHEMHKRFKIFQDNAKRVFKVNHMGKSLKLRLNQFA 89

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DL+++E+  MY    +       K+      +  +    +P S+DWREKGAVN +K+QG 
Sbjct: 90  DLSDDEFSMMYGSNITHYNNLHAKAGGRVGGFMYERAMNIPFSIDWREKGAVNAIKNQGL 149

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           C        VAAVE I++I T EL+SLSEQE+VDCD K+  GC GG  D AF+FI+QNGG
Sbjct: 150 C-------AVAAVESIHQIKTNELVSLSEQEVVDCDYKV-GGCRGGNYDSAFEFIMQNGG 201

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +  E++YPY      C     N++ V+IDGYE V   +E +L KAVA QPV+V++ + G 
Sbjct: 202 ITIEENYPYFAGNGYCRRRGPNSERVTIDGYECVPQNNEYALMKAVAHQPVAVSVASSGS 261

Query: 282 AFQHYESGVFT--GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
            F+ Y  G+      CG  +DH VV VGYG++   DYW++RN +G+ WG NGY+K+QR  
Sbjct: 262 DFRFYGEGMLREGSFCGYRIDHTVVVVGYGSDEEGDYWIIRNQYGTQWGMNGYMKMQRGT 321

Query: 340 LDTNTGKCGIAMEASYPVK 358
            +   G CG+AM+ S+PVK
Sbjct: 322 RNPQ-GVCGMAMQPSFPVK 339


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 137/289 (47%), Positives = 192/289 (66%), Gaps = 22/289 (7%)

Query: 75  IFKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
           +FK+N+ +I+  +N+ ++ YK  +N+FA              +   K  +  S +    +
Sbjct: 57  VFKENVNYIEACNNAADKPYKRDINQFA-------------PKKRFKGHMCSSIIRITTF 103

Query: 134 ACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS-EQE 192
             +     P +VD R+K AV P+KDQG CG  WA S VAA EGI+ +  G+LI LS EQE
Sbjct: 104 KFENVTATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQE 163

Query: 193 LVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP--SRRNAKVVSI 249
           LVDCD K ++  C GGLMD AF+FIIQN G+++E +YPY G + KC+   + +NA  + I
Sbjct: 164 LVDCDTKGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATI-I 222

Query: 250 DGYEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
            GYEDV   +E + L+KAVA+ PVSVAI+A G  FQ Y+SGVFTG CG+ LDHGV AVGY
Sbjct: 223 TGYEDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGY 282

Query: 309 G-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           G +++G +YWLV+NS G++WGE GY+++QR  +D+    CGIA++ASYP
Sbjct: 283 GVSDDGTEYWLVKNSRGTEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 330


>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           KHGK+ +      KR  IF DNL +I+E N+ N +YK+G+N++ DLT EE+ A+ L + +
Sbjct: 33  KHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDLTLEEFAALKL-SST 91

Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
           D    +    VA    A      LP SVDWR+KG +NPVKDQG CGSCWAFS + A+E  
Sbjct: 92  DMSEGMGDGFVAG---AGPTTTTLPTSVDWRKKGVLNPVKDQGYCGSCWAFSAIGALEPR 148

Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
             I TG+L+SLSEQ+LVDC     N GCNGGLMD AF++ I+  G+D E  YPY+G++  
Sbjct: 149 YAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEY-IKATGVDKESTYPYVGSDET 207

Query: 237 CDPSRRNA----KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT 292
           C  +  N      V  + G + +    E +L + VA  PVS+A+ A  ++FQHY+SGV++
Sbjct: 208 CQATVENKTDGLPVGEVTGNQMLHQ-TEKALMEGVAAAPVSIAMYANLQSFQHYKSGVYS 266

Query: 293 -GEC---GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
              C   G ++DHGVVAVGYGTENG DY+++RNSWG  WG++GYV L+R +   + G+C 
Sbjct: 267 DPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKRGV--GSFGQCN 324

Query: 349 IAMEASYPVKNSQ 361
           I      P   S+
Sbjct: 325 IYKYMCVPTLKSR 337


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 191/311 (61%), Gaps = 16/311 (5%)

Query: 53  QTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRA 110
           + W A+HGK+         R   ++ N ++IDEHN       Y + +N+F DL N E+++
Sbjct: 23  RAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKS 82

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +Y G R     R  K  V + R       +LP SVDW +KG V PVK+QG CGSCW+FS 
Sbjct: 83  LYNGYRMSNAPRKGKPFVPAARV-----QDLPASVDWSKKGWVTPVKNQGQCGSCWSFSA 137

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             ++EG +   TG L+SLSEQ LVDC   + N GCNGGLMD AF+++I+N G+D+E  YP
Sbjct: 138 TGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYP 197

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
           Y   ++ C  +  +    +I GY DV+   E  L+ AVA   PVSVAI+A   +FQ Y S
Sbjct: 198 YRAVDSTCKFNTADVG-ATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSS 256

Query: 289 GVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           GV+    C S  LDHGV+AVGYGT+   DYWLV+NSWG+ WG +GY+++ RN    +  K
Sbjct: 257 GVYDPLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRN----HNNK 312

Query: 347 CGIAMEASYPV 357
           CGIA  ASYPV
Sbjct: 313 CGIATSASYPV 323


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 142/319 (44%), Positives = 196/319 (61%), Gaps = 25/319 (7%)

Query: 49  MTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNRTYKVGLNKFADLTN 105
           +T ++ +    GKT  G  H  ++  IF+ NL  I++ N+    +R Y +G+ +FAD++ 
Sbjct: 163 LTNFEHFKEHFGKTYEGDEHALRQ-GIFQRNLAHIEKFNAEKAASRGYTLGITQFADMST 221

Query: 106 EEYRAMYLGTRSDAK-----RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            E+R  YLG R +A      R+L +  VA  R       +LPE+VDWR+KGAV+PVKDQG
Sbjct: 222 AEFRQTYLGLRMNASTIAKLRKLQREVVADDR-------DLPEAVDWRDKGAVSPVKDQG 274

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
            CGSCWAFST  A+EG + +  GEL+SLSEQ++VDC   ++ GCNGG    A +++  NG
Sbjct: 275 QCGSCWAFSTSGAIEGQHFLKNGELLSLSEQQMVDCSW-LDFGCNGGQPMLAMEYVRFNG 333

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           G++ E  YPY G    C   +++A    I G+     + E +L+KAVA   P+SV ++A 
Sbjct: 334 GLELETAYPYKGVGGSCHSDKKSA-AAKITGFWMAGFYSESALQKAVAKVGPISVGMDAS 392

Query: 280 GRAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
           G  FQHY+SG++  E C S  LDH V+AVGYGT +  DYWLV+NSW + WGE GY KL R
Sbjct: 393 GEDFQHYKSGIYNPESCSSIGLDHAVLAVGYGTSDDGDYWLVKNSWNTSWGEKGYFKLPR 452

Query: 338 NLLDTNTGKCGIAMEASYP 356
           N       KCGIA    YP
Sbjct: 453 N----KGNKCGIATTPIYP 467


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 137/307 (44%), Positives = 183/307 (59%), Gaps = 15/307 (4%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
           W + HGK+ + +     R  I++ NL  I  HN+ + +YK+ +N   DLT +E+R  YLG
Sbjct: 30  WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
            R+        +K     Y   +  ++P SVDW +KG V  VK+QG CGSCWAFST  +V
Sbjct: 90  VRAHHN----STKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSV 145

Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EG +   TG L+SLSEQ L+DC     N GC GGLMD AF++I  NGG+D+E  YPYLG 
Sbjct: 146 EGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQ 205

Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
           +  C  S  +     + GY+D+    E +L+ AVA   PVSVA++A    +Q Y SGV+ 
Sbjct: 206 QGSCHFSSSHVG-ARVTGYQDIPQGSEQALQSAVATVGPVSVAVDA--SQWQFYSSGVYD 262

Query: 293 GE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
              C S  LDHGV+ +GYG  NG DYWLV+NSWG  WG  GY+ + RN       +CGIA
Sbjct: 263 NPYCSSTQLDHGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRN----KNNQCGIA 318

Query: 351 MEASYPV 357
             ASYP+
Sbjct: 319 SSASYPL 325


>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
          Length = 388

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 138/311 (44%), Positives = 191/311 (61%), Gaps = 19/311 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  W   HG++        KR  +F +N + + E N+ N    + LN+FADLT EE+ A 
Sbjct: 46  FSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFAAT 105

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           +LG  + + R   +    S +YA    ++LP +VDWR+K AV PVK+Q  CGSCWAFS  
Sbjct: 106 HLG-YNPSLREGKEHTTTSFQYA--DANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSAT 162

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
            AVEGIN I TG+L+SLSEQ+LVDCD + + GC GGLMD+AF +I +NGG+DSE DY Y 
Sbjct: 163 GAVEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYW 222

Query: 232 GAENKCDPSRR-NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           G    C   +  +  VV+IDG+EDV   D  +LKKA+A QPVS+           Y SGV
Sbjct: 223 GYGLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL-----------YHSGV 271

Query: 291 FTGE-CGSALDHGVVAVGY--GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
              + C   L+HGV+AVGY  G++ G  +++++NSWG  WGE G+ +L     +  +G C
Sbjct: 272 VGDDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEA-SGAC 330

Query: 348 GIAMEASYPVK 358
           G+   ASYP+K
Sbjct: 331 GVYKAASYPLK 341


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSL----NRTYKVGLNK 99
           D +   +QT+  +H K  N +   E+RF  +IF +N   I +HN L      ++K+GLNK
Sbjct: 21  DVIKEEWQTFKMEHRK--NFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNK 78

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           ++D+   E++    G     ++ L     +   Y   A  ++P+SVDWR+ GAV  VKDQ
Sbjct: 79  YSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQ 138

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+ AA+EG +    G L+SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 139 GHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 198

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
           NGG+D+E+ YPY G ++ C  ++         G+ D+   DE +L KAVA   PVSVAI+
Sbjct: 199 NGGIDTEKSYPYEGIDDSCHFTKSGVGATDT-GFVDIPQGDEEALMKAVATMGPVSVAID 257

Query: 278 AGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVK 334
           A   +FQ Y  GV+   EC +  LDHGV+ VGYGT+  G+DYWLV+NSWG+ WG+ GY+K
Sbjct: 258 ASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIK 317

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN       +CGIA  +SYP 
Sbjct: 318 MARN----QDNQCGIATASSYPT 336


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 142/314 (45%), Positives = 191/314 (60%), Gaps = 15/314 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  + A H K        + R +I+ +N   + +HN L     ++Y+V +NKF DL + E
Sbjct: 31  WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R++  G +   K++      ++  +   A  E+PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 91  FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWA 148

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+LISLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E 
Sbjct: 149 FSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY   ++ C  + RN   V   G+ D+   +E  LK AVA   PVSVAI+A   +FQ 
Sbjct: 209 TYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267

Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y  GV +   C S  LDHGV+ VGYG++NG DYWLV+NSW   WG+ GY+K+ RN     
Sbjct: 268 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARN----R 323

Query: 344 TGKCGIAMEASYPV 357
              CG+A  ASYP+
Sbjct: 324 KNHCGVATAASYPL 337


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 143/351 (40%), Positives = 202/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S  +     + D++P ++DWRE GAV  VK QG CG CWAFS V ++E   KI TG L+
Sbjct: 116 SSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341


>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 337

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 128/309 (41%), Positives = 191/309 (61%), Gaps = 7/309 (2%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A+HGK        E+  QIF++N+ FI+  +   ++++ +  N+FADL +EE++A
Sbjct: 32  HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS- 169
           + L      +  L  +     RY      ++P S+DWR++G V P+KDQG C SCWAFS 
Sbjct: 92  L-LTNGHKKEHSLWTTTETLFRY--DNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSL 148

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
            VA +EG+++I+T EL+ LSEQELVD  +  + GC G  ++ AF+FI + G ++SE  YP
Sbjct: 149 CVATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYP 208

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y G  N C   +    V  I GY+ V    E +L KAVA+Q VSV++EA   AFQ Y SG
Sbjct: 209 YKGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSG 268

Query: 290 VFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           +FTG+CG+  DH V    YG + +G  YWL +NSWG++WGE GY++++ + +    G CG
Sbjct: 269 IFTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXD-IPAKEGLCG 327

Query: 349 IAMEASYPV 357
           IA    YP+
Sbjct: 328 IAKYPYYPI 336


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 156/371 (42%), Positives = 217/371 (58%), Gaps = 33/371 (8%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMT----IYQTWL---AK 58
           M+  I+ +  L  +S++  A           +H+S+ +   +  T     ++TW      
Sbjct: 1   MYTLIAVICVLTVVSAAPQAVNWFEIQPAKVEHASNLKLQVKASTRLGPYHETWKEFKTL 60

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
            GK  + +    KRF IF+D L  I+EHN       ++Y +G+N+F+D++++EY      
Sbjct: 61  FGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEY------ 114

Query: 115 TRSDAKRRLMKSKVASQRYAC----KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            R +  RR   ++  S+   C    K+G +L + VDWR+KG V PVK+QG CGSCW+FST
Sbjct: 115 LRHNGLRR--GNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFST 172

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             ++EG +   TG+LISLSEQ+LVDC     N GCNGGLMD AF++I   GG++ E DYP
Sbjct: 173 TGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYP 232

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
           Y   + KC   +   K     G  DV   DE +LK A+A   P+SVAI+A   +FQ Y+ 
Sbjct: 233 YTAKQGKCHLKKSLFKANDT-GCTDVESGDEDALKDALASVGPISVAIDASHASFQSYDG 291

Query: 289 GVF-TGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV+   EC S  LDHGV+ VGYGT ENG DYWLV+NSWG  WGE GY+K+ RN       
Sbjct: 292 GVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRN----KDN 347

Query: 346 KCGIAMEASYP 356
           +CGIA +ASYP
Sbjct: 348 QCGIATQASYP 358


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 146/313 (46%), Positives = 195/313 (62%), Gaps = 20/313 (6%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
           +H K  +       R +I+  N   I +HN         Y++ +NK+ADL +EE+     
Sbjct: 33  QHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVN 92

Query: 114 G-TRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           G  R+D+K+ L   ++     +   A  E+P +VDWR+KGAV PVKDQG CGSCW+FS  
Sbjct: 93  GFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSAT 152

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            A+EG +   TG+L+SLSEQ LVDC  K  N GCNGG+MDYAFQ+I  NGG+D+E+ YPY
Sbjct: 153 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPY 212

Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
              ++ C     N K V  +  GY D+   DE +LKKA+A   PVS+AI+A   +FQ Y 
Sbjct: 213 EAIDDTC---HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYS 269

Query: 288 SGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            GV +  +C S  LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN    + 
Sbjct: 270 EGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----HD 325

Query: 345 GKCGIAMEASYPV 357
             CG+A  ASYP+
Sbjct: 326 NHCGVATCASYPL 338


>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
          Length = 319

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 135/268 (50%), Positives = 172/268 (64%), Gaps = 14/268 (5%)

Query: 72  RFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           RF  FK ++  I  HN+L N +Y +GLN+FADL+ EE++  Y G +   +R   +S    
Sbjct: 61  RFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYFGCK-HVEREFARSNNLH 119

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISL 188
           Q       +  P S+DWR   AV P+KDQG CGSCWAFS   ++EG   ++ G+  L SL
Sbjct: 120 QEV-----EAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIEGA-WVLQGKHTLTSL 173

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQ+LVDC     NAGCNGGLMDYAF++II N G+ +E  YPY G    C  S    KVV
Sbjct: 174 SEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKGVGGLCQKS--CTKVV 231

Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           +I G++DV+  DE S   AV    PVSVAIEA    FQ Y SGVF+G CG  LDHGV+AV
Sbjct: 232 TISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAV 291

Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVK 334
           GYGT    DYW+V+NSWG+ WGE+GY++
Sbjct: 292 GYGTTGSQDYWIVKNSWGTSWGESGYIR 319


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 190/320 (59%), Gaps = 20/320 (6%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
           + T ++ + A H K+         RF+IF +N   +  HN        +YK+G+N+F DL
Sbjct: 23  LRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDL 82

Query: 104 TNEEYRAMYLGTRS--DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
              E+  M+ G R    A R       A+  Y+      LP+S+DWREKGAV PVK+QG 
Sbjct: 83  LPHEFARMFNGYRGARTAGRGSTFLPPANVNYS-----SLPQSMDWREKGAVTPVKNQGQ 137

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFST  ++EG + + TG L+SLSEQ LVDC     N GC GGLMD AFQ+I  NG
Sbjct: 138 CGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANG 197

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           G+D+E+ YPY   + +C   ++N       G+ D+    E  LKKAVA   PVSVAI+A 
Sbjct: 198 GIDTEKSYPYEAEDGECRFKKQNVGATDT-GFVDIEQGSEDDLKKAVATVGPVSVAIDAS 256

Query: 280 GRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             +FQ Y  GV+   EC S  LDHGV+ VGYG E+G  YWLV+NSW   WG+NGY+K+ R
Sbjct: 257 HSSFQLYSEGVYDETECSSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSR 316

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           +       +CGIA  ASYP+
Sbjct: 317 D----KDNQCGIASAASYPL 332


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 146/313 (46%), Positives = 194/313 (61%), Gaps = 20/313 (6%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
           +H K  +       R +I+  N   I +HN         Y++ +NK+ADL +EE+     
Sbjct: 33  QHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVN 92

Query: 114 G-TRSDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           G  R+D+K+ L   ++     +   A  E+P +VDWR+KGAV PVKDQG CGSCW+FS  
Sbjct: 93  GFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSAT 152

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            A+EG +   TG+L+SLSEQ LVDC  K  N GCNGG+MDYAFQ+I  NGG+D+E+ YPY
Sbjct: 153 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPY 212

Query: 231 LGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYE 287
              ++ C     N K V  +  GY D+   DE +LKKA+A   PVS+AI+A   +FQ Y 
Sbjct: 213 EAIDDTC---HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYS 269

Query: 288 SGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            GV +  +C S  LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN      
Sbjct: 270 EGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----RD 325

Query: 345 GKCGIAMEASYPV 357
             CG+A  ASYP+
Sbjct: 326 NHCGVATCASYPL 338


>gi|298705581|emb|CBJ28832.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
          Length = 553

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 132/331 (39%), Positives = 195/331 (58%), Gaps = 19/331 (5%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN--RTYKVGLNKFAD 102
           D +V   ++ W+A+HG T    G  ++R +IF +N   ID HN+ N   T+ +  N+F+ 
Sbjct: 36  DAKVANRFRAWMAQHGVTFGTKGEFDRRLKIFAENSDLIDTHNTANDGSTFTLSHNEFSH 95

Query: 103 LTNEEYRAMYLG----------TRSDAKRRLMKSKVASQRYACK-AGDELPESVDWREKG 151
           L+ +E++  + G           R   +RR M+     +R   +  G E+P+ VDW  +G
Sbjct: 96  LSWDEFKETHFGYKRSSDKPKPARQTPERRPMEKVAGGRRRLVELTGSEIPDEVDWVREG 155

Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
           AV PV++QG CGSCWAFST+ A+EG   + T +LI  SE++LVDCD K++ GC GG M+ 
Sbjct: 156 AVTPVQNQGMCGSCWAFSTIGAMEGAYYLATDDLIKFSEEQLVDCD-KVDKGCFGGDMEQ 214

Query: 212 AFQFIIQNGGMDSEQDYPYLG---AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA 268
           AF +I +NGG+  E +YPY+G       C  +    +   +  +  V   DE  +     
Sbjct: 215 AFDWIKENGGVCPEDEYPYVGLWPPFKTCATTCTPVEGSQVKEWAQVKATDEALMTALAT 274

Query: 269 DQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDW 327
             P+++AIEA   AFQ Y  GV+T  CG  LDHGV+AVGYGT E+G DYW V+NSWG  W
Sbjct: 275 VGPIAIAIEADQMAFQFYSDGVYTAPCGDKLDHGVLAVGYGTWEDGTDYWKVKNSWGDSW 334

Query: 328 GENGYVKLQR-NLLDTNTGKCGIAMEASYPV 357
           G+ GY+ L+R +  +   G+CG+ +EA YP+
Sbjct: 335 GQGGYILLERADSEEDEGGQCGLLIEAIYPI 365


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 140/355 (39%), Positives = 207/355 (58%), Gaps = 20/355 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+A++G+          R
Sbjct: 7   LVFLFLFLCVMWASPSAASCD---------EPSDPMMKQFEEWMAEYGRVYKDNDEKMLR 57

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  N+ N  +Y +G+N+F D+TN E+ A Y G        + +  V S 
Sbjct: 58  FQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGL--SLPLNIKREPVVS- 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       +P+S+DWR+ GAV  VK+QG CGSCWAF+++A VE I KI  G L+SLSEQ
Sbjct: 115 -FDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQ 173

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           +++DC   ++ GC GG ++ A+ FII N G+ S   YPY  A+  C  +        I  
Sbjct: 174 QVLDC--AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCK-TNGVPNSAYITR 230

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   +E ++  AV++QP++ A++A G  FQHY+ GVFTG CG+ L+H +V +GYG +
Sbjct: 231 YTYVQRNNERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYGQD 289

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
            +G  +W+VRNSWG+ WGE GY++L R+ + ++ G CGIAM+  YP   S  S +
Sbjct: 290 SSGKKFWIVRNSWGAGWGEGGYIRLARD-VSSSFGLCGIAMDPLYPTLQSGPSVE 343


>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
 gi|194696462|gb|ACF82315.1| unknown [Zea mays]
 gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
          Length = 361

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 153/361 (42%), Positives = 212/361 (58%), Gaps = 26/361 (7%)

Query: 11  STLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE 70
           + LV +  +S++ AA  S I Y   HD +S    ++ +  +Y+ W A H   +  +G   
Sbjct: 13  AALVVVIALSTTPAA--SAIDY-TEHDLAS----EESLWALYERWCA-HYNMARDLGEKT 64

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEY------RAMYLGTR--SDAKRR 122
           +RF +FK+N   I EHN  N TY +GLN+F+D+T+EE+      R ++   +  SD +  
Sbjct: 65  RRFNLFKENAHRIYEHNQGNATYTLGLNRFSDMTDEEFSRSPYGRCLFAPVQRISDGENE 124

Query: 123 LMKS----KVASQRYACKAGDELPESVDWREKGAVNPVKDQG-SCGSCWAFSTVAAVEGI 177
            ++               A   LP SVDWR + +V  VKDQG +CGSCWAF+ +AAVEGI
Sbjct: 125 ELQQHEDVSFNLTHGGATAALGLPPSVDWRGR-SVTRVKDQGLTCGSCWAFAAIAAVEGI 183

Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           N I T  L++LSEQ+LVDCD  ++ GC GG +  A  FI++N G+  E  YPY+G + +C
Sbjct: 184 NAIRTWSLVTLSEQQLVDCD-NVDHGCAGGWIPSALDFIVRNRGIVPEGTYPYIGTQGRC 242

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS 297
                 A  V+IDGY  V PFD  +L  AVA QPV+VA+E+   AF+HY+ GVF G CG 
Sbjct: 243 --RHVMAPPVTIDGYRRVLPFDVNALMSAVAAQPVAVAMESSAWAFRHYQGGVFNGNCGG 300

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            L H    VGYG   G  +W+V+NSWG  WGE GYV++ RN  +   G CGI  +  YPV
Sbjct: 301 RLGHAAAVVGYGDGAGGPFWIVKNSWGPKWGEGGYVRISRNAPN-RLGICGILTQPLYPV 359

Query: 358 K 358
           K
Sbjct: 360 K 360


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 144/314 (45%), Positives = 193/314 (61%), Gaps = 21/314 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYR 109
           ++ W  +H K  +       R++I++ N + I+ HN  S    + +G+NKF DL + E+ 
Sbjct: 22  WEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFA 81

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            M+ G    A+    K  VA   Y      +   +VDWR KGAV  VK+QG CGSCWAFS
Sbjct: 82  EMFNGYMMQARSNSTKVFVADPNY------KADPTVDWRTKGAVTGVKNQGQCGSCWAFS 135

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           T  ++EG + + TG+L+SLSEQ LVDC  ++ N GCNGGLMD AF++I +NGG+D+E  Y
Sbjct: 136 TTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASY 195

Query: 229 PYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
           PY   + +C   R  A  V  +  GY D+   DE +L +AV    PVSVAI+A   +FQ 
Sbjct: 196 PYQAHDERC---RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQL 252

Query: 286 YESGV-FTGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y SGV +  EC  +ALDHGV+A+GYGTE G DYWLV+NSWG+DWG  GY+ + RN     
Sbjct: 253 YRSGVYYERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRN----R 308

Query: 344 TGKCGIAMEASYPV 357
              CGIA EASYP 
Sbjct: 309 NNNCGIATEASYPT 322


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 194/314 (61%), Gaps = 21/314 (6%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYL 113
           +H K  +       R +I+  N   I +HN         +++ +NK+ DL +EE+     
Sbjct: 33  QHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEEFVQTLN 92

Query: 114 G-TRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           G  R++AK+ ++K     +   Y   A  E+P++VDWREKGAV PVKDQG CGSCW+FS 
Sbjct: 93  GFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGSCWSFSA 152

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             A+EG +   TG+L+SLSEQ LVDC  K  N GCNGG+MD+AFQ+I  NGG+D+E+ YP
Sbjct: 153 TGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGGIDTEKAYP 212

Query: 230 YLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
           Y   ++ C     N K V  +  G+ D+   DE +L KA+A   PVSVAI+A   +FQ Y
Sbjct: 213 YEAIDDTC---HYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASHESFQFY 269

Query: 287 ESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
             GV +  +C S  LDHGV+AVGYGT E G DYWLV+NSWG+ WG+ GYVK+ RN     
Sbjct: 270 SEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARN----R 325

Query: 344 TGKCGIAMEASYPV 357
              CGIA  ASYP+
Sbjct: 326 DNHCGIATAASYPL 339


>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
 gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
           proteinase II; Flags: Precursor
 gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
          Length = 337

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 143/355 (40%), Positives = 204/355 (57%), Gaps = 22/355 (6%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M L+I+ +  L  +S S  +  ++ S+    D    W   +     ++ ++         
Sbjct: 1   MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYTHKEFMP-------- 52

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
                 R++ FK N+ ++   NS      +GLN+ ADL+NEEYR  YLGTR+  K     
Sbjct: 53  ------RYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYH 106

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
            +    R   +   + P +VDWREK AV PVKDQG CGSC++FST  +VEG+  I TG+L
Sbjct: 107 KRNLGLRLN-RPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKL 165

Query: 186 ISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
           +SLSEQ ++DC     N GCNGGLM  AF++II+N G++SE+ YPY    N     +  +
Sbjct: 166 VSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGS 225

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHG 302
               I  Y+++   DE  L+ A+   PVSVAI+A   +FQ Y +GV+     S+  LDHG
Sbjct: 226 VAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHG 285

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+AVG GT+NG DY++V+NSWG  WG NGY+ + RN  D N   CGI+  ASYP+
Sbjct: 286 VLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARN-KDNN---CGISTMASYPI 336


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 143/351 (40%), Positives = 201/351 (57%), Gaps = 21/351 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           ++ L+ LFF+          IS  N      S +    V   ++ W+++HG+        
Sbjct: 8   MNILITLFFV----------ISMFNTQTRGRS-QPKLSVSERHELWMSRHGRVYKDEVEK 56

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            +RF IFK+N++FI+  N   N +YK+G+N+FAD+T++E+ A + G  +     L  S +
Sbjct: 57  VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGL-NIPNSYLSPSPM 115

Query: 129 ASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           +S        + D++P ++DW E GAV  VK QG CG CWAFS V ++EG  KI TG L+
Sbjct: 116 SSTELKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 175

Query: 187 SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             SEQEL+DC    N GCNGG M  AF FI +NGG+  E DY YLG +  C    + A  
Sbjct: 176 EFSEQELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTA-A 233

Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
           V I  Y+ V P  E SL +AV  QPVS+ I A  +  Q Y  G + G C   ++H V A+
Sbjct: 234 VQISSYQ-VVPEGETSLLQAVTKQPVSIGI-AASQDLQFYAGGTYDGSCADRINHAVTAI 291

Query: 307 GYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           GYGT E G  YWL++NSWG+ WGENG++K+ R+  +   G C IA  +SYP
Sbjct: 292 GYGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNP-AGLCDIAKMSSYP 341


>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
          Length = 215

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 124/211 (58%), Positives = 156/211 (73%), Gaps = 3/211 (1%)

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAFSTV  VEGINKI TG+L+SLSEQELVDC+   N GCNGGLM+ A++FI ++
Sbjct: 1   GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD-NEGCNGGLMENAYEFIKKS 59

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +E+ YPY   +  CD S+ NA  V+IDG+E V   DE +L KAVA+QPVSVAI+A 
Sbjct: 60  GGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDAS 119

Query: 280 GRAFQHYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQR 337
           G   Q Y  GV+TG+ CG+ LDHGV  VGYGT  +G  YW+V+NSWG+ WGE GY+++QR
Sbjct: 120 GSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQR 179

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
            +     G CGIAMEASYP+K S ++ KP P
Sbjct: 180 GVDAAEGGVCGIAMEASYPLKLSSHNPKPSP 210


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 193/314 (61%), Gaps = 17/314 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +++W  K+GK+  G G    R ++++ NL+ + +HN L       Y++G+N +ADL NEE
Sbjct: 19  WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + A+    +        K K ++Q +    G  LP SVDWR +G V PVKDQG CGSCW 
Sbjct: 79  FMAL----KGSGGLLQAKDKSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWT 134

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS   ++EG +   TG L+SLSEQ+LVDC  R  N GCNGGLM+ A+ +I   GG++ E 
Sbjct: 135 FSATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELES 194

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY   + +C   R    V +  GY  +   DE +L +AV    PV+V+I+A G +FQ 
Sbjct: 195 AYPYTARDGRCKFDRSKV-VATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQL 253

Query: 286 YESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           YESGV+    C S  LDHGV+AVGYGTE G +YWLV+NSWG  WG+ GY+K+ +   D N
Sbjct: 254 YESGVYDFRRCSSTNLDHGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSK---DKN 310

Query: 344 TGKCGIAMEASYPV 357
             +CGIA ++ YP+
Sbjct: 311 N-QCGIATDSCYPL 323


>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 137/314 (43%), Positives = 190/314 (60%), Gaps = 17/314 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +++W  K+GK+  G G    R ++++ NL+ + +HN L       Y++G+N +ADL NEE
Sbjct: 19  WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + A+    +  +     K + ++Q +    G  LP SVDWR +G V PVKDQG CGSCW+
Sbjct: 79  FMAL----KGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWS 134

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS   ++EG +   TG L+SLSEQ+LVDC     N GC+GGLM+ A+ +I   GG+  E 
Sbjct: 135 FSATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLES 194

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY     +C   +  A V +  G+  +   DE SL +AV    PV+VAI+A G  FQ 
Sbjct: 195 AYPYTAQNGRCHFDQSKA-VATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQL 253

Query: 286 YESGVF-TGEC-GSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           YESGV+    C  S+LDHGV+A GYGTE G DYWLV+NSWG  WG  GY+K+ RN     
Sbjct: 254 YESGVYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRN----K 309

Query: 344 TGKCGIAMEASYPV 357
           + +CGIA  A YP+
Sbjct: 310 SNQCGIATMACYPL 323


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 151/331 (45%), Positives = 200/331 (60%), Gaps = 39/331 (11%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
           DE   +++ W +K         ++EK    R  +++ NL+ I+    EH+    TY +G+
Sbjct: 25  DEHWNLWKDWHSKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGKHTYSLGM 76

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           N F D+T+EE+R +  G +  ++R+L  S      +      E P SVDWR+KG V PVK
Sbjct: 77  NHFGDMTHEEFRQIMNGYKLKSQRKLRGSLFMEPNFL-----EAPRSVDWRDKGYVTPVK 131

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
           DQG CGSCWAFST  A+EG +   TG L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191

Query: 217 IQNGGMDSEQDYPYLGA-ENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
             NGG+DSE+ YPYLG  E  C  DPS  +A      G+ DV    E +L KAVA   PV
Sbjct: 192 KDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDT---GFVDVPSGSERALMKAVASVGPV 248

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
           SVAI+AG  +FQ Y SG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW  +
Sbjct: 249 SVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDVDGKKYWIVKNSWSEN 308

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG+ GY+ + ++        CGIA  ASYP+
Sbjct: 309 WGDKGYIYMAKD----KKNHCGIATAASYPL 335


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/309 (46%), Positives = 202/309 (65%), Gaps = 22/309 (7%)

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
           H KT + +    +RF+IF++N++ I+EHN L     ++Y +G+N+F+DL +EE+   Y G
Sbjct: 63  HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEF-VKYNG 121

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
                K+  +K    S   A     E P+SVDWR+KG V  VK+QG CGSCW+FST  ++
Sbjct: 122 L----KKTSLKDGGCSSYLAANNLVE-PDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSL 176

Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EG +   +G+L+SLSE +LVDC +   N GCNGGLMD AF++I   GG++SE+DYPY   
Sbjct: 177 EGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPK 236

Query: 234 ENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF 291
           +  C     + KV + D G  DV    E +LKKAV++  PVSVAI+A   +FQ Y  GV+
Sbjct: 237 QGTC--KFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVY 294

Query: 292 -TGECGS-ALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
              EC S  LDHGV+ VGYGT++ G DYW+V+NSWG++WGE+GYVK+ RN       +CG
Sbjct: 295 DEPECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN----KKNQCG 350

Query: 349 IAMEASYPV 357
           IA +ASYP+
Sbjct: 351 IATQASYPL 359


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 191/320 (59%), Gaps = 15/320 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
           + T+  +H K          R +IF DN   I +HNS       +YK+ +NK+ D+ + E
Sbjct: 34  WMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHE 93

Query: 108 YRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           +  +  G       +L   ++     +   A   LP+ VDWR++GAV PVKDQG CGSCW
Sbjct: 94  FVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHCGSCW 153

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           +FS   A+EG +   TG L+SLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E
Sbjct: 154 SFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 213

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY    +KC  +  N+  + + GY D+   DE  LK AVA   PVSVAI+A  ++FQ
Sbjct: 214 ASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDASHQSFQ 272

Query: 285 HYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y  GV +  EC S  LDHGV+ +GYGT ENG DYWLV+NSWG  WG NGY+K+ RN L+
Sbjct: 273 FYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMARNKLN 332

Query: 342 TNTGKCGIAMEASYPVKNSQ 361
                CGIA  ASYP+  S+
Sbjct: 333 ----HCGIASSASYPLVGSK 348


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 136/310 (43%), Positives = 192/310 (61%), Gaps = 12/310 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W + HGK          R  I+++NL+ I  HN    ++K+ +N   D+T+ E    
Sbjct: 29  WKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQT 88

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            LG +    ++  +S+     +   A  ++ +S+DWR KG V PVK+QG CGSCWAFST 
Sbjct: 89  LLGLKL---KKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTT 145

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            A+EG +   TG+L+SLSEQ LVDC  K  N GC GGLMD AFQ+I +NGG+D+E+ YPY
Sbjct: 146 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPY 205

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
           L  +  C    ++A      G+ D+   DE +L++A+A   P+S+AI+A    F  Y  G
Sbjct: 206 LAKDGVCH-YNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQG 264

Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+   +C S  LDHGV+AVGYGT++G DYWLV+NSWG  WGE GY+K+ RN  D    KC
Sbjct: 265 VYDDPDCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARNDHD----KC 320

Query: 348 GIAMEASYPV 357
           G+A +ASYP+
Sbjct: 321 GVASKASYPL 330


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 154/358 (43%), Positives = 202/358 (56%), Gaps = 28/358 (7%)

Query: 14  VFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
           +FLF I +  A   +I  ++              V   + T+  +H K          R 
Sbjct: 3   LFLFLIVAVLATAQAISFFE-------------LVNQEWTTFKMEHNKVYKNDVEERFRM 49

Query: 74  QIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           +IF DN   I +HN        +YK+ +NK+ D+ + E+     G       +L   ++ 
Sbjct: 50  KIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLP 109

Query: 130 -SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
            +  +   A   LP++VDWRE GAV PVKDQG CGSCW+FS   A+EG +   TG LI L
Sbjct: 110 IAASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPL 169

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E  YPY    +KC  +  N+   
Sbjct: 170 SEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGAR 229

Query: 248 SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVV 304
            + GY D+   +E  LK AVA   PVSVAI+A  ++FQ Y  GV +  EC S  LDHGV+
Sbjct: 230 DV-GYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVL 288

Query: 305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           AVGYGT ENG DYWLV+NSWG  WG+NGY+K+ RN L+     CGIA  ASYP+  SQ
Sbjct: 289 AVGYGTDENGQDYWLVKNSWGETWGDNGYIKMARNKLN----HCGIASTASYPLVGSQ 342


>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 294

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 147/297 (49%), Positives = 187/297 (62%), Gaps = 20/297 (6%)

Query: 72  RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEY-RAMYLGTRSDAKRRLMKS 126
           R QI+  N + +  HN L     ++Y++G+ +FAD+ NEEY R + LG          + 
Sbjct: 6   RRQIWLSNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKRLISLGCLGAFNASAPRK 65

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
             A  R A   G  LP +VDWR+KG V  VKDQ  CGSCWAFS   ++EG N   TG+L+
Sbjct: 66  GSAFFRLA--EGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKLV 123

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRN 243
           SLSEQ+LVDC     N GC GGLMD AF++I +NGG+D+E+ YPY   + KC   P    
Sbjct: 124 SLSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESYPYEAEDGKCRFKPQNIG 183

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LD 300
           AK     GY DV+  DE +LK+AVA   PVSVAI+A   +FQ YESGV+   EC S  LD
Sbjct: 184 AKCT---GYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELECSSEDLD 240

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           HGV+AVGYGT+NG DYWLV+NSWG  WG+ GY+ + RN       +CGIA  ASYP+
Sbjct: 241 HGVLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRN----KHNQCGIASMASYPL 293


>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 191/315 (60%), Gaps = 16/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  + G++ N      +R +I+  N R +  HN +     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85

Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           Y R +  G        L +   A  R     G +LP SVDWREKG V  VKDQ  CGSCW
Sbjct: 86  YKRQISQGCLGSFNASLPRRGSAYLRLP--EGADLPNSVDWREKGYVTEVKDQKQCGSCW 143

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  ++EG     TG+L+SLSEQ+LVDC     N GC GGLMD AF++I  NGG+D+E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY   + +C  +  N    +  GY DV   DE +LK+AVA   PVSVAI+A   +FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQ 262

Query: 285 HYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            YESGV+   EC S+ LDHGV+AVGYG++NG DYWLV+NSWG  WG  GY+ + RN    
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---- 318

Query: 343 NTGKCGIAMEASYPV 357
              +CGIA  +SYP+
Sbjct: 319 KHNQCGIATASSYPL 333


>gi|403344237|gb|EJY71457.1| Cathepsin L [Oxytricha trifallax]
          Length = 341

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/309 (46%), Positives = 187/309 (60%), Gaps = 16/309 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRA 110
           +  +LAK+GK+       E R   +K N+  I  HNS N  T+ +  NKF D T ++YR 
Sbjct: 43  FANFLAKYGKSYGTREEFEFRLNQYKTNMALISAHNSKNGETFTLAANKFTDYTPQQYRK 102

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           + LG +S       K++  +++YA     ++P SVDWREK AV PVKDQG CGSCWAFST
Sbjct: 103 L-LGYKSK------KNQNDAKKYATFNLTDVPSSVDWREKNAVTPVKDQGQCGSCWAFST 155

Query: 171 VAAVEGINKIVTGELISLSEQELVDCD--RKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
             ++EG + I +G L S SEQ+LVDCD  +  N GCNGG M  A  +  +N  +D E DY
Sbjct: 156 TGSLEGRDAIASGVLQSYSEQQLVDCDFSKDGNQGCNGGDMGLAMAYSAKN-PLDLESDY 214

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY G +  C   +   K  +  G   V P     LK A+A+ PVSVAIEA    FQ Y  
Sbjct: 215 PYEGVDGTCRAKQGQGKSKN-SGSTYVKPNSPDDLKAAIAEGPVSVAIEADSLFFQFYSK 273

Query: 289 GVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GVF+ + CG+ LDHGV+AVGYGTENG DY+LV+NSW S WG +GY+K+    +  N G C
Sbjct: 274 GVFSSKYCGTNLDHGVLAVGYGTENGSDYYLVKNSWSSGWGLDGYIKIG---VAANEGIC 330

Query: 348 GIAMEASYP 356
           GI ME  +P
Sbjct: 331 GIQMEPVFP 339


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 187/316 (59%), Gaps = 23/316 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +Q + A+HG+    +     R  +F+ N +FID+HN+       T+ + +N+F D+T+EE
Sbjct: 23  WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 82

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCW 166
             A   G      RR             KA DE LPE VDWR KGAV PVKDQ  CGSCW
Sbjct: 83  IVATMNGFLGAPTRRPAA--------VLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCW 134

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  ++EG + +  G+L+SLSEQ LVDC  K  N GC GGLMD AF++I  N G+D+E
Sbjct: 135 AFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTE 194

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY   + KC     N       GY DV    E +LKKAVA   P+SV I+A    F 
Sbjct: 195 DSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVGIDASQSTFH 253

Query: 285 HYESGVFTGE-CGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y +GV+  + C S  LDHGV+AVGYG+ ENG D+WLV+NSW + WG+ GY+K+ RN   
Sbjct: 254 FYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN--- 310

Query: 342 TNTGKCGIAMEASYPV 357
                CGIA +ASYP+
Sbjct: 311 -RNNNCGIASQASYPL 325


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 147/318 (46%), Positives = 189/318 (59%), Gaps = 26/318 (8%)

Query: 53  QTWLA---KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
           Q WLA    HGK          R ++F DN + IDEHN+       +YK+ +N   DL  
Sbjct: 11  QEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMV 70

Query: 106 EEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
            E++A+  G  ++    R  K  V S        + LP+SVDWR++GAV PVKDQG CGS
Sbjct: 71  HEFKALMNGFKKTPNAERNGKIYVPSN-------ENLPKSVDWRQRGAVTPVKDQGHCGS 123

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMD 223
           CW+FS   ++EG   + TG L+SLSEQ LVDC +   N+GC GGLM+ AFQ++  N G+D
Sbjct: 124 CWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGID 183

Query: 224 SEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
           +E  YPY   EN C    +  KV   D GY D+    E  L+ AVA   P+SV I+A   
Sbjct: 184 TEASYPYEARENNC--RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHE 241

Query: 282 AFQHYESGVFTGE-CG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           +FQ Y  GV+  + C  S LDHGV+ VGYGTENG DYWLV+NSWG  WGE+GY+K+ RN 
Sbjct: 242 SFQFYSEGVYKEQYCSPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARN- 300

Query: 340 LDTNTGKCGIAMEASYPV 357
              +   CGIA  ASYPV
Sbjct: 301 ---HKNHCGIASMASYPV 315


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 192/315 (60%), Gaps = 20/315 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           ++ W   H K       + +R +I++DNL+ + +HN+ +     +Y +G+NK+ADL  EE
Sbjct: 28  WEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +  M  G + DA R     K  S      A  + P+SVDWR++G V PVKDQG CGSCWA
Sbjct: 87  FVQMMNGLKFDASRERQGIKFLSY-----AKFQAPDSVDWRDEGYVTPVKDQGQCGSCWA 141

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FST  ++EG +   TG L SLSEQ LVDC     N GC GGLMDYAFQ+I  N G+D+E 
Sbjct: 142 FSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTED 201

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKA-VADQPVSVAIEAGGRAFQH 285
            YPY   ++ C  S  N       GY DV   DE +LK+A  A+ P+SVAI+A   +FQ 
Sbjct: 202 KYPYEAEDDTCRFSPDNVGATD-SGYVDVDSGDEDALKEACAANGPISVAIDASHESFQL 260

Query: 286 YESGVFTGECGSA--LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           YESGV+  E  S+  LDHGV+ VGYGT++ G DYW+V+NSWG  WG+ GY+ + RN    
Sbjct: 261 YESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRN---- 316

Query: 343 NTGKCGIAMEASYPV 357
              +CGIA  ASYP 
Sbjct: 317 KDNQCGIATSASYPT 331


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 187/316 (59%), Gaps = 23/316 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +Q + A+HG+    +     R  +F+ N +FID+HN+       T+ + +N+F D+T+EE
Sbjct: 22  WQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEE 81

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCW 166
             A   G      RR             KA DE LPE VDWR KGAV PVKDQ  CGSCW
Sbjct: 82  IVATMNGFLGAPTRRPAA--------VLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCW 133

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  ++EG + +  G+L+SLSEQ LVDC  K  N GC GGLMD AF++I  N G+D+E
Sbjct: 134 AFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGIDTE 193

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY   + KC     N       GY DV    E +LKKAVA   P+SV I+A    F 
Sbjct: 194 DSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVGIDASQSTFH 252

Query: 285 HYESGVFTGE-CGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y +GV+  + C S  LDHGV+AVGYG+ ENG D+WLV+NSW + WG+ GY+K+ RN   
Sbjct: 253 FYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN--- 309

Query: 342 TNTGKCGIAMEASYPV 357
                CGIA +ASYP+
Sbjct: 310 -RNNNCGIASQASYPL 324


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 185/308 (60%), Gaps = 19/308 (6%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
           W   H K  +  G    R+ I+KDN R I EHN     + + +N+F D+TN E++A    
Sbjct: 30  WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFKAF--- 86

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
                   L    V    +        P++VDWR +G V PVKDQG CGSCWAFST  ++
Sbjct: 87  -----NGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141

Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EG +   TG+L+SLSEQ LVDC     N GCNGGLMD AF +I +N G+DSE  YPY   
Sbjct: 142 EGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAE 201

Query: 234 ENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF 291
           + KC    +   V + D G+ D+   +E  LK+AVA   P+SVAI+A   +FQ Y SGV+
Sbjct: 202 DGKC--VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVY 259

Query: 292 T-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
               C S  LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K++RN  +    +CGI
Sbjct: 260 NEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKN----QCGI 315

Query: 350 AMEASYPV 357
           A +ASYP+
Sbjct: 316 ATKASYPL 323


>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 142/322 (44%), Positives = 195/322 (60%), Gaps = 15/322 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFA 101
           D V   + ++  +H K  +       R +IF +N   + +HN L       +K+GLNK+A
Sbjct: 21  DLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYA 80

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           D+ + E+ +   G        L  S +  + R+   A  +LP++VDWR+KGAV  VKDQG
Sbjct: 81  DMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQG 140

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQN 219
            CGSCW+FS   ++EG +   TG+L+SLSEQ LVDC  R  N GCNGGLMD AF++I  N
Sbjct: 141 HCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDN 200

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           GG+D+E+ YPYL  + KC    +N+      G+ D+   +E  LK AVA   PVS+AI+A
Sbjct: 201 GGIDTEKSYPYLAEDEKCHYKAQNSGATD-KGFVDIEEANEDDLKAAVATVGPVSIAIDA 259

Query: 279 GGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
               FQ Y  GV++  EC S  LDHGV+ VGYGT ++G DYWLV+NSWG  WG NGY+K+
Sbjct: 260 SHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLNGYIKM 319

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
            RN        CG+A +ASYP+
Sbjct: 320 ARN----QDNMCGVASQASYPL 337


>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
 gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
          Length = 349

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 132/314 (42%), Positives = 191/314 (60%), Gaps = 10/314 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +Q W A++ +T       ++RF ++ +N++FI+  N    +Y++G N+FADLT EE++  
Sbjct: 37  FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENQFADLTEEEFKDT 96

Query: 112 YLGTRSD--AKRRLMKSKVASQRYACKAG----DELPESVDWREKGAVNPVKDQGSCGSC 165
           YL    +  +    M   V +   A  +G    +E P SVDWR KGAV PVK Q  CGSC
Sbjct: 97  YLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHCGSC 156

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM-DYAFQFIIQNGGMDS 224
           WAF+ VA++EG++KI TG L+SLSEQE+VDCDR  N     G     A +++ +NGG+ +
Sbjct: 157 WAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGGLTT 216

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYPY+G + +C   +       I G + V   +E +L+ AVA +PV+V+I A  RAFQ
Sbjct: 217 ESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA-SRAFQ 275

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
            Y+ G+F+G C +  +H V  VGYG   +G  YW+V+NSWG  WGE GYV++QR  +   
Sbjct: 276 FYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRG-VRAR 334

Query: 344 TGKCGIAMEASYPV 357
            G CGIA+   Y V
Sbjct: 335 EGVCGIAIAPFYAV 348


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 194/320 (60%), Gaps = 23/320 (7%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
            +  +Q + A++GK       +  R  +++ N  FI+ HN        ++ + +N+F D+
Sbjct: 18  TLNEWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDM 77

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T EE  A   G  S  K      KV          DELP++VDWR+KGAV PVKDQ +CG
Sbjct: 78  TTEEINAAMNGFLSAGK------KVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACG 131

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS   ++EG + + TG+L+SLSEQ LVDC D+  N GC GGLMD AF++I  N G+
Sbjct: 132 SCWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGI 191

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAG 279
           D+E+ YPY   E K  P R N+  V  ++  Y D+    E  L+KAVA++ PVSVAI+A 
Sbjct: 192 DTEESYPY---EAKNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDAS 248

Query: 280 GRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
              F  Y  G++  E C S+ LDHGV+AVGYGT++  DYWLV+NSW   WG++GY+K+ R
Sbjct: 249 TSTFHFYSRGIYYDEKCSSSFLDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSR 308

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N        CGIA +ASYPV
Sbjct: 309 N----RNNNCGIASQASYPV 324


>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
          Length = 333

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 147/354 (41%), Positives = 208/354 (58%), Gaps = 38/354 (10%)

Query: 14  VFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRF 73
           + LFF+   SA   ++       +  S W        +++T   K   T+  +     R 
Sbjct: 7   ICLFFVCVYSAPTFNV-------ELDSHW-------ALFKTTFGKQYSTAEEI----TRR 48

Query: 74  QIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
             ++ N+  I +HN  +     TY +GLN +ADLTN E+  +  G R +A     ++K A
Sbjct: 49  LAWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNAEFNQVMNGLRVNAS----QTKSA 104

Query: 130 SQR-YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
           ++R Y    G ELP SVDWR KG V P+KDQG CGSCWAFS+  ++EG +   TG+L+SL
Sbjct: 105 NRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSL 164

Query: 189 SEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
           SEQ L DC +K  N GCNGGLMD AF +I +N G+D+E  YPY   + KC    + A V 
Sbjct: 165 SEQNLTDCSQKQGNMGCNGGLMDQAFTYIKENNGIDTESSYPYKAVDEKCH--FKAADVG 222

Query: 248 SID-GYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
           + D GY D++  DE +L+ A+A   P+SVAI+A   +FQ Y SG +     SA  LDHGV
Sbjct: 223 ATDTGYTDIAQQDENALQSAIATVGPISVAIDASHSSFQLYRSGAYNERACSATQLDHGV 282

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +AVGY +E+G DY++V+NSWG+ WG+ GY+ + RN       +CGIA  ++YP 
Sbjct: 283 LAVGYDSEDGKDYYIVKNSWGTSWGQKGYIWMTRN----KNNQCGIATMSTYPT 332


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 148/357 (41%), Positives = 203/357 (56%), Gaps = 35/357 (9%)

Query: 12  TLVFLF---FISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH 68
           TL+FL    F+  S+A  ++ +  D  H                  + A H K       
Sbjct: 1   TLIFLLGAVFVQLSAALSLTNLLADEWH-----------------LFKATHKKEYPSQLE 43

Query: 69  NEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
            + R +I+ +N   + +HN L     ++Y+V +NKF DL + E+R++  G +   K++  
Sbjct: 44  EKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQH--KKQNS 101

Query: 125 KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE 184
               ++  +   A  E+PESVDWREKGA+ PVKDQG CG CWAFS+  A+EG     TG+
Sbjct: 102 SRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGK 161

Query: 185 LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
           L+SL EQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E  YPY   ++ C  + RN
Sbjct: 162 LVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRN 221

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALD 300
              V   G+ D+   +E  LK AVA   PVSVAI+A   +FQ Y  GV +   C S  LD
Sbjct: 222 RGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLD 280

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           HGV+ VGYG++NG DYWLV+NSW   WG+ GY+K+ RN        CG+A  ASYP+
Sbjct: 281 HGVLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARN----RKNHCGVATAASYPL 333


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 140/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  + A H K        + R +I+ +N   + +HN L     ++Y+V +NKF DL + E
Sbjct: 31  WHLFKATHKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R++  G +   K++      ++  +   A  E+PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 91  FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWA 148

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+L+SLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E 
Sbjct: 149 FSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
            YPY   +  C  + RN   V   G+ D+   +E  LK AVA   PVSVAI+A   +FQ 
Sbjct: 209 TYPYEAEDGVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267

Query: 286 YESG-VFTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y  G  +   C S  LDHGV+ VGYG++NG DYWLV+NSW   WG+ GY+K+ RN     
Sbjct: 268 YSKGXYYEPSCDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYIKIARN----R 323

Query: 344 TGKCGIAMEASYPV 357
              CG+A  ASYP+
Sbjct: 324 KNHCGVATAASYPL 337


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 139/299 (46%), Positives = 185/299 (61%), Gaps = 18/299 (6%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRR----L 123
           R +I+ ++   I +HN        +YK+G+NK+ D+ + E+     G    AK      +
Sbjct: 47  RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYM 106

Query: 124 MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
               V   ++   A  +LPE VDWR+ GAV  +KDQG CGSCW+FST  A+EG +   +G
Sbjct: 107 KGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSG 166

Query: 184 ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
            L+SLSEQ L+DC  +  N GCNGGLMD AF++I  NGG+D+EQ YPY G ++KC  + +
Sbjct: 167 YLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPK 226

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGSA-L 299
           N     + G+ D+   DE  L +AVA   PVSVAI+A   +FQ Y SGV+   EC S  L
Sbjct: 227 NTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDL 285

Query: 300 DHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV+ VGYGT E GVDYWLV+NSWG  WGE GY+K+ RN       +CGIA  ASYP+
Sbjct: 286 DHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRN----KNNRCGIASSASYPL 340


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 137/307 (44%), Positives = 186/307 (60%), Gaps = 17/307 (5%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLG 114
           W   H K  +  G    R+ I+KDN R I EHN     + + +N+F D+TN E++A    
Sbjct: 30  WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFKAF--- 86

Query: 115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
                   L    V    +        P++VDWR +G V PVKDQG CGSCWAFST  ++
Sbjct: 87  -----NGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSL 141

Query: 175 EGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
           EG +   TG+L+SLSEQ LVDC     N GC+GGLMD AF +I +N G+DSE  YPY   
Sbjct: 142 EGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAE 201

Query: 234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT 292
           + KC   ++++   +  G+ D+   +E  LK+AVA   P+SVAI+A   +FQ Y SGV+ 
Sbjct: 202 DGKC-VFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYN 260

Query: 293 -GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
              C S  LDHGV+ VGYGTE+G DYWLV+NSW + WG+ GY+K++RN  +    +CGIA
Sbjct: 261 EPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKN----QCGIA 316

Query: 351 MEASYPV 357
            +ASYP+
Sbjct: 317 TKASYPL 323


>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
 gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
          Length = 307

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 143/297 (48%), Positives = 184/297 (61%), Gaps = 18/297 (6%)

Query: 71  KRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           +R +IF++N + I+ HN+       TY +G N+FA +TN+E+ A  +G      R   KS
Sbjct: 18  RRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGG-CLLDRNASKS 76

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                        ELP++VDWR KG V PVK+Q  CGSCWAFST  ++EG     TG+L+
Sbjct: 77  TADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKLV 136

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRN 243
           SLSEQ LVDC  +  N GCNGGLMD AF++I  NGG+D+E  YPY   + KC   P+   
Sbjct: 137 SLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPADVG 196

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LD 300
           A V    GY D+S  DE +L +AVA   P+SVAI+A    FQ Y  GV +  +C S  LD
Sbjct: 197 ATVT---GYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD 253

Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           HGV+AVGYGTE G DYWLV+NSWG  WG+NGY+ + RN       +CGIA  ASYP+
Sbjct: 254 HGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRN----KNNQCGIATSASYPL 306


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 141/294 (47%), Positives = 184/294 (62%), Gaps = 19/294 (6%)

Query: 71  KRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           +RF IF+DNL  I+E N +N +   + +G+N+FAD+TN E+  M LG     + ++    
Sbjct: 48  RRF-IFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGL--GGRNKIAGDS 104

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           V    +      +LP  VDW +KG V  VK+QG CGSCWAFST  ++EG     TG+L+S
Sbjct: 105 VFESSHV----QDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVS 160

Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC   + N GCNGGLMD AF +I +NGG+D+E  YPY G++  C     N   
Sbjct: 161 LSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGTCR-FLENKVG 219

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGV 303
            ++ G+ DV   DE +LK+AVA   P+SVAI+A    FQ Y  GV+    C S  LDHGV
Sbjct: 220 ATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTELDHGV 279

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYGTE G DYWLV+NSWGS WG  GY+K+ RN       +CGIA +ASYP 
Sbjct: 280 LVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRN----KKNRCGIATQASYPT 329


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 191/320 (59%), Gaps = 15/320 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
           + T+  +H K          R +IF DN   I +HNS       +YK+ +NK+ D+ + E
Sbjct: 28  WMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           +  +  G       +L   ++     +   A   LP+ VDWR++GAV PVKDQG CGSCW
Sbjct: 88  FVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGAVTPVKDQGHCGSCW 147

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           +FS   A+EG +   TG L+SLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E
Sbjct: 148 SFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY    +KC  +  N+  + + GY D+   +E  LK AVA   PVSVAI+A  ++FQ
Sbjct: 208 ASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQ 266

Query: 285 HYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y  GV +  EC S  LDHGV+ +GYGT ENG DYWLV+NSWG  WG NGY+K+ RN L+
Sbjct: 267 FYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLN 326

Query: 342 TNTGKCGIAMEASYPVKNSQ 361
                CGIA  ASYP+  S+
Sbjct: 327 ----HCGIASSASYPLVGSK 342


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 141/317 (44%), Positives = 193/317 (60%), Gaps = 25/317 (7%)

Query: 53  QTWLA---KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTN 105
           + WLA   + GK+         R  ++K+N R IDEHN        +YK+ +N F DL  
Sbjct: 24  EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E++A+      +  +R  K + + + +    G +LP  VDWR+KGAV PVKD G CGSC
Sbjct: 84  HEFKAL------NKLKRSAKQQNSGEVFRATGG-KLPAKVDWRQKGAVTPVKDPGQCGSC 136

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           WAFS+  ++ G   +   +L+SLSEQ+LVDC     N GC+GG+M  AFQ+I  NGG+D+
Sbjct: 137 WAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDT 196

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
           E  YPY   ++KC    +   V   D GY D++  DE +LK+AVA+  P+SVAI+AG  +
Sbjct: 197 EGSYPYEAEDDKC--RYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLS 254

Query: 283 FQHYESGVFTGE--CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y  G++       + LDHGV+ VGYGTENG DYWLV+NSWG  WGENGY+K+ RN  
Sbjct: 255 FQFYSEGIYDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGYIKIARN-- 312

Query: 341 DTNTGKCGIAMEASYPV 357
             +   CGIA  ASYP+
Sbjct: 313 --HNNHCGIASMASYPI 327


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 189/320 (59%), Gaps = 15/320 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS----LNRTYKVGLNKFADLTNEE 107
           + T+  +H K          R +IF DN   I +HN        +YK+ +NK+ D+ + E
Sbjct: 28  WTTFKMEHNKVYKNDIEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           +     G       +L   ++     +   A   LP++VDWRE GAV PVKDQG CGSCW
Sbjct: 88  FVNTLNGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCW 147

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           +FS   A+EG +   TG LI LSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E
Sbjct: 148 SFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY    +KC  +  N+    + GY D+   +E  LK AVA   PVSVAI+A  ++FQ
Sbjct: 208 VTYPYEAENDKCRYNAANSGARDV-GYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQ 266

Query: 285 HYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Y  GV +  EC S  LDHGV+AVGYGT ENG DYWLV+NSWG  WG+NGY+K+ RN L+
Sbjct: 267 FYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMARNKLN 326

Query: 342 TNTGKCGIAMEASYPVKNSQ 361
                CGIA  ASYP+  SQ
Sbjct: 327 ----HCGIASTASYPLVGSQ 342


>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
 gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
          Length = 349

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 132/314 (42%), Positives = 191/314 (60%), Gaps = 10/314 (3%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +Q W A++ +T       ++RF ++ +N++FI+  N    +Y++G N+FADLT EE++  
Sbjct: 37  FQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENRFADLTEEEFKDT 96

Query: 112 YLGTRSD--AKRRLMKSKVASQRYACKAG----DELPESVDWREKGAVNPVKDQGSCGSC 165
           YL    +  +    M   V +   A  +G    +E P SVDWR KGAV PVK Q  CGSC
Sbjct: 97  YLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQHCGSC 156

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM-DYAFQFIIQNGGMDS 224
           WAF+ VA++EG++KI TG L+SLSEQE+VDCDR  N     G     A +++ +NGG+ +
Sbjct: 157 WAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNGGLTT 216

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           E DYPY+G + +C   +       I G + V   +E +L+ AVA +PV+V+I A  RAFQ
Sbjct: 217 ESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINA-SRAFQ 275

Query: 285 HYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
            Y+ G+F+G C +  +H V  VGYG   +G  YW+V+NSWG  WGE GYV++QR  +   
Sbjct: 276 FYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRG-VRAR 334

Query: 344 TGKCGIAMEASYPV 357
            G CGIA+   Y V
Sbjct: 335 EGVCGIAIAPFYAV 348


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  + A H K        + R +I+ +N   + +HN L     ++Y+V +NKF DL + E
Sbjct: 31  WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHE 90

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R++  G +   K++      ++  +   A  E+PESVDWR KGA+ PVKDQG CGSCWA
Sbjct: 91  FRSIMNGYQH--KKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQCGSCWA 148

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+LISLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E 
Sbjct: 149 FSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 208

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
            YPY   +N C  + RN   +   G+  +   +E  LK AVA   PVSVAI+A   +FQ 
Sbjct: 209 TYPYEAEDNVCRYNPRNRGAID-RGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 267

Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y  GV +   C S  LDHGV+ VGYG++NG DYWLV+NSW   WG+ GY+K+ RN     
Sbjct: 268 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARN----R 323

Query: 344 TGKCGIAMEASYPV 357
              CGIA  ASYP+
Sbjct: 324 KNHCGIATAASYPL 337


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 201/327 (61%), Gaps = 26/327 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
           D ++   ++ W   HGK  +      +R  +++ NL+ I+    EH+    TY++G+N+F
Sbjct: 22  DKQLDNHWEQWKNWHGKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHTYRLGMNRF 80

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+T+EE+R +  G +   +RR   S      +      E+P S+DWREKG V PVKDQG
Sbjct: 81  GDMTHEEFRQVMNGYKHKKERRFRGSLFMEPNFL-----EVPNSLDWREKGYVTPVKDQG 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I   
Sbjct: 136 ECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQ 195

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            G+DSE+ YPY+G +++  P   + K  + +  G+ D+    E +L KA+A   PVSVAI
Sbjct: 196 NGLDSEESYPYVGTDDQ--PCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAI 253

Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ +  EC S  LDHGV+AVGYG E    +G  YW+V+NSW  +WG+ 
Sbjct: 254 DAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDK 313

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GYV + ++        CGIA  ASYP+
Sbjct: 314 GYVYMAKD----RHNHCGIATAASYPL 336


>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
 gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
          Length = 334

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 195/317 (61%), Gaps = 20/317 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           + +W  K GK    +    +R   + +N + +  HN L     ++Y++G+  FAD+ N+E
Sbjct: 26  FHSWKLKFGKIYKSVEEESQRKNTWLENRKLVLVHNMLADQGIKSYRLGMTYFADMDNQE 85

Query: 108 YR-AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           YR +++ G      R   K   AS       G  LP++VDWR+KG V  VKDQ +CGSCW
Sbjct: 86  YRQSVFKGCLGSFNR--TKGHRASTFLLQAGGAVLPDTVDWRDKGYVAEVKDQKNCGSCW 143

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFS   ++EG     TG+L+SLSEQ+LVDC  K  N GC GGLMD AF++I  N G+D+E
Sbjct: 144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGIDTE 203

Query: 226 QDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
           + YPY   +  C   P+   A   +  GY D++  DE +L+KAVA+  P+SVAI+AG  +
Sbjct: 204 ESYPYEATDGDCRFKPATVGA---TCTGYVDINSEDENALQKAVANIGPISVAIDAGHIS 260

Query: 283 FQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y SG++    C S  LDHGV+AVGYGT+N  DYWLV+NSWG DWG+ GY+K+ RN  
Sbjct: 261 FQLYGSGIYNEPNCSSEDLDHGVLAVGYGTDNQQDYWLVKNSWGLDWGDQGYIKMTRN-- 318

Query: 341 DTNTGKCGIAMEASYPV 357
                +CGIA  ASYP+
Sbjct: 319 --KNNQCGIATAASYPL 333


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 134/295 (45%), Positives = 189/295 (64%), Gaps = 14/295 (4%)

Query: 71  KRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           +R ++F++NL+ I+ HN L+     +Y++G+N+FAD+  +E+ ++  G R + + ++ + 
Sbjct: 62  QRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNGFRMNNRTKV-RD 120

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
            + S   +      LP  VDWR++G V P+KDQG CGSCW+FST  A+EG +   TG+L+
Sbjct: 121 HLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLV 180

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ L+DC     N GCNGG+MDYAFQ+I  N G D+E  YPY  A+  C   +    
Sbjct: 181 SLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYEAADGPCRFKKEYVG 240

Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-EC-GSALDHG 302
                GY D+   DE  +K+AVA   PVSVAI+A   +FQ Y+SGV+   EC    LDHG
Sbjct: 241 ATDT-GYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECDPEGLDHG 299

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGTE G DYWLV+NSWG+ WG+ GY+K+ RN       +CGI+  ASYP+
Sbjct: 300 VLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRN----KNNQCGISSMASYPL 350


>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 144/315 (45%), Positives = 191/315 (60%), Gaps = 16/315 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  + G++ N      +R +I+  N R +  HN +     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEE 85

Query: 108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           Y R +  G        L +   A  R     G +LP SVDWREKG V  VKDQ  CGSCW
Sbjct: 86  YKRQISQGCLGSFNASLPRRGSAYLRLP--EGADLPNSVDWREKGYVTDVKDQKQCGSCW 143

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  ++EG     TG+L+SLSEQ+LVDC     N GC GGLMD AF++I  NGG+D+E
Sbjct: 144 AFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTE 203

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
             YPY   + +C  +  N    +  GY DV   DE +LK+A+A   PVSVAI+A   +FQ
Sbjct: 204 DSYPYEAEDGQCRYNSANIG-ATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQ 262

Query: 285 HYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            YESGV+   EC S+ LDHGV+AVGYG++NG DYWLV+NSWG  WG  GY+ + RN    
Sbjct: 263 LYESGVYDEPECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRN---- 318

Query: 343 NTGKCGIAMEASYPV 357
              +CGIA  +SYP+
Sbjct: 319 KHNQCGIATASSYPL 333


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 201/327 (61%), Gaps = 21/327 (6%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLN 98
           R D ++ + +Q W + H K  +    + +R  +++ NL+ I+ HN   SL + +YK+G+N
Sbjct: 35  RVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMN 93

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +F D+T EE+R +  G +     R    K    ++   +  E P SVDWREKG V PVKD
Sbjct: 94  QFGDMTAEEFRQLMNGYKHKKSER----KYRGSQFLEPSFLEAPRSVDWREKGYVTPVKD 149

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
           QG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++ 
Sbjct: 150 QGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQ 209

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            NGG+DSE+ YPY   +++    +      +  G+ D+    E +L KAVA   PVSVAI
Sbjct: 210 DNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAI 269

Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ +  +C S  LDHGV+ VGYG E    +G  YW+V+NSWG  WG+ 
Sbjct: 270 DAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDK 329

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY+ + ++        CGIA  ASYP+
Sbjct: 330 GYIYMAKD----RKNHCGIATAASYPL 352


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 156/370 (42%), Positives = 216/370 (58%), Gaps = 35/370 (9%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHS-------SSWRTDDEVMTIYQTWLAK 58
           MF  +S LV L       A+  + I   + HDH+       S  +  DE   ++  +   
Sbjct: 1   MFRLLS-LVLL------CASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKES 53

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG 114
            GK+ N    N+   + F  N+  IDEHN  +R    T+++GLN  ADL   +YR +  G
Sbjct: 54  FGKSYNKDEEND-YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLN-G 111

Query: 115 TRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
            R    RR     + S   ++      E+P+SVDWR+KG V  VK+QG CGSCWAFS   
Sbjct: 112 YR---HRRNFGDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATG 168

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           A+EG +   +G+++SLSEQ LVDC  K  N GCNGGLMD AF++I  N G+D+E+ YPY+
Sbjct: 169 ALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYV 228

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV 290
           G E KC   +++       G+ D+   DE +LK AVA Q P+S+AI+AG R FQ Y+ GV
Sbjct: 229 GRETKCHFKKKDIGAED-KGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGV 287

Query: 291 FTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           +   EC S  LDHGV+ VGYGT+    DYWL++NSWG  WGE GY+++ RN     +  C
Sbjct: 288 YYDEECSSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARN----RSNHC 343

Query: 348 GIAMEASYPV 357
           G+A +ASYP+
Sbjct: 344 GVATKASYPL 353


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 197/324 (60%), Gaps = 18/324 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D +   +QT+  +H K          R +IF +N   I +HN L      ++K+GLNK+A
Sbjct: 22  DVIKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYA 81

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKS--KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           D+ + E+     G      ++L  S        +      +LP+SVDWR KGAV  VKDQ
Sbjct: 82  DMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQ 141

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+  A+EG +   TG LISLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 142 GHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 201

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
           NGG+D+E+ YPY G ++ C  ++    + + D G+ D+   DE  L +AVA   PVSVAI
Sbjct: 202 NGGIDTEKSYPYEGIDDSCHFNK--GTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAI 259

Query: 277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
           +A   +FQ Y +GV+   +C    LDHGV+ VGYGT ENG DYWLV+NSWG+ WG+ G++
Sbjct: 260 DASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFI 319

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN  D N  +CGIA  +SYP+
Sbjct: 320 KMARN--DDN--QCGIATASSYPL 339


>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 148/320 (46%), Positives = 192/320 (60%), Gaps = 26/320 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K GK+ +       R QI+  N + +  HN L     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWRLKFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEE 85

Query: 108 YRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           Y+ +     LG+ + +  R       S       G +LP++VDWRE+G V  VKDQ  CG
Sbjct: 86  YKKLVSRGCLGSFNASLPRR-----GSTFLRLPEGIDLPDAVDWREQGYVTGVKDQKQCG 140

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS   A+EG +   TG L+SLSEQ+LVDC     N GCNGG MD AF++I  NGG+
Sbjct: 141 SCWAFSATGALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGI 200

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           D+E  YPY   +  C   R N   V  +  GY DV+ +DE +LK+AVA   PVSVAI+A 
Sbjct: 201 DTEASYPYEAEDWLC---RYNPASVGATCSGYVDVNKYDEEALKEAVATIGPVSVAIDAS 257

Query: 280 GRAFQHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             +FQ Y SGV+   G     LDHGV+AVGYGTENG DYWLV+NSWG  WGE GY+K+ R
Sbjct: 258 HASFQFYTSGVYDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSR 317

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N       +CGIA  ASYP+
Sbjct: 318 N----KHNQCGIASAASYPL 333


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 143/296 (48%), Positives = 191/296 (64%), Gaps = 16/296 (5%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKS 126
           R +IF +N   + +HN L      ++K+G+NK+AD+ + E+  +  G  R+ +  R  +S
Sbjct: 47  RMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGES 106

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
              S  +   A  +LP  +DWR+KGAV PVKDQG CGSCW+FS   ++EG +   +G+L+
Sbjct: 107 D-DSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLV 165

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ LVDC  K  N GCNGGLMD AF++I  NGG+D+EQ YPY   + KC    +N K
Sbjct: 166 SLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-K 224

Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECG-SALDHG 302
             +  GY D+   +E  L+ AVA   PVSVAI+A  ++FQ Y  GV +  EC  S LDHG
Sbjct: 225 GATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHG 284

Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGTE +G DYWLV+NSWG  WG+ GY+K+ RN  D N   CGIA EASYP+
Sbjct: 285 VLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN-RDNN---CGIATEASYPL 336


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 140/314 (44%), Positives = 189/314 (60%), Gaps = 15/314 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  + A H K        + R +I+ +N   + +HN L     ++Y V +NKF DL + E
Sbjct: 27  WHLFKATHKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R++  G +   K++      ++  +   A   +PESVDWREKGA+ PVKDQG CGSCWA
Sbjct: 87  FRSIMNGYQH--KKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQCGSCWA 144

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+L+SLSEQ L+DC  K  N GCNGGLMD AFQ+I  N G+D+E 
Sbjct: 145 FSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTEN 204

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY   ++ C  + RN   V   G+ D+   +E  LK AVA   PVSVAI+A   +FQ 
Sbjct: 205 TYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQF 263

Query: 286 YESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y  GV +   C S  LDHGV+ VGYG++NG DYWLV+NSW   WG+ GY+K+ RN     
Sbjct: 264 YSKGVYYEPSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKMARN----R 319

Query: 344 TGKCGIAMEASYPV 357
              CG+A  ASYP+
Sbjct: 320 KNHCGVASAASYPL 333


>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
 gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
 gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
 gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
 gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
 gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
 gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
 gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
 gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
          Length = 379

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 138/345 (40%), Positives = 210/345 (60%), Gaps = 28/345 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
           T  +V +++Q W ++HG+  +      KR +IFK+NL +I + N+ NR    ++++GLNK
Sbjct: 36  TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNA-NRKSPHSHRLGLNK 94

Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           FAD+T +E+   YL    D  +  ++   K+  ++Y+C   D  P S DWR+KG +  VK
Sbjct: 95  FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
            QG CGS WAFS   A+E  + I TG+L+SLSEQELVDC  + + GC  G    +F++++
Sbjct: 152 YQGGCGSGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGCYNGWHYQSFEWVL 210

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
           ++GG+ ++ DYPY   E +C  ++   K V+IDGYE +   D       E +   A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269

Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
           P+SV+I+A  + F  Y  G++ GE C S   ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGEDW 327

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
           GE+GY+ +QRN  +   G CG+   ASYP K       SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 139/330 (42%), Positives = 202/330 (61%), Gaps = 19/330 (5%)

Query: 40  SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKV 95
           S+ + + + ++++QTW     K    +   E++   + +N   I EHN   SL  ++Y++
Sbjct: 17  SAMQLNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRL 76

Query: 96  GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY----ACKAGDELPESVDWREKG 151
            +N++ DLT+EE+ +M  G R+D   RL +       Y    +  +  +LP  VDWR+ G
Sbjct: 77  EMNEYGDLTSEEFSSMMNGYRNDI--RLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHG 134

Query: 152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMD 210
            V PVK+QG CGSCW+FS   ++EG +K  TG+L+SLSEQ L+DC   + N GCNGGLMD
Sbjct: 135 LVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMD 194

Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD- 269
            AF++I   GG+D+E  YPY   ++ C  +  ++      G+ D+   DE  LK+A A  
Sbjct: 195 QAFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDT-GFVDIKSGDEEMLKEAAATV 253

Query: 270 QPVSVAIEAGGRAFQHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
            P+SVAI+A   +FQ Y +GV+  T    + LDHGV+ VGYGTENG DYWLV+NSWG  W
Sbjct: 254 GPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGW 313

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GE GY+K+ RN       +CGIA +ASYP+
Sbjct: 314 GEAGYIKMSRNA----DNQCGIATQASYPL 339


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 132/311 (42%), Positives = 199/311 (63%), Gaps = 18/311 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W  K+GKT   +  +  R +I+  N  +++EHNS++ ++++ +N+FADLT EE+ ++
Sbjct: 29  WRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTAEEFSSI 88

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
           Y G     + R         RY    G  +P+SVDWR KG V PVK+Q  CGSCWAFST 
Sbjct: 89  YNG-YGKGRNRENHENTTIYRYT---GGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTT 144

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
            ++EG +   TG+L+SLSEQ LVDCD+K + GC GGLM  AF++I +N G+D+E+ YPY 
Sbjct: 145 GSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHGCQGGLMTTAFKYIEENKGIDTEESYPYK 203

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV 290
               +C+  +++    +++ +  +   D  +LKKAVA+  P+SVA++A   +FQ Y+SG+
Sbjct: 204 AKNGRCE-FKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSGI 262

Query: 291 FTGECGSA--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL--QRNLLDTNTGK 346
           +  +  S+  LDHGV+ VGYG E+G +YWLV+NSWG +WG  GY K+  ++NL       
Sbjct: 263 YDPKICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGKNWGMEGYFKIASKKNL------- 315

Query: 347 CGIAMEASYPV 357
           CGI   A YPV
Sbjct: 316 CGICTSACYPV 326


>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 193/338 (57%), Gaps = 24/338 (7%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKFA 101
           D +M  +  W+    ++         RF++++ N+R+I+    E  +   TY++G   F 
Sbjct: 54  DLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFT 113

Query: 102 DLTNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDE-----------LPESVDW 147
           DLT+EE+ ++Y G   D   R   +   ++ +       G E            P  +DW
Sbjct: 114 DLTDEEFISLYTGKIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDW 173

Query: 148 REKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
           R++GAV PVKDQG CGSCWAF TVA +EGI+KI  G L+SLSEQ+LVDCD  ++ GCNGG
Sbjct: 174 RKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDF-LDGGCNGG 232

Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
               AFQ+IIQNGG+ +   Y Y  AE +C  +R+ A    I GY  V    E+S+   V
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGNRKPA--AKITGYRKVKSNSEVSMVNIV 290

Query: 268 ADQPVSVAIEAGGRAFQHYESGVFTGECG-SALDHGVVAVGYGTEN-GVDYWLVRNSWGS 325
           A+QP++ +I   G  FQHY+ G++ G C  S L+H +  VGYG +  G  YW+V+NSWG+
Sbjct: 291 ANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGA 350

Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
            WG  GY+ ++R   +   G+CGIA+   +P+ N   S
Sbjct: 351 AWGNKGYMLMKRGTKNP-LGQCGIAVRPIFPLMNGGRS 387


>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
          Length = 351

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 147/356 (41%), Positives = 211/356 (59%), Gaps = 19/356 (5%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
           TL+F+       + D+   S+ +N   +     D EV   +  +  +H K   G+     
Sbjct: 3   TLIFVTLFCCVLSKDLHWESHRDNLYSNFQEVLDAEVA--WHKFKLEHNKVYVGIEEESL 60

Query: 72  RFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  IF  N +FI +HN+L+ T    + VG+N+FAD+T  E+  M  G + D+ R      
Sbjct: 61  RKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTVHEFAQMMNGLKPDSTR------ 114

Query: 128 VASQRYACKAGD-ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           V+   Y     D  LP  VDWR KG V+ VK+QGSCGSCWAFST  ++EG +   TG ++
Sbjct: 115 VSGSTYLSPNIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEGQHMRKTGTMV 174

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
            LSEQ LVDC     N GCNGGLM  AF++I  N G+D+E+ YPY G +  C   ++N  
Sbjct: 175 DLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYPYAGRDGDCK-FKKNKV 233

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDHG 302
             ++ G+ ++   +E  L++A+A   PVSVAI+A  ++F  Y+SGV+   EC SA LDHG
Sbjct: 234 GATVTGFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPECDSAQLDHG 293

Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL-DTNTGKCGIAMEASYPV 357
           V+AVGYG+ +G DY++V+NSWG+ WGE GY++     + D   G CGI ++ASYPV
Sbjct: 294 VLAVGYGSIHGKDYYIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILLDASYPV 349


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 143/329 (43%), Positives = 201/329 (61%), Gaps = 23/329 (6%)

Query: 42  WRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGL 97
           W+ D E+   +Q W + H K  +      +R  +++ NL+ I+ HN   +L + +YK+G+
Sbjct: 124 WQVDPELDGHWQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGM 182

Query: 98  NKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           N+F D+T EE+R +  G     ++R+   S+     +      E P SVDWREKG V PV
Sbjct: 183 NQFGDMTTEEFRQLMNGYVHKKSERKYRGSQFLEPNFL-----EAPRSVDWREKGYVTPV 237

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           KDQG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+
Sbjct: 238 KDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQY 297

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           +  NGG+DSE+ YPY   +++    +      +  G+ D+    E +L KAVA   PVSV
Sbjct: 298 VQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSV 357

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           AI+AG  +FQ Y+SG+ +  +C S  LDHGV+ VGYG E    +G  YW+V+NSWG  WG
Sbjct: 358 AIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWG 417

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + GY+ + ++        CGIA  ASYP+
Sbjct: 418 DKGYIYMAKD----RKNHCGIATAASYPL 442


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  250 bits (638), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 16/321 (4%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADL 103
           V+  ++ +  +H K  +       R +IF +N   I  HN      + TYK+ +NK+ D+
Sbjct: 25  VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGS 161
            + E+ +   G R +       ++  +     +  D  +LP++VDWR KGAV P+KDQG 
Sbjct: 85  LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFS   A+EG     TG+L+SLSEQ LVDC RK  N GCNGGLMD AF+++ +NG
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           G+D+E+ YPY   + KC  + R A      G+ DV    E +LKKAVA   PVSVAI+A 
Sbjct: 205 GIDTEESYPYDAEDEKCHYNPRAAGAED-KGFVDVREGSEHALKKAVATVGPVSVAIDAS 263

Query: 280 GRAFQHYESGVFT-GECG-SALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQ 336
             +FQ Y  GV+   EC    LDHGV+ VGYG  ++G DYWLV+NSWG+ WG+ GYVK+ 
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           RN       +CGIA  AS+P+
Sbjct: 324 RN----RDNQCGIASSASFPL 340


>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
          Length = 324

 Score =  250 bits (638), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 142/309 (45%), Positives = 192/309 (62%), Gaps = 20/309 (6%)

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMY 112
           AKH KT +G     +R+ I++ NL+ I+ HN L      TY +G NK+AD+TNEE+R   
Sbjct: 27  AKHNKTYSGDEDIIRRY-IWQTNLQKIEAHNELYAKGLSTYFLGENKYADMTNEEFRRTL 85

Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
            G R D  + L      S  +     D LP +VDWR++G V  VKDQG CGSCWAFST  
Sbjct: 86  SGLRVD--KELTPGDFVSGMFK----DSLPTAVDWRKEGYVTEVKDQGQCGSCWAFSTTG 139

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           ++EG +   T +L+SLSE  LVDC +K  N GCNGGLMD AF++I  N G+D+E+ YPY 
Sbjct: 140 SLEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDNAFKYIADNKGIDTEKSYPYK 199

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV 290
             + KC+  + N        Y+D++   E +L++AVA   P+SVAI+A   +FQ Y  GV
Sbjct: 200 PEDRKCNFKKANVGATD-KLYKDITSGSEDALQEAVATIGPISVAIDASHDSFQLYSGGV 258

Query: 291 FTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
           +  +  S   LDHGV+AVGY ++NG DYW+V+NSWG  WG +GY+ + RN       +CG
Sbjct: 259 YNEKACSTKTLDHGVLAVGYDSKNGDDYWIVKNSWGKSWGIDGYIWMSRN----KKNQCG 314

Query: 349 IAMEASYPV 357
           IA  ASYPV
Sbjct: 315 IATMASYPV 323


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 142/296 (47%), Positives = 191/296 (64%), Gaps = 16/296 (5%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKS 126
           R +IF +N   + +HN L      ++K+G+NK+AD+ + E+  +  G  R+ +  R  +S
Sbjct: 47  RMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGES 106

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
              S  +   A  +LP  +DWR+KGAV PVKDQG CGSCW+FS   ++EG +   +G+L+
Sbjct: 107 D-DSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLV 165

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ LVDC  K  N GCNGGLMD AF++I  NGG+D+EQ YPY   + KC    +N K
Sbjct: 166 SLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-K 224

Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGEC-GSALDHG 302
             +  GY D+   +E  L+ AVA   PVSVAI+A  ++FQ Y  GV +  +C  S LDHG
Sbjct: 225 GATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHG 284

Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGTE +G DYWLV+NSWG  WG+ GY+K+ RN  D N   CGIA EASYP+
Sbjct: 285 VLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN-RDNN---CGIATEASYPL 336


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 198/323 (61%), Gaps = 21/323 (6%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFA 101
           DE   ++  +    GK+ N    N+   + F  N+  IDEHN  +R    T+++GLN  A
Sbjct: 41  DEAFKLWDDYKEAFGKSYNKDEEND-YMEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIA 99

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
           DL   +YR +  G R    RR     + S   ++      E+P+SVDWR+KG V  VK+Q
Sbjct: 100 DLPFSQYRKLN-GYRH---RRNFGDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQ 155

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS   A+EG +   +G+++SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 156 GMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKD 215

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
           N G+D+E+ YPY+G E KC   +++       G+ D+   DE +LK AVA Q P+S+AI+
Sbjct: 216 NHGIDTEESYPYVGRETKCHFKKKDIGAED-KGFVDLPEGDEEALKVAVATQGPISIAID 274

Query: 278 AGGRAFQHYESGVFTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVK 334
           AG R FQ Y+ GV+   EC S  LDHGV+ VGYGT+    DYWL++NSWG  WGE GY++
Sbjct: 275 AGHRTFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIR 334

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN     +  CG+A +ASYP+
Sbjct: 335 IARN----RSNHCGVATKASYPL 353


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 149/353 (42%), Positives = 207/353 (58%), Gaps = 29/353 (8%)

Query: 15  FLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQ 74
           FL F++   A   ++  +D   +   +++     MT        H K          R +
Sbjct: 3   FLIFLAICVAGSQAVSFFDLVQEQWGAFK-----MT--------HNKQYQSETEERFRMK 49

Query: 75  IFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLG-TRSDAKRRLMKSKVA 129
           IF +N   + +HN L      ++K+G+NK+AD+ + E+  +  G  R+ +  R  +S   
Sbjct: 50  IFMENSHTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESD-D 108

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S  +   A  +LP  +DWR+KGAV PVKDQG CGSCW+FS   ++EG +   +G+L+SLS
Sbjct: 109 SVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLS 168

Query: 190 EQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           EQ LVDC  K  N GCNGGLMD AF++I  NGG+D+EQ YPY   + KC    +N K  +
Sbjct: 169 EQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKN-KGAT 227

Query: 249 IDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGEC-GSALDHGVVA 305
             GY D+   +E  L+ AVA   PVSVAI+A  ++FQ Y  GV +  +C  S LDHGV+ 
Sbjct: 228 DRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLV 287

Query: 306 VGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           VGYGTE +G DYWLV+NSWG  WG+ GY+K+ RN        CGIA EASYP+
Sbjct: 288 VGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN----RNNNCGIATEASYPL 336


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 148/328 (45%), Positives = 197/328 (60%), Gaps = 22/328 (6%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFA 101
           D VM  +Q + A+H K  N     + R +IF DN + I +HN+  +     YK+GLNK++
Sbjct: 21  DLVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYS 80

Query: 102 DLTNEEYRAMYLG-TRSDAKRRLM----KSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           D+ + E+   + G  +S     L     K+ +    +   A  +LP+ VDW + GAV PV
Sbjct: 81  DMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPV 140

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQF 215
           KDQG CGSCWAFS   A+EG++   T  L+SLSEQ L+DC   + N GCNGGLMD AFQ+
Sbjct: 141 KDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQY 200

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSV 274
           +  NGG+D+E+ YPY G  + C     N+  +   GY DV   DE +LK AVA   PVSV
Sbjct: 201 VRINGGIDTERSYPYEGNNDVCRYEPENSGAIDT-GYTDVPLGDEDALKSAVATVGPVSV 259

Query: 275 AIEAGGRAFQHYESGV-FTGECGS---ALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWG 328
           AI+A   +FQ Y SGV F   C +   +LDHGV+ VGYGT  E   DYWLV+NSWG  WG
Sbjct: 260 AIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWG 319

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           ENGY+K+ RN       +CGIA + S+P
Sbjct: 320 ENGYIKMARNA----DNQCGIATQPSFP 343


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 143/324 (44%), Positives = 192/324 (59%), Gaps = 25/324 (7%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D V++ +++W   H K  +     + R +IF +N   I  HN+       TY + +N + 
Sbjct: 23  DVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRHNAEAIQGRHTYFMKMNHYG 82

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           DL + E+ AM  G   + K  L  + + S+         LPE VDWRE+GAV PVK+QG 
Sbjct: 83  DLLHHEFVAMVNGYIYNNKTTLGGTFIPSKNI------NLPEHVDWREEGAVTPVKNQGQ 136

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNG 220
           CGSCW+FS   ++EG +   TG+LISLSEQ LVDC RK  N GC GGLMDYAF++I  N 
Sbjct: 137 CGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYIQDNN 196

Query: 221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
           G+D+E  YPY G +  C  DP  +    +   G+ D+    E  L+KA+A   P+SVAI+
Sbjct: 197 GIDTEASYPYEGIDGHCHYDPKNKGGSDI---GFVDIKKGSEKDLQKALATVGPISVAID 253

Query: 278 AGGRAFQHYESGVFTGECGSA--LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYV 333
           A   +FQ Y  GV++ +  S   LDHGV+AVGYGT+   G DYWLV+NSW   WGE+GY+
Sbjct: 254 ASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGEDGYI 313

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN        CGIA  ASYPV
Sbjct: 314 KMARN----KDNMCGIASSASYPV 333


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 139/352 (39%), Positives = 206/352 (58%), Gaps = 22/352 (6%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+A++G+         +R
Sbjct: 7   LVFLFLFLCVMWASPSAASRD---------EPSDPMMKRFEEWMAEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  NS N  +Y +G+N+F D+TN E+ A Y G        + +  V S 
Sbjct: 58  FQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGV--SLPLNIEREPVVS- 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       +P+S+DWR  GAV  VK+   CGSCWAF+ +A VE I KI  G LISLSEQ
Sbjct: 115 -FDDVDISAVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQ 173

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS--I 249
           +++DC   ++ GC+GG ++ A+ FII N G+ S   YPY  ++ +    R N    S  I
Sbjct: 174 QVLDC--AVSYGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQ-GTCRINGVPNSAYI 230

Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
            GY  V   +E S+  AV++QP++ +IEA G  FQHY+ GVF+G CG++L+H +  +GYG
Sbjct: 231 TGYTRVQSNNERSMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYG 289

Query: 310 TE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
            + +G  +W+VRNSWG+ WGE GY+++ R+ + +++G CGIA+   YP   S
Sbjct: 290 QDSSGKKFWIVRNSWGASWGERGYIRMARD-VSSSSGLCGIAIRPLYPTLQS 340


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 191/322 (59%), Gaps = 20/322 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--------TYKVGLNKFADL 103
           +++W+A+HG+T        +R +IF+ N   ID  NS           ++++  N+FADL
Sbjct: 43  HESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADL 102

Query: 104 TNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           T+EE+RA   G R   A    +      + ++ +A  +   S+DWR  GAV  VKDQGSC
Sbjct: 103 TDEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQA--DAAGSMDWRAMGAVTGVKDQGSC 160

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
           G CWAFS VAA+EG+ KI TG L+SLSEQ+LVDCD    + GC GGLMD AFQ+I + GG
Sbjct: 161 GCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGG 220

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + SE  YPY G +     S R     SI G+EDV   +E +L  AVA QPVSVAI  G  
Sbjct: 221 LASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDY 280

Query: 282 AFQHYE----SGVFTGECGSA-LDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKL 335
            F+ Y+         G C S  LDH + AVGYG   +G  YWL++NSWGS WGE+GYV++
Sbjct: 281 VFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRI 340

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
           +R       G CG+A  ASYPV
Sbjct: 341 RRG--SRGEGVCGLAKLASYPV 360


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 138/322 (42%), Positives = 200/322 (62%), Gaps = 22/322 (6%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHN----SLNRTYKVGLN 98
           D  + + +  W A+H +T      NE   R   ++ NL+ I+ HN    +   ++++G+N
Sbjct: 22  DQTLDSQWHQWKAQHRRT---YAANEDGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
           KF D+T EE++ +  G  S+  ++  K  +  +        +LP+SVDWREKG V PVK+
Sbjct: 79  KFGDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLA----QLPKSVDWREKGYVTPVKN 134

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFII 217
           QG CGSCWAFS   ++EG     T +L+SLSEQ LVDC   + N GC+GGLMD AF+++ 
Sbjct: 135 QGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK 194

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            NGG+D+EQ YPYLG +N+C   R      ++ G+ D+   +E +L KAVA+  P+SVAI
Sbjct: 195 NNGGIDTEQAYPYLGQDNECK-YRAECSGANVTGFVDIPSMNERALMKAVANVGPISVAI 253

Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
           +AG  +FQ YESGV +  +C S+ LDHGV+ VGYG+    +YW+V+NSWG +WG+ GYV 
Sbjct: 254 DAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKDEYWIVKNSWGEEWGKKGYVL 313

Query: 335 LQRNLLDTNTGKCGIAMEASYP 356
           + +         CGIA  ASYP
Sbjct: 314 MAK----FRNNHCGIATAASYP 331


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 143/328 (43%), Positives = 201/328 (61%), Gaps = 23/328 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLN 98
           R D E+   +Q W + H K  +    + +R  +++ NL+ I+ HN   +L + +YK+G+N
Sbjct: 1   RADPELDGHWQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMN 59

Query: 99  KFADLTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           +F D+T EE+R +  G     ++R+   S+     +      E P SVDWREKG V PVK
Sbjct: 60  QFGDMTTEEFRQLMNGYAHKKSERKYRGSQFLEPSFL-----EAPRSVDWREKGYVTPVK 114

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
           DQG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++
Sbjct: 115 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYV 174

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVA 275
             NGG+DSE+ YPY   +++    +      +  G+ D+    E +L KAVA   PVSVA
Sbjct: 175 QDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVA 234

Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
           I+AG  +FQ Y+SG+ +  +C S  LDHGV+ VGYG E    +G  YW+V+NSWG  WG+
Sbjct: 235 IDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGD 294

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            GY+ + ++        CGIA  ASYP+
Sbjct: 295 KGYIYMAKD----RKNHCGIATAASYPL 318


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 194/322 (60%), Gaps = 16/322 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFA 101
           D V   +  +   H K          R +IF +N   + +HN L      ++K+G+NK++
Sbjct: 21  DLVQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYS 80

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
           D+ N E+    L   + +K  L   ++  S  +   A  ELP+ +DWR+ GAV PVKDQG
Sbjct: 81  DMLNHEF-VHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQG 139

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CGSCW+FST  ++EG +   + +L+SLSEQ L+DC  K  N GCNGGLMD AF++I  N
Sbjct: 140 QCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYIKDN 199

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           GG+D+EQ YPY   + KC    RN K  +  G+ D+   DE  LK AVA   P+SVAI+A
Sbjct: 200 GGIDTEQSYPYKAEDEKCHYKPRN-KGATDRGFVDIESGDEEKLKAAVATVGPISVAIDA 258

Query: 279 GGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKL 335
               FQ Y  GV +  EC S  LDHGV+ VGYGT E+G DYWLV+NSWG  WG+ GY+K+
Sbjct: 259 SHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQGYIKM 318

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
            RN  D N   CGIA +ASYP+
Sbjct: 319 ARN-RDNN---CGIATQASYPL 336


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 193/321 (60%), Gaps = 29/321 (9%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNE 106
           +++ W +KH   S        R  +++ NL+ I+    EH     +Y++G+N F D+TNE
Sbjct: 32  LWKNWHSKHYHESE----EGWRRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMTNE 87

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+R    G +   +R+   S      Y      + P++VDWREKG V PVKDQGSCGSCW
Sbjct: 88  EFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKDQGSCGSCW 142

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I  N G+D+E
Sbjct: 143 AFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTE 202

Query: 226 QDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
           + YPY+G +   DP     +  + +  G+ D+    E ++ KAVA   PVSVAI+AG  +
Sbjct: 203 ESYPYVGTDE--DPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHES 260

Query: 283 FQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQ 336
           FQ YESG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+ GY+ + 
Sbjct: 261 FQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMA 320

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           ++        CGIA  +SYP+
Sbjct: 321 KD----RKNHCGIATASSYPL 337


>gi|281204396|gb|EFA78592.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 330

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 134/294 (45%), Positives = 185/294 (62%), Gaps = 16/294 (5%)

Query: 70  EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
           + R+  FK+NL  I + NS   +  +G+N  ADL+NEEYR +YLG + DA R  +  + A
Sbjct: 46  QDRYNAFKNNLDLIHKWNSQGHSTVLGVNHLADLSNEEYRNLYLGVKVDASR--LPQQAA 103

Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
           S +   K    +  S+DWR  GAV  VKDQG CGSCW+FST  ++EG N+I TG   SLS
Sbjct: 104 SIKLN-KVFAPVAASLDWRSSGAVGRVKDQGQCGSCWSFSTTGSIEGANQIATGNFASLS 162

Query: 190 EQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN---KCDPSRRNAK 245
           EQ+L+DC R   N GCNGGLMD A +++I  GG+D+E+ YPY  +++   K +P+   AK
Sbjct: 163 EQQLMDCSRDYGNEGCNGGLMDAAMKYVIAQGGLDTEESYPYTMSDSYTCKFNPANIGAK 222

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
           + S   Y DV    E  L   +   PVSVAI+A   +FQ Y+SGV+     S+  LDHGV
Sbjct: 223 ISS---YIDVQRGSETDLAAKLNKGPVSVAIDASHSSFQLYKSGVYYEPACSSYNLDHGV 279

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +AVGYGTE   +YW+V+NSWG +WG +GY+ + ++     +  CGI+  AS PV
Sbjct: 280 LAVGYGTEGSSNYWIVKNSWGPNWGLSGYIWMAKD----KSNHCGISSMASIPV 329


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 141/323 (43%), Positives = 193/323 (59%), Gaps = 16/323 (4%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D +   + T+  +H KT         R +IF +N   I +HN        T+K+ +NK+A
Sbjct: 21  DVIKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYA 80

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKS--KVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           D+ + E+R    G      + L  S        +   A  +LP+SVDWREKGAV  VKDQ
Sbjct: 81  DMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQ 140

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+  A+EG +   TG L+SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
           NGG+D+E+ YPY G ++ C  ++ +       G+ D+   +E  + +AVA   PVSVAI+
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGATD-RGFADIPQGNEKKMAEAVATIGPVSVAID 259

Query: 278 AGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVK 334
           A   +FQ Y  G++   EC S  LDHGV+ VGYGT E+G DYWLV+NSWG+ WG+ G++K
Sbjct: 260 ASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIK 319

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN       +CGIA  +SYP+
Sbjct: 320 MARN----EDNQCGIASASSYPL 338


>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
          Length = 376

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 150/335 (44%), Positives = 201/335 (60%), Gaps = 27/335 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
           +++ + ++Y+ W + H   S  +   + RF+ FK N R I E N   +  YK+GLNKFAD
Sbjct: 37  SEESMWSLYERWRSVH-TVSRDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLNKFAD 95

Query: 103 LTNEEYRAMYLGTR---SDAKRRLMK------SKVASQRYACKAGDELPESVDWREKGAV 153
           LT EE+ + Y G +   S+A  RL        S  +  + A   GD  P++ DWR+ GAV
Sbjct: 96  LTQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDA-PDAWDWRDHGAV 154

Query: 154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG-CN-GGLMDY 211
             VKDQG CGSCWAFS V AVE +N IVTG L++LSEQ+++DC     AG C  GG   Y
Sbjct: 155 TAVKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCS---GAGDCTYGGYTYY 211

Query: 212 AFQFIIQNG-GMDSEQDYPYLGAENKCD--PSRRNAK---VVSIDGYEDVSPFDEMSLKK 265
           A  + I NG  +D     PY    +     P R +AK   VV ID    ++  DE +LK+
Sbjct: 212 AMLYAISNGLTLDQCGKTPYYQRYDAQQHLPCRFDAKKPPVVKIDSMYVMNNADEAALKR 271

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWG 324
           AV  QPVSV I+AGG    +Y  GVFTG CG++L+H V+ VGYG T +G  YW+V+NSWG
Sbjct: 272 AVYKQPVSVLIDAGG--IGYYSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWG 329

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
           +DWGE GY +L+R+ + T  G CGI M   YP+KN
Sbjct: 330 ADWGEKGYFRLKRD-VGTQGGLCGITMYPIYPIKN 363


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 195/329 (59%), Gaps = 26/329 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
           R D ++   +  W   H K+ +      +R  +++ NL+ I+    EH     +Y++G+N
Sbjct: 21  RFDSQLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKNLKKIEMHNLEHTMGKHSYRLGMN 79

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
            F D+TNEE+R    G +   +R+   S      Y      + P++VDWREKG V PVKD
Sbjct: 80  HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
           QGSCGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I 
Sbjct: 135 QGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
            N G+D+E+ YPY+G +   DP     +    +  G+ D+    E ++ KAVA   PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSV 252

Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           AI+AG  +FQ YESG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG
Sbjct: 253 AIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + GY+ + ++        CGIA  +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 126/291 (43%), Positives = 182/291 (62%), Gaps = 9/291 (3%)

Query: 71  KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
           KR+ IFK+NL +I  HN    +Y + +NKF DLT EE+R  YLG +    R     +   
Sbjct: 108 KRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYLGYKKPDLR--TPPREVD 165

Query: 131 QRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSE 190
                   +++P  VDWR++G V  VKDQG CGSCWAFS   A+EG+    TG+L++LS+
Sbjct: 166 TTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQ 225

Query: 191 QELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
           Q+LVDC R + N GC+GG M+ AF+++++NGG+ S ++YPY+  +  C  S+  + V +I
Sbjct: 226 QQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRKDGVCKSSQCTS-VATI 284

Query: 250 DGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGY 308
            GY  V    E S+K A+A   PVSVAI+A   AFQ Y  G+F   CG+ LDHGV+ VGY
Sbjct: 285 TGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDAPCGTNLDHGVLLVGY 344

Query: 309 GTENG--VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             E     DYW+++NSWG+ WG+ GY+ +   +     G+CG+ ++ S+PV
Sbjct: 345 SAETAGQGDYWIMKNSWGAAWGKGGYMLMA--MHKGPAGQCGVLLDGSFPV 393


>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
           Ervatamin-A Complexed With Irreversible Inhibitor E-64
 gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
           Ervatamin-A Complexed With Irreversible Inhibitor E-64
          Length = 209

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 125/218 (57%), Positives = 149/218 (68%), Gaps = 11/218 (5%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPE VDWR KGAV P+K+QG CGSCWAFSTV  VE IN+I TG LISLSEQ+LVDC +K 
Sbjct: 1   LPEHVDWRAKGAVIPLKNQGKCGSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKK- 59

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GC GG  D A+Q+II NGG+D+E +YPY   +    P R   KVV IDG + V   +E
Sbjct: 60  NHGCKGGYFDRAYQYIIANGGIDTEANYPYKAFQG---PCRAAKKVVRIDGCKGVPQCNE 116

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
            +LK AVA QP  VAI+A  + FQHY+ G+FTG CG+ L+HGVV VGYG     DYW+VR
Sbjct: 117 NALKNAVASQPSVVAIDASSKQFQHYKGGIFTGPCGTKLNHGVVIVGYGK----DYWIVR 172

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG  WGE GY +++R       G CGIA    YP K
Sbjct: 173 NSWGRHWGEQGYTRMKR---VGGCGLCGIARLPFYPTK 207


>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
          Length = 352

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 187/296 (63%), Gaps = 20/296 (6%)

Query: 73  FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            + F  N+  I+EHN  +R    T+++GLN  ADL   EYR +  G R    RRL    +
Sbjct: 65  MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 120

Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                ++      ++P+SVDWRE   V PVK+QG CGSCWAFS   A+EG +   TG+L+
Sbjct: 121 RKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 180

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ LVDC  K  N GCNGGLMD AF++I  N G+D+E+ YPY+G E +C   +R+  
Sbjct: 181 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 240

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
                G+ D+   DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F  EC S  LDHG
Sbjct: 241 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 299

Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGT+    DYW+++NSWG+ WGE GYV++ RN        CG+A +ASYP+
Sbjct: 300 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 351


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 186/302 (61%), Gaps = 32/302 (10%)

Query: 72  RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE-------YRAMYLGTRSDAK 120
           R   ++ NL+ + EHN        TY +G+NK+AD+T  E       Y A   G R+  +
Sbjct: 47  RRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDR 106

Query: 121 RRL-MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
                 SK+A           LP++VDWR+KG V  VKDQG CGSCWAFST  A+EG + 
Sbjct: 107 HTFSFNSKIA-----------LPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHF 155

Query: 180 IVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
             TG+L+SLSEQ LVDC  ++ N GCNGGLMD AF++I +N G+D+E  YPY   +N+C 
Sbjct: 156 KQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCR 215

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGS 297
               N       G+ D++  DE +L++AVA   P+SVAI+AG  +FQ Y+ GV+     S
Sbjct: 216 FKAANVGATDT-GFTDITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCS 274

Query: 298 A--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
              LDHGV+AVGYGT++G DYWLV+NSWG  WG+ GY+K+ RN       +CGIA  ASY
Sbjct: 275 QTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRN----KRNQCGIATAASY 330

Query: 356 PV 357
           P+
Sbjct: 331 PL 332


>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 136/308 (44%), Positives = 193/308 (62%), Gaps = 19/308 (6%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           ++QT+ AK+GK        E R ++   N+ +I++ NS   ++ +G+  FAD+TN E+  
Sbjct: 26  LFQTFEAKYGKNYLS-SEREYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEF-- 82

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
                 + +K      K  + + A    +   ES+DWREKGAV PVK+QGSCGSCWAFS 
Sbjct: 83  ------ATSKLCGCMKKPLNHKQARVLNNMAVESIDWREKGAVTPVKNQGSCGSCWAFSA 136

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
             A+EG N + TG+L+SLSEQ+LVDCD + +AGC GG MD AF+++++  G+ +E+DYPY
Sbjct: 137 TGALEGGNFVATGKLVSLSEQQLVDCDTE-DAGCGGGFMDTAFEYVMKK-GLCTEEDYPY 194

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
              +  C   +  + V+SI GYEDV   D ++LK+A+   PVSVAI+A    FQ Y  GV
Sbjct: 195 HAKDEDCKDDQCTS-VISITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGV 253

Query: 291 FTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
              + CG++L+HGV+AVGY  E    Y +V+NSWG+ WG+ GYVK+     D   G CGI
Sbjct: 254 LDSDMCGTSLNHGVLAVGYAKE----YIIVKNSWGASWGDKGYVKIAHR--DQGEGICGI 307

Query: 350 AMEASYPV 357
            M ASYP 
Sbjct: 308 NMAASYPT 315


>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
 gi|255645733|gb|ACU23360.1| unknown [Glycine max]
          Length = 362

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 209/368 (56%), Gaps = 33/368 (8%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           M L  + L   F +  S    +S+    N  +  +S   ++EV  ++Q W  +H +    
Sbjct: 2   MSLQRTKLFPFFIVLVSFTCSLSLAMSSNQLEQFAS---EEEVFQLFQAWQKEHKREYGN 58

Query: 66  MGHNEKRFQIFKDNLRFIDEHNSLNRT----YKVGLNKFADLTNEEYRAMYLG------T 115
                KRFQIF+ NLR+I+E N+  ++    +++GLNKFAD++ EE+   YL       +
Sbjct: 59  QEEKAKRFQIFQSNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLKEIEMPYS 118

Query: 116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVE 175
             +++++L K   A     C   D LP SVDWR+KGAV  V+DQG C S WAFS   A+E
Sbjct: 119 NLESRKKLQKGDDAD----C---DNLPHSVDWRDKGAVTEVRDQGKCQSHWAFSVTGAIE 171

Query: 176 GINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
           GINKIVTG L+SLS Q++VDCD   + GC GG    AF ++I+NGG+D+E  YPY     
Sbjct: 172 GINKIVTGNLVSLSVQQVVDCD-PASHGCAGGFYFNAFGYVIENGGIDTEAHYPYTAQNG 230

Query: 236 KCDPSRRNA-KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE 294
            C   + NA KVVSID    V   +E  L + V+ QPVSV+I+A G   Q Y  GV+ GE
Sbjct: 231 TC---KANANKVVSIDNLLVVVGPEEALLCR-VSKQPVSVSIDATG--LQFYAGGVYGGE 284

Query: 295 -CGSALDHGVVA---VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT-NTGKCGI 349
            C        +    VGYG+  G DYW+V+NSWG DWGE GY+ ++RN+ D    G C I
Sbjct: 285 NCSKNSTKATLVCLIVGYGSVGGEDYWIVKNSWGKDWGEEGYLLIKRNVSDEWPYGVCAI 344

Query: 350 AMEASYPV 357
                +P+
Sbjct: 345 NAAPGFPI 352


>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
          Length = 347

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 187/296 (63%), Gaps = 20/296 (6%)

Query: 73  FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            + F  N+  I+EHN  +R    T+++GLN  ADL   EYR +  G R    RRL    +
Sbjct: 60  MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 115

Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                ++      ++P+SVDWRE   V PVK+QG CGSCWAFS   A+EG +   TG+L+
Sbjct: 116 RKNGTKFLVPFNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 175

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ LVDC  K  N GCNGGLMD AF++I  N G+D+E+ YPY+G E +C   +R+  
Sbjct: 176 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 235

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
                G+ D+   DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F  EC S  LDHG
Sbjct: 236 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 294

Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGT+    DYW+++NSWG+ WGE GYV++ RN        CG+A +ASYP+
Sbjct: 295 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 346


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 149/372 (40%), Positives = 215/372 (57%), Gaps = 31/372 (8%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MATAS  LA+  L     + + +A     I+                ++  ++ W A++ 
Sbjct: 3   MATASASLALVMLFACSLLLAGTAFSDDTIAIP--------------LLERFKAWQAEYN 48

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
           +T       ++RF ++ +NLRFI   N L+   +Y++G N+F DLT EE++  YL    +
Sbjct: 49  RTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDE 108

Query: 119 --AKRRLMKSKVASQRYACKA-GD---ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
                  M   V +   A  + GD   E P SVDWR KGAV PVK+Q  CGSCWAF+TVA
Sbjct: 109 QPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           ++EG+++I TG L+SLSEQE+VDCDR  N  GC GG    A +++ +NGG+ +E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G++ +C   +       I GY+ V   +E  L++AVA +PV+V I+A  RAFQ Y+ GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDA-SRAFQFYKRGVF 287

Query: 292 TGECG-SALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           +G C  + ++H V  VGYG+          YW+V+NSWG  WGENGYV++ R +     G
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-G 346

Query: 346 KCGIAMEASYPV 357
            C IA+E  YPV
Sbjct: 347 MCAIAIEPYYPV 358


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/351 (39%), Positives = 203/351 (57%), Gaps = 19/351 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           +VFLF       A  S  S D            D +M  ++ W+ ++G+         +R
Sbjct: 7   VVFLFLFLCVMWASPSAASAD---------EPSDPMMKRFEEWMVEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  NS N  +Y +G+N+F D+TN E+ A Y G  S   R L   +    
Sbjct: 58  FQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGIS---RPLNIEREPVV 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       +P+S+DWR+ GAV  VK+Q  CG+CWAF+ +A VE I KI  G L  LSEQ
Sbjct: 115 SFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           +++DC +    GC GG    AF+FII N G+ S   YPY  A+  C  +        I G
Sbjct: 175 QVLDCAK--GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCK-TNGVPNSAYITG 231

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   +E S+  AV+ QP++VA++A    FQ+Y+SGVF G CG++L+H V A+GYG +
Sbjct: 232 YARVPRNNESSMMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYGQD 290

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
            NG  YW+V+NSWG+ WGE GY+++ R+ + +++G CGIA+++ YP   S+
Sbjct: 291 SNGKKYWIVKNSWGARWGEAGYIRMARD-VSSSSGICGIAIDSLYPTLESR 340


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 188/312 (60%), Gaps = 20/312 (6%)

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMY 112
           A HGK  +       R +I+ +N   I  HN        +YK+ +N+F DL + E+    
Sbjct: 55  ALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEF---- 110

Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           + TR+  KR    +      Y    G E   LP++VDWR+KGAV PVK+QG CGSCWAFS
Sbjct: 111 VSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFS 170

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           T  ++EG +   TG ++SLSEQ LVDC  K  N GC GGLMD AF++I  NGG+D+E  Y
Sbjct: 171 TTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSY 230

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
           PY G +  C   + +       G+ D+   +E  LKKAVA   PVSVAI+A   +FQ Y 
Sbjct: 231 PYNGTDGICHFEKSDVGATDT-GFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYS 289

Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
            GV+   EC S +LDHGV+ VGYGT++G DYWLV+NSWG+ WG++GY+ + RN       
Sbjct: 290 QGVYDEPECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRN----KEN 345

Query: 346 KCGIAMEASYPV 357
           +CGIA  ASYP+
Sbjct: 346 QCGIASSASYPL 357


>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
 gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
           Protease Ervatamin C
 gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
           Protease Ervatamin C
          Length = 208

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 123/218 (56%), Positives = 153/218 (70%), Gaps = 11/218 (5%)

Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
           LPE +DWR+KGAV PVK+QGSCGSCWAFSTV+ VE IN+I TG LISLSEQELVDCD+K 
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
           N GC GG   +A+Q+II NGG+D++ +YPY   +  C  +   +KVVSIDGY  V   +E
Sbjct: 60  NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNE 116

Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
            +LK+AVA QP +VAI+A    FQ Y SG+F+G CG+ L+HGV  VGY      +YW+VR
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVR 172

Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NSWG  WGE GY+++ R       G CGIA    YP K
Sbjct: 173 NSWGRYWGEKGYIRMLR---VGGCGLCGIARLPYYPTK 207


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 141/319 (44%), Positives = 192/319 (60%), Gaps = 18/319 (5%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
           + T ++ + ++H K  +       RF+IF +N   + +HN+       +YK+ +NKF DL
Sbjct: 23  LRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDL 82

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
              E+  M  G R    +    + +     A      LP +VDWR+KGAV PVK+QG CG
Sbjct: 83  LPHEFAKMVNGYRGKQNKEQRPTFIPP---ANLNDSSLPTTVDWRKKGAVTPVKNQGQCG 139

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFST  ++EG +   TG+L+SLSEQ LVDC D   N GCNGGLMD  FQ+I  NGG+
Sbjct: 140 SCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGI 199

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
           D+E+ +PY   +  C    + A V + D G+ D+    E  LKKAVA   PVSVAI+A  
Sbjct: 200 DTEESHPYTAQDGDC--KFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASH 257

Query: 281 RAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            +FQ Y  GV+   +C S+ LDHGV+ VGYG +NG  YWLV+NSWG DWG+NGY+ + R+
Sbjct: 258 GSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSRD 317

Query: 339 LLDTNTGKCGIAMEASYPV 357
                  +CGIA  ASYP+
Sbjct: 318 ----KDNQCGIASSASYPL 332


>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
          Length = 371

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 139/319 (43%), Positives = 193/319 (60%), Gaps = 24/319 (7%)

Query: 50  TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
           +++Q +L K+ +  +     E+R  IF +N   I EHN L      +Y +G+N F+D TN
Sbjct: 65  SMWQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTN 124

Query: 106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
            E   +  G R  +K     S+  SQ     A    P  VDWR KGAV PVK+QG CGSC
Sbjct: 125 SELDVLR-GFRHSSK----ASRSGSQYIPFDAAP--PAEVDWRTKGAVTPVKNQGDCGSC 177

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           WAFS    +EG + + TG+L+SLSEQ+LVDC    N GC+GGLMD AF+++ ++ G+D+E
Sbjct: 178 WAFSATGGIEGQHYLATGKLVSLSEQQLVDCSSS-NDGCDGGLMDLAFEYVKEHKGIDTE 236

Query: 226 QDYPYL----GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
             YPY+    G   +C    + A  V++ GY D+    E+ L++AV    P+SV I AG 
Sbjct: 237 VHYPYVSGNTGYARQCSFDPKYA-AVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGL 295

Query: 281 RAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            +F  YESG+++   C    LDHGV+ VGYG +NGV YWL++NSWG DWGENGYV++ RN
Sbjct: 296 PSFMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRN 355

Query: 339 LLDTNTGKCGIAMEASYPV 357
               +   CG+A  ASYP+
Sbjct: 356 ----HNNLCGVATMASYPL 370


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/351 (39%), Positives = 203/351 (57%), Gaps = 19/351 (5%)

Query: 13  LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
           LVFLF       A  S  S D            D +M  ++ W+ ++G+         +R
Sbjct: 7   LVFLFLFLCVMWASPSAASAD---------EPSDPMMKRFEEWMVEYGRVYKDNDEKMRR 57

Query: 73  FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           FQIFK+N+  I+  NS N+ +Y +G+N+F D+TN E+ A Y G  S   R L   +    
Sbjct: 58  FQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGIS---RPLNIEREPVV 114

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            +       +P+S+DWR+ GAV  VK+Q  CG+CWAF+ +A VE I KI  G L  LSEQ
Sbjct: 115 SFDDVDISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174

Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
           +++DC +    GC GG    AF+FII N G+ S   YPY  A+  C  +        I G
Sbjct: 175 QVLDCAK--GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCK-TNGVPNSAYITG 231

Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
           Y  V   +E S+  AV+ QP++VA++A   + Q+Y SGVF G CG++L+H V A+GYG +
Sbjct: 232 YARVPRNNESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQD 290

Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
            NG  YW+V+NSWG+ WGE GY+++ R+ + +++G CGIA+++ YP   S+
Sbjct: 291 SNGKKYWIVKNSWGARWGEAGYIRMARD-VSSSSGICGIAIDSLYPTLESR 340


>gi|298709635|emb|CBJ31444.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
          Length = 475

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 192/315 (60%), Gaps = 17/315 (5%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE---HNSLNRTYKVGLNKFADLTNEEYRAM 111
           W  K+G++    G   + F   ++  R  D+   HN  +  Y +  N ++ ++ +E+R  
Sbjct: 164 WTYKYGQS---WGSVHEAFHALQNYARADDKIALHNHEDAGYTLAHNAYSHMSWQEFREH 220

Query: 112 Y-LGTRSDAKRRLMKSKVASQRYACKAGDEL------PESVDWREKGAVNPVKDQGSCGS 164
           + +G         + ++ A +    KA  EL      P+ VDW  KGAV PVK+QGSCGS
Sbjct: 221 FSIGKDMVVPPDQLPAEFALRPRGEKAPKELLRGAPIPDEVDWVAKGAVTPVKNQGSCGS 280

Query: 165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
           CW+FST  ++EG + I  G L  LSEQELVDCD   + GCNGGLMDY+F +I QNGG+ S
Sbjct: 281 CWSFSTTGSMEGAHFIKHGNLAVLSEQELVDCD-TYDMGCNGGLMDYSFHWIQQNGGICS 339

Query: 225 EQDYPYLGAENKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
           E+DYPY  A + C  S  +  +   +D + DV+  DE +L +AVA QPVS+AIEA   +F
Sbjct: 340 EEDYPYTAAGDLCKKSTCDVVEGTMVDKWVDVASDDEQALMEAVAQQPVSIAIEADQMSF 399

Query: 284 QHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           Q Y  GV T  CG+ LDHGV+ VGYG +E+GV YW V+NSWG +WG  GY+ L+R   D 
Sbjct: 400 QLYSGGVLTAACGTNLDHGVLLVGYGVSEDGVKYWKVKNSWGPEWGAEGYILLKRE-ADQ 458

Query: 343 NTGKCGIAMEASYPV 357
             G+CGI  +ASYPV
Sbjct: 459 EGGECGILEQASYPV 473


>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
 gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
          Length = 334

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 195/320 (60%), Gaps = 26/320 (8%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K GK+         R   +  N + +  HN +     ++Y++G+  FAD++NEE
Sbjct: 26  FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85

Query: 108 YRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           YR +     LG+ ++ K R       S  +  +    +P++VDWR+KG V  +KDQ  CG
Sbjct: 86  YRQLVFRGCLGSMNNTKAR-----GGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCG 140

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS   ++EG     TG+L+SLSEQ+LVDC     N GC+GGLMD AFQ+I  N G+
Sbjct: 141 SCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGL 200

Query: 223 DSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           D+E  YPY   + +C  +PS   A   S  GY D++  DE +L++AVA   P+SVAI+AG
Sbjct: 201 DTEDSYPYEAQDGECRFNPSTVGA---SCTGYVDIASGDESALQEAVATIGPISVAIDAG 257

Query: 280 GRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             +FQ Y SGV+   +C S+ LDHGV+AVGYG+ NG DYW+V+NSWG DWG  GY+ + R
Sbjct: 258 HSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSR 317

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           N     + +CGIA  ASYP+
Sbjct: 318 N----KSNQCGIATAASYPL 333


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/294 (47%), Positives = 188/294 (63%), Gaps = 15/294 (5%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R +IF +N + I++HNS  +    ++K+ LN  AD+   EY  +YLG    +K     +K
Sbjct: 47  RKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKAN--NNK 104

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           + S  +   A   L + VDWR KGAV PVK+QG CGSCWAFST  A+EG N   TG+L+S
Sbjct: 105 LQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVS 164

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC     N GC GGLMD AFQ+I +N G+D+E+ YPY G +  C   R+ +  
Sbjct: 165 LSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETC-RFRKTSIG 223

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
            +  G+ D++  DE +L +AVA   P+SVAI+A  ++FQ Y  GV +  EC S  LDHGV
Sbjct: 224 ATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGV 283

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYG E+   YWLV+NSWG+ WG+ GY+K+ R+  D N   CGIA +ASYP+
Sbjct: 284 LVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARD-QDNN---CGIATQASYPL 333


>gi|116788286|gb|ABK24823.1| unknown [Picea sitchensis]
          Length = 294

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 132/276 (47%), Positives = 174/276 (63%), Gaps = 14/276 (5%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
           I  LV L  + SS  A    I+Y N  D S     ++ +++++  W   HGKT       
Sbjct: 7   ILKLVMLLLVFSSVTA----ITY-NPRDLS-----ENGLLSLFDRWCNHHGKTYTAK-QR 55

Query: 70  EKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
             RFQ+FK+NL +I EHNS  N T+ +GLN F+DLT++E+R   +G R       +KS+ 
Sbjct: 56  PLRFQVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFRTQQMGLRGHPPS--LKSRR 113

Query: 129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
              +        +P S+DWR+K AV  VKDQG+CG CWAFS   A+EGINKIVTG L+SL
Sbjct: 114 REPKSGLLELYNIPSSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSL 173

Query: 189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVS 248
           SEQEL DCD   N+GC+GGLMDYAFQ++I NGG+D+E DYPY G +  C+  + N +VV+
Sbjct: 174 SEQELCDCDTSYNSGCDGGLMDYAFQWVIVNGGIDTEVDYPYKGVQKACNSKKVNRRVVT 233

Query: 249 IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
           ID Y DV   +E +L +AV  QPVSV I  G RAFQ
Sbjct: 234 IDDYIDVPANNERALLQAVVGQPVSVGISGGERAFQ 269


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 26/327 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKF 100
           D ++   ++ W   HGK  +      +R  I++ NLR I  HN  +     TY++G+N F
Sbjct: 22  DKQLDDHWEQWKTWHGKNYHEKEEGWRRM-IWEKNLRKIQFHNLEHSMGIHTYRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+ +EE+R +  G +   +R+   S      +      E+P  +DWREKG V PVKDQG
Sbjct: 81  GDMNHEEFRQVMNGYKHKTERKFKGSLFMEPNFL-----EVPSKLDWREKGYVTPVKDQG 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG      G+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I  N
Sbjct: 136 ECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDN 195

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            G+DSE+ YPYLG +++  P   + K  + +  G+ D+    E +L KAVA   PVSVAI
Sbjct: 196 NGLDSEEAYPYLGTDDQ--PCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAI 253

Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+ 
Sbjct: 254 DAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDK 313

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY+ + ++        CGIA  ASYP+
Sbjct: 314 GYIYMAKD----RKNHCGIATAASYPL 336


>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 331

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 143/317 (45%), Positives = 196/317 (61%), Gaps = 21/317 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEE 107
           ++ W   +GK         KR  I+ +NL+++ +HN        TYKV  N+FADL+N+E
Sbjct: 24  WEEWKTLYGKVYRAE-EELKRQYIWLENLKYVTQHNLEADEGKHTYKVDTNQFADLSNDE 82

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSC 165
           +R +     S   R   +    +  +    GD +  P++VDWR++G V PVKDQ  CGSC
Sbjct: 83  WREL---MTSQVTRPTNQMSFCNMTFM-TVGDHVIAPKNVDWRKEGYVTPVKDQKQCGSC 138

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           WAFST  ++EG +   TG+L+SLSEQ LVDC  K  N GC GGLMD  F++I  NGG+D+
Sbjct: 139 WAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHGCQGGLMDLGFEYIFDNGGIDT 198

Query: 225 EQDYPYLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
           E  YPY+   E +C   R N+   ++ G  D+    E +L KAVAD  P+SVAI+AG ++
Sbjct: 199 ESSYPYMAKNEPQCMYKRSNSG-ATLTGCVDIKRGSESALMKAVADVGPISVAIDAGHKS 257

Query: 283 FQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y+SGV +   C S  LDHGV+AVG+G +NG D+WLV+NSWG  WG  GY+ + RN  
Sbjct: 258 FQMYKSGVYYEPSCSSVKLDHGVLAVGFGADNGEDFWLVKNSWGPIWGMEGYIMMSRN-R 316

Query: 341 DTNTGKCGIAMEASYPV 357
           D N   CGIA +ASYP+
Sbjct: 317 DNN---CGIATQASYPL 330


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 202/360 (56%), Gaps = 35/360 (9%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMG 67
           +  STL+    + +S+A  +S                 D V++ +++W   HGKT +   
Sbjct: 1   MKCSTLLLSVLVIASTANAVSFF---------------DVVLSDWESWKLMHGKTYSSSI 45

Query: 68  HNEKRFQIFKDNLRFIDEHNS--LN--RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRL 123
             + R +I+ +N   I  HNS  LN    Y + +N + DL + E+ AM  G +   K   
Sbjct: 46  EEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTAS 105

Query: 124 MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
           +        Y      +LP  VDWRE+GAV PVK+QG CGSCW+FS   A+EG +   TG
Sbjct: 106 LGG-----TYIPNKNIQLPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTG 160

Query: 184 ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
           +LISLSEQ LVDC RK  N GC GGLMD+AF +I  N G+D+E  YPY G +  C  + +
Sbjct: 161 KLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPK 220

Query: 243 NAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGS-AL 299
           N     I G+ D+    E  LKKAVA   P+SVAI+A   +FQ Y  GV+   +C S  L
Sbjct: 221 NKGGSDI-GFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEEL 279

Query: 300 DHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV+ VG+GT+  +G DYWLV+NSW   WG+ GY+K+ RN        CGIA  ASYPV
Sbjct: 280 DHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARN----KENMCGIASSASYPV 335


>gi|169659203|dbj|BAG12786.1| putative cysteine protease [Sorogena stoianovitchae]
          Length = 293

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 135/293 (46%), Positives = 188/293 (64%), Gaps = 19/293 (6%)

Query: 65  GMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM 124
           G   ++ R  +F +++R ++  N+   +Y +GLN+FADLT EE+ ++YLG        ++
Sbjct: 18  GGAEDKHRLALFAESVRIVETENAKGHSYTLGLNQFADLTTEEFSSLYLGL-------VL 70

Query: 125 KSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
           ++KV AS+    + GD   E+VDWR+KGAV PVKDQ SCGSCWAFS   A+EG     TG
Sbjct: 71  ENKVQASESVVLQDGDS-EENVDWRQKGAVTPVKDQKSCGSCWAFSATGAMEGALVKSTG 129

Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
           +LI+LSEQ+LVDC  K N GCNGGLM  AF +++   G  +E+DYPY G + +C  +  +
Sbjct: 130 KLINLSEQQLVDCVTKCN-GCNGGLMTAAFDYVLGR-GRATEKDYPYKGVDGRCKQTATD 187

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
            K   I GY +V   +  +LK AVA  P+SVA+ A G   Q Y+SGV    CG+ LDHGV
Sbjct: 188 NK---IKGYNNVPQNNYKALKAAVAS-PLSVAVNAAG-TIQRYKSGVIDANCGTRLDHGV 242

Query: 304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           +AVGY    G DYW+V+NSWG+ +GENGY +++    +   G CGI M A+ P
Sbjct: 243 LAVGY---QGEDYWIVKNSWGNGYGENGYFRVKMGTQNGGAGVCGINMMAAQP 292


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 137/317 (43%), Positives = 196/317 (61%), Gaps = 19/317 (5%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNE 106
           ++Q +   H +T  G     +R ++F++NL+ I  HN L+      Y++G+N+FAD+   
Sbjct: 42  LWQDFKTVHERTY-GETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEAN 100

Query: 107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           E+ ++  G R +  R  ++  + +   +      +P  VDWR++G V PVK+QG CGSCW
Sbjct: 101 EFASIMNGFRMN-NRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCW 159

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST  ++EG +   TG+L+SLSEQ LVDC     N GCNGG++DYAFQ+I  N G D+E
Sbjct: 160 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTE 219

Query: 226 QDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRA 282
             YPY   +  C   R  +  V  +  GY D+   DE  +K+AVA   PVSVAI+A   +
Sbjct: 220 ACYPYEAVDGTC---RFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSS 276

Query: 283 FQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y+SG++   EC    LDH V+ VGYGTE G DYWLV+NSWG+ WG+ GY+K+ RN+ 
Sbjct: 277 FQMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMARNM- 335

Query: 341 DTNTGKCGIAMEASYPV 357
                +CGIA +ASYP+
Sbjct: 336 ---DNQCGIASQASYPL 349


>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
          Length = 357

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 145/361 (40%), Positives = 204/361 (56%), Gaps = 22/361 (6%)

Query: 4   ASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTS 63
           + +F+     + L   SSSS   +       N D   S    DE + ++Q W  +HG   
Sbjct: 7   SKLFIFFFICITLICFSSSSNFPVQYSILGPNLDKLPS---QDETIQLFQLWRKEHGLVY 63

Query: 64  NGMGHNEKRFQIFKDNLRFIDEHNSLNRT---YKVGLNKFADLTNEEYRAMYLGTRSDAK 120
             +    KRF+IF  NL +I E N+   +   Y +GLN FAD +  E++ +YL +     
Sbjct: 64  KDLKEMAKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLHSLDMPT 123

Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
                 K+     +C A    P S+DWR K AV  +K+QGSCGSCWAFS   A+EGI+ I
Sbjct: 124 DS--APKLNGPLLSCIA----PASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAI 177

Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCDP 239
            TGELISLSEQELV+CDR ++ GCNGG ++ AF ++I NGG+  E +YPY G +   C+ 
Sbjct: 178 TTGELISLSEQELVNCDR-VSKGCNGGWVNKAFDWVISNGGITLEAEYPYTGKDGGNCNS 236

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG-ECGSA 298
            ++     +IDGYE V   D   L  ++  QP+S+ + A    FQ YESG+F G +C S+
Sbjct: 237 DKQVPIKATIDGYEQVEQSDN-GLLCSIVKQPISICLNA--TDFQLYESGIFDGQQCSSS 293

Query: 299 ---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
               +H V+ VGY + NG DYW+V+NSWG+ WG NGY+ ++RN      G CG+   A  
Sbjct: 294 SKYTNHCVLIVGYDSSNGEDYWIVKNSWGTKWGINGYIWIKRN-TGLPYGVCGMNAWAYN 352

Query: 356 P 356
           P
Sbjct: 353 P 353


>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 134/321 (41%), Positives = 197/321 (61%), Gaps = 18/321 (5%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
           D  +M  ++ W A H ++        +RF++++ N+ +ID  N     TY++G N+FADL
Sbjct: 38  DMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADL 97

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD-----ELPESVDWREKGAVNPVKD 158
           T EE+ A Y G  + +   +  +  A   ++    D     + P SVDWR KGAV PVK+
Sbjct: 98  TGEEFLARYAGGHTGSA--ITTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKN 155

Query: 159 QGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
           QGS C SCWAFS VA +E +  I TG+L++LSEQ+LVDCD K + GCN G    AFQ+I+
Sbjct: 156 QGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIM 214

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
           +NGG+ +   YPY      C  ++     V+I G+  V+  +E++L+ AVA QP+ VAIE
Sbjct: 215 ENGGITTAAQYPYKAVRGACSAAK---PAVTITGHLAVAK-NELALQSAVARQPIGVAIE 270

Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
               + Q Y+SGVF+  CG  + H VV VGYG + +G+ YWLV+NSWG  WGE GY++++
Sbjct: 271 V-PISMQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMR 329

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           R++     G CGIA++ +YP 
Sbjct: 330 RDV--GGGGLCGIALDTAYPT 348


>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 137/290 (47%), Positives = 189/290 (65%), Gaps = 24/290 (8%)

Query: 76  FKDNLRFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
           F  N+ +I+  +N+ ++ YK G+N+F               R+  K  +  S +    + 
Sbjct: 57  FXGNVNYIEACNNAADKPYKXGINQFP-------------PRNRFKGHMCSSIIRITTFK 103

Query: 135 CKAGDELPESVDWREKGAVNP--VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS-EQ 191
            +     P +VD R+KGAV P  VKDQG CG  WA S VAA EGI+ +  G+LI LS E 
Sbjct: 104 FENVTATPSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEP 163

Query: 192 ELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR--RNAKVVS 248
           ELVDCD K ++ GC GGL D AF+FIIQN G+++E +YPY G + KC+ +   +NA  + 
Sbjct: 164 ELVDCDTKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATI- 222

Query: 249 IDGYEDVSPFDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
           I GY+DV   +E + L+KAVA+ PVSVAI+A G  FQ Y+SGVFTG CG+ LDHGV AVG
Sbjct: 223 ITGYDDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVG 282

Query: 308 YG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           YG +++G +YWLV+NS G +WGE GY+++QR  +D+    CGIA++ASYP
Sbjct: 283 YGVSDDGTEYWLVKNSRGPEWGEEGYIRMQRG-VDSEEALCGIAVQASYP 331


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 204/329 (62%), Gaps = 27/329 (8%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
           D  + T ++ W + HGK+        +R  +++++LR I+ HN   SL + ++++G+N F
Sbjct: 22  DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEEHLRVIEIHNLEHSLGKHSFRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
            D+ NEE+R +  G +     ++L  S      +      E+P+ VDWR++G V PVKDQ
Sbjct: 81  GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFST  A+EG +   TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++  
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
           NGG+DSE  YPY+G ++   P   N +  + +  G+ D+    E +L KA+A   PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253

Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
           I+AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQ 313

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NGY+ + ++        CGIA  ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 148/305 (48%), Positives = 193/305 (63%), Gaps = 29/305 (9%)

Query: 71  KRFQIFKDNLRFI----DEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
           + F++F+ NL  I    +E+N   ++Y++GLN FA LT EE+ A YLG    A+    K+
Sbjct: 50  RAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGY-GGAEVEQPKT 108

Query: 127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
           + A  ++  K+  E+P SVDWREKGAV  VK+QG+CGSCWAFS VAA+EG + + +GELI
Sbjct: 109 RRAG-KHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELI 167

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM--DSEQDYPYLGAENKCDPSRRN 243
           SLSEQ+LVDC +K  N GC GG MD AF++ + N G   DSE+DYPY G + KC  S   
Sbjct: 168 SLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADG 227

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF---TGECGSAL 299
            +  +I GY DV   +E  L  AVA+  PVSVAI AG  A Q Y  GVF    G C   L
Sbjct: 228 VR-ATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGA-ALQFYLRGVFNGVAGTCFGPL 285

Query: 300 DHGVVAVGYGTEN-----GVDYWLVRNSWGSDWGENGYVKLQR--NLLDTNTGKCGIAME 352
           +HGV AVGYGT +      +DYW+++NSWG  WGE G+V+  R  NL       CG+A  
Sbjct: 286 NHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNL-------CGVANG 338

Query: 353 ASYPV 357
           ASYP+
Sbjct: 339 ASYPL 343


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 139/314 (44%), Positives = 186/314 (59%), Gaps = 17/314 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           +  W  +HGK          R  I++ NL  + +HN      + TY +G+N+FADL NEE
Sbjct: 28  WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + AM  G R +   +  K    S        D+LP++VDWR KG V PVKDQG CGSCWA
Sbjct: 88  FVAMMTGFRVNGTSKAAK---GSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWA 144

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FS   ++EG     TG+L+SLSEQ LVDC  + N GC+GG MD AFQ+II  GG+D+E  
Sbjct: 145 FSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR-NYGCHGGFMDRAFQYIIDAGGIDTEAT 203

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
           Y Y   +  C   + N    ++ GY DV+   E +L+KAVA   P+SVAI+A  + F+ Y
Sbjct: 204 YSYRAVDGNCHFKKANVG-ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFY 262

Query: 287 ESGVFT--GECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           +SGV+   G   + L H V+ VGYG T +G DYW+V+NSW   WG NGY+ + RN     
Sbjct: 263 KSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRN----K 318

Query: 344 TGKCGIAMEASYPV 357
             +CGIA EASYP+
Sbjct: 319 DNQCGIASEASYPM 332


>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
          Length = 324

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 137/318 (43%), Positives = 188/318 (59%), Gaps = 20/318 (6%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
             T +  +  ++G+          R  ++  N+ FI+ HN        TY + +N+F D+
Sbjct: 18  TFTSFHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDM 77

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           TNEE  A+  G    ++ R +   V   R      D LP  VDWR KGAV PVKDQ +CG
Sbjct: 78  TNEEINAVMNGLLPASESRGVA--VLGGR-----DDTLPAEVDWRTKGAVTPVKDQKACG 130

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS   ++EG + +  G+L+SLSEQ LVDC  K  + GC GGLMD+AF +I  NGG+
Sbjct: 131 SCWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGI 190

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
           D+E  YPY   + KC  +  N+   ++ GY DV    E +L+KAVA   P+SVAI+A   
Sbjct: 191 DTEASYPYEATDGKCQYNPANSG-ATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRS 249

Query: 282 AFQHYESGV-FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
            F  Y  GV +  EC S +LDHGV+AVGYGT++G DYWLV+NSW   WG +G++++ RN 
Sbjct: 250 TFHFYHKGVYYDKECSSTSLDHGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRN- 308

Query: 340 LDTNTGKCGIAMEASYPV 357
                  CGIA +ASYP+
Sbjct: 309 ---RNNNCGIATQASYPL 323


>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
          Length = 347

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 186/296 (62%), Gaps = 20/296 (6%)

Query: 73  FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKV 128
            + F  N+  I+EHN  +R    T+++GLN  ADL   EYR +  G R    RRL    +
Sbjct: 60  MEAFVKNMIHIEEHNHEHRLGRKTFEMGLNNIADLPFSEYRKLN-GYRH---RRLFGDSM 115

Query: 129 ASQ--RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                ++      + P+SVDWRE   V PVK+QG CGSCWAFS   A+EG +   TG+L+
Sbjct: 116 RKNGTKFLVPFNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLV 175

Query: 187 SLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
           SLSEQ LVDC  K  N GCNGGLMD AF++I  N G+D+E+ YPY+G E +C   +R+  
Sbjct: 176 SLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIG 235

Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
                G+ D+   DE +LK AVA Q P+S+AI+AG R+FQ Y+ GV F  EC S  LDHG
Sbjct: 236 AED-RGFVDLPEGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHG 294

Query: 303 VVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+ VGYGT+    DYW+++NSWG+ WGE GYV++ RN        CG+A +ASYP+
Sbjct: 295 VLLVGYGTDPEAGDYWIIKNSWGTKWGEKGYVRIARN----RNNHCGVATKASYPL 346


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 141/326 (43%), Positives = 196/326 (60%), Gaps = 24/326 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKF 100
           D ++   +  W   H K  +      +R  I++ NL+ I+ HN  +     TY++G+N F
Sbjct: 22  DQQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+T+EE+R +  G +    RR   S      +      E+P  +DWREKG V PVKDQG
Sbjct: 81  GDMTHEEFRQVMNGFKHKKDRRFRGSLFMEPNFI-----EVPNKLDWREKGYVTPVKDQG 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++   
Sbjct: 136 ECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQ 195

Query: 220 GGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIE 277
            G+DSE+ YPYLG +++ C    +N+   +  G+ D+    E +L KA+A   PVSVAI+
Sbjct: 196 NGLDSEESYPYLGTDDQPCHFDPKNS-AANDTGFVDIPSGKERALMKAIAAVGPVSVAID 254

Query: 278 AGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENG 331
           AG  +FQ Y+SG+ +  EC S  LDHGV+AVGYG E    +G  YW+V+NSW  +WG+ G
Sbjct: 255 AGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKG 314

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
           Y+ + ++        CGIA  ASYP+
Sbjct: 315 YIYMAKD----RHNHCGIATAASYPL 336


>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
          Length = 338

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 146/328 (44%), Positives = 203/328 (61%), Gaps = 33/328 (10%)

Query: 46  DEVMTIYQTW-LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
           DE   ++++W   K+ +   G      R  +++ NL+ I+ HN   S+ + TY++G+N F
Sbjct: 27  DEHWNLWKSWHTKKYHEKEEGW-----RRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHF 81

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+TNEE+R +  G +  A+R++  S      +      E P S+DWR+KG V PVKDQG
Sbjct: 82  GDMTNEEFRQLMNGYKHKAERKVKGSLFLEPNFL-----EAPRSLDWRDKGYVTPVKDQG 136

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFS   A+EG     TG+++ LSEQ LV+C R + N GCNGGLMD AFQ++  N
Sbjct: 137 QCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDN 196

Query: 220 GGMDSEQDYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAV-ADQPVSVA 275
            G+DSE+ YPYLG ++ KC  DP R NA  V+  G+ D+    E +L KAV A  P+SVA
Sbjct: 197 QGLDSEESYPYLGTDDQKCHYDP-RYNA--VNDTGFVDIKSGSEHALMKAVTAVGPISVA 253

Query: 276 IEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
           I+AG  +FQ Y+SG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+
Sbjct: 254 IDAGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGD 313

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            GYV + ++        CGIA  ASYP+
Sbjct: 314 KGYVYMAKD----RQNHCGIATAASYPL 337


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 37/330 (11%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
           DE   ++++W  K         ++EK    R  +++ NL+ I+    EH+    TY++G+
Sbjct: 25  DEHWDLWKSWHTKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGEHTYRLGM 76

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           N F D+T+EE+R +  G +  ++R+   S      +      E P SVDWR+ G V PVK
Sbjct: 77  NHFGDMTHEEFRQIMYGYKRKSERKFKGSLFMEPNFL-----EAPRSVDWRDNGYVTPVK 131

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
           DQG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVS 273
             N G+DSE  YPYLG +++  P   + K  S +  G+ D+    E +L KAVA   PVS
Sbjct: 192 KDNQGLDSEDSYPYLGTDDQ--PCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS 249

Query: 274 VAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDW 327
           VAI+AG  +FQ Y+SG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   W
Sbjct: 250 VAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKW 309

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G+ GY+ + ++        CGIA  ASYP+
Sbjct: 310 GDKGYIYMAKD----RKNHCGIATAASYPL 335


>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 338

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 195/329 (59%), Gaps = 26/329 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
           R D ++   +  W   H K  +      +R  +++ NL+ I+    EH     ++++G+N
Sbjct: 21  RFDSQLEDHWHLWKNWHSKNYHASEEGWRRM-VWEKNLKKIEIHNLEHTMGKHSHRLGMN 79

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
            F D+TNEE+R    G +   +R+   S      Y      + P++VDWREKG V PVKD
Sbjct: 80  HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
           QGSCGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I 
Sbjct: 135 QGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
            N G+D+E+ YPY+G +   DP     +  + +  G+ D+    E ++ KAVA   PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSAANETGFVDIPSGKEHAMMKAVAAVGPVSV 252

Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           AI+AG  +FQ YESG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG
Sbjct: 253 AIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + GY+ + ++        CGIA  +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  247 bits (631), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 140/328 (42%), Positives = 202/328 (61%), Gaps = 25/328 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
           D  + T ++ W + HGK+        +R  +++ +LR I+ HN   SL + ++++G+N F
Sbjct: 22  DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+ NEE+R +  G     K +    K+    +      E+P+ VDWR++G V PVKDQG
Sbjct: 81  GDMPNEEFRQLMNGY----KYKQTHKKLQGSHFLEPNFQEVPKHVDWRDEGYVTPVKDQG 136

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG +   TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++  N
Sbjct: 137 QCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDN 196

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
           GG+DSE  YPY+G ++   P   N +  + +  G+ D+    E +L KA+A   PVSVAI
Sbjct: 197 GGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAI 254

Query: 277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+N
Sbjct: 255 DAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQN 314

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           GY+ + ++        CGIA  ASYP++
Sbjct: 315 GYILMAKD----KDNHCGIATAASYPLE 338


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 198/327 (60%), Gaps = 26/327 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
           D ++   +Q W   H K  +      +R  +++ NLR I+    EH+    +Y++G+N F
Sbjct: 21  DPQLDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+T+EE+R +  G +   +R+   S      +      E P +VDWR+KG V PVKDQG
Sbjct: 80  GDMTHEEFRQIMNGYKRREQRKYSGSLFMEPNFL-----EAPRAVDWRDKGYVTPVKDQG 134

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++  N
Sbjct: 135 QCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDN 194

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            G+DSE  YPY G +++  P + NA+  +++  G+ D+    E +L KAVA   PVSVAI
Sbjct: 195 QGLDSEDFYPYKGTDDQ--PCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAI 252

Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+ 
Sbjct: 253 DAGHESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDK 312

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G++ + ++        CGIA  ASYP+
Sbjct: 313 GFIYMAKD----RHNHCGIATAASYPL 335


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 189/310 (60%), Gaps = 28/310 (9%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
           ++ W+++  +  +       RF+IFK NL+F++  N + N TYK+ +NKF+DLT+EE++A
Sbjct: 18  HEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEEFQA 77

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            Y+G   +      + K  S RY  +   E  ES+DWR +GAV PVKDQG CG CWAF+ 
Sbjct: 78  RYMGLVPEGMTGDSQ-KTVSFRY--ENVSETGESMDWRLEGAVTPVKDQGQCGCCWAFAA 134

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           VAAVEG+ KI  GEL+SLSEQ+LVDC     N GC+GGL   A+ +I +N G+ SE++YP
Sbjct: 135 VAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENYP 194

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
           Y   +  C  +   A  +S  GYE V   DE +L KAV+               QH   G
Sbjct: 195 YQAVQQTCKSTDPAAATIS--GYEAVPKDDEEALLKAVS---------------QH---G 234

Query: 290 VFTGE-CGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           +F  E CG+   H V  VGYGT E G+ YWL++NSWG  WGENGY++++R+ +D   G C
Sbjct: 235 IFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRD-VDEPQGMC 293

Query: 348 GIAMEASYPV 357
           G+A  A YPV
Sbjct: 294 GLAHRAYYPV 303


>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 139/295 (47%), Positives = 190/295 (64%), Gaps = 19/295 (6%)

Query: 72  RFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  IF+ N++ I+ HN L      +Y++GLN FAD+T +E+   Y GTR +A     +++
Sbjct: 45  RRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMTPDEFEK-YRGTRFEAN----EAR 99

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           V+  ++       +P++VDWR +G V PVK+QG CGSCWAFST  A+EG +   +G+L+S
Sbjct: 100 VSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVS 159

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC     NAGCNGGLMD AF+FI   GG+++E+ YPY G +  C    R    
Sbjct: 160 LSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIG- 218

Query: 247 VSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTG-ECGS-ALDHGV 303
             + G+ DV   DE +LK+A     PVSVAI+A G+ FQ Y+ GV+    C S +LDHGV
Sbjct: 219 AKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGV 278

Query: 304 VAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYG T +G DYWLV+NSWGS WG++GY+++ RN       +CGIA  ASYP 
Sbjct: 279 LVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRN----KENQCGIATMASYPT 329


>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 139/317 (43%), Positives = 188/317 (59%), Gaps = 20/317 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K  K+ +       R Q++ +N +F+  HN L     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWKLKFEKSYDSESDEAHRKQVWLNNRKFVLMHNILADQGLKSYRLGMTHFADMDNEE 85

Query: 108 YRAMY-LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
           Y+ +   G        L +    S       G  LP++VDWR+KG V  VKDQ  CGSCW
Sbjct: 86  YKQLVSQGCLHTFNASLPER--GSAFLGLPEGTALPDTVDWRDKGYVTEVKDQKQCGSCW 143

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
           AFST   +EG +   TG+L+SLSEQ+L+DC     N GCNGG +  A Q+I  NGG+D+E
Sbjct: 144 AFSTTGVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGIDTE 203

Query: 226 QDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
             YPY     +C   P    AK     GY  V P +E +LKKAVA   P+SV I+A   +
Sbjct: 204 TSYPYKAKGQRCRYKPDGIGAKCT---GYVHVKPSNEETLKKAVATLGPISVGIDASRHS 260

Query: 283 FQHYESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           FQ Y+SGV+   +C  + LDHG +AVGYGTENG DYWL++NSWG  WG+ GY+K+ RN  
Sbjct: 261 FQFYQSGVYDDPDCSKTVLDHGALAVGYGTENGHDYWLIKNSWGLRWGDKGYIKMSRN-- 318

Query: 341 DTNTGKCGIAMEASYPV 357
              + +CGIA EASYP+
Sbjct: 319 --KSNQCGIASEASYPL 333


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 144/317 (45%), Positives = 186/317 (58%), Gaps = 17/317 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LNR-TYKVGLNKFADLTNEE 107
           + T+   H K  +       R +IF +N   I  HN    LN  +YK+G+NK+ D+ + E
Sbjct: 28  WNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHE 87

Query: 108 YRAMYLGTRSD--AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
           +     G      A+ R  +  + S R+   A  E+P SVDWR  GAV P+KDQG CGSC
Sbjct: 88  FINTLNGFNKSVSAQLRAQRRPIGS-RFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSC 146

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDS 224
           W+FS   A+EG +  +TG+L+SLSEQ L+DC  R  N GCNGGLMD AFQ+I  N G+D+
Sbjct: 147 WSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQYIKDNHGLDT 206

Query: 225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
           E  YPY    +KC  + RN       GY D+   +E  LK AVA   PVSVAI+A   +F
Sbjct: 207 EISYPYEAENDKCRYNPRNNGATD-SGYVDIPEGNEKKLKAAVATIGPVSVAIDASAESF 265

Query: 284 QHYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
           Q Y  GV +   C S  LDHGV+ VGYGT +N  DYWLV+NSWG  WG+ GY+K+ RN  
Sbjct: 266 QFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTWGDEGYIKMARN-- 323

Query: 341 DTNTGKCGIAMEASYPV 357
                 CGIA  ASYP+
Sbjct: 324 --KDNHCGIASSASYPL 338


>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 133/315 (42%), Positives = 192/315 (60%), Gaps = 17/315 (5%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADL 103
           D  +M  ++ W A H ++        +RF++++ N+ +ID  N     TY++G N+FADL
Sbjct: 38  DMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADL 97

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS-C 162
           T EE+ A Y G  + +          S         + P SVDWR KGAV PVK+QGS C
Sbjct: 98  TGEEFLARYAGGHTGSAITTAAEADGSLE------ADPPASVDWRAKGAVTPVKNQGSQC 151

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGM 222
            SCWAFS VA +E +  I TG+L++LSEQ+LVDCD K + GCN G    AFQ+I++NGG+
Sbjct: 152 YSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD-KYDGGCNKGYYHRAFQWIMENGGI 210

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
            +   YPY      C  ++     V+I G+  V+  +E++L+ AVA QP+ VAIE    +
Sbjct: 211 TTAAQYPYKAVRGACSAAK---PAVTITGHLAVAK-NELALQSAVARQPIGVAIEV-PIS 265

Query: 283 FQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            Q Y+SGVF+  CG  + H VV VGYG + +G+ YWLV+NSWG  WGE GY++++R++  
Sbjct: 266 MQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDV-- 323

Query: 342 TNTGKCGIAMEASYP 356
              G CGIA++ +YP
Sbjct: 324 GGGGLCGIALDTAYP 338


>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 401

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 190/320 (59%), Gaps = 20/320 (6%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN---RTYKVGLNKFADL 103
           E    +  W+  H K+ +   H   RF+I+K N R+I   N  +    ++ V +N+F DL
Sbjct: 90  EEQRAFTEWMRTHRKSYH-HDHFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDL 148

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
           T++E+  +Y G    +  +  +     +++A  AG  +PES DWR+KG V+ VKDQG CG
Sbjct: 149 TSDEFNRLYNGLHVFSAPKASEKVERPRQWANTAG--IPESGDWRQKGVVSRVKDQGMCG 206

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI--NAGCNGGLMDYAFQFIIQNGG 221
           SCWAFST  + EGIN I T  L+ LSEQ LVDC      N GCNGG MD AF++II N G
Sbjct: 207 SCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKG 266

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVV---SIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           +DSE  YPY+ A+ +C   R N K V        + +   DE +L  A A QP+SV I+A
Sbjct: 267 IDSEASYPYVAADGQC---RFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDA 323

Query: 279 GGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
           G  +FQ Y  GV+   EC S  L+HGV+ VG+G E G  YWLV+NSWG  WG +GY+K+ 
Sbjct: 324 GRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGVERGQAYWLVKNSWGQTWGMDGYIKMS 383

Query: 337 RNLLDTNTGKCGIAMEASYP 356
           R   D N  +CGIA  ASYP
Sbjct: 384 R---DKNN-QCGIATLASYP 399


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 137/315 (43%), Positives = 191/315 (60%), Gaps = 21/315 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           ++T+   HGK          R +IF +N + I+ HN+       +YK+ +N F DL + E
Sbjct: 27  WETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
            +A+  G +      +  +     +    + D+LP+SVDWR+KGAV PVKDQG CGSCW+
Sbjct: 87  IKALMNGFK------MTPNTKREGKIYFPSNDKLPKSVDWRQKGAVTPVKDQGQCGSCWS 140

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS   ++EG   +  G+L+SLSEQ L+DC ++  N GC GGLMD AFQ++  N G+D+E 
Sbjct: 141 FSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTES 200

Query: 227 DYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
            YPY   +  C    +  KV   D GY D+   DE +L+ A+A   P+SVAI+A   +F 
Sbjct: 201 SYPYEARDYAC--RFKKDKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFH 258

Query: 285 HYESGVFTGECGSA--LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
            Y  GV+     S+  LDHGV+AVGYGTENG DYWLV+NSWG  WGE+GY+K+ RN    
Sbjct: 259 FYSEGVYNEPYCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARN---- 314

Query: 343 NTGKCGIAMEASYPV 357
           ++  CGIA  ASYP+
Sbjct: 315 HSNHCGIASMASYPI 329


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 203/329 (61%), Gaps = 27/329 (8%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
           D  + T ++ W + HGK+        +R  +++ +LR I+ HN   SL + ++++G+N F
Sbjct: 22  DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
            D+ NEE+R +  G +     ++L  S      +      E+P+ VDWR++G V PVKDQ
Sbjct: 81  GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFST  A+EG +   TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++  
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
           NGG+DSE  YPY+G ++   P   N +  + +  G+ D+    E +L KA+A   PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253

Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
           I+AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQ 313

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NGY+ + ++        CGIA  ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 141/318 (44%), Positives = 190/318 (59%), Gaps = 16/318 (5%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
           + T ++ + + H KT         RF+IF +N  FI +HN        +YK+G+N+FADL
Sbjct: 23  LRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADL 82

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
              E+  M  G +    +RL          A      LP++VDWR+KGAV PVKDQG CG
Sbjct: 83  LPHEFVKMMNGYQG---KRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCG 139

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS+  ++EG + + TG+L+SLSEQ LVDC     N GCNGGLMD +F +I  NGG+
Sbjct: 140 SCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGI 199

Query: 223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
           D+E  YPY   +  C   + +       G+ D+    E  L+KAVA   PVSVAI+A  +
Sbjct: 200 DTEDSYPYEAEDGDCRYKKEDVGATDT-GFVDIKEGSEKDLQKAVATVGPVSVAIDASQQ 258

Query: 282 AFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           +FQ Y  GV+    C S +LDHGV+AVGYG +NG  YWLV+NSW   WG++GY+ + R  
Sbjct: 259 SFQLYSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSR-- 316

Query: 340 LDTNTGKCGIAMEASYPV 357
            D N  +CGIA  ASYP+
Sbjct: 317 -DKNN-QCGIASSASYPL 332


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 26/329 (7%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLN 98
           R D ++   +  W   H K+ +      +R  +++ NL+ I+    EH     +Y++G+N
Sbjct: 21  RFDSQLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKNLKKIEMHNLEHTMGKHSYRLGMN 79

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
            F D+TNEE+R    G +   +R+   S      Y      + P++VDWREKG V PVKD
Sbjct: 80  HFGDMTNEEFRQTMNGYKQTTERKFKGSLFMEPNYL-----QAPKAVDWREKGYVTPVKD 134

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
           QGSCGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I 
Sbjct: 135 QGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQ 194

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSV 274
            N G+D+E+ YPY+G +   DP     +    +  G+ D+    E ++ KAVA   PVSV
Sbjct: 195 DNAGLDTEESYPYVGTDE--DPCHYKPEFSGANETGFVDIPSGKEHAMMKAVAAVGPVSV 252

Query: 275 AIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           AI+AG  +FQ YE G+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG
Sbjct: 253 AIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWG 312

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + GY+ + ++        CGIA  +SYP+
Sbjct: 313 DKGYIYMAKD----RKNHCGIATASSYPL 337


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 149/372 (40%), Positives = 215/372 (57%), Gaps = 34/372 (9%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MATAS  LA+     L    ++ + D   I                 ++  ++ W A++ 
Sbjct: 29  MATASASLALMFACSLLLAGTAFSDDTIAIP----------------LLERFKAWQAEYN 72

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
           +T       ++RF I+ +N+RFI   N L+   +Y++G N+F DLT EE++  YL  + D
Sbjct: 73  RTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYL-MKLD 131

Query: 119 AKRRLMKSKVASQRYACKAG-------DELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            +    ++   +      AG        E P SVDWR KGAV  VKDQ  CGSCWAF+TV
Sbjct: 132 EQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATV 191

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           A++EG+++I TG L+SLSEQE+VDCDR  N  GC GG    A +++ +NGG+ +E DYPY
Sbjct: 192 ASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPY 251

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +G++ +C   +       I GY+ V   +E  L++AVA QPV+V ++A  RAFQ Y+SGV
Sbjct: 252 VGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFVDA-SRAFQFYKSGV 310

Query: 291 FTGEC-GSALDHGVVAVGYGT----ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           F+G C  + ++H V  VGYG+      G  YW+V+NSWG  WGENGYV++ R +     G
Sbjct: 311 FSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-G 369

Query: 346 KCGIAMEASYPV 357
            C IA+E  YPV
Sbjct: 370 MCAIAIEPYYPV 381


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 189/321 (58%), Gaps = 23/321 (7%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
           + T ++ +   H K+         RF+IF +N   I +HN+       +YK+G+N+F DL
Sbjct: 23  LRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82

Query: 104 TNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              E+  ++ G R     R    M     +          LP +VDWR+KGAV PVKDQG
Sbjct: 83  LAHEFAKIFNGYRGQRTSRGSTFMPPANVNDS-------SLPSTVDWRKKGAVTPVKDQG 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CGSCWAFS   ++EG + +  GEL+SLSEQ LVDC +   N GC GGLMD AF++I  N
Sbjct: 136 QCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKAN 195

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
            G+D+E+ YPY   ++KC   + +       G+ D+    E  LKKAVA   P+SVAI+A
Sbjct: 196 DGIDAEESYPYEAMDDKCRFKKEDVGATDT-GFVDIEGGSEDDLKKAVATVGPISVAIDA 254

Query: 279 GGRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
           G  +FQ Y  GV+   EC S  LDHGV+AVGYG ++G  YWLV+NSWG  WG+NGY+ + 
Sbjct: 255 GHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMS 314

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           R   D N  +CGIA  ASYP+
Sbjct: 315 R---DKNN-QCGIASAASYPL 331


>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
 gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
 gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
          Length = 344

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/331 (43%), Positives = 193/331 (58%), Gaps = 42/331 (12%)

Query: 52  YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
           +  W+  H K  TS   G    R+ IFK N+ ++ + NS      +GLN FAD+TNEEYR
Sbjct: 30  FTDWMITHQKSYTSEEFG---ARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86

Query: 110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             YLGT+ DA   +   + KV +   A         S DWR +GAV PVK+QG CG CW+
Sbjct: 87  NTYLGTKFDASSLIGTQEEKVFTTSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FST  + EG +    GEL+SLSEQ L+DC  + N+GC+GGLM YAF++II N G+D+E  
Sbjct: 139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           YPY     KC+    N+   ++  Y+ V+   E SL+ AV   PVSVAI+A  ++FQ Y 
Sbjct: 198 YPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256

Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGV-------------------DYWLVRNSWGSD 326
           SG+ +  EC S  LDHGV+AVGYG+ +G                    +YW+V+NSWG+ 
Sbjct: 257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG  GY+ + RN  D N   CGIA  AS+PV
Sbjct: 317 WGIEGYILMSRN-RDNN---CGIASSASFPV 343


>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
 gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
          Length = 334

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 145/331 (43%), Positives = 197/331 (59%), Gaps = 25/331 (7%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
           S++ + D  +   +  W A H +   GM     R  +++ N++ ID HN         + 
Sbjct: 16  SAAPKLDQSLTEQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIDLHNREYSQGQHGFT 74

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + +N F D+TNEE+R +  G R+   R   K KV  +        E+P+SVDW  KG V 
Sbjct: 75  MAMNAFGDMTNEEFRQVMNGFRNQKPR---KGKVFQEPLFA----EIPKSVDWTLKGYVT 127

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
           PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNEGCNGGLMDNAF 187

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
           Q++ +NGG+DSE+ YPYLG +      +      +  G+ D+ P  E +L KAVA   P+
Sbjct: 188 QYVKENGGLDSEESYPYLGTDTDSCKYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
           SVAI+AG ++FQ Y+SG+ +  +C S  LDHGV+ VGYG E    N   +W+V+NSWG +
Sbjct: 247 SVAIDAGHQSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPE 306

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG NGYVK+ +   D N   CGIA  ASYP 
Sbjct: 307 WGTNGYVKMAK---DQNN-HCGIATAASYPT 333


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 139/298 (46%), Positives = 189/298 (63%), Gaps = 23/298 (7%)

Query: 72  RFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  +++ NLR I+ HN   SL + +Y++G+N+F D+TNEE+R +  G ++   ++++K  
Sbjct: 48  RRVLWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYKN---QKMIKGS 104

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
                +      E P++VDWREKG V PVKDQG CGSCWAFST  A+EG +    G+LIS
Sbjct: 105 T----FLAPNNFEAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLIS 160

Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC R + N GCNGGLMD AFQ++  NGG+DSE  YPY   +++      N   
Sbjct: 161 LSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNS 220

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
            +  G+ DV    E  L KAVA   PVSVA++AG ++FQ Y+SG+ +  EC S  LDHGV
Sbjct: 221 ANDTGFVDVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGV 280

Query: 304 VAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYG E    +G  YW+V+NSW   WG NGY+K+ ++        CGIA  ASYP+
Sbjct: 281 LVVGYGFEGEDVDGKRYWIVKNSWSEKWGNNGYIKIAKD----RHNHCGIATAASYPL 334


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/310 (45%), Positives = 186/310 (60%), Gaps = 28/310 (9%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           ++GK    +   ++RF++F DNL+ I  HN    +YK+G+N+F DLT +E+R   LG   
Sbjct: 67  RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126

Query: 118 D----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
           +     K  L  + V            LPE+ DWRE G V+PVK+QG CGSCW FST  A
Sbjct: 127 NCSATTKGNLKVTNVV-----------LPETKDWREAGIVSPVKNQGKCGSCWTFSTTGA 175

Query: 174 VEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
           +E       G+ ISLSEQ+LVDC     N GCNGGL   AF++I  NGG+D+E+ YPY G
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVF 291
               C  S  N  V  ID   +++   E  LK AVA  +PVS+A E   + F+ Y+SGV+
Sbjct: 236 KNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVY 293

Query: 292 TG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           T  ECG+    ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++          C
Sbjct: 294 TSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME-----MGKNMC 348

Query: 348 GIAMEASYPV 357
           GIA  ASYPV
Sbjct: 349 GIATCASYPV 358


>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/296 (47%), Positives = 180/296 (60%), Gaps = 18/296 (6%)

Query: 72  RFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R +I+ DN R I EHN    LN  TYK+G+NK+ D+ + E+     G        +    
Sbjct: 49  RMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVTAGIETEG 108

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           V    +   A  +LP+ VDW ++GAV  VKDQG CGSCWAFS+  A+EG +   TG L+S
Sbjct: 109 VT---FISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVS 165

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ L+DC  K  N GCNGGLMDYAFQ+I  N G+D+E+ YPY    ++C  + RN+  
Sbjct: 166 LSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRCRYNPRNSGA 225

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGV 303
               GY D+   DE  LK AVA   P+SVAI+A   +FQ Y  GV+     SA  LDHGV
Sbjct: 226 TD-KGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDPDCSAENLDHGV 284

Query: 304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYGT+  +G DYWLV+NSWG  WG+ GY+K+ RN        CGIA  ASYP+
Sbjct: 285 LIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARN----KNNHCGIASSASYPL 336


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 150/365 (41%), Positives = 211/365 (57%), Gaps = 45/365 (12%)

Query: 6   MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
           MF  + TL    +IS+  AA    I  D   DH +SW++             +HGK+ + 
Sbjct: 2   MFALLVTL----YISAVFAAPSIDIQLD---DHWNSWKS-------------QHGKSYHE 41

Query: 66  MGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
                +R  I+++NLR I++HN   SL N T+K+G+N+F D+TNEE+R    G + D  R
Sbjct: 42  DVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNR 100

Query: 122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
                     ++        P+ VDWR++G V PVKDQ  CGSCW+FS+  A+EG     
Sbjct: 101 TSQGPLFMEPKFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRK 155

Query: 182 TGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
           TG+LIS+SEQ LVDC R   N GCNGGLMD AFQ++ +N G+DSEQ YPYL  ++   P 
Sbjct: 156 TGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDL--PC 213

Query: 241 RRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CG 296
           R + +  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++ Q Y+SG++    C 
Sbjct: 214 RYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACT 273

Query: 297 SALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
           S LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + ++        CGIA  
Sbjct: 274 SQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD----KNNHCGIATM 329

Query: 353 ASYPV 357
           ASYP+
Sbjct: 330 ASYPL 334


>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
          Length = 329

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 187/314 (59%), Gaps = 25/314 (7%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEEYRA 110
           W   H KT         R +I++ NLR I  HN   SL   TY +G+N   D+T EE   
Sbjct: 29  WKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEILQ 88

Query: 111 MYLGTR--SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAF 168
           M+ GTR   +  RR       S  +   AG  +P+SVDWREKG V  VK+QGSCGSCWAF
Sbjct: 89  MFAGTRVRPNLTRR-------SSPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAF 141

Query: 169 STVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           S   A+EG  K  TG++ SLS Q LVDC  K  N GCNGG M  AFQ++I +GG+DS++ 
Sbjct: 142 SAAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEA 201

Query: 228 YPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
           YPY   + +C  D S+R A   S   Y  VS  DE +LK+AVA   P+SVAI+A    F 
Sbjct: 202 YPYTAMDGQCRYDQSQRAANCSS---YNYVSEGDEEALKQAVATIGPISVAIDATRPMFI 258

Query: 285 HYESGVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
            Y SGV++   C   ++HGV+ VGYG+ NG DYWLV+NSWG+ +G+ GY+++ RN     
Sbjct: 259 LYHSGVYSDPTCTQNVNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARN----K 314

Query: 344 TGKCGIAMEASYPV 357
              CGIA  A YP+
Sbjct: 315 GNMCGIANYACYPL 328


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 37/330 (11%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEK----RFQIFKDNLRFID----EHNSLNRTYKVGL 97
           DE   ++++W  K         ++EK    R  +++ NL+ I+    EH+    TY++G+
Sbjct: 25  DEHWDLWKSWHTKK--------YHEKEEGWRRMVWEKNLKKIELHNLEHSMGEHTYRLGM 76

Query: 98  NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           N F D+T+EE+R +  G +  ++R+   S      +      E P SVDWR+ G V PVK
Sbjct: 77  NHFGDMTHEEFRQIMNGYKRKSERKFKGSLFMEPNFL-----EAPRSVDWRDNGYVTPVK 131

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFI 216
           DQG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ+I
Sbjct: 132 DQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI 191

Query: 217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVS 273
             N G+DSE  YPYLG +++  P   + K  S +  G+ D+    E +L KAVA   PVS
Sbjct: 192 KDNQGLDSEDSYPYLGTDDQ--PCHYDPKYNSANDTGFIDIPSGKERALMKAVAAVGPVS 249

Query: 274 VAIEAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDW 327
           VAI+AG  +FQ Y+SG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   W
Sbjct: 250 VAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKW 309

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           G+ GY+ + ++        CGIA  ASYP+
Sbjct: 310 GDKGYIYMAKD----RKNHCGIATAASYPL 335


>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
          Length = 334

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 139/319 (43%), Positives = 196/319 (61%), Gaps = 24/319 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K G+T +      +R Q + +N + +  HN L     ++Y++G+  FAD+ NEE
Sbjct: 26  FHAWRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEE 85

Query: 108 YRAMY----LGT-RSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           Y+ +     LG+  +   RR       S  +      +LP +VDWR+KG V  VKDQ  C
Sbjct: 86  YKRLISQGCLGSFNASLPRR------GSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQC 139

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFS   ++EG     TG+L+SLSEQ+LVDC     N GC GGLMD AF++I   GG
Sbjct: 140 GSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGG 199

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
           +D+E+ YPY   + +C   + +A   +  GY DVS  DE +L++AVA   P+SV I+A  
Sbjct: 200 IDTEESYPYEAEDGECR-YKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASH 258

Query: 281 RAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
            +FQ YESG++   +C S+ LDHGV+AVGYG+ENG DYWLV+NSWG  WG+ GY+K+ +N
Sbjct: 259 ISFQLYESGLYDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKN 318

Query: 339 LLDTNTGKCGIAMEASYPV 357
                + +CGIA  ASYP+
Sbjct: 319 ----KSNQCGIATAASYPL 333


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 187/320 (58%), Gaps = 23/320 (7%)

Query: 50  TIYQTWL---AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
           ++ Q W    A+HG+    +     R  +F+ N +FID+HN+       T+ + +N+F D
Sbjct: 19  SLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 78

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +T+EE+ A   G  +   RR      A         + LP+ VDWR KGAV PVKDQ  C
Sbjct: 79  MTSEEFTATMNGFLNVPSRRPTAILRAD------PDETLPKEVDWRTKGAVTPVKDQKQC 132

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFST  ++EG + +  G+L+SLSEQ LVDC  K  N GC GGLMD AF++I  N G
Sbjct: 133 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 192

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
           +D+E  YPY   + KC     N       GY DV    E +LKKAVA   P+SVAI+A  
Sbjct: 193 IDTEDSYPYEAQDGKCRFDASNVGATDT-GYVDVEHGSESALKKAVATIGPISVAIDASQ 251

Query: 281 RAFQHYESGVFTGE-CGSA-LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQR 337
            +FQ Y  GV+  E C S  LDHGV+AVGYG TE G  YWLV+NSW + WG  GY+++ R
Sbjct: 252 PSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSR 311

Query: 338 NLLDTNTGKCGIAMEASYPV 357
           +        CGIA +ASYP+
Sbjct: 312 D----KKNNCGIASQASYPL 327


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 190/314 (60%), Gaps = 24/314 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           +Q++  KHGKT        KRF IF++NLR I+ HN+  +    +Y  G+NKFAD+T  E
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           ++AM L T+   K     S VA++ +    G  +PES+DWR +  V P+KDQ  CGSCWA
Sbjct: 86  FKAM-LATQVKTK----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWA 140

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           F+ V + EG   + TG+L   SEQ+LVDC   +N GC+GG +D  F + IQ  G++ E D
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPY-IQTNGLELESD 199

Query: 228 YPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
           YPY G +  C  S  ++KVV+ +  Y  V P +E +L +AV    PV++AI A     Q 
Sbjct: 200 YPYTGYDGYC--SYESSKVVTKVSSYVSV-PANEQALLEAVGTAGPVAIAINADD--LQF 254

Query: 286 YESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y SG+   + C    LDHGV+AVGY +ENG DYWL++NSWG+DWGE+GY +  R      
Sbjct: 255 YFSGIIDDKYCDPEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLR-----G 309

Query: 344 TGKCGIAMEASYPV 357
              CG+  +A YP+
Sbjct: 310 QNICGVKEDAVYPL 323


>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
 gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
          Length = 363

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 20/321 (6%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
           RT D +   +  +  ++GK+       +KRF+IF ++L+ +   N    +Y++G+N+F+D
Sbjct: 55  RTRDALR--FARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNRKGLSYRLGINRFSD 112

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           ++ EE+RA  LG   +    L     A       A   LP++ DWRE G V+PVK+QG C
Sbjct: 113 MSWEEFRATRLGAAQNCSATL-----AGNHRMRAAAVALPKTKDWREDGIVSPVKNQGHC 167

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGG 221
           GSCW FST  A+E      TG+ ISLSEQ+LVDC +  N  GCNGGL   AF++I  NGG
Sbjct: 168 GSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGKPFNNFGCNGGLPSQAFEYIKYNGG 227

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGG 280
           +D+E+ YPY G    CD    N  V  +D   +++   E  LK AVA  +PVSVA +   
Sbjct: 228 LDTEESYPYKGVNGICDFKAENVGVKVLDSV-NITLGAEDELKDAVALVRPVSVAFQV-V 285

Query: 281 RAFQHYESGVFTGE-CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
             F+ Y+SGV+T + CG+    ++H V+AVGYG ENGV YWL++NSWG+DWG+ GY K++
Sbjct: 286 NGFRQYKSGVYTSDSCGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDKGYFKME 345

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
                     CG+A  ASYP+
Sbjct: 346 -----MGKNMCGVATCASYPI 361


>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
          Length = 310

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/299 (45%), Positives = 186/299 (62%), Gaps = 23/299 (7%)

Query: 72  RFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R   +K NL+ I+ HN  +     TY++G+N F D+T+EE+R +  G +    RR   S 
Sbjct: 21  RRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL 80

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
                +      E+P  +DWREKG V PVKDQG CGSCWAFST  A+EG     TG+L+S
Sbjct: 81  FMEPXFI-----EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVS 135

Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRNAK 245
           LSEQ LVDC R + N GCNGGLMD AFQ++    G+DSE+ YPYLG +++ C    +N+ 
Sbjct: 136 LSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNS- 194

Query: 246 VVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDHG 302
             +  G+ D+    E +L KA+A   PVSVAI+AG  +FQ Y+SG+ +  EC S  LDHG
Sbjct: 195 AANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHG 254

Query: 303 VVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           V+AVGYG E    +G  YW+V+NSW  +WG+ GY+ + ++        CGIA  ASYP+
Sbjct: 255 VLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKD----RHNHCGIATAASYPL 309


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/318 (43%), Positives = 199/318 (62%), Gaps = 19/318 (5%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
           +M  + +W A + ++       ++RFQ+++ N+  I+  N   N TY +G N+FADLT E
Sbjct: 45  MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104

Query: 107 EYRAMY----LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG-S 161
           E+  +Y    +  R DA ++  ++ V+S   A  A    P SVDWR KGAV P+K+QG S
Sbjct: 105 EFLDLYTMKGMPVRRDAGKK--RANVSSSAAAVDA----PTSVDWRSKGAVTPIKNQGPS 158

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
           C SCWAF T A +E I KI TG+L+SLSEQEL+DCD   + GCN G     ++++IQNGG
Sbjct: 159 CSSCWAFVTAATIESITKITTGKLVSLSEQELIDCD-PYDGGCNLGYFVNGYRWVIQNGG 217

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           + +E +YPY      C  SR      +I  Y  + P  E  L++AVA QPV+ AIE GG 
Sbjct: 218 LTTEANYPYQARRYACSRSRAAQHAATISDYVQL-PAGEGQLQQAVAQQPVAAAIEMGG- 275

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNL 339
           + Q Y  GVF+G+CG+ ++H +  VGYG +  +G+ YWLV+NSWG  WGE GY++++R++
Sbjct: 276 SLQFYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDV 335

Query: 340 LDTNTGKCGIAMEASYPV 357
                G CGIA++ +YPV
Sbjct: 336 --GRGGLCGIALDLAYPV 351


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 192/315 (60%), Gaps = 26/315 (8%)

Query: 55  WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRA 110
           W AKH K   GM     R  +++ N++ I+ HN         + + +N F D+TNEE+R 
Sbjct: 32  WKAKHRKLY-GMREEGWRRAVWEKNMKMIEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQ 90

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +  G R+   +   K KV  +     +  E+P+SVDWREKG V PVK+QG CGSCWAFS 
Sbjct: 91  VMNGFRNQKHK---KGKVFQE----PSFLEVPKSVDWREKGYVTPVKNQGQCGSCWAFSA 143

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             A+EG     TG+LISLSEQ LVDC R + N GC+GGLMDYAFQ+I +NGG+DSE+ YP
Sbjct: 144 TGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEGCDGGLMDYAFQYIKENGGLDSEESYP 203

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYES 288
           Y   +  C   R    V +  G+ D+ P +E +L KAVA   P+SVAI+AG  +FQ Y+ 
Sbjct: 204 YDAMDESCK-YRPEYSVANDTGFVDI-PKEEKALMKAVATVGPISVAIDAGHESFQFYKE 261

Query: 289 GV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
           GV F  EC S  +DHGV+ VGYG E    +   +WLV+NSWG +WG  GY+K+ ++    
Sbjct: 262 GVYFEPECSSDNVDHGVLVVGYGYEETESDNNKFWLVKNSWGEEWGLGGYIKMTKD---- 317

Query: 343 NTGKCGIAMEASYPV 357
               CGIA  ASYP 
Sbjct: 318 QKNHCGIATAASYPT 332


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 191/314 (60%), Gaps = 24/314 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           +Q++  KHGKT        KRF IF++NLR I+ HN+  +    +Y  G+NKFAD+T  E
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           ++AM L T+   K     S VA++ +    G  +PES+DWR +  V P+KDQ  CGSCW+
Sbjct: 86  FKAM-LATQVKTK----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWS 140

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           F+ V + EG   + TG+L   SEQ+LVDC   +N GC+GG +D  F + IQ  G++ E D
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPY-IQTNGLELESD 199

Query: 228 YPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
           YPY G +  C  S  ++KVV+ +  Y  V P +E +L +AV    PV++AI A     Q 
Sbjct: 200 YPYTGYDGSC--SYDSSKVVTKVSSYVSV-PANEQALLEAVGTAGPVAIAINADD--LQF 254

Query: 286 YESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
           Y SG+   + C    LDHGV+AVGY +ENG+DYWL++NSWG+DWGE+GY +  R      
Sbjct: 255 YFSGIIDDKYCDPEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLR-----G 309

Query: 344 TGKCGIAMEASYPV 357
              CG+  +A YP+
Sbjct: 310 QNICGVKEDAVYPL 323


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 149/372 (40%), Positives = 216/372 (58%), Gaps = 34/372 (9%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MATAS  LA+     L    ++ + D   I                 ++  ++ W A++ 
Sbjct: 3   MATASASLALMFACSLLLAGTAFSDDTIAIP----------------LLERFKAWQAEYN 46

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
           +T       ++RF I+ +N+RFI   N L+   +Y++G N+F DLT EE++  YL  + D
Sbjct: 47  RTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYL-MKLD 105

Query: 119 AKRRLMKSKVASQRYACKAG-------DELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            +    ++   +      AG        E P SVDWR KGAV  VKDQ  CGSCWAF+TV
Sbjct: 106 EQPPAAEAMGPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATV 165

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
           A++EG+++I TG L+SLSEQE+VDCDR  N  GC GG    A +++ +NGG+ +E DYPY
Sbjct: 166 ASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPY 225

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
           +G++ +C   +       I GY+ V   +E  L++AVA++PV+V I+A  RAFQ Y+SGV
Sbjct: 226 VGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAERPVAVFIDA-SRAFQFYKSGV 284

Query: 291 FTGEC-GSALDHGVVAVGYGT----ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           F+G C  + ++H V  VGYG+      G  YW+V+NSWG  WGENGYV++ R +     G
Sbjct: 285 FSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-G 343

Query: 346 KCGIAMEASYPV 357
            C IA+E  YPV
Sbjct: 344 MCAIAIEPYYPV 355


>gi|5901663|gb|AAD55363.1| cysteine protease [Hordeum vulgare subsp. vulgare]
          Length = 163

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 117/163 (71%), Positives = 137/163 (84%), Gaps = 1/163 (0%)

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFS V+ VE IN++VTGE+I+LSEQELV+C     N+GCNGGLMD AF FII+NGG
Sbjct: 1   GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 60

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
           +D+E+DYPY   + KCD +R NAKVVSIDG+EDV   DE SL+KAVA QPVSVAIEAGGR
Sbjct: 61  IDTEEDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 120

Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
            FQ Y SGVF+G CG++LDHGVVAVGYGT+NG DYW+VRNSWG
Sbjct: 121 EFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWG 163


>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
          Length = 362

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  +M  ++ W   H ++        +RF +++ N  FID  N   + TY++  N+FADL
Sbjct: 44  DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADL 103

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
           T EE+ A Y G  +          V        AGD         ++P SVDWR +GAV 
Sbjct: 104 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 157

Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
           P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD   + GCN G    A+
Sbjct: 158 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 216

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +++++NGG+ +E DYPY      C+ ++       I G+  V P +E +L+ AVA QPV+
Sbjct: 217 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 276

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
           VAIE  G   Q Y+ GV+TG CG+ L H V  VGYGT+  +G  YW ++NSWG  WGE G
Sbjct: 277 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 335

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
           Y+++ R++     G CG+ ++ +YP
Sbjct: 336 YIRILRDV--GGPGLCGVTLDIAYP 358


>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
          Length = 210

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 120/209 (57%), Positives = 148/209 (70%), Gaps = 1/209 (0%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           +HGK    +     RF+IFK+NL+ IDE N +   Y +GLN+F+DL+++E++ MYLG + 
Sbjct: 3   QHGKIYESIEEKLHRFEIFKENLKHIDERNKIVSNYWLGLNEFSDLSHDEFKKMYLGLKV 62

Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
           D    L   K + Q +  +   +LP+SVDWR+KGAV PVK+QG CGSCWAFSTVAAVEGI
Sbjct: 63  DHDL-LNNKKQSQQDFEYRDFVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGI 121

Query: 178 NKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC 237
           N+I TG L SLSEQEL+DCD   N GCNGGLMDYAFQFII NGG+  E DYPYL  E  C
Sbjct: 122 NQIKTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFQFIISNGGLHKEDDYPYLMEEGTC 181

Query: 238 DPSRRNAKVVSIDGYEDVSPFDEMSLKKA 266
           D  R  ++VV+IDGY DV   DE SL KA
Sbjct: 182 DEKRDESEVVTIDGYRDVPANDEQSLLKA 210


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/323 (42%), Positives = 197/323 (60%), Gaps = 21/323 (6%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADL 103
           V   + ++  +H K          R +IF DN   + +HN L       YK+ +NK+ DL
Sbjct: 23  VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82

Query: 104 TNEEYRAMYLG---TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            + E+  +  G   T++  KR  ++  +    +   A  ++P++VDWR++GAV PVKDQG
Sbjct: 83  LHHEFVGLLNGFNRTKTYLKRGELQDSIT---FIEPAHVDIPDTVDWRQEGAVTPVKDQG 139

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
            CGSCW+FS   A+EG +   T +L+SLSEQ LVDC  +  N GCNGGLMD AF++I  N
Sbjct: 140 HCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNN 199

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           GG+D+E  YPY+G + K   S +N +  +  G+ D+   DE  LK AVA   P+S+AI+A
Sbjct: 200 GGIDTEAAYPYMGEDEKFRYSAKN-RGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDA 258

Query: 279 GGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVK 334
              +FQ Y +GV++   C S  LDHGV+ VGYGT+   G+DYWLV+NSWG  WG +GY+K
Sbjct: 259 SHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIK 318

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN       +CG+A +ASYP+
Sbjct: 319 MARN----QDNQCGVATQASYPL 337


>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
          Length = 334

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 146/331 (44%), Positives = 198/331 (59%), Gaps = 25/331 (7%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
           S++   D  + + +  W A H +   GM     R  +++ N++ I+ HN         + 
Sbjct: 16  SAAPELDQSLDSQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFT 74

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + +N F D+TNEE+R +  G R+   R   K KV  +        E+P+SVDW +KG V 
Sbjct: 75  MAMNAFGDMTNEEFRQVMNGFRNQKHR---KGKVFQEPLFA----EIPKSVDWTQKGYVT 127

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
           PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD+AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGNQGCNGGLMDFAF 187

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
           Q+I  NGG+DSE+ YPYL  +      +    V +  G+ D+ P  E +L KAVA   P+
Sbjct: 188 QYIKDNGGLDSEESYPYLARDTDSCNYKPEYSVANDTGFVDI-PQRERALMKAVATVGPI 246

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
           SVAI+AG ++FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   +W+V+NSWG +
Sbjct: 247 SVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPE 306

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG NGYVK+ +   D N   CGIA  ASYP 
Sbjct: 307 WGCNGYVKMAK---DQNN-HCGIATAASYPT 333


>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/296 (46%), Positives = 182/296 (61%), Gaps = 17/296 (5%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R +IF DN R I EHN         YK+G+NK+ D+ + E      G        + + +
Sbjct: 83  RMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLNGFNKSV--TVSEEQ 140

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           +    +   A  ELP+SVDWR+KGAV  +KDQG CGSCWAFS+  A+EG +   +G L+S
Sbjct: 141 LIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVS 200

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ L+DC  K  N GCNGGLMDYAF++I +N G+D+E+ YPY    ++C  + +N+  
Sbjct: 201 LSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAENDQCRYNPKNSGA 260

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
             + G+ D+   DE  LK AVA   P+SVAI+A   +F  Y  GV +  EC  A LDHGV
Sbjct: 261 SDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYYEPECSPANLDHGV 319

Query: 304 VAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYGT++G   DYWLV+NSWG  WGE GY+K+ RN        CGIA  ASYP+
Sbjct: 320 LIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARN----KENHCGIASSASYPL 371


>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 333

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 138/298 (46%), Positives = 188/298 (63%), Gaps = 16/298 (5%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           K GK         KR  IF+ NL  I++ N+ + +YK+G+N+ ADLT+EE+ A+ LGT  
Sbjct: 34  KFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDLSYKLGVNEHADLTHEEFAALKLGTLK 93

Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
            + RR  K  + +         +LP SVDWR K  + PVKDQGSCGSCWAFST  A+E  
Sbjct: 94  MSTRRDDKFVIEADT------TQLPTSVDWRNKNVLTPVKDQGSCGSCWAFSTTGALEAQ 147

Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
             I TG+L+SLSEQ+LVDC     N GC GGLMD A+++ I++ G+D E  Y Y G ++ 
Sbjct: 148 YAIATGKLLSLSEQQLVDCSSGYGNNGCEGGLMDDAYEY-IKSAGLDQESTYSYNGTDDV 206

Query: 237 CDPS--RRNAKVVS--IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF- 291
           C  S  +R+  + +  + G+  +    E SL KA+AD PVSVA+ A    F+ Y+SGV+ 
Sbjct: 207 CQGSLAKRSDGIPAGEVTGFHMLDK-TEQSLMKALADAPVSVAMYAADPDFRFYKSGVYS 265

Query: 292 TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
           +  C   LDHGVVAVGYGTENG DY+++RNSWGS WG+ GY  L+R +  +  G+C I
Sbjct: 266 SATCNGKLDHGVVAVGYGTENGSDYFIIRNSWGSSWGQAGYFYLKRGV--SGYGECNI 321


>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 330

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 194/316 (61%), Gaps = 22/316 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K  K+ +      +R QI+ +N + + +HN+L     +++++G+  FAD+ NEE
Sbjct: 26  FHAWKLKFEKSYDSPSEETQRKQIWLNNRKLVLKHNALADLGLKSFRLGMTYFADMENEE 85

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           Y+   LG        L +     +R     G  LP++VDWR++G V  VKDQ  CGSCWA
Sbjct: 86  YKK--LGCLGSFNASLPRHGSTFRRLP--KGTVLPDTVDWRKQGYVTHVKDQKECGSCWA 141

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS   A+EG     TG+L+SLSEQ+LVDC RK  N GC GG   +AFQ+I  NGG+D+E+
Sbjct: 142 FSATGALEGQYFKKTGKLVSLSEQQLVDCSRKFRNNGCEGGEPHWAFQYIRYNGGLDTEE 201

Query: 227 DYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
            Y Y   + +C  +P    AK     GY +VSPF++ +LK+AVA   P+SVAI+    +F
Sbjct: 202 SYHYEAKDGQCHYNPDSVGAKC---SGYVNVSPFED-ALKEAVATIGPISVAIDISRVSF 257

Query: 284 QHYESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
           Q Y SGV+     S   L+H V+AVGYGTENG DYWLV+NSWGS+WG  GY+K+ RN   
Sbjct: 258 QLYHSGVYDEPWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKMTRN--- 314

Query: 342 TNTGKCGIAMEASYPV 357
               +CGIA EASYP+
Sbjct: 315 -KDNQCGIATEASYPL 329


>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
          Length = 362

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  +M  ++ W   H ++        +RF +++ N  FID  N   + TY++  N+FADL
Sbjct: 44  DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADL 103

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
           T EE+ A Y G  +          V        AGD         ++P SVDWR +GAV 
Sbjct: 104 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 157

Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
           P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD   + GCN G    A+
Sbjct: 158 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 216

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +++++NGG+ +E DYPY      C+ ++       I G+  V P +E +L+ AVA QPV+
Sbjct: 217 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 276

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
           VAIE  G   Q Y+ GV+TG CG+ L H V  VGYGT+  +G  YW ++NSWG  WGE G
Sbjct: 277 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 335

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
           Y+++ R++     G CG+ ++ +YP
Sbjct: 336 YIRILRDV--GGPGLCGVTLDIAYP 358


>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
          Length = 337

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 138/327 (42%), Positives = 199/327 (60%), Gaps = 26/327 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
           D ++   +  W + H K  +      +R  +++ NL+ I+    EH+    +Y++G+N F
Sbjct: 22  DPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKKIELHNLEHSMGKHSYRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+T+EE++ +  G +  A+R+   S      +      E P SVDWREKG V PVKDQG
Sbjct: 81  GDMTHEEFKQIMNGYKHKAERKFKGSLFLEPNFL-----EAPRSVDWREKGYVTPVKDQG 135

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG     TG+L+SLS Q LV+C R + N GCNGGLMD AFQ++  N
Sbjct: 136 ECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPEGNEGCNGGLMDQAFQYVKDN 195

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            G+DSE  YPYLG +++  P   + K  + +  G+ D+   +E +L KAVA   PVSVAI
Sbjct: 196 QGLDSEDSYPYLGTDDQ--PCHYDPKFSAANDTGFVDIPSGNERALMKAVASVGPVSVAI 253

Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ +  EC S  LDHGV+AVGYG +    +G  +W+V+NSW  +WG+ 
Sbjct: 254 DAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDVDGKKFWIVKNSWSENWGDK 313

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY+ + ++        CGIA  ASYP+
Sbjct: 314 GYIYMAKD----RKNHCGIATAASYPL 336


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 187/321 (58%), Gaps = 23/321 (7%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFAD 102
           E  + +  W A HGK  N       RF+IF++N   I +HN   R    TY +G+N F D
Sbjct: 18  EFSSEWLKWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGD 77

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           L + E+     G +                +       +P   +W  KGAV PVKDQG C
Sbjct: 78  LLHSEFLERSNGFQGGVS--------GGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKC 129

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFS   +VEG   +   +L+SLSEQ+LVDC   + N GC GGLMD AF++ I N G
Sbjct: 130 GSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKG 189

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
           + +E+ YPY   +N C   +++  V +I  ++DV   DE  LK AVA+  PVSVAI+A  
Sbjct: 190 IANEKSYPYTAKDNDC-KYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASS 248

Query: 281 RAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQ 336
             FQ YESGV+  E C S  LDHGV+AVGYGT+  +G+D+WLV+NSW + WG NGY+K+ 
Sbjct: 249 SKFQFYESGVYYDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMA 308

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           RN  D N   CGIA  ASYP+
Sbjct: 309 RN-KDNN---CGIATMASYPI 325


>gi|1222694|gb|AAA92018.1| CP5 [Dictyostelium discoideum]
          Length = 344

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 143/331 (43%), Positives = 192/331 (58%), Gaps = 42/331 (12%)

Query: 52  YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
           +  W+  H K  TS   G    R+ IF  N+ ++ + NS      +GLN FAD+TNEEYR
Sbjct: 30  FTDWMITHQKSYTSEEFG---ARYNIFTANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86

Query: 110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             YLGT+ DA   +   + KV +   A         S DWR +GAV PVK+QG CG CW+
Sbjct: 87  NTYLGTKFDASSLIGTQEEKVHTNSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
           FST  + EG +    GEL+SLSEQ L+DC  + N+GC+GGLM YAF++II N G+D+E  
Sbjct: 139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197

Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
           YPY     KC+    N+   ++  Y+ V+   E SL+ AV   PVSVAI+A  ++FQ Y 
Sbjct: 198 YPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256

Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGV-------------------DYWLVRNSWGSD 326
           SG+ +  EC S  LDHGV+AVGYG+ +G                    +YW+V+NSWG+ 
Sbjct: 257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG  GY+ + RN  D N   CGIA  AS+PV
Sbjct: 317 WGIEGYILMSRN-RDNN---CGIASSASFPV 343


>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 358

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 130/325 (40%), Positives = 186/325 (57%), Gaps = 23/325 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  +M  ++ W   H ++        +RF +++ N  FID  N   + TY++  N+FADL
Sbjct: 40  DMVMMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADL 99

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGD---------ELPESVDWREKGAVN 154
           T EE+ A Y G  +          V        AGD         ++P SVDWR +GAV 
Sbjct: 100 TEEEFLATYTGYYAG------DGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVV 153

Query: 155 PVKDQGS-CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAF 213
           P K Q S C SCWAF T A +E +N I TG+L+SLSEQ+LVDCD   + GCN G    A+
Sbjct: 154 PPKSQTSTCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCD-SYDGGCNLGSYGRAY 212

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
           +++++NGG+ +E DYPY      C+ ++       I G+  V P +E +L+ AVA QPV+
Sbjct: 213 KWVVENGGLTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVA 272

Query: 274 VAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENG 331
           VAIE  G   Q Y+ GV+TG CG+ L H V  VGYGT+  +G  YW ++NSWG  WGE G
Sbjct: 273 VAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERG 331

Query: 332 YVKLQRNLLDTNTGKCGIAMEASYP 356
           Y+++ R++     G CG+ ++ +YP
Sbjct: 332 YIRILRDV--GGPGLCGVTLDIAYP 354


>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
          Length = 327

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 193/312 (61%), Gaps = 25/312 (8%)

Query: 59  HGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLG 114
           HGK  N    +  R  IF +N + + +HN        T+ + +NKF DLTNEE+R + +G
Sbjct: 27  HGKQYNEY-EDTARHAIFLENCKIVKQHNEEAAMGKHTFFMRMNKFGDLTNEEFRMLVIG 85

Query: 115 TRSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           +       LM+S    Q     +    G ++ ++VDWR+KGAV  VK+Q  CGSCWAFST
Sbjct: 86  SG------LMQSNRTQQAEGGVFESIPGLKVNDTVDWRQKGAVTKVKNQEQCGSCWAFST 139

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             ++EG + + +G L+SLSEQ LVDC RK  N GC GGLMD AF++I  NGG+D+E+ YP
Sbjct: 140 TGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYIKTNGGIDTEECYP 199

Query: 230 YLGA-ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
           Y G  E KC+  + +    ++  + DV   DE +LK+A A   P+SV I+A   +FQ Y+
Sbjct: 200 YKGRDERKCE-YKASCSGATLSSFVDVKTGDEDALKQASATIGPISVGIDASHPSFQLYD 258

Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
            GV+    C S  LDHGV+ VGYGT++  DYWLV+NSWG+DWG  GY+ + RN       
Sbjct: 259 HGVYHEKRCSSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGMEGYIMMSRN----KDN 314

Query: 346 KCGIAMEASYPV 357
           +CGIA +ASYPV
Sbjct: 315 QCGIATQASYPV 326


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 200/341 (58%), Gaps = 33/341 (9%)

Query: 29  IISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS 88
           +I  D N   SS W+     MT Y+       +       +E+RF+IF +N   I +HN 
Sbjct: 53  VIGVDWNFTLSSIWK---HFMTTYK-------RNYIDPSEHERRFKIFANNFVRISKHNV 102

Query: 89  L----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES 144
                  +Y +G+N+F+D T+EE +      R    R  + +     +Y   A    P  
Sbjct: 103 RFIQGQVSYTMGINEFSDKTDEELK------RLRCFRGSLNASRDGSKYITIAAPP-PSE 155

Query: 145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAG 203
           +DWR KGAV PVK+QG+CGSCWAFS   A+EG N + TG L+SLSEQ+LVDC  +  N  
Sbjct: 156 IDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNA 215

Query: 204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPS-RRNAK--VVSIDGYEDVSPFD 259
           CNGGLMD AF+++  + G+D+E  YPY+  E    +P+ R N K  VV + GY D+    
Sbjct: 216 CNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQ 275

Query: 260 EMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT-GECGS-ALDHGVVAVGYGTENGVDY 316
              LK+AV    P+SVAI AG  +F  Y+SGV++  +C S  LDHGV+ VGYG ENG+ Y
Sbjct: 276 VSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPY 335

Query: 317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WL++NSWG  WGENGYVK+ R+    +   CG+A  ASYP+
Sbjct: 336 WLIKNSWGPHWGENGYVKILRD----HNNLCGVASMASYPL 372


>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
          Length = 329

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 131/292 (44%), Positives = 180/292 (61%), Gaps = 18/292 (6%)

Query: 72  RFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
           R+  FKDNL FI   N++N+  ++G   FADLTNEEYRA+YLG   DA     +     Q
Sbjct: 48  RYSAFKDNLDFIHRWNAVNKETELGATVFADLTNEEYRAVYLGMNVDASNFAAQPATLDQ 107

Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
            Y       +  ++DWR  GAV  VKDQG CGSCWAFST  AVEG ++I TG  +SLSEQ
Sbjct: 108 VY-----QPVRSTLDWRNNGAVGRVKDQGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQ 162

Query: 192 ELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN---KCDPSRRNAKVV 247
           +L+DC R   N GC GGLMD A  +I++ GG+++E+ YPY   ++   K +P+   AK  
Sbjct: 163 QLMDCSRSYGNHGCQGGLMDSAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAK-- 220

Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE-CGS-ALDHGVVA 305
            + GY ++    E  L   +   PV++A++A   +FQ Y+SGVF    C S +L HGV+A
Sbjct: 221 -LSGYSNIKRGSEADLAAKLNIGPVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLA 279

Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           VGYGTE    YW+V+NSWG+ WG+ GY+ + +   D N   CG+A  +S P+
Sbjct: 280 VGYGTEGSSAYWIVKNSWGTRWGDAGYIWIAK---DRNN-HCGVATMSSIPI 327


>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
          Length = 329

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 138/312 (44%), Positives = 195/312 (62%), Gaps = 21/312 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           ++ W  K+ + S G+   E R +I+ +N+ ++ E N+   +YK+  N+FADLTN EYR +
Sbjct: 30  WEGWKLKYNR-SYGL-DEELRKKIWANNMLYVKEFNAEGHSYKLAANQFADLTNLEYRQI 87

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCWAFST 170
           YLG  ++A+    +     QR   K  DE LP +VDWR KG V PVK+QG CGSCW+FS 
Sbjct: 88  YLGYDNEARLSRKREGKVFQR---KMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSA 144

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             ++EG   I +G+L+S SEQELVDC   + N GC GGLMDYAF++   N   + E DY 
Sbjct: 145 TGSLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLA-EKESDYT 203

Query: 230 YLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
           Y     KC   + NA+  V     + D+   +  +LK+AVA++ P++VA++A   +FQ Y
Sbjct: 204 YTAKNGKC---KYNAQLGVTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMY 260

Query: 287 ESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
            SG++T    S   LDHGV+ VGYGT+NGVDYWL++NSWG  WG +GY K++       +
Sbjct: 261 HSGIYTPFLCSKTKLDHGVLVVGYGTDNGVDYWLIKNSWGMAWGMDGYFKIE-----MKS 315

Query: 345 GKCGIAMEASYP 356
            KCGI  +ASYP
Sbjct: 316 DKCGICTQASYP 327


>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 330

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 136/321 (42%), Positives = 189/321 (58%), Gaps = 32/321 (9%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
           +  W  K  K+ +      +R QI+  N + + +HN+L     +++++G+  FAD+ NEE
Sbjct: 26  FHAWKLKFEKSYDSSSEETQRKQIWLTNRKLVLKHNALADQGLKSFRLGMTYFADMENEE 85

Query: 108 YRAM--------YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           Y+ +         L  R+    RL K  V            LP++VDWRE+G V  VK Q
Sbjct: 86  YKKLGCLGSFNASLPCRASTLNRLPKVTV------------LPKTVDWREQGYVTDVKHQ 133

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
             CGSCWAFS   A+EG +   TG L+ LSEQ+LVDC RK  N GC+GG  ++AFQ+I  
Sbjct: 134 QQCGSCWAFSATGALEGQHFKKTGTLVPLSEQQLVDCSRKYRNNGCDGGEPNWAFQYIRD 193

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
           NGG+D+E+ Y Y   + +C   R N+     +GY DVSPF+E  ++      P+SV+I+ 
Sbjct: 194 NGGVDTEKSYRYEAKDGQCR-YRSNSIGAKCNGYVDVSPFEEALMEAVATIGPISVSIDD 252

Query: 279 GGRAFQHYESGVFTGECGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
              +FQ Y+SGV+     S   L+H V+AVGYGTENG DYWLV+NSWGS WG  GY+K+ 
Sbjct: 253 SRVSFQLYQSGVYDEPWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSGWGNKGYIKMT 312

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           RN       +CGIA EASYP+
Sbjct: 313 RN----KGNQCGIATEASYPL 329


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 136/300 (45%), Positives = 186/300 (62%), Gaps = 25/300 (8%)

Query: 72  RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  I++ NL  I+    EH+    +Y++G+N F D+T+EE+R +  G +   +R+ + S 
Sbjct: 47  RRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRKTERKAIGSL 106

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
                +        P +VDWREKG V PVKDQG CGSCWAFST  A+ZG N    G+L+S
Sbjct: 107 FMEPNFMVA-----PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVS 161

Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC R + N GC GGLMD AFQ++  N G+DSE  YPYLG +++  P   + K 
Sbjct: 162 LSEQNLVDCSRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQ--PCHYDPKY 219

Query: 247 VSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDH 301
            S++  G+ D+    E +L KAVA   PVSVAI+AG  +FQ Y+SG+ +  EC S  LDH
Sbjct: 220 NSVNDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDH 279

Query: 302 GVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+AVGYG E    +G  YW+V+NSW   WG+ GY+ + ++        CGIA  ASYP+
Sbjct: 280 GVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD----RKNHCGIATAASYPL 335


>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
          Length = 263

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 131/272 (48%), Positives = 173/272 (63%), Gaps = 10/272 (3%)

Query: 86  HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
           HN+ N TYK+G N+F+ +  +E+ A Y+G  + AK  + + +      A K  D +   V
Sbjct: 1   HNAKNSTYKLGHNEFSGMFWDEFVAQYVGDATGAKAYMERERNYDYTLA-KQVDAVASDV 59

Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
           DW   GAV  VK+QG CGSCW+FST  A+EG  +I    L SLSEQ LVDCD   ++GCN
Sbjct: 60  DWVASGAVTGVKNQGQCGSCWSFSTTGALEGAFEIAGNTLTSLSEQNLVDCDT-TDSGCN 118

Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
           GGLMD AF++I  NGG+ SE DY Y  A+  C  +    KV ++ G+ DV   DE +LK 
Sbjct: 119 GGLMDNAFKWIQSNGGICSEADYAYTAAKGTCKTTCD--KVATLSGHTDVPSGDEDALKT 176

Query: 266 AVADQPVSVAIEAGGRAFQHYESGVF-TGECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
           AVA  PVS+AIEA    FQ Y SG+  +  CG+ LDHGV+ VGYGT++G +YW V+NSWG
Sbjct: 177 AVAIGPVSIAIEADKSVFQSYSSGILDSSACGTNLDHGVLVVGYGTDDGSEYWKVKNSWG 236

Query: 325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
           + WGE+GYV++ R      +  CGIA E SYP
Sbjct: 237 TTWGESGYVRIAR-----GSNICGIASEPSYP 263


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 139/317 (43%), Positives = 189/317 (59%), Gaps = 28/317 (8%)

Query: 51  IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
           ++  +  ++GK    +   ++RF++F DNL+ I  HN    +YK+G+N+F D+T +E+R 
Sbjct: 60  LFARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDITWDEFRR 119

Query: 111 MYLGTRSD----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
             LG   +     K  L  + V            LPE+ DWRE G V+PVK+QG CGSCW
Sbjct: 120 DRLGAAQNCSATTKGNLKLTNVV-----------LPETKDWREAGIVSPVKNQGKCGSCW 168

Query: 167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
            FST  A+E       G+ ISLSEQ+LVDC     N GCNGGL   AF++I  NGG+D+E
Sbjct: 169 TFSTTGALEAAYGQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTE 228

Query: 226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQ 284
           + YPY G    C  S  N  V  ID   +++   E  LK AVA  +PVS+A E   + F+
Sbjct: 229 EAYPYTGKNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFK 286

Query: 285 HYESGVFTG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
            Y+SGV+T  ECG+    ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++    
Sbjct: 287 QYKSGVYTSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME---- 342

Query: 341 DTNTGKCGIAMEASYPV 357
                 CGIA  ASYPV
Sbjct: 343 -MGKNMCGIATCASYPV 358


>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
 gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
          Length = 334

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
           + D  + T +  W A H +     G NE+  R  +++ N++ I+ HN         + + 
Sbjct: 20  KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PV
Sbjct: 77  MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K+Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           + +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           A++AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG
Sbjct: 249 AVDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            NGYVK+ +   D N   CGIA  ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 142/328 (43%), Positives = 200/328 (60%), Gaps = 18/328 (5%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  +M  ++ + A + +T        +RF++++ N+ +I+  N   + TY++G N+FADL
Sbjct: 33  DMLMMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADL 92

Query: 104 TNEEYRAMY-----LGTRSDA-KRRLMKSKVASQ------RYACKAGDEL-PESVDWREK 150
           T +E+RAMY     + +R DA +RR M + +A         Y   A +E  P SVDWR K
Sbjct: 93  TVQEFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSK 152

Query: 151 GAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMD 210
           GAV PVKDQG CG CWAF+TVA +EG++KI TG+L+SLSEQELVD     + GC GGL +
Sbjct: 153 GAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVD-CDDADDGCGGGLPE 211

Query: 211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ 270
            A +++  NGG+ +E +YPY G   KCD  + +     I   + V    E  L++AVA Q
Sbjct: 212 IAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQ 271

Query: 271 PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGE 329
           PV+VAI A   +   Y+SGV++G C +  DH V  VGYG +N G  YW+++NSW   WGE
Sbjct: 272 PVAVAINAPD-SLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGE 330

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPV 357
            GY ++QR +     G CGIA  ASYPV
Sbjct: 331 KGYGRMQRGVA-AKEGLCGIATHASYPV 357


>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 139/310 (44%), Positives = 189/310 (60%), Gaps = 21/310 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEY-R 109
           +  ++AK+GK+       + R ++FK NL  +  +N+ N  TY++GLNKFAD T  EY R
Sbjct: 43  FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAEYKR 102

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            +  G + +   R +K   A +           + V+W E+GAV PVKDQG CGSCW+FS
Sbjct: 103 LLGFGGQKNKNPRNIKVLGAPKN----------DGVNWVEQGAVTPVKDQGQCGSCWSFS 152

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
              A+EG  KI  G L SLSEQ+LVDC + + N GC GG MD AFQ++ Q   +++E  Y
Sbjct: 153 ATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALETEDQY 211

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   ++ C  S  +A VV +D + DV+P +   LK A+   PVSVAIEA    FQ Y  
Sbjct: 212 PYEAVDDTCRAS--SAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSG 269

Query: 289 GVFT-GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GV     CG+ LDHGV+AVGYG E+G DY+LV+NSWG+ WGE GYVK+  +  +     C
Sbjct: 270 GVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDNI----C 325

Query: 348 GIAMEASYPV 357
           GI  +ASYP+
Sbjct: 326 GILSQASYPI 335


>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
 gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
          Length = 308

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 132/310 (42%), Positives = 187/310 (60%), Gaps = 22/310 (7%)

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGT 115
           GK  N +     R  IF++N + + +HN        T+ + +NKF DLT EE+R + +G+
Sbjct: 8   GKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGS 67

Query: 116 RSDAKRRLMKSKVASQR----YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
                   M+S    Q     +    G ++ ++VDWR+KGAV  VK+Q  CGSCWAFS  
Sbjct: 68  G------FMQSNKTQQAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSAT 121

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            ++EG + + T  L+SLSEQ LVDC R+  N GC GG MD AF++I  NGG+D+E+ Y Y
Sbjct: 122 GSLEGQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSY 181

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESG 289
            G +      + +    ++  Y D+   DEM+L +AV+   P+SVAI+AG ++FQ Y  G
Sbjct: 182 RGRDESMCRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHG 241

Query: 290 VF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           V+   +C S  LDHGV+AVGYG+ NG DYWLV+NSWG++WG  GY+ + RN       +C
Sbjct: 242 VYDEPKCSSTHLDHGVLAVGYGSSNGSDYWLVKNSWGTEWGMEGYIMMSRN----KHNQC 297

Query: 348 GIAMEASYPV 357
           GIA  A YPV
Sbjct: 298 GIATRAIYPV 307


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 141/325 (43%), Positives = 201/325 (61%), Gaps = 21/325 (6%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRF--QIFKDNLRFIDEHNSL----NRTYKVGLNK 99
           D +   +QT+  +H K  N +   E+RF  +IF +N   I +HN L      ++K+GLNK
Sbjct: 21  DVIKEEWQTFKMEHRK--NYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNK 78

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSK-VASQRYACKAGDELPESVDWREKGAVNPVKD 158
           +AD+ + E++    G     ++ L   +      Y   A  ++P++VDWR+ GAV  VKD
Sbjct: 79  YADMLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKD 138

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
           QG CGSCW+FS+  ++EG +    G L+SLSEQ LVDC  K  N GCNGGLMD AF++I 
Sbjct: 139 QGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 198

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSVA 275
            NGG+D+E+ YPY G ++ C  ++  A V + D G+ D+   DE ++ KAVA   PV+VA
Sbjct: 199 DNGGVDTEKSYPYEGIDDSCHFNK--ATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVA 256

Query: 276 IEAGGRAFQHYESGVFTG-ECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGY 332
           I+A   +FQ Y  GV+    C S  LDHGV+ VGYGT+ +G DYWLV+NSWG+ WG+ GY
Sbjct: 257 IDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGY 316

Query: 333 VKLQRNLLDTNTGKCGIAMEASYPV 357
           +K+ RN       +CGIA  +S+P 
Sbjct: 317 IKMARN----QDNQCGIATASSFPT 337


>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
 gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
 gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
           Full=Cathepsin V; Flags: Precursor
 gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
 gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
 gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
 gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
 gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
 gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
 gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
 gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
          Length = 334

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
           + D  + T +  W A H +     G NE+  R  +++ N++ I+ HN         + + 
Sbjct: 20  KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PV
Sbjct: 77  MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K+Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           + +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SV
Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           A++AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            NGYVK+ +   D N   CGIA  ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332


>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
          Length = 334

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
           + D  + T +  W A H +     G NE+  R  +++ N++ I+ HN         + + 
Sbjct: 20  KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PV
Sbjct: 77  MNAFPDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K+Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           + +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SV
Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           A++AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            NGYVK+ +   D N   CGIA  ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332


>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
 gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
          Length = 401

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 145/369 (39%), Positives = 205/369 (55%), Gaps = 25/369 (6%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDD-------EVMTIYQTWLAKHG 60
           + I+TL+ +F +       +S+   +N  D    +   D       E    ++ +  K+ 
Sbjct: 38  IIIATLIAIFIVL---VVTVSLYITNNTSDKIDDFVPGDYVDPATREYRKSFEEFKKKYN 94

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           KT + M    +RF+I+K N+ FI   NS   +Y + +N+F DL+ EE+ A + G   D+K
Sbjct: 95  KTYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSK 154

Query: 121 --RRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
              R+ KS   S   A +  +E   P S++W E G VNP+++Q +CGSCWAFS VAA+EG
Sbjct: 155 DDERVFKSSRVS---ASELEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEG 211

Query: 177 INKIVTGE-LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
                T   L SLSEQ+ VDC ++  N GC+GG M  AFQ+ I+N  + +  DYPY   E
Sbjct: 212 ATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDYPYFAEE 271

Query: 235 NKC-DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT 292
             C D    N   + +  Y+ V P +  +LK A+A   P+SVAI+A    FQ Y+SGVF 
Sbjct: 272 KTCMDSFCENYIEIPVKAYKYVFPRNINTLKTALAKYGPISVAIQADQTPFQFYKSGVFD 331

Query: 293 GECGSALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
             CG+ ++HGVV VGY  +     +YWLVRNSWG  WGE GY+KL   L     G CGI 
Sbjct: 332 APCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA--LHSGKKGTCGIL 389

Query: 351 MEASYPVKN 359
           +E  YPV N
Sbjct: 390 VEPVYPVIN 398


>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
          Length = 360

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 139/310 (44%), Positives = 185/310 (59%), Gaps = 28/310 (9%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           ++GK    +   ++RF++F DNL+ I  HN    +YK+G+N+F DLT +E+R   LG   
Sbjct: 67  RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126

Query: 118 D----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
           +     K  L  + V            LPE+  WRE G V+PVK+QG CGSCW FST  A
Sbjct: 127 NCSATTKGNLKVTNVV-----------LPETKGWREAGIVSPVKNQGKCGSCWTFSTTGA 175

Query: 174 VEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
           +E       G+ ISLSEQ+LVDC     N GCNGGL   AF++I  NGG+D+E+ YPY G
Sbjct: 176 LEAAYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTG 235

Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVF 291
               C  S  N  V  ID   +++   E  LK AVA  +PVS+A E   + F+ Y+SGV+
Sbjct: 236 KNGLCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVY 293

Query: 292 TG-ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           T  ECG+    ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++          C
Sbjct: 294 TSTECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME-----MGKNMC 348

Query: 348 GIAMEASYPV 357
           GIA  ASYPV
Sbjct: 349 GIATCASYPV 358


>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
 gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
          Length = 334

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
           + D  + T +  W A H +     G NE+  R  +++ N++ I+ HN         + + 
Sbjct: 20  KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PV
Sbjct: 77  MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K+Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           + +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVTPGKEKALMKAVATVGPISV 248

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           A++AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            NGYVK+ +   D N   CGIA  ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 194/323 (60%), Gaps = 27/323 (8%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
           E+ + +Q +L  HGK   G     +R  I++ NL +I++HN      + ++ +G+N++ D
Sbjct: 22  ELDSEWQLYLKAHGKQY-GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +TNEE+R+    T +  K R   S+ +        GD LP++VDWR KG V P+K+QG C
Sbjct: 81  MTNEEFRS----TMNGYKMRNGTSRGSLYLPPSNIGD-LPDTVDWRPKGYVTPIKNQGQC 135

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCW+FS   ++EG     TG+L SLSEQ LVDC +K  N GC GGLMD AFQ+I  N G
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSG 195

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSI--DGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           +D+E  YPY     KC   R NA  V     G+ D+    E  L+ AVA   P+SVAI+A
Sbjct: 196 IDTESSYPYEAKNGKC---RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDA 252

Query: 279 GGRAFQHYESGV----FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
              +FQ Y SGV    F  E  + LDHGV+AVGYGTE+G DYWLV+NSWG  WG+ GY+ 
Sbjct: 253 SHMSFQLYRSGVYHEFFCSE--TRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIM 310

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN  +     CGIA  ASYP 
Sbjct: 311 MSRNKRNN----CGIATSASYPT 329


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 141/324 (43%), Positives = 189/324 (58%), Gaps = 18/324 (5%)

Query: 51  IYQTWL---AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADL 103
           + Q W+    +H K          R +I+  N   I +HN    L + TY++ +NK+ D+
Sbjct: 24  VNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDM 83

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKV-ASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
            N E++ M  G        L   ++     +      ELP+ VDWR+ GAV  VKDQG C
Sbjct: 84  LNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHC 143

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCWAFS   ++EG +   TG L+SLSEQ L+DC     N GCNGGLMD AF +I  N G
Sbjct: 144 GSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKG 203

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
           +D+E+ YPY G ++KC   +R++    + G+ D+   DE  LK AVA   PVSVAI+A  
Sbjct: 204 LDTEKTYPYEGEDDKCRYDKRSSGASDV-GFVDIPVGDEQKLKAAVATVGPVSVAIDASH 262

Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
           ++FQ Y  G+ F  EC S  LDHGV+ VGYGT E G DYW+V+NSWG  WGE GY+K+ R
Sbjct: 263 QSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEKGYIKMAR 322

Query: 338 NLLDTNTGKCGIAMEASYPVKNSQ 361
           N+       CGIA  ASYP+  S+
Sbjct: 323 NI----DNHCGIASSASYPIVGSR 342


>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
 gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
          Length = 334

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 145/328 (44%), Positives = 197/328 (60%), Gaps = 29/328 (8%)

Query: 43  RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
           + D  + T +  W A H +     G NE+  R  +++ N++ I+ HN         + + 
Sbjct: 20  KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76

Query: 97  LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
           +N F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PV
Sbjct: 77  MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129

Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
           K+Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           + +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SV
Sbjct: 190 VKENGGLDSEESYPYVAMDEICKYRPENS-VANDTGFTVVTPGKEKALMKAVATVGPISV 248

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           A++AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
            NGYVK+ ++        CGIA  ASYP
Sbjct: 309 SNGYVKIAKD----KKNHCGIATAASYP 332


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/298 (46%), Positives = 184/298 (61%), Gaps = 20/298 (6%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R ++F DN   I  HN L +    +Y++ +N F DL + E+     G R  + RR+   +
Sbjct: 51  RMKVFMDNKHKIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRH-SLRRVTGDE 109

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           + S  +       +P+SVDWR +GAV  VK+QG CGSCWAFST  ++EG +   T +L S
Sbjct: 110 IDSVTFIPAYNVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTS 169

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRNA 244
           LSEQ L+DC  K  N GC+GGLMD AF +I  N G+D+EQ YPY G ++KC   P    A
Sbjct: 170 LSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKCRYKPQESGA 229

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGE-CGSA---L 299
              +  G+ D+   DE  LK AVA   P+SVAI+A  ++FQ Y+ GV+  + CG+    L
Sbjct: 230 ---TDKGFVDIPQGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDL 286

Query: 300 DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           DHGV+AVGYGTENG DYWLV+NSWG  WG +GY+K+ RN        CGIA  ASYP+
Sbjct: 287 DHGVLAVGYGTENGKDYWLVKNSWGKRWGLDGYIKMARN----KHNHCGIATSASYPL 340


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score =  244 bits (622), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 147/371 (39%), Positives = 213/371 (57%), Gaps = 31/371 (8%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
           MATAS  LA+  L     + + +A     I+                ++  ++ W A++ 
Sbjct: 3   MATASASLALVMLFACSLLLAGTAFSDDTIAIP--------------LLERFKAWQAEYN 48

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSD 118
           +T       ++RF ++ +NLRFI   N L+   +Y++G N+F DLT EE++  YL    +
Sbjct: 49  RTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDE 108

Query: 119 --AKRRLMKSKVASQRYACKA-GD---ELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
                  M   V +   A  + GD   E P SVDWR KGAV PVK+Q  CGSCWAF+TVA
Sbjct: 109 QPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVA 168

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKIN-AGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           ++EG+++I TG L+SLSEQE+VDCDR  N  GC GG    A +++ +NGG+ +E DYPY+
Sbjct: 169 SIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYV 228

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
           G++ +C   +       I GY+ V   +E  L++AVA +PV+V I+A  RAFQ Y+ GVF
Sbjct: 229 GSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDA-SRAFQFYKRGVF 287

Query: 292 TGECG-SALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           +G C  + ++H V  VGYG+          YW+V+NSWG  WGENGYV++ R +     G
Sbjct: 288 SGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-G 346

Query: 346 KCGIAMEASYP 356
            C IA+E   P
Sbjct: 347 MCAIAIEPLLP 357


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  243 bits (621), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 188/322 (58%), Gaps = 25/322 (7%)

Query: 48  VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADL 103
           V++ +++W   HGK+       + R +I  +N   I  HN+       +Y + +N + DL
Sbjct: 23  VLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDL 82

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
            + E+ AM  G     K  L  S + S+        +LP  VDWRE GAV PVK+QG CG
Sbjct: 83  LHHEFVAMVNGYEYVNKTSLGGSFIPSKNV------KLPTHVDWREDGAVTPVKNQGQCG 136

Query: 164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGM 222
           SCWAFS+  ++EG     TG+LI LSEQ LVDC RK  N GC GGLMD+AF +I  N G+
Sbjct: 137 SCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGI 196

Query: 223 DSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           D+E  YPY G   +C  DPS++ +  +   G+ DV    E  L KAVA   PVSVAI+A 
Sbjct: 197 DTEGSYPYEGVGGRCHYDPSKKGSSDI---GFVDVKKGSEEELLKAVASVGPVSVAIDAS 253

Query: 280 GRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKL 335
             +FQ Y  GV F  +C    LDHGV+ VGYGT+  +G DYWLV+NSW  +WG+ GY+K+
Sbjct: 254 HMSFQFYSHGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKM 313

Query: 336 QRNLLDTNTGKCGIAMEASYPV 357
            RN        CGIA  ASYPV
Sbjct: 314 ARN----KKNMCGIASSASYPV 331


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  243 bits (621), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 146/329 (44%), Positives = 199/329 (60%), Gaps = 30/329 (9%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNK 99
           TD  +   + +W   H KT        +R  +++ NL+ I+ HN   SL + +Y++G+N+
Sbjct: 21  TDPALDNHWYSWKDWHKKTYAPKEEGWRRV-LWEKNLKMIEFHNLDHSLGKHSYRLGMNQ 79

Query: 100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
           F D+TNEE++ +  G ++       +  +    +      E P+SVDWR+KG V PVKDQ
Sbjct: 80  FGDMTNEEFKQLMNGYKN-------QKMIRGSTFLAPNNFEAPKSVDWRKKGYVTPVKDQ 132

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFST  A+EG +   T +LISLSEQ LVDC R + N GCNGGLMD AFQ++  
Sbjct: 133 GQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQGNEGCNGGLMDQAFQYVKD 192

Query: 219 NGGMDSEQDYPYLGAENK-C--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
           NGG+DSE  YPY   +++ C  DP+  +A      G+ DV    E  L KAVA   PVSV
Sbjct: 193 NGGIDSEDSYPYTAKDDQECHYDPNNNSANDT---GFVDVQSGCEKDLMKAVASVGPVSV 249

Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
           AI+AG ++FQ Y+SG+ +  EC S  LDHGV+ VGYG E    +G  YW+V+NSW   WG
Sbjct: 250 AIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDVDGKKYWIVKNSWSEKWG 309

Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           +NGY+    N+       CGIA  ASYP+
Sbjct: 310 DNGYI----NIAKDRHNHCGIATAASYPL 334


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 141/319 (44%), Positives = 194/319 (60%), Gaps = 27/319 (8%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEY----- 108
           +H K  +       R +I+  N   I +HN         +++ +NK+ADL +EE+     
Sbjct: 33  QHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLN 92

Query: 109 ---RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
              R+   G++   + +LM  +     +   A  ++P ++DWREKGAV PVKDQG CGSC
Sbjct: 93  GFNRSAAAGSKLLGREQLMTIE-EPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSC 151

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
           W+FS   A+EG +   TG+L+SLSEQ LVDC  K  N GCNGGLMD AFQ++  N G+D+
Sbjct: 152 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDT 211

Query: 225 EQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGR 281
           E+ YPY   +++C     N K +  +  G+ D+   DE +LKKA+A   PVSVAI+A   
Sbjct: 212 EKAYPYEAIDDEC---HYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHE 268

Query: 282 AFQHYESGV-FTGECGS-ALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRN 338
           +FQ Y  GV +  +C S  LDHGV+AVGYG TE+G DYWLV+NSWG+ WG+ GYVK+ RN
Sbjct: 269 SFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARN 328

Query: 339 LLDTNTGKCGIAMEASYPV 357
                   CGIA  ASYP+
Sbjct: 329 ----RENHCGIATTASYPL 343


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 141/324 (43%), Positives = 196/324 (60%), Gaps = 25/324 (7%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFA 101
           DE   ++++W +K+ +     G    R  +++ NL+ I+ HN   SL + +Y +G+N F 
Sbjct: 26  DEHWDLWKSWHSKNYQHEKEEGW---RRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFG 82

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
           D+TNEE+R +  G +      L + K     +      E P+ VDWRE+G V PVKDQG 
Sbjct: 83  DMTNEEFRQVMNGYK------LQQRKFKGSLFLEPNNMEAPKQVDWREEGYVTPVKDQGQ 136

Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNG 220
           CGSCWAFST  A+EG     T +L+SLSEQ LVDC R + N GCNGGLMD AFQ+I  N 
Sbjct: 137 CGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNS 196

Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAG 279
           G+DSE+ YPYLG +++    +      +  G+ D+    E +L KA+A   PVSVAI+AG
Sbjct: 197 GLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDAG 256

Query: 280 GRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYV 333
             +FQ Y+SG+ +  EC S  LDHGV+AVGYG E    +G  YW+V+NSW   WG+ GY+
Sbjct: 257 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYI 316

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
            + ++        CGIA  ASYP+
Sbjct: 317 LMAKD----RKNHCGIATAASYPL 336


>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
          Length = 358

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 138/349 (39%), Positives = 199/349 (57%), Gaps = 38/349 (10%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLN 98
           S+S  ++ +    +  W+ KH ++      N  R+ ++K N+ +++E NS      +GLN
Sbjct: 17  SASSYSEQQYRDSFTNWMQKHSRSYASHEFN-TRYSVYKKNMDYVNEWNSKGSETVLGLN 75

Query: 99  KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
             AD+TN+EY+A+YLGT++DA  RL  +  ++     K    LP S+DW  +GAV  VK+
Sbjct: 76  SLADMTNQEYQAIYLGTKTDATARLAAASASASF--GKVQGALPASIDWVAQGAVTQVKN 133

Query: 159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
           QG CGSCW+FS   + EG ++I T  L++LSEQ L+DC     N GCNGGLMD AF++II
Sbjct: 134 QGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGNDGCNGGLMDNAFKYII 193

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
            NGG+D+E  YPY+    KC  +  N+   ++  Y DV+   E +L+      PVSVAI+
Sbjct: 194 ANGGIDTEASYPYVAKVQKCKYNPANSG-ATLSSYVDVTSGSESALQSQTVKGPVSVAID 252

Query: 278 AGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGT------------------------- 310
           A  ++FQ Y+SGV +   C S  LDHGV+ VGYGT                         
Sbjct: 253 ASHQSFQLYDSGVYYEPACSSTNLDHGVLVVGYGTASANGSSDSDSSAASQSSSSESSDD 312

Query: 311 --ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
               G  +W V+NSWG +WG +GY+++ RN  D N   CGIA  AS P+
Sbjct: 313 QATQGAQFWKVKNSWGPEWGLSGYIQMARN-RDNN---CGIATTASQPI 357


>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 358

 Score =  243 bits (621), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 135/319 (42%), Positives = 194/319 (60%), Gaps = 19/319 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
           ++ W+A+ G++    G   +R ++F  N R +D  N   NRTY +GLN+F+DLT+ E+  
Sbjct: 42  HERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDHEFLQ 101

Query: 111 MYLG-TRSDAKRRLM--KSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
            +LG  R   +R L+  + +V  +  A   G ++P SVDWR KGAV  +K+Q SCGSCWA
Sbjct: 102 QHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSCGSCWA 161

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
           F+ VAA EG+ KI TG LIS+SEQ+++DC  DR   + C+ G +  A ++++ +GG+  E
Sbjct: 162 FAAVAATEGLVKIATGNLISMSEQQVLDCTGDR---SSCDSGYISDALRYVVTSGGLQRE 218

Query: 226 QDYPYLGAENKCDPSRRNAK---VVSIDGYEDVS-PFDEMSLKKAVADQPVSVAIEAGGR 281
             Y Y G +  C  SRR A+     S+ G    +   DE +L+   A QPV+V +EA   
Sbjct: 219 AAYAYTGQKGACG-SRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASEP 277

Query: 282 AFQHYESGVFTG--ECGSALDHGVVAVGYGTENGV-DYWLVRNSWGSDWGENGYVKLQRN 338
            F+HY SGV+ G   CG  L+H +  VGYGTENG  +YWLV+N WG+ WGENGY+++ R 
Sbjct: 278 DFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGYMRVARR 337

Query: 339 LLDTNTGKCGIAMEASYPV 357
             +     CGIA  A YP 
Sbjct: 338 --NGAGANCGIASVAFYPT 354


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 136/296 (45%), Positives = 184/296 (62%), Gaps = 19/296 (6%)

Query: 72  RFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  IF+DNL+ I+ HN    +   +Y +G+N+FAD+T+ EY    +G             
Sbjct: 43  RRLIFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLITSNLTKTGS 102

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
            A+ RY      ++ ++VDWR+KG V  +KDQG CGSCWAFST  ++EG +   TG L+S
Sbjct: 103 RATYRYM--PNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVS 160

Query: 188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRNA 244
           LSEQ LVDC R+  N GC GG MD  FQ+IIQN G+D+EQ YPY    ++C  D S   A
Sbjct: 161 LSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNHRCKFDNSCIGA 220

Query: 245 KVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGSA-LDH 301
            + S   + DV+  DE +LK+A A+  P+SV I+A  ++FQ Y SGV+   EC S  LDH
Sbjct: 221 TMSS---FTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDH 277

Query: 302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+ VGYGT    DYWLV+NSWG+ WG  GY+ + RN       +CG+A +AS+PV
Sbjct: 278 GVLVVGYGTYGSKDYWLVKNSWGTVWGNEGYIMMSRN----KDNQCGVATDASFPV 329


>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
          Length = 338

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/310 (44%), Positives = 188/310 (60%), Gaps = 21/310 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEY-R 109
           +  ++AK+GK+       + R ++FK NL  +  +N  N  TY++GLNKFAD T  EY R
Sbjct: 43  FTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAEYKR 102

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            +  G + +   R +K   A +           + V+W E+GAV PVKDQG CGSCW+FS
Sbjct: 103 LLGFGGQKNKNPRNIKVLGAPKN----------DGVNWVEQGAVTPVKDQGQCGSCWSFS 152

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
              A+EG  KI  G L SLSEQ+LVDC + + N GC GG MD AFQ++ Q   +++E  Y
Sbjct: 153 ATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALETEDQY 211

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
           PY   ++ C  S  +A VV +D + DV+P +   LK A+   PVSVAIEA    FQ Y  
Sbjct: 212 PYEAVDDTCRAS--SAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSG 269

Query: 289 GVFT-GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
           GV     CG+ LDHGV+AVGYG E+G DY+LV+NSWG+ WGE GYVK+  +  +     C
Sbjct: 270 GVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAASPDNI----C 325

Query: 348 GIAMEASYPV 357
           GI  +ASYP+
Sbjct: 326 GILSQASYPI 335


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 27/323 (8%)

Query: 47  EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
           E+ + +Q +L  HGK   G     +R  I++ NL +I++HN      + ++ +G+N++ D
Sbjct: 22  ELDSEWQLYLKAHGKQY-GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80

Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
           +TNEE+R+    T +  K R   S+ +        GD LP++VDWR KG V P+K+QG C
Sbjct: 81  MTNEEFRS----TMNGYKMRNGTSRGSLYLPPSNIGD-LPDTVDWRPKGYVTPIKNQGQC 135

Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
           GSCW+FS   ++EG     TG+L SLSEQ LVDC +K  N GC GGLMD AFQ+I  N G
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNG 195

Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSI--DGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
           +D+E  YPY     KC   R NA  V     G+ D+    E  L+ AVA   P++VAI+A
Sbjct: 196 IDTESSYPYEAKNGKC---RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDA 252

Query: 279 GGRAFQHYESGV----FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
              +FQ Y+SGV    F  E  + LDHGV+AVGYGTE+G DYWLV+NSWG  WG+ GY+ 
Sbjct: 253 SHMSFQLYKSGVYHEFFCSE--TRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIM 310

Query: 335 LQRNLLDTNTGKCGIAMEASYPV 357
           + RN  +     CGIA  ASYP 
Sbjct: 311 MSRNKRNN----CGIATSASYPT 329


>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
 gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
 gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
          Length = 334

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/331 (43%), Positives = 197/331 (59%), Gaps = 25/331 (7%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
           S++ + D  +   +  W A HG+   GM     R  +++ N++ I+ HN         + 
Sbjct: 16  SAAPKLDQNLDADWYKWKATHGRLY-GMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFS 74

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + +N F D+TNEE+R +  G ++   +   K KV  +        E+P+SVDWREKG V 
Sbjct: 75  MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKVFHESLVL----EVPKSVDWREKGYVT 127

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
            VK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 AVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAF 187

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
           Q++  NGG+D+E+ YPYLG E      +      +  G+ D+ P  E +L KAVA   P+
Sbjct: 188 QYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
           SVAI+AG  +FQ Y+SG+ +  +C S  LDHGV+ VGYG E    N   +W+V+NSWG +
Sbjct: 247 SVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPE 306

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG NGYVK+ +   D N   CGI+  ASYP 
Sbjct: 307 WGWNGYVKMAK---DQNN-HCGISTAASYPT 333


>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 398

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/381 (36%), Positives = 207/381 (54%), Gaps = 37/381 (9%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE----VMTIYQTWLAKHGKTS 63
           + + T+  +  + SSS  ++   + D+   HS     D      +M  +Q W+A  G++ 
Sbjct: 16  VVLVTICQMLAVGSSS--ELMPPTTDDEMIHSDYSGRDKHNDLLMMGRFQGWMAAQGRSY 73

Query: 64  NGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
                  +RF+++K N+R+I+  N    +   T+++G   F DLT+EE+ A+Y G+    
Sbjct: 74  WTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDLTHEEFSALYNGSMPPP 133

Query: 120 KRRL------------------MKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQ 159
           +                     +   VA        G     P S DWR+ GAV P+KDQ
Sbjct: 134 EEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWPPRSRDWRKHGAVTPIKDQ 193

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQN 219
           G CGSCWAF TVA +EG +KIV G L+SLSEQ+L+DCD   N+GC GG +  A+++I + 
Sbjct: 194 GRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCD-YTNSGCKGGFVIRAYRWIRKI 252

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
           GG+ +   YPY GA  KC   +R      I G+  V    E++L  AVA QPV+V I A 
Sbjct: 253 GGLTTSSAYPYKGARGKC--MKRRRAAARIAGWRSVRSRSEVALVNAVAGQPVAVYISAS 310

Query: 280 GRAFQHYESGVFTGECGSA-LDHGVVAVGYG--TENGVDYWLVRNSWGSDWGENGYVKLQ 336
           G+ FQHY+ G+  G C +A L+H V  VGYG   + G  YW+V+NSWG+ WG+ GY+ ++
Sbjct: 311 GKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWIVKNSWGTTWGQEGYILMK 370

Query: 337 RNLLDTNTGKCGIAMEASYPV 357
           R   +   G+CGIA    +P+
Sbjct: 371 RGTRNPR-GQCGIATSPVFPL 390


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 26/331 (7%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
           S++   D  +   +  W A H +   GM     R  +++ N++ I++HN   R    ++ 
Sbjct: 16  SATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIEQHNQEYREGKHSFT 74

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + +N F D+T+EE+R +  G ++   R+  K KV  +    +A    P SVDWREKG V 
Sbjct: 75  MAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVT 127

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
           PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC   + N GCNGGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
           Q++  NGG+DSE+ YPY   E  C  + + + V +  G+ D+ P  E +L KAVA   P+
Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDI-PKQEKALMKAVATVGPI 245

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
           SVA++AG ++FQ Y+ G+ F  +C S  +DHGV+ VGYG E    +   YWLV+NSWG +
Sbjct: 246 SVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG  GY+K+ ++  +     CGIA  ASYP 
Sbjct: 306 WGMGGYIKMAKDRRN----HCGIASAASYPT 332


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 133/327 (40%), Positives = 194/327 (59%), Gaps = 19/327 (5%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
           D  +M  +  W A H ++        +RFQ+++DN+ +I+  N   + TY++G N+FADL
Sbjct: 35  DMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADL 94

Query: 104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYA--------CKAGDEL---PESVDWREKGA 152
           T EE+ A +     D  R      V +               GD++   P SVDWR KGA
Sbjct: 95  TREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGA 154

Query: 153 VNPVKDQGSCGSC-WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
           V P K Q S  S  WAF  VA +E ++ I TG+L++LSEQ+LVDCD + + GCN G    
Sbjct: 155 VVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCD-QYDGGCNRGTFRR 213

Query: 212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
           AF ++IQNGG+ +E +YPY  A+  C+ ++ +  V +I G+  V   +E+++K AVA QP
Sbjct: 214 AFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQP 273

Query: 272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGE 329
           V+ AIE G    Q Y+SGV++G CG+ L+H V  VGYG +   G  YW+V+NSWG  WGE
Sbjct: 274 VAAAIELGSD-MQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGE 332

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYP 356
            GY+++QR +L    G CGI ++ +YP
Sbjct: 333 RGYIRMQRKIL--GPGLCGIMLDVAYP 357


>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 282

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 132/275 (48%), Positives = 179/275 (65%), Gaps = 21/275 (7%)

Query: 92  TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ---RYACKAGDELPESVDWR 148
           ++K+G+N  ADL   EYR      R +  RR     +AS+   ++      E+P++VDWR
Sbjct: 19  SFKIGINHIADLPFAEYR------RLNGFRRTFGDNIASRNATKWRAPLNFEVPDAVDWR 72

Query: 149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGG 207
           ++G V PVK+QG CGSCWAFS   ++EG +K  TG+L+SLSEQ LVDC     N GCNGG
Sbjct: 73  DEGYVTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVDCSADFGNNGCNGG 132

Query: 208 LMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV 267
           LMD+AF+++ QN G+D+E+ YPY   + KC   + N       G+ D+   DE  LK AV
Sbjct: 133 LMDFAFEYVKQNHGIDTEESYPYKAKQKKCHFQKANVGADDT-GFVDLPEADEEQLKAAV 191

Query: 268 ADQ-PVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGVVAVGYGT--ENGVDYWLVRNS 322
           A Q PVSVAI+AG R+F+ Y++GV+  +  S   LDHGV+ VGYGT  E+G DYW+V+NS
Sbjct: 192 ASQGPVSVAIDAGHRSFRLYKTGVYYEKHCSPEQLDHGVLVVGYGTDPEHG-DYWIVKNS 250

Query: 323 WGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG +WGE GYV++ RN        CGIA +ASYP+
Sbjct: 251 WGEEWGEKGYVRIARN----RNNHCGIASKASYPL 281


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 132/294 (44%), Positives = 182/294 (61%), Gaps = 16/294 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           ++++ AK+GKT     +   R  I+      + EHN+       +YK+GLN FAD+ N E
Sbjct: 27  WESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R M  G R    R  +   V S          LP SVDWR KGAV P+K+QG CGSCWA
Sbjct: 87  FRKMMNGYRRGTPRNSVVVHVESNI-------TLPASVDWRTKGAVTPIKNQGQCGSCWA 139

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FST  ++EG + +  G+L+SLSEQELVDC   + N GC+GGLMD AF +I +N G+D+EQ
Sbjct: 140 FSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQ 199

Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
            YPY G +  C   + +    ++ G+ DV+   E  L+ A A   P+SVAI+A    FQ 
Sbjct: 200 SYPYTGEDGTCSFKKSDV-AATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQL 258

Query: 286 YESGVF-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
           YESGV+   +C +  LDHGV+ VGYGT++G  YWLV+NSWG+DWG +GY+++ R
Sbjct: 259 YESGVYDVSDCSTTELDHGVLVVGYGTDDGTAYWLVKNSWGTDWGHHGYIQMSR 312


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 136/319 (42%), Positives = 196/319 (61%), Gaps = 25/319 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
           + +W ++HGK+ +      +R  I+++NLR I++HN   SL N T+K+G+N+F D+TNEE
Sbjct: 28  WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R    G + D  R    +      +        P+ VDWR++G V PVKDQ  CGSCW+
Sbjct: 87  FRQAMNGYKQDPNRTSKGALFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+LIS+SEQ LVDC R + N GCNGG+MD AFQ++ +N G+DSEQ
Sbjct: 142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201

Query: 227 DYPYLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
            YPYL  ++   P R + +  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++ 
Sbjct: 202 SYPYLARDDL--PCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSL 259

Query: 284 QHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRN 338
           Q Y+SG++    C S LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + ++
Sbjct: 260 QFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD 319

Query: 339 LLDTNTGKCGIAMEASYPV 357
                   CGIA  ASYP+
Sbjct: 320 ----KNNHCGIATMASYPL 334


>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
 gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
           max]
          Length = 379

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 135/345 (39%), Positives = 207/345 (60%), Gaps = 28/345 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
           T  +V +++Q W ++HG+  +      KR +IFK+N  +I + N+ NR    ++++GLNK
Sbjct: 36  TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNK 94

Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           FAD+T +E+   YL    D  +  ++   K+  ++Y+C   D  P S DWR+KG +  VK
Sbjct: 95  FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
            QG CG  WAFS   A+E  + I TG+L+SLSEQELVDC  + + G   G    +F++++
Sbjct: 152 YQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVL 210

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
           ++GG+ ++ DYPY   E +C  ++   K V+IDGYE +   D       E +   A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269

Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
           P+SV+I+A  + F  Y  G++ GE C S   ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGEDW 327

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
           GE+GY+ +QRN  +   G CG+   ASYP K       SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371


>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
 gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
 gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 152/371 (40%), Positives = 208/371 (56%), Gaps = 30/371 (8%)

Query: 1   MATASMFLAISTLVFLFFISSSSAADMS----IISYDNNHDHSSSWRT---DDEVMTIYQ 53
           MA  +  +  S L  L  +++ S+ D S    ++S D  HD  SS+            + 
Sbjct: 1   MARVAGLVVSSILFLLCCVAAGSSFDESNPIKLVS-DRLHDFESSFVKVLGQSRRALSFA 59

Query: 54  TWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYL 113
            +  +HGK     G  + RF IF ++L  I   N     Y +GLN+FAD T +E++   L
Sbjct: 60  RFAHRHGKRYETEGEMKLRFAIFSESLDLIRSTNKKGLPYTLGLNQFADWTWQEFQKYRL 119

Query: 114 GTRSDAKRRLMKSKVASQRYACKAGDEL-PESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
           G   +          A+ R   K  + L PE+ DWRE+G V+PVK+QG CGSCW FST  
Sbjct: 120 GAAQNCS--------ATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTG 171

Query: 173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
           A+E       G+ ISLSEQ+LVDC R   N GCNGGL   AF++I  NGG+D+E+ YPY 
Sbjct: 172 ALEAAYHQAFGKGISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYT 231

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGV 290
           G ++ C  S  N  V  ++   +++   E  LK AVA  +PVSVA E  G +F+ Y+ GV
Sbjct: 232 GKDDACKFSSENVGVRVVESV-NITLGAEDELKHAVAFVRPVSVAFEVVG-SFRLYKEGV 289

Query: 291 F-TGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGK 346
           + T  CGS    ++H V+AVGYG ENG+ YWL++NSWG DWG+NGY K++          
Sbjct: 290 YTTSTCGSTPMDVNHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKME-----MGKNM 344

Query: 347 CGIAMEASYPV 357
           CGIA  ASYPV
Sbjct: 345 CGIATCASYPV 355


>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
          Length = 313

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/307 (43%), Positives = 184/307 (59%), Gaps = 18/307 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           + ++ A++GK          R ++F  N+ +  + NS +  Y VG   FAD+TN E+   
Sbjct: 23  FNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTVGATPFADMTNTEFAV- 81

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
                S     ++K K+   + A    +   E+VDWREKGAV PVK+Q SCGSCWAFS  
Sbjct: 82  -----SKLCGCMLKPKMT--KPATPIMEPAAEAVDWREKGAVTPVKNQASCGSCWAFSAT 134

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
            A+EG N +  GELISLSEQ+LVDCD + ++GC GGLM YAF++  +  GM  E+DYPY 
Sbjct: 135 GAMEGRNFVANGELISLSEQQLVDCDHQ-SSGCGGGLMTYAFEY-AKKKGMCKEEDYPYH 192

Query: 232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
             +  C    +   VV   GYE+V  FD  +LK+AV+  PVSVA+EA    FQ Y  GV 
Sbjct: 193 AVDEDCK-DDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSIVFQMYTGGVI 251

Query: 292 -TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
            +  CG++L+HGV+AVGY    G DYW+V+NSWG  WG+ GY+K++    ++  G CGI 
Sbjct: 252 DSSACGTSLNHGVLAVGY----GADYWIVKNSWGESWGDKGYLKIKYT--ESGAGICGIN 305

Query: 351 MEASYPV 357
              SYP 
Sbjct: 306 QMNSYPT 312


>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
           purpuratus]
 gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 334

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 196/316 (62%), Gaps = 20/316 (6%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNKFADLTNEE 107
           ++ W+  HGK  + MG   +R  I++DNLR I +HN        TY++G+N+F D+TN E
Sbjct: 28  WKEWVDYHGKEYSAMGEEMERRMIWEDNLRIITKHNLEHSQGKTTYRLGMNEFGDMTNAE 87

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           + A    TR+  K   +        +      +LP+SVDWR +G V PVKDQG CGSCWA
Sbjct: 88  FVA----TRTMKKMSGVPKVGQGSTFLPSEFLQLPDSVDWRTEGYVTPVKDQGQCGSCWA 143

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FSTV A+EG + + TG L+SLSEQ LVDC + + N GCNGG   +A ++I  NGG+D+E 
Sbjct: 144 FSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDGCNGGWPAWADEYIKSNGGIDTEV 203

Query: 227 DYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQ 284
            YPY G ++ C    R + V  +I G+ +V    E +L+KA+A   P+SV I+A   +FQ
Sbjct: 204 GYPYEGVDDSC--HYRTSDVGATITGFAEVEADSEKALEKALAQVGPISVCIDATQPSFQ 261

Query: 285 HYESGVF-TGECGS-ALDHGVVAVGY-GTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
            YESGV+   +C S ALDH V AVGY  T +G  Y++V+NSWG+ WG+ GY+ + R+   
Sbjct: 262 LYESGVYDEPDCSSTALDHCVTAVGYDSTADGDKYYIVKNSWGTTWGQEGYIWMSRD--- 318

Query: 342 TNTGKCGIAMEASYPV 357
               +CGIA  A+YP+
Sbjct: 319 -KQKQCGIATNATYPL 333


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 146/364 (40%), Positives = 206/364 (56%), Gaps = 30/364 (8%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDE-VMTI---------YQTWLAKH 59
           + ++V +  I++S+AAD+     +     S   R  +E V+ I         +  +  ++
Sbjct: 7   LPSVVLVILIAASAAADIGFDESNPIRMVSDGLREIEESVVQILGQSRHVLSFARFTHRY 66

Query: 60  GKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
           GK        + RF IFK+NL  I   N    +YK+G+N+FADLT +E++   LG   + 
Sbjct: 67  GKKYQNAEEIKLRFSIFKENLDLIRSTNKKRLSYKLGVNQFADLTWQEFQRNKLGAAQNC 126

Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
              L  S   ++         LPE+ DWRE G V+PVKDQG CGSCW FST  A+E    
Sbjct: 127 SATLKGSHKLTEA-------ALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYH 179

Query: 180 IVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
              G+ ISLSEQ+LVDC     N GCNGGL   AF++I  NGG+D+E+ YPY G +  C 
Sbjct: 180 QAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKDGTCK 239

Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFT-GECG 296
            S  N  V  +D   +++   E  LK AV   +PVS+A E   ++F+ Y+SGV+T   CG
Sbjct: 240 YSAENVGVQVLDSV-NITLGAEDELKHAVGLVRPVSIAFEV-VKSFRLYKSGVYTDSHCG 297

Query: 297 SA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEA 353
           +    ++H V+AVGYG E+GV YWL++NSWG+DWG+ GY K++          CGIA  A
Sbjct: 298 NTPMDVNHAVLAVGYGIEDGVPYWLIKNSWGADWGDKGYFKME-----MGKNMCGIATCA 352

Query: 354 SYPV 357
           SYPV
Sbjct: 353 SYPV 356


>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 137/307 (44%), Positives = 187/307 (60%), Gaps = 22/307 (7%)

Query: 58  KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
           ++GK    +   ++RF++F DNL+ I  HN    +YK+G+N+F DLT +E+R   LG   
Sbjct: 67  RYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRLGAAQ 126

Query: 118 DAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
           +          A+ +   K  +  LPE+ DWRE G V+PVK+QG CGSCW FST  A+E 
Sbjct: 127 NCS--------ATTKGNVKLTNAVLPETKDWREDGIVSPVKNQGKCGSCWTFSTTGALEA 178

Query: 177 INKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
                 G+ ISLSEQ+LVDC     N GCNGGL   AF++I  NGG+D+E+ YPY G   
Sbjct: 179 AYSQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNG 238

Query: 236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTG- 293
            C  S  N  V  ID   +++   E  LK AVA  +PVS+A E   + F+ Y+SGV++  
Sbjct: 239 LCKFSSENVGVKVIDSV-NITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVYSST 296

Query: 294 ECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
           ECG+    ++H V+AVGYG ENGV YWL++NSWG+DWG++GY K++          CGIA
Sbjct: 297 ECGNTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDDGYFKME-----MGKNMCGIA 351

Query: 351 MEASYPV 357
             ASYPV
Sbjct: 352 TCASYPV 358


>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
 gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
 gi|228243|prf||1801240A Cys protease 1
          Length = 322

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 30/319 (9%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
           ++ +  K G+    +     R  +F DNL++I+E N        TY + +N+F+D+TNE+
Sbjct: 20  WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEK 79

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES--VDWREKGAVNPVKDQGSCGSC 165
           + A+  G +   +   + +            D  PES  VDWR KGAV PVKDQG CGSC
Sbjct: 80  FNAVMKGYKKGPRPAAVFTST----------DAAPESTEVDWRTKGAVTPVKDQGQCGSC 129

Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMD 223
           WAFST   +EG + + TG L+SLSEQ+LVDC      N GCNGG ++ A  ++  NGG+D
Sbjct: 130 WAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVD 189

Query: 224 SEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
           +E  YPY   +N C   R N+  +  +  GY  ++   E +LK A  D  P+SVAI+A  
Sbjct: 190 TESSYPYEARDNTC---RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASH 246

Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
           R+FQ Y +GV +   C S+ LDH V+AVGYG+E G D+WLV+NSW + WGE+GY+K+ RN
Sbjct: 247 RSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARN 306

Query: 339 LLDTNTGKCGIAMEASYPV 357
                   CGIA +A YP 
Sbjct: 307 ----RNNNCGIATDACYPT 321


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 194/319 (60%), Gaps = 25/319 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
           + +W ++HGK+ +      +R  I+++NLR I++HN      N T+K+G+N+F D+TNEE
Sbjct: 28  WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEE 86

Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
           +R    G + D  R    +      +        P+ VDWR++G V PVKDQ  CGSCW+
Sbjct: 87  FRQAMNGYKQDPNRTSKGALFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
           FS+  A+EG     TG+LIS+SEQ LVDC R + N GCNGG+MD AFQ++ +N G+DSEQ
Sbjct: 142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201

Query: 227 DYPYLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
            YPYL  ++   P R + +  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++ 
Sbjct: 202 SYPYLARDDL--PCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSL 259

Query: 284 QHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQRN 338
           Q Y+SG++    C S LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + ++
Sbjct: 260 QFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKD 319

Query: 339 LLDTNTGKCGIAMEASYPV 357
                   CGIA  ASYP+
Sbjct: 320 ----KNNHCGIATMASYPL 334


>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
          Length = 342

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 147/361 (40%), Positives = 212/361 (58%), Gaps = 34/361 (9%)

Query: 10  ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
            S L  L +IS   A+ +      + +D  SS     E+  ++  +   +GK+ + M  +
Sbjct: 4   FSLLCILTWISVE-ASSLKFQPLRHQNDVMSS-----ELNELWTEYKETYGKSYD-MKED 56

Query: 70  EKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
             R  +++ NLR I  HN  +     ++ +G+N+ +DLT  EYR   LG R     R  K
Sbjct: 57  VVRRSLWEGNLRHISMHNVKHDLGKHSFSMGINELSDLTPSEYRQR-LGLRPALGERTGK 115

Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
             V +       G+++PE VDWR+KG V PVK+QG+CGSCWAFS+  ++EG +  +TG+L
Sbjct: 116 KFVYN-------GEKVPEHVDWRDKGYVTPVKNQGACGSCWAFSSTGSLEGQHFRLTGQL 168

Query: 186 ISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKC----DPS 240
           +SLSEQ LVDC +K  NAGCNGG MD AF ++  N G+D+E  YPY G ++ C     P 
Sbjct: 169 VSLSEQNLVDCTKKYGNAGCNGGWMDNAFNYVKANNGIDTEAFYPYEGHDDWCGYDGSPG 228

Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF--TGECGS 297
            + A      G+ DV   DE++LK+AVA   PVSV I+A  R+FQ Y+SG++       S
Sbjct: 229 HKGANCT---GHVDVQQGDELALKQAVATVGPVSVGIDATHRSFQLYKSGIYDEVACSNS 285

Query: 298 ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + DH V+ VGYG++ G DYWLV+NSWG+ WG +GY+ + RN       +C IA  ASYP 
Sbjct: 286 STDHAVLVVGYGSQGGHDYWLVKNSWGTSWGMDGYIMMSRN----KGNQCAIASYASYPT 341

Query: 358 K 358
           +
Sbjct: 342 E 342


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/324 (42%), Positives = 196/324 (60%), Gaps = 18/324 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D VM  + T+  +H K          R +IF +N   I +HN        ++K+ +NK+A
Sbjct: 57  DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 116

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
           DL + E+R +  G      ++L  +  + +   +   A   LP+SVDWR KGAV  VKDQ
Sbjct: 117 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 176

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+  A+EG +   +G L+SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 177 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 236

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
           NGG+D+E+ YPY   ++ C  ++    V + D G+ D+   DE  + +AVA   PVSVAI
Sbjct: 237 NGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 294

Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
           +A   +FQ Y  GV+   +C +  LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G++
Sbjct: 295 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 354

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN       +CGIA  +SYP+
Sbjct: 355 KMLRN----KENQCGIASASSYPL 374


>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
          Length = 333

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/331 (43%), Positives = 197/331 (59%), Gaps = 26/331 (7%)

Query: 39  SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYK 94
           SS+   D  +   +  W A H +   GM   E R  +++ N++ I+    E+N    ++ 
Sbjct: 16  SSALTFDRSLEAQWIKWKAMHNRLY-GMNEEEWRRAVWEKNMKMIELHNHEYNQGKHSFT 74

Query: 95  VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
           + +N F D+TNEE+R +  G ++   R+    KV  +        E P SVDWREKG V 
Sbjct: 75  MAMNAFGDMTNEEFRQVMNGFQN---RKPRNGKVFQEPLF----HEAPRSVDWREKGYVT 127

Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
           PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC   + N GC+GGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQGCDGGLMDYAF 187

Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
           Q++ +NGG+DSE+ YPY   E  C  +   + V +  G+ D+ P  E +L KAVA   P+
Sbjct: 188 QYVQENGGLDSEESYPYEATEESCKYNPEYS-VANDTGFVDI-PKLEKALMKAVATVGPI 245

Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE-NGVD---YWLVRNSWGSD 326
           SVAI+AG  +FQ Y+ G+ F  EC S  +DHGV+ VGYG E  G D   YWLV+NSWG  
Sbjct: 246 SVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTGSDNSKYWLVKNSWGEK 305

Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           WG +GY+K+ ++        CGIA  ASYP 
Sbjct: 306 WGMDGYIKMAKD----RKNHCGIASAASYPT 332


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/324 (42%), Positives = 196/324 (60%), Gaps = 18/324 (5%)

Query: 46  DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
           D VM  + T+  +H K          R +IF +N   I +HN        ++K+ +NK+A
Sbjct: 53  DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112

Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
           DL + E+R +  G      ++L  +  + +   +   A   LP+SVDWR KGAV  VKDQ
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 172

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFS+  A+EG +   +G L+SLSEQ LVDC  K  N GCNGGLMD AF++I  
Sbjct: 173 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 232

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
           NGG+D+E+ YPY   ++ C  ++    V + D G+ D+   DE  + +AVA   PVSVAI
Sbjct: 233 NGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290

Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
           +A   +FQ Y  GV+   +C +  LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G++
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350

Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
           K+ RN       +CGIA  +SYP+
Sbjct: 351 KMLRN----KENQCGIASASSYPL 370


>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
          Length = 214

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 115/220 (52%), Positives = 152/220 (69%), Gaps = 7/220 (3%)

Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
           PES+DWR+KGAV PVKDQ  CGSCWAFSTVA VEGINKIVTG+LISLSEQEL+DCDR+ +
Sbjct: 2   PESIDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-S 60

Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
            GCNGG    + Q+++ N G+ +E +YPY   +  C    +    V I GY+ V P DE+
Sbjct: 61  HGCNGGYQTTSLQYVVDN-GVHTEYEYPYEKKQGNCRAKDKKGLKVQITGYKRVPPNDEI 119

Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
           SL K +A+QPVSV IE+  R+F  Y  G++ G CG+ LDH V A+GYG     DY L++N
Sbjct: 120 SLIKVIANQPVSVLIESKDRSFHFYRGGIYKGPCGTRLDHAVTAIGYGK----DYILIKN 175

Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
           SWG +WGE GY++++R     + G CG+   + +P+K  Q
Sbjct: 176 SWGPNWGEKGYIRIKR-ASGKSEGICGVYKSSYFPIKGYQ 214


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 138/327 (42%), Positives = 198/327 (60%), Gaps = 26/327 (7%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID----EHNSLNRTYKVGLNKF 100
           D ++   ++ W + H K  +      +R  +++ NL+ I+    EH+    +Y++G+N F
Sbjct: 21  DPQLDDHWELWKSWHSKKYHEKEEGWRRM-VWEKNLKKIELHNLEHSMGTHSYRLGMNHF 79

Query: 101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
            D+T+EE+R +  G +  A+ +   S      +      E P+SVDWR+ G V PVKDQG
Sbjct: 80  GDMTHEEFRQLMNGYKRKAETKARGSLFLEPNFL-----EAPKSVDWRDNGYVTPVKDQG 134

Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
            CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++  N
Sbjct: 135 QCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDN 194

Query: 220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
            G+DSE  YPYLG +++  P   +    S++  G+ D+    E +L KAVA   PVSVAI
Sbjct: 195 QGLDSEDSYPYLGTDDQ--PCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAI 252

Query: 277 EAGGRAFQHYESGV-FTGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
           +AG  +FQ Y+SG+ +  EC S  LDHGV+ VGYG +    +G  YW+V+NSW   WG+ 
Sbjct: 253 DAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEKWGDK 312

Query: 331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GY+ + ++        CGIA  ASYP+
Sbjct: 313 GYIYMAKD----RKNHCGIATAASYPL 335


>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
          Length = 344

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 138/302 (45%), Positives = 183/302 (60%), Gaps = 21/302 (6%)

Query: 72  RFQIFKDNLRFIDEHNSLNR----TYKVGLNKF---ADLTNEEYRAMYLGTRSDAKRR-- 122
           R +I+ ++   I +HN        +YK+G+N +    D+ + E+     G    AK    
Sbjct: 47  RMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKN 106

Query: 123 --LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
             +    V   ++   A  +LPE VDWR+ GAV  +KDQG CGSCW+FST  A+EG +  
Sbjct: 107 LYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFR 166

Query: 181 VTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
            +G L+SLSEQ L+DC  +  N GCNGGLMD AF++I  NGG+D+EQ YPY G ++KC  
Sbjct: 167 QSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRY 226

Query: 240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFT-GECGS 297
           + +N     + G+ D+   DE  L +AVA   PVSVAI+A    FQ Y SGV+   EC S
Sbjct: 227 NPKNTGAEDV-GFVDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSS 285

Query: 298 A-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
             LDHGV+ VGYGT E GVDYWLV+NSWG  WGE GY+K+ RN       +CGIA  ASY
Sbjct: 286 TDLDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRN----KNNRCGIASSASY 341

Query: 356 PV 357
           P+
Sbjct: 342 PL 343


>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
          Length = 379

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 135/345 (39%), Positives = 207/345 (60%), Gaps = 28/345 (8%)

Query: 44  TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
           T  +V +++Q W ++HG+  +      KR +IFK+N  +I + N+ NR    ++++GLNK
Sbjct: 36  TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNK 94

Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
           FAD+T +E+   YL    D  +  ++   K+  ++Y+C   D  P S DWR+KG +  VK
Sbjct: 95  FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151

Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
            QG CG  WAFS   A+E  + I TG+L+SLSEQELVDC  + + G   G    +F++++
Sbjct: 152 YQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVL 210

Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
           ++GG+ ++ DYPY   E +C  ++   K V+IDGYE +   D       E +   A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269

Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
           P+SV+I+A  + F  Y  G++ GE C S   ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGFDW 327

Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
           GE+GY+ +QRN  +   G CG+   ASYP K       SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 184/312 (58%), Gaps = 20/312 (6%)

Query: 57  AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMY 112
           A HGK          R +I+ +N   I  HN        +YK+ +N+F DL + E+    
Sbjct: 32  ALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEF---- 87

Query: 113 LGTRSDAKRRLMKSKVASQRYACKAGDE---LPESVDWREKGAVNPVKDQGSCGSCWAFS 169
           + TR+  KR    S      +    G E   LP++VDWR+KGAV PVK+QG CGSCWAFS
Sbjct: 88  VSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFS 147

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           T  ++EG +   T +L+SLSEQ LVDC R   N GC GGLMD AF++I  N G+D+E  Y
Sbjct: 148 TTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSY 207

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
           PY   +  C  +R +       G+ D+   DE  LKKAVA   PVSVAI+A   +FQ Y 
Sbjct: 208 PYNATDGVCHFNRSDVGATDT-GFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYS 266

Query: 288 SGVF-TGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
            GV+   EC S  LDHGV+ VGYGT++G DYWLV+NSWG+ WG+ GY+ + RN       
Sbjct: 267 EGVYDEPECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRN----KDN 322

Query: 346 KCGIAMEASYPV 357
           +CGIA  ASYP+
Sbjct: 323 QCGIASSASYPL 334


>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
 gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
          Length = 401

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 144/369 (39%), Positives = 205/369 (55%), Gaps = 25/369 (6%)

Query: 8   LAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDD-------EVMTIYQTWLAKHG 60
           + I+TL+ +F +       +S+   +N  D    +   D       E    ++ +  K+ 
Sbjct: 38  IIIATLIAIFIVL---VVTVSLYITNNTSDKIDDFVPGDYVDPATREYRKSFEEFKKKYH 94

Query: 61  KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
           K  + M    +RF+I+K N+ FI   NS   +Y + +N+F DL+ EE+ A + G   D+K
Sbjct: 95  KVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSK 154

Query: 121 --RRLMKSKVASQRYACKAGDEL--PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
              R+ KS   S   A ++ +E   P S++W E G VNP+++Q +CGSCWAFS VAA+EG
Sbjct: 155 DDERVFKSSRVS---ASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEG 211

Query: 177 INKIVTGE-LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
                T   L SLSEQ+ VDC ++  N GC+GG M  AFQ+ I+N  + +  DYPY   E
Sbjct: 212 ATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDYPYFAEE 271

Query: 235 NKC-DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT 292
             C D    N   + +  Y+ V P +  +LK A+A   P+SVAI+A    FQ Y+SGVF 
Sbjct: 272 KTCMDSFCENYIEIPVKAYKYVFPRNINALKTALAKYGPISVAIQADQTPFQFYKSGVFD 331

Query: 293 GECGSALDHGVVAVGYGTENGV--DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
             CG+ ++HGVV VGY  +     +YWLVRNSWG  WGE GY+KL   L     G CGI 
Sbjct: 332 APCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLA--LHSGKKGTCGIL 389

Query: 351 MEASYPVKN 359
           +E  YPV N
Sbjct: 390 VEPVYPVIN 398


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 191/312 (61%), Gaps = 14/312 (4%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
           ++ + A+H K          R  IF++N +FI++HNS     + +G+N F DLTN+EYR 
Sbjct: 81  WENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRE 140

Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
            YLG R         S + S+    +  +++P+ +DWR++G V PVK+QG CGSCWAFS 
Sbjct: 141 RYLGYRRPENTPSKASYIFSR---AEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSA 197

Query: 171 VAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
           V ++EG +   TG+L+SLSEQ LVDC   + N+GCNGG MD AF+++  N G+D+E  YP
Sbjct: 198 VGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYP 257

Query: 230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV-ADQPVSVAIEAGGRAFQHYES 288
           Y+G +  C    ++    ++ G+ DV   DE +L++AV    PVSVAI+A    FQ Y  
Sbjct: 258 YVGTDGSCHFKNKSIG-ATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRG 316

Query: 289 GVFTGE-CG-SALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           GV+    C  S LDHGV+ VGYG +  G D+W+V+NSWG  WG  GY+++ RN       
Sbjct: 317 GVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRN----KGN 372

Query: 346 KCGIAMEASYPV 357
           +CGIA +AS P 
Sbjct: 373 QCGIASKASIPT 384


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 135/300 (45%), Positives = 186/300 (62%), Gaps = 25/300 (8%)

Query: 72  RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  +++ NL+ I+    EH+    +Y++G+N F D+T+EE+R +  G +   +R+   S 
Sbjct: 12  RRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYKRKPQRKFTGSL 71

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
                +      E P +VDWR+ G V PVKDQG CGSCWAFST  A+EG +   TG+L+S
Sbjct: 72  FMEPNFL-----EAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVS 126

Query: 188 LSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC R + N GCNGGLMD AFQ+I  N G+DSE  YPYLG +++  P   + K 
Sbjct: 127 LSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQ--PCHYDPKY 184

Query: 247 VSID--GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGS-ALDH 301
            S +  G+ D+    E +L KAVA   PVSVAI+AG  +FQ Y+SG+ +  +C S  LDH
Sbjct: 185 NSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDH 244

Query: 302 GVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           GV+ VGYG E    +G  YW+V+NSW   WG+ GY+ + ++        CGIA  ASYP+
Sbjct: 245 GVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD----RKNHCGIATAASYPL 300


>gi|6851030|emb|CAB71032.1| cysteine protease [Lolium multiflorum]
          Length = 359

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 187/314 (59%), Gaps = 23/314 (7%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
           +  +  +HGK+       ++RF+IF ++L  +   N    +YK+G+N+F+D+T EE++A 
Sbjct: 58  FARFAVRHGKSYGSAAEVQRRFRIFSESLDEVRSTNRKGLSYKLGINRFSDMTWEEFQAT 117

Query: 112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
            LG        L  + +       +  + LPE+ DWRE G V+PVKDQ SCGSCW FST 
Sbjct: 118 KLGAAQTCSATLAGNHLM------RDANALPETKDWRETGIVSPVKDQASCGSCWTFSTT 171

Query: 172 AAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
            A+E      TG+ ISLSEQ+LVDC    N  GCNGGL   AF++I  NGG+D+E+ YPY
Sbjct: 172 GALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYIKYNGGIDTEESYPY 231

Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESG 289
            G    C     NA V   D   +++   E  LK AV   +PVSVA E     F+ Y+SG
Sbjct: 232 KGVNGVCKYRPENAAVQVADSV-NITLNAEDELKNAVGLVRPVSVAFEV-IDGFKQYKSG 289

Query: 290 VFTGE-CGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ--RNLLDTN 343
           V+T + CG+  D   H V+AVGYG ENGV YWL++NSWG+DWGE+GY K++  +N+    
Sbjct: 290 VYTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGEDGYFKMEMGKNM---- 345

Query: 344 TGKCGIAMEASYPV 357
              C +A  ASYP+
Sbjct: 346 ---CAVATCASYPI 356


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/329 (42%), Positives = 202/329 (61%), Gaps = 27/329 (8%)

Query: 45  DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
           D  + T ++ W + HGK+        +R  +++ +LR I+ HN   SL + ++++G+N F
Sbjct: 22  DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHF 80

Query: 101 ADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
            D+ NEE+R +  G +     ++L  S      +      E+P+ VDWR++G V PVKDQ
Sbjct: 81  GDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFL-----EVPKHVDWRDEGYVTPVKDQ 135

Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQ 218
           G CGSCWAFST  A+EG +   TG+L+SLSEQ LV+C + + N GCNGGLMD AFQ++  
Sbjct: 136 GQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKD 195

Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID--GYEDVSPFDEMSLKKAVAD-QPVSVA 275
           NGG+DSE  YPY+G ++   P   N +  + +  G+ D+    E +L KA+A   PVSVA
Sbjct: 196 NGGIDSEDSYPYVGTDDT--PCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVA 253

Query: 276 IEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGE 329
           I+AG  +FQ Y+SG+ F  EC S  LDHGV+ VGYG E    +G  YW+V+NSW    G+
Sbjct: 254 IDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQ 313

Query: 330 NGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
           NGY+ + ++        CGIA  ASYP++
Sbjct: 314 NGYILMAKD----KDNHCGIATAASYPLE 338


>gi|75994626|gb|ABA33834.1| cysteine protease Mir1 [Zea mays subsp. parviglumis]
          Length = 248

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 124/252 (49%), Positives = 172/252 (68%), Gaps = 21/252 (8%)

Query: 51  IYQTWLAKHGK--TSN-----GMGHNEK----RFQIFKDNLRFIDEHNSLN----RTYKV 95
           +Y+ W +KHG+  +SN       G  E+    R ++F+DNLR+ID+HN+       T+++
Sbjct: 1   MYEAWKSKHGRGGSSNDDCDIAPGEEEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRL 60

Query: 96  GLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
           GL  FADLT +EYR   LG R+  +R     +    R     GD LP+++DWR+ GAV  
Sbjct: 61  GLTPFADLTLDEYRGRVLGFRARGRRSGHGYRARRPR----GGDLLPDAIDWRQLGAVTE 116

Query: 156 VKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQF 215
           VKDQ  CG CWAFS VAA+EG+N I TG L+SLSEQE++DCD + ++GC+GG M+ AF+F
Sbjct: 117 VKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGGQMEDAFRF 175

Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSV 274
           +I NGG+DSE DYP++G +  CD S+ +N KV +IDG  +V   +E +L++AVA QPVSV
Sbjct: 176 VIGNGGIDSEADYPFIGTDGTCDASKEKNEKVATIDGLVEVVSNNETALQEAVAIQPVSV 235

Query: 275 AIEAGGRAFQHY 286
           AI+A GRAFQHY
Sbjct: 236 AIDASGRAFQHY 247


>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 137/312 (43%), Positives = 190/312 (60%), Gaps = 17/312 (5%)

Query: 52  YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYR 109
           +Q W  K+ K         +R  I++ N +F++ HN  S    + V +N+FADL   E+ 
Sbjct: 24  FQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFG 83

Query: 110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
            ++ G           S   S      +G ++P++VDW+EKGAV P+K+QG CGSCW+FS
Sbjct: 84  RIFNGLLP------RPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFS 137

Query: 170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
           +  ++EG + I TG L+SLSEQ+L+DC  K  N GCNGGLMD +F+++    G ++E +Y
Sbjct: 138 STGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNY 197

Query: 229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYE 287
           PY  AEN       +  VV+   Y D+   DE SLK AVA+  P+SVAI+A   +FQ Y 
Sbjct: 198 PYT-AENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYN 256

Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
           SGV +   C S  LDHGV+A+GYGTE+G DYWLV+NSWG+ WG  GY+K+ RN       
Sbjct: 257 SGVYYASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRN----RNN 312

Query: 346 KCGIAMEASYPV 357
            CGIA +ASYP 
Sbjct: 313 NCGIATQASYPT 324


>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 196

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 114/190 (60%), Positives = 141/190 (74%), Gaps = 2/190 (1%)

Query: 184 ELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN 243
           +L+SLSEQELVDCD   N GCNGGLMD AF FI + GG+ +E++YPY+ A+ KCD  +RN
Sbjct: 4   KLVSLSEQELVDCDNGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMAADGKCDLKKRN 63

Query: 244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGV 303
             VVSIDG+EDV P DE SL KAVA+QPVSVAIEA G  FQ Y  GVFTG+CG+ LDHGV
Sbjct: 64  TPVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELDHGV 123

Query: 304 VAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
             VGYGT  +G  YW VRNSWG +WGE GY+++QR+ +D   G CGIAM+ SYP+K S +
Sbjct: 124 AIVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRD-IDAEEGLCGIAMQPSYPIKTSSD 182

Query: 363 SAKPKPHSSA 372
           +    P ++ 
Sbjct: 183 NPTGTPAATP 192


>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
           boliviensis]
 gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
           boliviensis]
 gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 333

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 205/358 (57%), Gaps = 41/358 (11%)

Query: 12  TLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEK 71
           TL+   F    ++A ++      NH   + W            W A H +   G    E 
Sbjct: 4   TLILAAFCLGLASAALTF-----NHSLEAQWIK----------WKAMHNRLY-GKNEEEW 47

Query: 72  RFQIFKDNLRFID----EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
           R  +++ N++ I+    E+N    ++ + +N F D+TNEE+R +  G ++   R+    K
Sbjct: 48  RRAVWEKNMKTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQN---RKPRNGK 104

Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
           V  +        E P SVDWREKG V PVK+QG CGSCWAFS   A+EG     TG+L+S
Sbjct: 105 VFQEPLL----HEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVS 160

Query: 188 LSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
           LSEQ LVDC   + N GCNGGLMDYAFQ++ +NGG+DSE+ YPY   E  C  + + + V
Sbjct: 161 LSEQNLVDCSGPQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYS-V 219

Query: 247 VSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGV 303
            +  G+ D+ P  E +L KAVA   P+SVAI+AG  +FQ Y+ G+ F  EC S  +DHGV
Sbjct: 220 ANDTGFVDI-PKLEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGV 278

Query: 304 VAVGYGTE-NGVD---YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
           + VGYG E  G D   YWLV+NSWG +WG +GY+K+ ++        CGIA  ASYP 
Sbjct: 279 LVVGYGFERTGSDNSKYWLVKNSWGEEWGMDGYIKMAKD----RKNHCGIASAASYPT 332


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.131    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,956,069,429
Number of Sequences: 23463169
Number of extensions: 256869948
Number of successful extensions: 643337
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6724
Number of HSP's successfully gapped in prelim test: 887
Number of HSP's that attempted gapping in prelim test: 612730
Number of HSP's gapped (non-prelim): 9208
length of query: 372
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 228
effective length of database: 8,980,499,031
effective search space: 2047553779068
effective search space used: 2047553779068
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)