BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy18108
         (102 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
 gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
          Length = 434

 Score = 71.2 bits (173), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 50/80 (62%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
           K +NE+  +N   L+ F+ F+  F+K Y +KEE  KRF +F  N+K I  LNK E GTA 
Sbjct: 118 KIDNEIINKNEYLLQSFKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQ 177

Query: 78  YGINHLSDLTREEMKSRLGL 97
           YGI   SDL+  E K+ LGL
Sbjct: 178 YGITEFSDLSVTEFKNYLGL 197


>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
          Length = 1036

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 15  GQMKSNNELKTEN--------PEHLKQ---FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
           GQ +S   L+ +N           LK+   F +F+  + K Y  KEE   RF +F+DNL 
Sbjct: 701 GQNRSKRSLRGQNYSQKMLQQSRQLKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLN 760

Query: 64  LIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
           LIE+L + E GT  YG+   +DLT+ E K+R LGL
Sbjct: 761 LIEELQRNEMGTGRYGVTQFTDLTKAEFKARHLGL 795


>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
          Length = 475

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 30/75 (40%), Positives = 47/75 (62%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S ++   E+ E L QF++F+  ++K Y ++EEV +R  +F +NLK  E L   + G+A Y
Sbjct: 162 STSQPLEESVELLGQFKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEY 221

Query: 79  GINHLSDLTREEMKS 93
           G+   SDLT EE +S
Sbjct: 222 GVTKFSDLTEEEFRS 236


>gi|17543258|ref|NP_502836.1| Protein Y40H7A.10 [Caenorhabditis elegans]
 gi|3880920|emb|CAA22062.1| Protein Y40H7A.10 [Caenorhabditis elegans]
          Length = 343

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 29/80 (36%), Positives = 45/80 (56%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+   + + T + ++   F+ F+  + + YP + E+ KRF +F  NL L+E  NK + G 
Sbjct: 33  QILQRHHIPTPDVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGK 92

Query: 76  ATYGINHLSDLTREEMKSRL 95
            TY +N  SDLT EE K  L
Sbjct: 93  VTYELNDFSDLTEEEWKKYL 112


>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
           rotundata]
          Length = 884

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 1/65 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE F++ ++K+Y + +E A R+ VF  NLK+IE L K E GTA YG+   +DLT EE K+
Sbjct: 579 FEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTPEEFKT 638

Query: 94  R-LGL 97
           + LGL
Sbjct: 639 KYLGL 643


>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 389

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 36/87 (41%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 10  TLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
           ++A+  Q    N  K  N   +KQ F KF  +  K Y   EE  +RF +F  NL +I +L
Sbjct: 15  SMAIINQNFHYNSTKQLNLTQVKQLFSKFKAEHKKFYNFLEE-QRRFEIFRQNLDIISEL 73

Query: 69  NKGEHGTATYGINHLSDLTREEMKSRL 95
           N+ E GTA YGI   SD+T EE KS++
Sbjct: 74  NQVEEGTAEYGITQFSDMTTEEFKSQI 100


>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
          Length = 471

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EHL  F KF   F ++Y T  E   RF +F+ NL+LIE+LN+ E G+A YGI   +D+T 
Sbjct: 163 EHL--FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTS 220

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 221 PEYKQRTGL 229


>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
          Length = 884

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 32/65 (49%), Positives = 44/65 (67%), Gaps = 1/65 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE FI+ F K+Y + +E   RF +F+ NLK+IE+L   E GTA YG+   +DLT +E K+
Sbjct: 579 FEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKEFKA 638

Query: 94  R-LGL 97
           R LGL
Sbjct: 639 RYLGL 643


>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
          Length = 471

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EHL  F KF   F ++Y T  E   RF +F+ NL+LIE+LN+ E G+A YGI   +D+T 
Sbjct: 163 EHL--FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTS 220

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 221 PEYKQRTGL 229


>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
          Length = 473

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/75 (37%), Positives = 46/75 (61%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           +N++   E+ + L QF+ F+  + K Y ++EE  +R  +F++NLK  E L   + G+A Y
Sbjct: 160 TNSQPVEESVQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEY 219

Query: 79  GINHLSDLTREEMKS 93
           G+   SDLT EE +S
Sbjct: 220 GVTKFSDLTEEEFRS 234


>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 8/86 (9%)

Query: 16  QMKSNNELK--------TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
           Q+K  NE++         E+ E L QF++F+  ++K Y +++E  +R ++F +NLK  E 
Sbjct: 151 QVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEK 210

Query: 68  LNKGEHGTATYGINHLSDLTREEMKS 93
           L   + G+A YG+   SDLT EE +S
Sbjct: 211 LQSLDQGSAEYGVTKFSDLTEEEFRS 236


>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 8/86 (9%)

Query: 16  QMKSNNELK--------TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
           Q+K  NE++         E+ E L QF++F+  ++K Y +++E  +R ++F +NLK  E 
Sbjct: 151 QVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEK 210

Query: 68  LNKGEHGTATYGINHLSDLTREEMKS 93
           L   + G+A YG+   SDLT EE +S
Sbjct: 211 LQSLDQGSAEYGVTKFSDLTEEEFRS 236


>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
 gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
          Length = 496

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 42/60 (70%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF++F++ F K Y +++E+ KR+ +F+ N+K +E L K E GTA YG+   +DLT EE +
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254


>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
          Length = 1165

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 33/72 (45%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           ++  HL  FEKF    S+ Y +  E   RF +F++NL  IE LNK E GTA YGI H +D
Sbjct: 852 DHARHL--FEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFAD 909

Query: 86  LTREEMKSRLGL 97
           +T  E + R GL
Sbjct: 910 MTSAEYRQRTGL 921


>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
          Length = 474

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 45/72 (62%)

Query: 22  ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           E   ++ E L QF++F+  ++++Y ++EE  +R  VF +NLK  E L   + GTA YG+ 
Sbjct: 164 EESVDSVELLGQFKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVT 223

Query: 82  HLSDLTREEMKS 93
             SDLT EE ++
Sbjct: 224 KFSDLTEEEFRT 235


>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
          Length = 325

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 49/77 (63%), Gaps = 3/77 (3%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P++ ++ +E+F RD+ K+Y   E+  KRFA+F+DNL   +     E GTA YG+   SDL
Sbjct: 25  PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDL 83

Query: 87  TREEMKSR-LGLNLSKH 102
           T EE +++ LGL + + 
Sbjct: 84  TPEEFEAKYLGLRIDEQ 100


>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
 gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
          Length = 953

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
           A    T A   + +S   LK ++  H+++ F+KF     + Y +  E   RF +F +NL 
Sbjct: 613 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 672

Query: 64  LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
            IE LNK E GTA YG+   +D+T  E ++  GL + KH
Sbjct: 673 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 711


>gi|308454071|ref|XP_003089699.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
 gi|308269278|gb|EFP13231.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
          Length = 316

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 44/75 (58%)

Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
          + + T + ++   F+ F+  + + Y T++E+ KRF +F  N+ L+E  NK + G  TY +
Sbjct: 10 HHIPTPDAKYTNAFQDFLVKYLRKYKTEDELVKRFTIFSRNMDLVERFNKEDLGKVTYEL 69

Query: 81 NHLSDLTREEMKSRL 95
          N  SDL+ EE K  L
Sbjct: 70 NDFSDLSDEEWKKFL 84


>gi|268534724|ref|XP_002632495.1| Hypothetical protein CBG13738 [Caenorhabditis briggsae]
          Length = 341

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 45/75 (60%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           +++ T + ++ + F+ F+  + + Y ++EE+ KRF +F  N+ L+E  NK   G  TY +
Sbjct: 37  HQIPTPDVKYTEAFQNFLVKYLREYKSEEEIVKRFTIFSRNMDLVERYNKEGAGKVTYEL 96

Query: 81  NHLSDLTREEMKSRL 95
           N  SDL+ EE K  L
Sbjct: 97  NDFSDLSDEEWKQFL 111


>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
          Length = 881

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 28/69 (40%), Positives = 44/69 (63%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           +N ++   FE FI  F+K++ +  E   RF +F+ NLK+I++L   E GTA YG+   +D
Sbjct: 568 QNIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVTMFAD 627

Query: 86  LTREEMKSR 94
           LT +E K+R
Sbjct: 628 LTPKEFKTR 636


>gi|341879557|gb|EGT35492.1| hypothetical protein CAEBREN_11857 [Caenorhabditis brenneri]
          Length = 340

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 27/75 (36%), Positives = 44/75 (58%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           +++ T + ++   F+ F+  + + Y ++EE+ KRF +F  N  L+E  NK + G  TY +
Sbjct: 36  HQIPTPDAKYTNAFQDFLVKYMREYKSEEEMVKRFTIFSRNADLVERYNKEDAGKVTYEL 95

Query: 81  NHLSDLTREEMKSRL 95
           N  SDLT EE K  L
Sbjct: 96  NDFSDLTDEEWKQFL 110


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F RD+ KSY   ++  KRFA+F+DNL   ++    E GTA YG+   SDLT EE  +
Sbjct: 32 YEQFKRDYGKSYANDDD-EKRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAA 90

Query: 94 RL 95
          + 
Sbjct: 91 KF 92


>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
 gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
          Length = 1834

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)

Query: 5    ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
            A    T A   + +S   LK ++  H+++ F+KF     + Y +  E   RF +F +NL 
Sbjct: 1494 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 1553

Query: 64   LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
             IE LNK E GTA YG+   +D+T  E ++  GL + KH
Sbjct: 1554 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 1592


>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
 gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
          Length = 1810

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 1/99 (1%)

Query: 5    ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
            A    T A   + +S   LK ++  H+++ F+KF     + Y +  E   RF +F +NL 
Sbjct: 1470 APTPVTTAPAVKRRSVRSLKIDDDAHVRRMFDKFRHHHRRQYASSMEHEMRFNIFRNNLF 1529

Query: 64   LIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
             IE LNK E GTA YG+   +D+T  E ++  GL + KH
Sbjct: 1530 KIEQLNKFERGTAKYGVTKFADMTVAEYRAHTGLVVPKH 1568


>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
          Length = 325

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 42/67 (62%), Gaps = 2/67 (2%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P++ ++ +E+F RD+ K+Y   E+  KRFA+F+DNL   +     E GTA YG+   SDL
Sbjct: 25 PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDL 83

Query: 87 TREEMKS 93
          T EE  +
Sbjct: 84 TNEEFAA 90


>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
 gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
          Length = 463

 Score = 58.9 bits (141), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 46/78 (58%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q   ++EL+ E  + L  F+ F+  ++K Y  +EE A+R  +F  NLK  + + + + GT
Sbjct: 148 QNVPSSELEDEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGT 207

Query: 76  ATYGINHLSDLTREEMKS 93
           A YG+   SDLT +E +S
Sbjct: 208 AEYGVTKYSDLTEDEFRS 225


>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
          Length = 350

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y T +E+ +RF +F +NL+LIE  NK   G  T G+NH +D T E
Sbjct: 47  HAVSFARFANRYGKRYDTVDEMKRRFKIFSENLQLIESTNKKRLGY-TLGVNHFADWTWE 105

Query: 90  EMKS-RLG 96
           E +S RLG
Sbjct: 106 EFRSHRLG 113


>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
 gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
          Length = 610

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 32/80 (40%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
           K +N    +  EHL  F KF   F + Y    E   R  +F  NL++IE LN  E G+A 
Sbjct: 289 KKHNHHSLDKVEHL--FHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAK 346

Query: 78  YGINHLSDLTREEMKSRLGL 97
           YGI   +D+T  E K R GL
Sbjct: 347 YGITEFADMTSTEYKERTGL 366


>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
           mellifera]
          Length = 881

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE FI  F+K++ +  E   RF +F+ NLK+I +L   E GTA YG+   +DLT +E K+
Sbjct: 576 FEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKEFKT 635

Query: 94  R 94
           R
Sbjct: 636 R 636


>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
 gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
          Length = 620

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EHL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 311 EHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 368

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 369 TEYKERTGL 377


>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1454

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 34   FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
            F+KF    +++Y +  E   RF +F++NL  IE LNK E GTA YGI H +D+T  E ++
Sbjct: 1146 FDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSAEYRA 1205

Query: 94   RLGL 97
            R GL
Sbjct: 1206 RTGL 1209


>gi|308447426|ref|XP_003087427.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
 gi|308256596|gb|EFP00549.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
          Length = 324

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 25/75 (33%), Positives = 45/75 (60%)

Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
          + + T + ++   F+ F+  + + Y T++E+ KRF +F  N+ L+E  NK + G  TY +
Sbjct: 18 HHIPTPDAKYTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVETYNKEDLGKVTYEL 77

Query: 81 NHLSDLTREEMKSRL 95
          N  SDL+ +E K+ L
Sbjct: 78 NDFSDLSDKEWKTFL 92


>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
          Length = 325

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 42/67 (62%), Gaps = 2/67 (2%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P++ ++ +E+F RD+ K+Y   E+  KRFA+F+DNL   +     E GTA YG+   SDL
Sbjct: 25 PDNARELYEQFKRDYGKAY-ANEDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDL 83

Query: 87 TREEMKS 93
          T EE  +
Sbjct: 84 TPEEFAA 90


>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
          Length = 473

 Score = 57.8 bits (138), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 41/67 (61%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           E+ E L  F+ F+  ++++Y ++EE  KR  +F+ N+K  + L   E G+A YGI   SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query: 86  LTREEMK 92
           LT +E +
Sbjct: 227 LTEDEFR 233


>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
 gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
          Length = 473

 Score = 57.8 bits (138), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 41/67 (61%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           E+ E L  F+ F+  ++++Y ++EE  KR  +F+ N+K  + L   E G+A YGI   SD
Sbjct: 167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query: 86  LTREEMK 92
           LT +E +
Sbjct: 227 LTEDEFR 233


>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
          Length = 460

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+KF+R ++++Y +KEE   R +VF  N+   + +   + GTA YGI   SDLT EE ++
Sbjct: 163 FKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRT 222


>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
 gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
          Length = 475

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 42/65 (64%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E L QF++F+  ++++Y ++E+  +R  +F +NLK  E L   + GTA YG+   SDLT 
Sbjct: 172 ELLGQFKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSDLTE 231

Query: 89  EEMKS 93
           EE ++
Sbjct: 232 EEFRT 236


>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
 gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
          Length = 615

 Score = 57.8 bits (138), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 31/69 (44%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +DLT 
Sbjct: 306 DHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADLTS 363

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 364 SEYKERTGL 372


>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
          Length = 321

 Score = 57.8 bits (138), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F RD+ K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85

Query: 94 R 94
          +
Sbjct: 86 K 86


>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
          Length = 322

 Score = 57.4 bits (137), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F RD+ K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85

Query: 94 R 94
          +
Sbjct: 86 K 86


>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
          Length = 320

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          L  FG + SN   ++EN   L  +E+F   + KSY   ++   RF VF+DNL  I+    
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66

Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
           E GTA YG+   SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90


>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
          Length = 327

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          L  FG + SN   ++EN   L  +E+F   + KSY   ++   RF VF+DNL  I+    
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66

Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
           E GTA YG+   SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90


>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
          Length = 325

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 2/68 (2%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P++ ++ +E+F RD+ K Y   ++  KRFA+F+DNL   + L   + GTA YG+   SDL
Sbjct: 25 PDNARELYEQFKRDYGKVYANDDD-QKRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDL 83

Query: 87 TREEMKSR 94
          T EE  ++
Sbjct: 84 TPEEFAAK 91


>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
          Length = 781

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 42/65 (64%), Gaps = 1/65 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y + +E   R  +F +NL +IE L K E  T  YG+N  +D++REE ++
Sbjct: 524 FDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQATGRYGVNMFADMSREEFRT 583

Query: 94  R-LGL 97
           R LGL
Sbjct: 584 RYLGL 588


>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
          Length = 322

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F RD+ K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPEEFAA 85

Query: 94 R 94
          +
Sbjct: 86 K 86


>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
 gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
          Length = 327

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          L  FG + SN   ++EN   L  +E+F   + KSY   ++   RF VF+DNL  I+    
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66

Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
           E GTA YG+   SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90


>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
          Length = 327

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/84 (40%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          L  FG + SN   ++EN   L  +E+F   + KSY   ++   RF VF+DNL  I+    
Sbjct: 11 LGFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQN 66

Query: 71 GEHGTATYGINHLSDLTREEMKSR 94
           E GTA YG+   SDLT +E K R
Sbjct: 67 MERGTAKYGVTQFSDLTAQEFKVR 90


>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
          Length = 331

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F++ + KSY + EE  +RFA+F  NL     LN    G   +GI   +D+++EE 
Sbjct: 32 EQFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEF 91

Query: 92 KSRL 95
          +SR+
Sbjct: 92 QSRV 95


>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
          Length = 478

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/62 (46%), Positives = 34/62 (54%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI    K Y  K EV KRF VF+ N K+I +L K E GTA YG    SD+T  E K 
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235

Query: 94  RL 95
            +
Sbjct: 236 TM 237


>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
          Length = 361

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 31/72 (43%), Positives = 44/72 (61%), Gaps = 3/72 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F R ++K+Y  KEE   RF +F++NLK I   N+ E GTA YG+   SDL+  E + 
Sbjct: 34  FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93

Query: 94  R-LGL--NLSKH 102
             LGL  +L++H
Sbjct: 94  HYLGLKKDLAEH 105


>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
          Length = 476

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 42/67 (62%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           E+ E L  F++F+  ++K Y ++EE  +R  +F++NLK  E +   + G+A YG+   SD
Sbjct: 170 ESVELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSD 229

Query: 86  LTREEMK 92
           LT EE +
Sbjct: 230 LTEEEFR 236


>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 477

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y    E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 168 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 225

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 226 TEYKERTGL 234


>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
          Length = 478

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/62 (46%), Positives = 34/62 (54%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI    K Y  K EV KRF VF+ N K+I +L K E GTA YG    SD+T  E K 
Sbjct: 176 FLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKE 235

Query: 94  RL 95
            +
Sbjct: 236 TM 237


>gi|308465858|ref|XP_003095186.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
 gi|308246042|gb|EFO89994.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
          Length = 326

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/75 (33%), Positives = 43/75 (57%)

Query: 21 NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
          + + T + ++   F+ F+  + + Y T++E+  RF +F  N+ L+E  NK + G  TY +
Sbjct: 20 HHIPTPDAKYTNAFQDFLVKYLREYKTEDELVMRFTIFSRNMDLVERYNKEDLGKVTYEL 79

Query: 81 NHLSDLTREEMKSRL 95
          N  SDL+ EE K  L
Sbjct: 80 NDFSDLSDEEWKKFL 94


>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
 gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
          Length = 477

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 33/59 (55%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F  F+    K Y  K EV KRF VF+ N K+I +L K E GTA YG    SD+T  E K
Sbjct: 174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFK 232


>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
 gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
           Precursor
 gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
 gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
          Length = 614

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 305 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 362

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 363 SEYKERTGL 371


>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
          Length = 322

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F RD+ K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 27 YEQFKRDYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85

Query: 94 R 94
          +
Sbjct: 86 K 86


>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
          Length = 597

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y TKEE   R +VF  N+   + +   +HGTA YG+   SDLT EE ++
Sbjct: 300 FKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQALDHGTAQYGVTKFSDLTEEEFRT 359


>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
 gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
          Length = 615

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
           K ++    +  +HL  F KF   F + Y +  E   R  +F  NLK IE LN  E G+A 
Sbjct: 295 KKHSHRALDKADHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAK 352

Query: 78  YGINHLSDLTREEMKSRLGL 97
           YGI   +D+T  E K R GL
Sbjct: 353 YGITEFADMTSSEYKERTGL 372


>gi|146386354|gb|ABQ23965.1| cathepsin F [Oryctolagus cuniculus]
          Length = 248

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+KF+R ++++Y +KEE   R +VF  N+   + +   + GTA YGI   SDLT EE ++
Sbjct: 91  FKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRT 150


>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
 gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
          Length = 615

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 306 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 363

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 364 SEYKERTGL 372


>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 629

 Score = 56.2 bits (134), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y    E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 320 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 377

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 378 TEYKERTGL 386


>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
 gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
          Length = 627

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y    E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 318 DHL--FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTS 375

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 376 TEYKERTGL 384


>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
 gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
          Length = 352

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 29/74 (39%), Positives = 41/74 (55%)

Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
          NN  +T +      F  + +   K Y T EE  KRF+ F+ NLK IE+LN    G A++G
Sbjct: 25 NNAYRTIDGPSKDLFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFG 84

Query: 80 INHLSDLTREEMKS 93
          +N  SDL+ EE  +
Sbjct: 85 MNKYSDLSEEEFSN 98


>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
 gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
 gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
          Length = 475

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T 
Sbjct: 166 DHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTS 223

Query: 89  EEMKSRLGL 97
            E K R GL
Sbjct: 224 SEYKERTGL 232


>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
          Length = 473

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y TKEE   R +VF +N+   + L   + GTA YGI   SDLT EE ++
Sbjct: 176 FKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTEEEFRT 235


>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
           pulchellus]
          Length = 475

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 31/72 (43%), Positives = 44/72 (61%), Gaps = 3/72 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F R ++K+Y  KEE   RF +F++NLK I   N+ E GTA YG+   SDL+  E + 
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225

Query: 94  R-LGL--NLSKH 102
             LGL  +L++H
Sbjct: 226 HYLGLKKDLAEH 237


>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE F+  ++++Y T+EE   R ++F +NL +I  L K E GT  YG+N  +D++ EE  +
Sbjct: 727 FENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVSTEEFHA 786

Query: 94  -RLGL 97
             LGL
Sbjct: 787 FYLGL 791


>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 359

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 30/68 (44%), Positives = 42/68 (61%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + KSY T EE+ +RF++F D+LK+I   NK +  + T G+N  +DLT E
Sbjct: 56  HSLAFARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNK-KGLSYTLGVNEFADLTWE 114

Query: 90  EM-KSRLG 96
           E  K RLG
Sbjct: 115 EFRKHRLG 122


>gi|38048171|gb|AAR09988.1| similar to Drosophila melanogaster CG12163, partial [Drosophila
           yakuba]
          Length = 213

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           +  +HL  F KF   F + Y +  E   R  +F  NLK IE LN  E G+A YGI   +D
Sbjct: 31  DKADHL--FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFAD 88

Query: 86  LTREEMKSRLGL 97
           +T  E K R GL
Sbjct: 89  MTSSEYKERTGL 100


>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
          Length = 462

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+KF+  ++++Y +KEE   R +VF  N+ L + +   + GTA YG+   SDLT EE ++
Sbjct: 165 FKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLTEEEFRT 224


>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
          Length = 459

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 29/60 (48%), Positives = 34/60 (56%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF  F+    K Y +K +  KRF VF+ NLK I    + E GTA YGI   SDLT EE K
Sbjct: 156 QFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEEFK 215


>gi|407859260|gb|EKG06954.1| cysteine protease, putative [Trypanosoma cruzi]
          Length = 422

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 25 TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
          T N ++L Q  FEK+I DF K Y   EE  KR A+F++NL  +   N     +   GIN 
Sbjct: 16 TSNEDYLAQYTFEKYIADFGKRYADPEEHRKRAAIFKENLAEVRAFNGVLGRSYRLGINK 75

Query: 83 LSDLTREEMKSRL 95
           SD+T+EE  ++ 
Sbjct: 76 FSDMTKEEFNAKF 88


>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
          Length = 336

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 41/59 (69%), Gaps = 2/59 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          FE FIR+++K Y +KE+  +RF +F +NLK I DLN  +   A +GIN  +DL++EE K
Sbjct: 41 FENFIREYNKKYDSKEK-EERFKIFVNNLKRINDLNH-KSTNAVHGINKFTDLSKEEFK 97


>gi|305434754|gb|ADM53739.1| cathepsin L2 precursor [Lepeophtheirus salmonis]
          Length = 382

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 29/82 (35%), Positives = 44/82 (53%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          LFG         +     +++FE F++++SKSY  +   + +  VF DNL+ IE+ N   
Sbjct: 15 LFGLAALAAGTSSPTQREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANP 74

Query: 73 HGTATYGINHLSDLTREEMKSR 94
            T   GIN  SDLT EE +S+
Sbjct: 75 KRTWDMGINEFSDLTDEEFESK 96


>gi|47212989|emb|CAF92720.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 142

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 40/64 (62%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          E L QF++F+  +SK Y ++EE   R  +F++NLK  E +   + G+A YGI   SDLT 
Sbjct: 2  ELLGQFKEFMMKYSKVYNSQEEADHRLKIFKENLKTAEKIQSLDEGSAEYGITKFSDLTE 61

Query: 89 EEMK 92
          EE +
Sbjct: 62 EEFR 65


>gi|155966155|gb|ABU41032.1| cysteine proteinase [Lepeophtheirus salmonis]
          Length = 372

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 29/82 (35%), Positives = 44/82 (53%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          LFG         +     +++FE F++++SKSY  +   + +  VF DNL+ IE+ N   
Sbjct: 6  LFGLAALAAGTSSPTQREIQEFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANP 65

Query: 73 HGTATYGINHLSDLTREEMKSR 94
            T   GIN  SDLT EE +S+
Sbjct: 66 KRTWDMGINEFSDLTDEEFESK 87


>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
          Length = 364

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 25/60 (41%), Positives = 34/60 (56%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  FI    K Y  + E  KRF +F+ NL++I    + + GTA YGIN  +DL+ EE K
Sbjct: 63  HFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFK 122


>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
 gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
          Length = 333

 Score = 55.5 bits (132), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 3/66 (4%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          L +F+ FI D++++Y TKEE   RF  F+ N + I   N      ATYG+N  +D T EE
Sbjct: 33 LARFKSFITDYNRNYTTKEEHEFRFQTFKKNFRRIASTNA---NGATYGVNKFADWTDEE 89

Query: 91 MKSRLG 96
           K  LG
Sbjct: 90 FKELLG 95


>gi|71662527|ref|XP_818269.1| cysteine protease [Trypanosoma cruzi strain CL Brener]
 gi|70883510|gb|EAN96418.1| cysteine protease, putative [Trypanosoma cruzi]
          Length = 434

 Score = 55.5 bits (132), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 25  TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           T + ++L Q  FEK+I DF K Y   EE  KR A+F++NL  +   N     +   GIN 
Sbjct: 28  TSDEDYLAQYTFEKYIADFGKRYADPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINK 87

Query: 83  LSDLTREEMKSRL 95
            SD+T+EE  ++ 
Sbjct: 88  FSDMTKEEFNAKF 100


>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
          Length = 482

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +K+E   R +VF  N+ L + +   +HGTA YG+   SDLT EE ++
Sbjct: 185 FKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYGVTKFSDLTEEEFRT 244


>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
          Length = 316

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
            FG + SN   ++EN   L  +E+F   + KSY   ++   RF VF+DNL  I+     
Sbjct: 1  GFFGVLGSNIP-ESENARQL--YEEFKLKYKKSYSNDDD-EYRFRVFKDNLLRIKQFQNM 56

Query: 72 EHGTATYGINHLSDLTREEMKSR 94
          E GTA YG+   SDLT +E K R
Sbjct: 57 ERGTAKYGVTQFSDLTAQEFKVR 79


>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
 gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
          Length = 335

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 27/71 (38%), Positives = 48/71 (67%), Gaps = 3/71 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
           FE F+ +++K+Y +  E  KR+++F+DNL  I  ++ N  +  TATYGIN  SDL++ E+
Sbjct: 35  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKFSDLSKSEL 94

Query: 92  KSRL-GLNLSK 101
            ++  GL++ +
Sbjct: 95  IAKFTGLSIPQ 105


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           L+TE  E +  +E+++    K Y    E  KRF +F+DNL+ I+D N  E  T   G+N 
Sbjct: 49  LRTEE-ELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNR 107

Query: 83  LSDLTREEMKSR 94
            +DLT EE +++
Sbjct: 108 FADLTNEEYRAK 119


>gi|407424636|gb|EKF39072.1| cysteine protease, putative [Trypanosoma cruzi marinkellei]
          Length = 438

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 30/73 (41%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 25  TENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           T N ++L Q  FEK+I DF K Y   EE  KR A+F +NL  I   N     +   GIN 
Sbjct: 32  TYNEDYLAQYTFEKYISDFGKRYADPEEHRKRNAIFNENLAKIRAFNGVLGRSYRLGINK 91

Query: 83  LSDLTREEMKSRL 95
            SD+T+EE  ++ 
Sbjct: 92  FSDMTKEEFNAKF 104


>gi|146215994|gb|ABQ10199.1| cysteine protease Cp1 [Actinidia deliciosa]
          Length = 358

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 30/71 (42%), Positives = 40/71 (56%), Gaps = 2/71 (2%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           +  H   F +F   + K Y T EE   RFA+F +NLKLI   NK +  + T G+NH +D 
Sbjct: 52  DSRHALSFARFAHRYGKRYETAEETKLRFAIFSENLKLIRSHNK-KGLSYTLGVNHFADW 110

Query: 87  TREEMKS-RLG 96
           T EE +  RLG
Sbjct: 111 TWEEFRRHRLG 121


>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
 gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
          Length = 381

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 39/64 (60%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +++F  ++    +SYPT EE  +RF ++ DN+K IE +N+    T T G N  +DLT +E
Sbjct: 56  MERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTDLTHQE 115

Query: 91  MKSR 94
             +R
Sbjct: 116 FLAR 119


>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
           familiaris]
          Length = 490

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y TKEE   R +VF +N+   + +   + GTA YGI   SDLT EE ++
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251


>gi|308454069|ref|XP_003089698.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
 gi|308269277|gb|EFP13230.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
          Length = 243

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 43/73 (58%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           + T + ++   F+ F+  + + Y T++E+ KRF +F  N+ L+E  NK + G  TY +N 
Sbjct: 39  IPTPDAKYTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVERYNKEDLGKVTYELND 98

Query: 83  LSDLTREEMKSRL 95
            SDL+ EE K  L
Sbjct: 99  FSDLSDEEWKKFL 111


>gi|403293523|ref|XP_003937763.1| PREDICTED: cathepsin W [Saimiri boliviensis boliviensis]
          Length = 373

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F R F++SY T EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKFFQRQFNRSYLTPEEHARRLDIFAHNLAQAQQLQEEDFGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
          Length = 477

 Score = 54.7 bits (130), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 27/62 (43%), Positives = 33/62 (53%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI    K Y  K EV KRF  F+ N K+I +L K E G+A YG    SD+T  E K 
Sbjct: 174 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEFKQ 233

Query: 94  RL 95
            +
Sbjct: 234 TM 235


>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 2/67 (2%)

Query: 34  FEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F  FI  +   Y     E+ KRF +F++N+K I +LN  E GT  Y +   +DLT EE K
Sbjct: 231 FFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFTDLTYEEFK 290

Query: 93  SR-LGLN 98
           S+ LGLN
Sbjct: 291 SKYLGLN 297


>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
 gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
          Length = 475

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 28/62 (45%), Positives = 32/62 (51%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI    K Y  K EV KRF  F+ N K I +L K E GTA YG    SD+T  E K 
Sbjct: 172 FLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQ 231

Query: 94  RL 95
            +
Sbjct: 232 TM 233


>gi|152926446|gb|ABS32280.1| cathepsin L protease inhibitor 2 [Diaprepes abbreviatus]
          Length = 91

 Score = 54.3 bits (129), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 50/76 (65%), Gaps = 6/76 (7%)

Query: 25 TENPEHL---KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY-- 78
          T+ P +L   +++EKF   F+++Y + +E AKRF +F+ NL+ I + N K E G  T+  
Sbjct: 5  TKAPSYLSDQEEWEKFKTGFNRNYDSSDEEAKRFNIFQQNLQSIREHNEKFERGETTFTQ 64

Query: 79 GINHLSDLTREEMKSR 94
          GIN  +DLT+EE K+R
Sbjct: 65 GINQFTDLTKEEFKAR 80


>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
          Length = 327

 Score = 54.3 bits (129), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F R + K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 32 YEQFKRGYGKVY-ANEDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 90

Query: 94 R 94
          +
Sbjct: 91 K 91


>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
 gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
          Length = 359

 Score = 54.3 bits (129), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 4/75 (5%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P+ ++  FE+F+RD++++Y    E  +R+  F  NLK I  LN  +   A+Y IN  SDL
Sbjct: 47  PDRMRDYFERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLN--QKSQASYDINKFSDL 104

Query: 87  TREEMKSRL-GLNLS 100
           T++E+ +R  GL+ S
Sbjct: 105 TKDEVVARFTGLDPS 119


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 1/82 (1%)

Query: 20  NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           + EL       +++ E+++  + K Y    E  KRF +F+DN++ IE  N   +     G
Sbjct: 27  SRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLG 86

Query: 80  INHLSDLTREEMK-SRLGLNLS 100
           +NHL+DLT EE K SR GL  S
Sbjct: 87  VNHLADLTIEEFKASRNGLKRS 108


>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
          Length = 427

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 40/63 (63%), Gaps = 2/63 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+F R F KSY +  + AKR+A+F+ NL  ++ + + E GTA YGI   SDL+ EE + 
Sbjct: 127 FEEFQRKFRKSYSS--DTAKRYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEEFRH 184

Query: 94  RLG 96
            L 
Sbjct: 185 SLA 187


>gi|294661899|gb|ADF28790.1| RE01479p [Drosophila melanogaster]
          Length = 334

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F KF   F + Y +  E   R  +F  NLK IE+LN  E G+A YGI   +D+T  E K 
Sbjct: 169 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 228

Query: 94  RLGL 97
           R GL
Sbjct: 229 RTGL 232


>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
          Length = 326

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFTLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|357624871|gb|EHJ75484.1| putative 26,29kDa proteinase [Danaus plexippus]
          Length = 553

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 6/94 (6%)

Query: 14  FGQMKSNNELKT----ENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
           F  M + N +K      + EH+  +F++F+   +K Y ++ E  KR  +F  NL+LI   
Sbjct: 223 FRHMATFNPMKEFVHPASDEHVHHEFDRFVNKHNKQYASEVEKTKRINIFRQNLRLIHSH 282

Query: 69  NKGEHGTATYGINHLSDLTREEMKSRLGLNLSKH 102
           N+   G  +  +NHL+D T EE+ +R G   + H
Sbjct: 283 NRAHRGF-SLAVNHLADHTDEELAARRGRRYTGH 315


>gi|332373716|gb|AEE61999.1| unknown [Dendroctonus ponderosae]
          Length = 346

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/60 (43%), Positives = 37/60 (61%), Gaps = 1/60 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-HGTATYGINHLSDLTREE 90
          +QF +++ DF+KSYP + E   RFA F+ +L  IE LN  +   +A YG+   SD T EE
Sbjct: 39 EQFHEYLSDFNKSYPQEAEFQFRFAAFKKSLANIEQLNANKTKSSAQYGLTKFSDFTAEE 98


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           L L   +   N ++  N + +  +E ++ +  KSY + +E   RF +F++NL++I+D N 
Sbjct: 19  LILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRIIDDHNA 78

Query: 71  GEHGTATYGINHLSDLTREEMKSR-LGL 97
             + + + G+N  +DLT EE +S  LGL
Sbjct: 79  DANRSYSLGLNRFADLTDEEYRSTYLGL 106


>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
          Length = 394

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y +KEE   R +VF +N+   + +   + GTA YGI   SDLT EE ++
Sbjct: 97  FKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQALDRGTAQYGITKFSDLTEEEFRT 156


>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
          Length = 408

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 112 FKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 171


>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE FI   +K Y + EE ++RF +F  N+K ++ L   E G+A YG    +DLT+ E K 
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 94  R-LGLNLS 100
           + LGL+ S
Sbjct: 340 KYLGLDSS 347


>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
          Length = 567

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 36/64 (56%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  F+ F+  ++KSY    E  +R  +F  NL+L   L + + G+A YG+   SDLT 
Sbjct: 265 ELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTE 324

Query: 89  EEMK 92
           EE +
Sbjct: 325 EEFR 328


>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
          Length = 358

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 38/87 (43%), Positives = 46/87 (52%), Gaps = 4/87 (4%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           L  Q+  N E    N EH   F  F   FSKSY TKEE   RF VF+ NL +   L++  
Sbjct: 24  LIRQVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKANL-IKAKLHQKL 80

Query: 73  HGTATYGINHLSDLTREEMKSR-LGLN 98
             TA +GI   SDLT  E + + LGLN
Sbjct: 81  DPTAEHGITKFSDLTASEFRRQFLGLN 107


>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
          Length = 343

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 2/62 (3%)

Query: 36  KFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS-R 94
           +F   + K Y T +E+ +RF +F +NL+LI+  NK   G  T G+NH +D T EE +S R
Sbjct: 46  RFANRYGKRYDTVDEMKRRFKIFSENLQLIKSTNKKRLGY-TLGVNHFADWTWEEFRSHR 104

Query: 95  LG 96
           LG
Sbjct: 105 LG 106


>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
          Length = 459

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y T+EE   R +VF +N+   + +   + GTA YGI   SDLT EE ++
Sbjct: 162 FKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRA 221


>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
          Length = 353

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F R   K Y + +E+  RF +F DNLKLI   N+    T T G+NH +D T E
Sbjct: 50  HALSFARFARRHGKRYRSVDEIRNRFRIFSDNLKLIRSTNR-RSLTYTLGVNHFADWTWE 108

Query: 90  EM-KSRLG 96
           E  + +LG
Sbjct: 109 EFTRHKLG 116


>gi|323454466|gb|EGB10336.1| hypothetical protein AURANDRAFT_22962 [Aureococcus anophagefferens]
          Length = 416

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 12  ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
           A F ++KS      +  +    F++FI ++SKSY T  E   RF +F  NL  I+ LN  
Sbjct: 67  ATFAKLKSVTYGTLDTRDQKSLFDQFIDEYSKSYDTTHEYNDRFTIFSKNLNYIDALNT- 125

Query: 72  EHGTATYGINHLSDLTREEMKSR 94
           ++  A +G+N  +D T EE   R
Sbjct: 126 QNPHALFGLNVFADQTEEERSKR 148


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 31/67 (46%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           KQFE FI++F K Y T EE   RF VF+ NL L    ++    TA++G+   SDLT EE 
Sbjct: 54  KQFESFIKEFGKVYHTVEEYEHRFKVFKSNL-LRALKHQALDPTASHGVTMFSDLTEEEF 112

Query: 92  KSR-LGL 97
            ++ LGL
Sbjct: 113 ATQYLGL 119


>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE FI   +K Y + EE ++RF +F  N+K ++ L   E G+A YG    +DLT+ E K 
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 94  R-LGLNLS 100
           + LGL+ S
Sbjct: 340 KYLGLDSS 347


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 48/83 (57%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           L L   +   N ++  N + +  +E ++ +  KSY + +E   RF +F++NL++I+D N 
Sbjct: 21  LILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNA 80

Query: 71  GEHGTATYGINHLSDLTREEMKS 93
             + + + G+N  +DLT EE +S
Sbjct: 81  DANRSYSLGLNRFADLTDEEYRS 103


>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
 gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F   + K+Y   ++   RF +F+DNL+  + L   E GTA YG+   SDLT EE K+
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEFKT 90

Query: 94 R 94
          R
Sbjct: 91 R 91


>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
          Length = 340

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 14  FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
             +  S+  L T + +H+  F  F+  FSK+Y +KEE   R   ++ N+  I + N    
Sbjct: 23  LSETSSSQSLYTADQDHI-DFVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQND 81

Query: 74  GTA-TYGINHLSDLTREEMKSRLG 96
           GT+ T G NHL+D T +E K  LG
Sbjct: 82  GTSFTLGPNHLADYTHDEYKKMLG 105


>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
          Length = 887

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/65 (41%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F+  ++++Y T EE   R  +F +NL +I+ L K E GTA Y +N  +D++ EE +S
Sbjct: 582 FNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVNMFADMSPEEFRS 641

Query: 94  R-LGL 97
           R LGL
Sbjct: 642 RYLGL 646


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 40/67 (59%)

Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          NPE +K FE+++ +  K+Y    E  KRF +F DNLK +++ N   + +   G+   +DL
Sbjct: 30 NPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADL 89

Query: 87 TREEMKS 93
          T EE ++
Sbjct: 90 TNEEFRA 96


>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
 gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
          Length = 490

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y TKEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 193 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRT 252


>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
          Length = 260

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+F R + K Y   E+  KRFA+F+DNL   + L   + GTA YG+   SDLT EE  +
Sbjct: 6  YEQFKRXYGKVY-ANEDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 64

Query: 94 R 94
          +
Sbjct: 65 K 65


>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
          Length = 328

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 8/79 (10%)

Query: 27  NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           +PE L Q       FE+++  + K+Y + EE  +RF VF+DNL  I++ N+ E  +   G
Sbjct: 72  SPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLG 131

Query: 80  INHLSDLTREEMKSR-LGL 97
           +N  +DLT +E K+  LGL
Sbjct: 132 LNAFADLTHDEFKATYLGL 150


>gi|326435242|gb|EGD80812.1| hypothetical protein PTSG_11722 [Salpingoeca sp. ATCC 50818]
          Length = 372

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 36/63 (57%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F  +F K+Y + EE   R ++FE  L  ++  N+ E  T   G+NH+SD T EE K 
Sbjct: 34 FEDFKLEFGKTYASHEEHEYRRSIFEQTLATVKAHNRDESKTWKQGVNHMSDWTDEEFKR 93

Query: 94 RLG 96
           LG
Sbjct: 94 LLG 96


>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
 gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE K+R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFKTR 91


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++ ++ K Y    E  KRF +F+DN++ IE  N   +     G+NHL+DLT EE 
Sbjct: 36  ERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95

Query: 92  K-SRLGL 97
           K SR GL
Sbjct: 96  KDSRNGL 102


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
          Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 39/64 (60%)

Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          N   +K+FE+++ ++ + Y   +E  +RF +F++N+K IE  N     + T GIN  +D+
Sbjct: 30 NDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDM 89

Query: 87 TREE 90
          T+ E
Sbjct: 90 TKSE 93


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           N ++  N + +  +E ++ +  KSY + +E   RF +F++NL++I+D N   + + + G+
Sbjct: 29  NSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGL 88

Query: 81  NHLSDLTREEMKSR-LGLNL 99
           N  +DLT EE +S  LGL +
Sbjct: 89  NRFADLTDEEYRSTYLGLKM 108


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 8/79 (10%)

Query: 27  NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           +PE L Q       FE+++  + K+Y + EE  +RF VF+DNL  I++ N+ E  +   G
Sbjct: 58  SPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLG 117

Query: 80  INHLSDLTREEMKSR-LGL 97
           +N  +DLT +E K+  LGL
Sbjct: 118 LNAFADLTHDEFKATYLGL 136


>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
           kowalevskii]
          Length = 352

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 34/59 (57%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F+ F++ + K Y T+EE   R+ +F+DNL   E L + E  T  YG+    DL+ EE +
Sbjct: 54  FQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFR 112


>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
           Full=Turgor-responsive protein 15A; Flags: Precursor
 gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
          Length = 363

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+  N E    N EH   F  F   FSKSY TKEE   RF VF+ NL +   L++    T
Sbjct: 32  QVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDPT 88

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +GI   SDLT  E + + LGL
Sbjct: 89  AEHGITKFSDLTASEFRRQFLGL 111


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++ ++ K Y    E  KRF +F+DN++ IE  N   +     G+NHL+DLT EE 
Sbjct: 36  ERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95

Query: 92  K-SRLGL 97
           K SR GL
Sbjct: 96  KDSRNGL 102


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 11/105 (10%)

Query: 1   MAEDASAEATLALF-------GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAK 53
           MA  +  + T+ALF        QM S    +T   E   + E+++ ++ K Y    E  K
Sbjct: 1   MAFTSQKQYTIALFLLLALGIPQMMSRKLHETSMRE---RHEQWMAEYGKVYKDAAEKEK 57

Query: 54  RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
           RF +F+ N++ IE  N   +     G+NHL+DLT EE K SR GL
Sbjct: 58  RFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGL 102


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++ ++ K Y    E  KRF +F+DN++ IE  N   +     G+NHL+DLT EE 
Sbjct: 36  ERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEF 95

Query: 92  K-SRLGL 97
           K SR GL
Sbjct: 96  KDSRNGL 102


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 28/63 (44%), Positives = 36/63 (57%), Gaps = 5/63 (7%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTRE 89
          F+ F   F K Y + EE A+RFA+F DNL  I   N    +G H T T G+N  +DLT E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLH-THTVGVNQFADLTNE 78

Query: 90 EMK 92
          E +
Sbjct: 79 EYR 81


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/79 (34%), Positives = 45/79 (56%), Gaps = 7/79 (8%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           G ++S +E+K       + FE ++    KSY   +E  KRF +F DNLK I++ N  E+ 
Sbjct: 38  GLVRSEDEVK-------EMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENR 90

Query: 75  TATYGINHLSDLTREEMKS 93
           +   G+N  +D+T EE ++
Sbjct: 91  SYKLGLNRFADITNEEYRT 109


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 39/68 (57%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           + E +  +E+++    K Y    E  KRF +F+DNL+ I+D N  E  T   G+N  +DL
Sbjct: 72  DEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADL 131

Query: 87  TREEMKSR 94
           T EE +++
Sbjct: 132 TNEEYRAK 139


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 30/67 (44%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           K+FE F++DF K Y + EE   RF VF+ NL L    ++    TA++G+   SDLT EE 
Sbjct: 54  KRFESFMKDFGKVYHSVEEYEHRFGVFKSNL-LKALKHQALDPTASHGVTMFSDLTEEEF 112

Query: 92  KSR-LGL 97
            S+ LGL
Sbjct: 113 TSKYLGL 119


>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
          Length = 491

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 39/65 (60%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           + L  F+ F+  ++++Y +KEE   R ++F +N+   + +   + GTA YGI   SDLT 
Sbjct: 189 QMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDLTE 248

Query: 89  EEMKS 93
           EE ++
Sbjct: 249 EEFRT 253


>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
          Length = 459

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 36/60 (60%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y TKEE   R ++F  N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 162 FKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 221


>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
          Length = 375

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P+ LK+ F  F   +++SYP   E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35 PQELKEVFRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94

Query: 87 TREE 90
          T EE
Sbjct: 95 TEEE 98


>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
 gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
          Length = 617

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 36/69 (52%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EHL  F KF   + + Y    E   R  +F  NL+ IE+LN  E G+A YGI   +D+T 
Sbjct: 308 EHL--FHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFADMTS 365

Query: 89  EEMKSRLGL 97
            E K   GL
Sbjct: 366 TEYKLHAGL 374


>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
          Length = 458

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F+  ++++Y TKEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 161 FKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEFRT 220


>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
          Length = 619

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/73 (34%), Positives = 38/73 (52%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
           +S  +L     + + QF+ F   ++KSY    E  +RF +F DNL   + L +   G A 
Sbjct: 250 QSFEDLPPATQDLMDQFKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQ 309

Query: 78  YGINHLSDLTREE 90
           +G+   SDLT EE
Sbjct: 310 FGVTQFSDLTEEE 322


>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
 gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
          Length = 323

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSETEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|118397739|ref|XP_001031201.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89285525|gb|EAR83538.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 352

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
          GQ   +    ++  +H ++FE+F + FSK+Y ++E    RFA F +NL  I+ LN  E  
Sbjct: 20 GQSNFDKNTFSQKHQHHQKFEQFKKSFSKAYESEEVQQFRFATFVENLNEIDRLN-AEVT 78

Query: 75 TATYGINHLSDLTREE 90
          TA + I+  SD T+EE
Sbjct: 79 TAQFDISFFSDYTKEE 94


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 29/65 (44%), Positives = 43/65 (66%), Gaps = 2/65 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEK++    K+Y + EE   RF VF+DNLKLI+++N+ E  +   G+N  +DLT +E K+
Sbjct: 44  FEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINR-EVTSYWLGLNEFADLTHDEFKT 102

Query: 94  R-LGL 97
             LGL
Sbjct: 103 TYLGL 107


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 39/64 (60%)

Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          N   +K+FE+++ ++ + Y   +E  +RF +F++N+K IE  N     + T GIN  +D+
Sbjct: 3  NDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDM 62

Query: 87 TREE 90
          T+ E
Sbjct: 63 TKSE 66


>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
          Length = 451

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 35/60 (58%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++KSY    E  +R  +F  NL+L   + + + G+A YG+   SDLT EE ++
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213


>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
 gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
          Length = 323

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+F+R ++K Y ++ E  +R+ +F+ NL  I  + K  + TA Y IN  SDL+++E  +
Sbjct: 28  FEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLSKDETIA 85

Query: 94  RL-GLNLSKH 102
           +  GL+L  H
Sbjct: 86  KYTGLSLPLH 95


>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
 gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
          Length = 356

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 46/69 (66%), Gaps = 3/69 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI--EDLNKGEHGTATYGINHLSDLTREEM 91
           FE F+ +++K+Y +  E  KR+++F+DNL  I  ++ N  +  TATY IN  SDL++ E+
Sbjct: 56  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115

Query: 92  KSRL-GLNL 99
            ++  GL++
Sbjct: 116 IAKFTGLSI 124


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 38/64 (59%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y    E   RF +F +NLK +E +N   + T T  +N  SDLT EE
Sbjct: 32 VEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEE 91

Query: 91 MKSR 94
           K+R
Sbjct: 92 FKAR 95


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 41/68 (60%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          E+P   ++ E+++ ++ K Y    E  KRF +F+DN++ IE  N  ++      +NHL+D
Sbjct: 32 ESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLAD 91

Query: 86 LTREEMKS 93
          LT +E K+
Sbjct: 92 LTLDEFKA 99


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 41/68 (60%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          E+P   ++ E+++ ++ K Y    E  KRF +F+DN++ IE  N  ++      +NHL+D
Sbjct: 32 ESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLAD 91

Query: 86 LTREEMKS 93
          LT +E K+
Sbjct: 92 LTLDEFKA 99


>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
          str. Neff]
          Length = 330

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 37/62 (59%), Gaps = 2/62 (3%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF +F   + KSY + EE  +R  +F DNL  I+ LN    G A YG+N  +DLT +E 
Sbjct: 30 QQFRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTG-ARYGVNKFADLTPKEF 87

Query: 92 KS 93
          K+
Sbjct: 88 KA 89


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           K  N E L  +E+++    KSY    E  KRF +F+DNLK I++ N G + T   G+   
Sbjct: 45  KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103

Query: 84  SDLTREEMKSRL 95
           +DLT EE +S+ 
Sbjct: 104 ADLTNEEYRSKF 115


>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
          Length = 477

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE + R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 180 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 239


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           K  N E L  +E+++    KSY    E  KRF +F+DNLK I++ N G + T   G+   
Sbjct: 45  KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103

Query: 84  SDLTREEMKSRL 95
           +DLT EE +S+ 
Sbjct: 104 ADLTNEEYRSKF 115


>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
          Length = 459

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ FI  ++++Y T+EE   R ++F +N+   +++   + GTA YG+   SDLT EE ++
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFRT 221


>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
 gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
          Length = 599

 Score = 51.6 bits (122), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F KF   + + Y    E   R  +F  +LK I++LN  E G+A YGI   +D+T 
Sbjct: 290 DHL--FHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADMTS 347

Query: 89  EEMKSRLGL 97
            E   R GL
Sbjct: 348 TEYAQRAGL 356


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          K FE + ++  K+Y +KE+   RF +FE+N + ++  N   + + T  +N  +DLT  E 
Sbjct: 30 KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89

Query: 92 K-SRLGLN 98
          K SRLGL+
Sbjct: 90 KASRLGLS 97


>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
          Length = 460

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE + R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 46/76 (60%), Gaps = 3/76 (3%)

Query: 29  EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           EH +  F  F   + KSY T+EE  KR+A+F++NL  I   N+  + + +  +NH  DL+
Sbjct: 113 EHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGY-SYSLKMNHFGDLS 171

Query: 88  REEMKSR-LGLNLSKH 102
           REE + + LG N S++
Sbjct: 172 REEFRRKYLGYNKSRN 187


>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
 gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
 gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
          Length = 460

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 38/60 (63%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE + R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRT 222


>gi|356530431|ref|XP_003533785.1| PREDICTED: cysteine proteinase [Glycine max]
          Length = 354

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F+  F KSY ++EE+ +R+ +F  NL+ I   NK +    T  +NH +D T EE K
Sbjct: 54  KFARFVSRFGKSYQSEEEMKERYEIFSQNLRFIRSHNK-KRLPYTLSVNHFADWTWEEFK 112

Query: 93  S-RLG 96
             RLG
Sbjct: 113 RHRLG 117


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 38/62 (61%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E+++ ++ K Y    E  KRF +F+DN++ IE  N   +     G+NHL+DLT EE 
Sbjct: 36 ERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEF 95

Query: 92 KS 93
          K+
Sbjct: 96 KA 97


>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/82 (31%), Positives = 46/82 (56%), Gaps = 3/82 (3%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          ++  +    +++ +N   L  +E+F   + K+Y   ++   RF +F+DNL   + L + E
Sbjct: 13 IWSALARTTQVEPDNARAL--YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEME 69

Query: 73 HGTATYGINHLSDLTREEMKSR 94
           GTA YG+   SDLT EE ++R
Sbjct: 70 QGTAQYGVTQFSDLTSEEFETR 91


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           + EK++  F KSY    E  KRF +F++N++ IE  N   +      INH +DLT EE K
Sbjct: 36  KHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFK 95

Query: 93  SRLGLNLSKH 102
           + L  N   H
Sbjct: 96  ASLNGNKKLH 105


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 1/80 (1%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + +E  T +   + + EK++ +  ++Y  +EE A+R  VF  N KLI+  N  E  T   
Sbjct: 29  AGDEAITVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRL 88

Query: 79  GINHLSDLTREEMK-SRLGL 97
             N  +DLT EE + +R GL
Sbjct: 89  ATNRFADLTDEEFRAARTGL 108


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score = 51.6 bits (122), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 1/80 (1%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + +E  T +   + + EK++ +  ++Y  +EE A+R  VF  N KLI+  N  E  T   
Sbjct: 29  AGDEAITVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRL 88

Query: 79  GINHLSDLTREEMK-SRLGL 97
             N  +DLT EE + +R GL
Sbjct: 89  ATNRFADLTDEEFRAARTGL 108


>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
          Length = 363

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+  N E    N EH   F  F   FSKSY TKEE   RF VF+ NL +   L++    T
Sbjct: 32  QVVDNEEDHLLNAEH--HFTSFKSKFSKSYSTKEEHDYRFGVFKSNL-IKAKLHQKLDPT 88

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +GI   SDLT  E + + LGL
Sbjct: 89  AEHGITKFSDLTASEFRRQFLGL 111


>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
          Length = 1785

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 18   KSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
            +S   LK ++  +++ QFEKF     + Y +  E   R+ +F +NL  I+ LN+ E GT 
Sbjct: 1461 RSVRSLKIDDEAYVRRQFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTG 1520

Query: 77   TYGINHLSDLTREEMKSRLGLNLSKH 102
             YG+   +D+T  E ++  GL + K 
Sbjct: 1521 KYGVTKFADMTTAEYRAHTGLIVPKQ 1546


>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
          Length = 363

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 44/83 (53%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+  N E    N EH   F  F   FSKSY TKEE   RF VF+ NL +   L++    T
Sbjct: 32  QVVDNEEDHLLNAEH--HFTSFKSKFSKSYSTKEEHDYRFGVFKSNL-IKAKLHQKLDPT 88

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +GI   SDLT  E + + LGL
Sbjct: 89  AEHGITKFSDLTASEFRRQFLGL 111


>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
          Length = 774

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 47/74 (63%), Gaps = 2/74 (2%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           +E+ +  + F  F+  ++++Y + E    RF +F +NL  IE+L + E GT  YG+N  +
Sbjct: 461 SEDMKAERLFNNFMTTYNRTYSSLER-NLRFKIFRENLNFIEELRETEQGTGIYGVNMFA 519

Query: 85  DLTREEMKSR-LGL 97
           D++++E ++R LGL
Sbjct: 520 DMSQKEFRTRYLGL 533


>gi|197258084|gb|ACH56226.1| cathepsin L-like cysteine proteinase [Radopholus similis]
          Length = 417

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 28/73 (38%), Positives = 42/73 (57%), Gaps = 3/73 (4%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA---TYGINH 82
           E PE +++F++  R FS+ + ++ E  +RF +FE NL  I  LN     T    TYG+N 
Sbjct: 89  ELPEVVREFDQIQRTFSREWNSERERWERFKLFERNLAEIARLNAEAKRTGRNMTYGVNG 148

Query: 83  LSDLTREEMKSRL 95
           ++D T EEM   L
Sbjct: 149 MADWTEEEMGRML 161


>gi|17978639|gb|AAL48318.1| berghepain-2 [Plasmodium berghei]
          Length = 468

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++++K Y + EE+ +RF +F +NLK IE  NK  H   T GIN  SD+
Sbjct: 145 NLESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNKENH-LYTKGINAFSDM 203

Query: 87  TREEMKSR 94
             EE K +
Sbjct: 204 RHEEFKMK 211


>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
          Length = 548

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 257 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 316


>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
          Length = 517

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279


>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
          Length = 485

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246


>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
          Length = 484

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246


>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
 gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
 gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
          Length = 324

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+K+Y ++ E   RF +F+ NL+ I + N+ +  TA Y IN  SDL++EE  S
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQND-STAQYEINKFSDLSKEEAIS 86

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 87 KYTGLSL 93


>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
 gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
 gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
 gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
 gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
 gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
 gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
 gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
 gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
 gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
          Length = 484

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246


>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
          Length = 548

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310


>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
          Length = 460

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222


>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
          Length = 379

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 82  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141


>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
          Length = 484

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246


>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
          Length = 382

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P  LK+ F  F   +++SYP   E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35 PLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94

Query: 87 TREE 90
          T EE
Sbjct: 95 TEEE 98


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 43/73 (58%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           N  +  N +    +E ++ +  KSY + +E   RF +F+DNL++I+D N   + + + G+
Sbjct: 29  NSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGL 88

Query: 81  NHLSDLTREEMKS 93
           N  +DLT EE +S
Sbjct: 89  NRFADLTDEEYRS 101


>gi|68076993|ref|XP_680416.1| falcipain 2 precursor [Plasmodium berghei strain ANKA]
 gi|56501341|emb|CAI05700.1| falcipain 2 precursor, putative [Plasmodium berghei]
          Length = 470

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++++K Y + EE+ +RF +F +NLK IE  NK  H   T GIN  SD+
Sbjct: 147 NLESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNKENH-LYTKGINAFSDM 205

Query: 87  TREEMKSR 94
             EE K +
Sbjct: 206 RHEEFKMK 213


>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
 gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
          Length = 334

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 2/67 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE F+ +++K+Y    E  KR+ +F+DNL+ I + NK  + TA Y IN  SDL+  E+ S
Sbjct: 37  FELFVANYNKNYTDPLEKTKRYHIFKDNLEEINNKNK-SNDTAVYRINKFSDLSTNELIS 95

Query: 94  RL-GLNL 99
           +  GLN+
Sbjct: 96  KYTGLNV 102


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 45/68 (66%), Gaps = 2/68 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FEK++  + K+Y + EE  +RF VF+DNL  I+D+NK +  +   G+N  +DLT +E
Sbjct: 48  IELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINK-KVTSYWLGLNEFADLTHDE 106

Query: 91  MKSR-LGL 97
            K+  LGL
Sbjct: 107 FKATYLGL 114


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 1/75 (1%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S +  +T++ E +  +E ++    KSY    E  KRF +F+DNL+ I++ N  E+ +   
Sbjct: 36  SKSSWRTDD-EVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKV 94

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 95  GLNRFADLTNEEYRS 109


>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
          Length = 323

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
 gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
 gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
          Length = 323

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 6/89 (6%)

Query: 6  SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
          S   TL   G+  + N    EN +++  FE FI+ ++KSY T +E A ++  F++NLK+I
Sbjct: 13 SVMLTLCHLGETVTYN---LENSDNI--FEDFIKKYNKSYATDQERAIKYENFKNNLKMI 67

Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
           D N G    A + IN  SDL + ++  R
Sbjct: 68 NDKNNGSK-DAVFDINAFSDLNKNDLLRR 95


>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
 gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|328711164|ref|XP_003244460.1| PREDICTED: cathepsin O-like [Acyrthosiphon pisum]
          Length = 339

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 1/66 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          +F KFI+ ++KSY  + E  KRF  F+ +LK I+ L++  +G   YGI   SDL+ EE  
Sbjct: 35 KFNKFIKMYNKSYMNETEHNKRFEHFKKSLKTIQLLSQKCNGCTNYGITEFSDLSTEEF- 93

Query: 93 SRLGLN 98
          +++ LN
Sbjct: 94 TKIYLN 99


>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
 gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
 gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 6/89 (6%)

Query: 6  SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
          S   TL   G+  + N    EN +++  FE FI+ ++KSY T +E A ++  F++NLK+I
Sbjct: 13 SVMLTLCHLGETVTYN---LENSDNI--FEDFIKKYNKSYATDQERAIKYENFKNNLKMI 67

Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
           D N G    A + IN  SDL + ++  R
Sbjct: 68 NDKNNGSK-YAVFDINAFSDLNKNDLLRR 95


>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
          Length = 323

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 45/67 (67%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKNYSSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
          Length = 381

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 84  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143


>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
 gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
          Length = 392

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 95  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 154


>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
          Length = 410

 Score = 50.8 bits (120), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ FI  ++++Y T+EE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 113 FKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 172


>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
          Length = 338

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 41  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100


>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
          Length = 490

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R ++F +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252


>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
 gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
          Length = 358

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N   + +F ++   +++SYPT EE  +RF V+  N++ IE  N+  + T T G N  +DL
Sbjct: 50  NKLMMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADL 109

Query: 87  TREE 90
           T EE
Sbjct: 110 TEEE 113


>gi|145531433|ref|XP_001451483.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419138|emb|CAK84086.1| unnamed protein product [Paramecium tetraurelia]
          Length = 314

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 4/71 (5%)

Query: 28 PEHLK---QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          PE L    QF+K+   F K Y T  E A RF V++D +K I+ LN  E+ T  +G    +
Sbjct: 23 PESLDLRVQFDKYTNQFGKFY-TPAERAYRFQVYQDAMKQIQILNSEENSTTVFGETQFT 81

Query: 85 DLTREEMKSRL 95
          DLT EE  + L
Sbjct: 82 DLTNEEFAALL 92


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 341

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 36/64 (56%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F + Y    E   RF +F+ NLK +E  N   + T T  +N  SDLT EE
Sbjct: 32 IEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEE 91

Query: 91 MKSR 94
           K+R
Sbjct: 92 FKAR 95


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 39/67 (58%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  FE ++ ++ KSY    E  +RF +F+DNL+ +++ N   + +   G+N  SDL
Sbjct: 41  NDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDL 100

Query: 87  TREEMKS 93
           T EE  S
Sbjct: 101 TLEEYSS 107


>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
          Length = 338

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 41  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 100


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 38/65 (58%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E ++ +E ++    K+Y    E  KRFA+F+DNL+ I+  N  +  T   G+N  +DLT 
Sbjct: 48  EVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTN 107

Query: 89  EEMKS 93
           EE +S
Sbjct: 108 EEFRS 112


>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
          Length = 396

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 1/64 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           +QF+ F + F + + + EE   RF VF+ NL+ IE+LN  ++ +  YGIN  SD T  E+
Sbjct: 86  QQFKDFNKKFGREHKSLEEYKMRFEVFQKNLRDIEELNL-KNPSVQYGINRFSDKTESEL 144

Query: 92  KSRL 95
           K+ L
Sbjct: 145 KNLL 148


>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
 gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
          Length = 2676

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 4/74 (5%)

Query: 29   EHLKQFEKFIRDFSKSY-PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
            EHL  F +F+  +   Y   + ++ +RF +F++N++ + +LN  E GTATYG+   +DLT
Sbjct: 2368 EHL--FYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFADLT 2425

Query: 88   REEMKSR-LGLNLS 100
             EE  ++ +G+  S
Sbjct: 2426 YEEFSTKHMGMKAS 2439


>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
          Length = 324

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 46/68 (67%), Gaps = 4/68 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
          FE+F+  F+K+Y ++ E  +RF +F+ NL+  E +NK ++ T A Y IN  SDL+++E  
Sbjct: 28 FEEFLHKFNKNYSSESEKLRRFKIFQHNLE--EIINKNQNDTSAQYEINKFSDLSKDETI 85

Query: 93 SRL-GLNL 99
          S+  GL+L
Sbjct: 86 SKYTGLSL 93


>gi|118350036|ref|XP_001008299.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89290066|gb|EAR88054.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 332

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 7/63 (11%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +++++F++  S +Y T EE   RFAVF DNLK IE       G + YGI    DLT EE 
Sbjct: 41 QKWQEFLKKHSITYKTIEEKLHRFAVFRDNLKKIE-------GHSNYGITKFMDLTSEEF 93

Query: 92 KSR 94
          + R
Sbjct: 94 QQR 96


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 40/69 (57%)

Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          +++   + + E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     G+N  +
Sbjct: 28 SDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFA 87

Query: 85 DLTREEMKS 93
          DLT +E +S
Sbjct: 88 DLTNDEFRS 96


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 9/101 (8%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAV 57
           A + ATL+L      +  +   +PE L+        FE +I +F K+Y T EE   RF V
Sbjct: 15  ALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFEV 74

Query: 58  FEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS-RLGL 97
           F+DNLK I++ NK +  +   G+N  +DL+ EE K   LGL
Sbjct: 75  FKDNLKHIDETNK-KVKSYWLGLNEFADLSHEEFKKMYLGL 114


>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
          Length = 338

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          FE+FI+D++K Y   E+  +RF +F +NLK I  +N+     A YGIN  SDL++EE
Sbjct: 41 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 95


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
          FE + ++  KSY ++EE + R  VFEDN   +   N   + + +  +N  +DLT  E K 
Sbjct: 29 FETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKT 88

Query: 93 SRLGLN 98
          SRLGL+
Sbjct: 89 SRLGLS 94


>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
 gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
          Length = 308

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 27 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 86


>gi|351724281|ref|NP_001237820.1| cysteine protease-like precursor [Glycine max]
 gi|149393486|gb|ABR26679.1| putative cysteine protease [Glycine max]
          Length = 355

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F+  F KSY ++EE+ +R+ +F  NL+ I   NK      T  +NH +D T EE K
Sbjct: 54  KFARFMSRFGKSYRSEEEMRERYEIFSQNLRFIRSHNKNRL-PYTLSVNHFADWTWEEFK 112

Query: 93  S-RLG 96
             RLG
Sbjct: 113 RHRLG 117


>gi|390470786|ref|XP_003734355.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin W [Callithrix jacchus]
          Length = 373

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 39/70 (55%), Gaps = 1/70 (1%)

Query: 28  PEHLKQFEKFIR-DFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+  KF +  F++SY T EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKFFQIQFNRSYLTPEEHARRLDIFAHNLVQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
 gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
          Length = 366

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 22  ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           E+  +     + F++F+ +F+K Y T++  A+++ +F+ N+ + + L + E GTA YG  
Sbjct: 54  EMNAKEARSWENFKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPT 113

Query: 82  HLSDLTREEM-KSRLGLN 98
             +D+T EE  K+ L  N
Sbjct: 114 IFADMTPEEFRKTHLNFN 131


>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
          nucleopolyhedrovirus]
 gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple
          nucleopolyhedrovirus]
 gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
 gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
          nucleopolyhedrovirus]
 gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple
          nucleopolyhedrovirus]
          Length = 323

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
          Length = 379

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 35/60 (58%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F+  ++++Y +KEE   R ++F  N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 82  FRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +FE+++    ++Y    E  +RF V+++NL LIE+ N G HG  T   N  +DLT EE +
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGY-TLTDNKFADLTNEEFR 176

Query: 93  SRL 95
           +++
Sbjct: 177 AKM 179


>gi|3941390|gb|AAC82352.1| group 1 allergen Eur m 1 0102 [Euroglyphus maynei]
          Length = 327

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 12/73 (16%)

Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+KSY  P KEEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 26 PASIKTFEEFKKAFNKSYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 75

Query: 86 LTREEMKSRLGLN 98
          L+ +E K++  +N
Sbjct: 76 LSLDEFKNQFLMN 88


>gi|158284547|ref|XP_307325.4| Anopheles gambiae str. PEST AGAP012577-PA [Anopheles gambiae str.
           PEST]
 gi|157021017|gb|EAA03137.4| AGAP012577-PA [Anopheles gambiae str. PEST]
          Length = 547

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 32/90 (35%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+     ++E  EHL  +F +F     KSY +  E  +R  VF  NL+ I   N+
Sbjct: 223 ATFNPMQEFVHPRSE--EHLHNEFGRFKNKHGKSYASPLEHERRLNVFRQNLRFIHSHNR 280

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
              G  T  +NHL+D T EE+K+  G   S
Sbjct: 281 ANRGF-TVAVNHLADRTEEELKALRGFRSS 309


>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
          Length = 396

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 1/64 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           +QF+ F + F + + + EE   RF VF+ NL+  E+LN+ ++ +  YGIN  SD T  E+
Sbjct: 86  QQFKDFNKKFGREHKSLEEYKMRFEVFQKNLREFEELNQ-KNPSVQYGINKFSDKTESEL 144

Query: 92  KSRL 95
           K+ L
Sbjct: 145 KNLL 148


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 42/71 (59%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T+ E
Sbjct: 34  MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSE 93

Query: 91  MKSRLGLNLSK 101
             ++    +S+
Sbjct: 94  FVAQYTGGISR 104


>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
          Length = 347

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
          FE+F   ++K Y + EE A+R A+F+++L  IE  N +   G  TY  G+N  +DLTREE
Sbjct: 31 FEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREE 90

Query: 91 MK 92
           +
Sbjct: 91 FR 92


>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
          Length = 489

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 35/60 (58%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F+  ++++Y +KEE   R +VF  N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 193 FRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 32/62 (51%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI+ F + Y +  E   RF  +  NL  +E L   E GTA YG+   SD++ EE + 
Sbjct: 170 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 229

Query: 94  RL 95
            +
Sbjct: 230 TM 231


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 32/62 (51%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  FI+ F + Y +  E   RF  +  NL  +E L   E GTA YG+   SD++ EE + 
Sbjct: 135 FLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEFQK 194

Query: 94  RL 95
            +
Sbjct: 195 TM 196


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH-GTATY--GINHLSDLTRE 89
           +F++++    K+Y   +E AKR A+F DN + +   N+    G  ++   +NHL+DLTRE
Sbjct: 69  RFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTRE 128

Query: 90  EMKSRLGLNLSK 101
           E K  LG + SK
Sbjct: 129 EFKHMLGYDASK 140


>gi|195494228|ref|XP_002094747.1| GE21992 [Drosophila yakuba]
 gi|194180848|gb|EDW94459.1| GE21992 [Drosophila yakuba]
          Length = 549

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/69 (39%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 29  EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           EH+ K F  F R    +YP++ E   R  +F  NL+ I   N+ +  T T  +NHL+D T
Sbjct: 239 EHVDKAFHHFKRKHGVAYPSETEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKT 297

Query: 88  REEMKSRLG 96
            EE+K+R G
Sbjct: 298 EEELKARRG 306


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 45/71 (63%), Gaps = 2/71 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FEKF+  + K+Y + EE  +RF VF+DNL  I++ NK   G    G+N  +DLT +E
Sbjct: 49  MELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGY-WLGLNEFADLTHDE 107

Query: 91  MKSR-LGLNLS 100
            K+  LGL L+
Sbjct: 108 FKAAYLGLTLT 118


>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
          Length = 323

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 44/67 (65%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K Y ++ E  +RF +F+ NL   E +NK ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIINKDQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/74 (40%), Positives = 45/74 (60%), Gaps = 2/74 (2%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           + N   ++ FEK++    K+Y + EE   RF VF+DNLK I+ +N+ E  +   G+N  +
Sbjct: 141 SSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNR-EVTSYWLGLNEFA 199

Query: 85  DLTREEMKSR-LGL 97
           DLT EE K+  LGL
Sbjct: 200 DLTHEEFKATYLGL 213


>gi|324514421|gb|ADY45863.1| Viral cathepsin [Ascaris suum]
          Length = 399

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 4/93 (4%)

Query: 1   MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
           ++ D SA +   +   M    EL  + P ++  F KF++++ + Y + +E   RF  F  
Sbjct: 69  LSSDPSAGSLETILADM---GELSNDYPIYIDSFVKFMQEYDRQYSSNDETRLRFRNFVR 125

Query: 61  NLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           N+K I+   KG      +GI   +D +  EMKS
Sbjct: 126 NMKFIKKAQKGRD-NVVFGITRFTDWSEAEMKS 157


>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y + EE+ +RF VF DNLK+I   NK +  +   G+N  +DLT +
Sbjct: 57  HALSFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115

Query: 90  EM-KSRLG 96
           E  + RLG
Sbjct: 116 EFRRDRLG 123


>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
 gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
          Length = 323

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 4/74 (5%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
           ++E F   F K Y   EE + R +VF D LK I++ N + + G  TY   IN+ SDLT E
Sbjct: 19  EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78

Query: 90  E-MKSRLGLNLSKH 102
           E + ++ G+   +H
Sbjct: 79  EVLATKTGMTRRRH 92


>gi|171460937|ref|NP_001116343.1| cathepsin W precursor [Felis catus]
 gi|6165261|emb|CAB59816.1| cysteine protease [Felis catus]
          Length = 344

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LKQ F  F   +++SY   EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGRLYG 104


>gi|22653681|sp|Q9TST1.2|CATW_FELCA RecName: Full=Cathepsin W; Flags: Precursor
          Length = 374

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LKQ F  F   +++SY   EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGRLYG 104


>gi|3273233|dbj|BAA31161.1| tetrain [Tetrahymena pyriformis]
          Length = 330

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 25 TENPE---HLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
          T NP    HL+   F+KF R+F  +Y  + E + R +VF +NLK IE  N     T    
Sbjct: 22 TRNPNADGHLEHYAFQKFKRNFGVTYKNQGEESYRLSVFLENLKSIEANNANPLSTHVEE 81

Query: 80 INHLSDLTREEMKSR 94
          +N  +DLT EE  +R
Sbjct: 82 VNSFTDLTEEEFAAR 96


>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
          Length = 803

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 43  KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
           +SY T EE+ KRF +F  N+K  + L K E GTA YG+   SD++ +E K   LGL
Sbjct: 509 RSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHYLGL 564


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
           L++F+ +  +++++Y T EE  +RF V+ +NL+ I+ +N+   G++   G N  +DLT E
Sbjct: 37  LERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEE 96

Query: 90  EMKSRLGLNLSKH 102
           E K    + L + 
Sbjct: 97  EFKDTYLMKLDEQ 109


>gi|440798492|gb|ELR19560.1| papain family cysteine protease containing protein [Acanthamoeba
          castellanii str. Neff]
          Length = 385

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/65 (32%), Positives = 39/65 (60%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F++++ +F+K+Y + +EV  R A+FE  L  I+  N+    +   G+N L+D +  E++ 
Sbjct: 35 FDRYVVEFNKAYASDDEVVSRRAIFESRLAAIKAHNRDASKSWKQGVNQLTDRSEAEIRQ 94

Query: 94 RLGLN 98
           LG N
Sbjct: 95 LLGYN 99


>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
          Length = 302

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 37/60 (61%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F+ F+  ++++Y +KEE   R +VF +N+   + +   + GTA YG+   SDLT EE ++
Sbjct: 5  FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 64


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
           L++F+ +  +++++Y T EE  +RF V+ +NL+ I+ +N+   G++   G N  +DLT E
Sbjct: 37  LERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEE 96

Query: 90  EMKSRLGLNLSKH 102
           E K    + L + 
Sbjct: 97  EFKDTYLMKLDEQ 109


>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
          Length = 375

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY  + E A+R  +F  NL   + L + E GTA +G+   SDL
Sbjct: 35  PLELKEVFKLFQIQFNRSYSNQAEYARRLDIFVHNLATAQRLQEEELGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          E+   + + E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     GIN  +D
Sbjct: 29 EDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFAD 88

Query: 86 LTREEMKS 93
          LT +E ++
Sbjct: 89 LTNDEFRT 96


>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
          Length = 396

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 1/64 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           +QF+ F   F + + T EE   RF +F+ NL+ IE+LN  ++ +  YGIN  SD T  E+
Sbjct: 86  QQFKDFNAKFQREHKTLEEYKMRFEIFQKNLRDIEELNL-KNPSVQYGINKFSDKTESEL 144

Query: 92  KSRL 95
           K+ L
Sbjct: 145 KNLL 148


>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
          Length = 335

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 13 LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
          L  Q+  +NE    N EH   F  F   FSK+Y TKEE   RF VF+ N++  + L+   
Sbjct: 3  LIRQVVDDNEDHVLNAEH--HFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAK-LHAKL 59

Query: 73 HGTATYGINHLSDLTREEMKSR-LGL 97
            +A +G+   SDLT  E + + LGL
Sbjct: 60 DPSAVHGVTKFSDLTPSEFRRQFLGL 85


>gi|341893155|gb|EGT49090.1| hypothetical protein CAEBREN_13400 [Caenorhabditis brenneri]
          Length = 372

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 3/98 (3%)

Query: 1  MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
          M  ++S    L  F  + +   L   + E LK FEKF     K Y T EE  KR   F  
Sbjct: 1  MVPNSSCSLILFAFFSIFAEYSLAQHSQEVLKNFEKFQTHHKKHYRTAEEKKKRLGHFAK 60

Query: 61 NLKLIEDLN---KGEHGTATYGINHLSDLTREEMKSRL 95
          N + I++LN   K      T+G+N  +D+ +EE  +RL
Sbjct: 61 NHQRIKELNEEAKKAGRNVTFGLNKFADMPKEERHARL 98


>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
 gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
          Length = 605

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           +HL  F  F   + + Y    E   R  +F  NL+ I++LN  E G+A YGI   +D+T 
Sbjct: 296 DHL--FHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFADMTS 353

Query: 89  EEMKSRLGL 97
            E   R GL
Sbjct: 354 SEYTQRAGL 362


>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
 gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
          Length = 328

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 42/66 (63%), Gaps = 2/66 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+ ++ K+Y    E +KR+ +F+DNL+ I   N+  + TA Y IN  SDL++ E+ S
Sbjct: 29 FESFVANYQKNYNDDLEKSKRYTIFKDNLEEINVKNR-LNDTAVYRINKFSDLSKTEIIS 87

Query: 94 RL-GLN 98
          +  GLN
Sbjct: 88 KYTGLN 93


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 9/79 (11%)

Query: 27  NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           +PEHL         FE +I    K+Y + EE   RF VF++NLK I+  NK E  +   G
Sbjct: 33  SPEHLTSVDKLVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNK-EVTSYWLG 91

Query: 80  INHLSDLTREEMKSR-LGL 97
           +N  +DL+ EE KS+ LGL
Sbjct: 92  LNEFADLSHEEFKSKFLGL 110


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 2/66 (3%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
          Q+E+F   F +SY  +EE A+R  VF  N++LI + N   H T T G+N  +DLT EE  
Sbjct: 18 QWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVEEFS 76

Query: 92 KSRLGL 97
          K+ +G 
Sbjct: 77 KTYMGF 82


>gi|340508003|gb|EGR33817.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 334

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           ++N  ++ +FE F   ++K Y ++++   R  VF +NLK IE  NK    + T G+N +S
Sbjct: 40  SQNVNYVSEFENFNFKYNKQYQSQQQYQYRLQVFTENLKYIEQQNKKSQ-SFTLGVNSIS 98

Query: 85  DLTREE-MKSRLGLNL 99
            LTREE +++ LGLN+
Sbjct: 99  HLTREEFIQTYLGLNI 114


>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
          Length = 374

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 32/93 (34%), Positives = 45/93 (48%), Gaps = 1/93 (1%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
           A + A+LA   +    N+     P  LKQ F  F   +++SY   EE A+R  +F  NL 
Sbjct: 12  ALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLA 71

Query: 64  LIEDLNKGEHGTATYGINHLSDLTREEMKSRLG 96
             + L   + GTA +G+   SDLT EE     G
Sbjct: 72  QAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYG 104


>gi|312106123|ref|XP_003150646.1| hypothetical protein LOAG_15105 [Loa loa]
 gi|307754189|gb|EFO13423.1| hypothetical protein LOAG_15105 [Loa loa]
          Length = 139

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 38/59 (64%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          F  FI+  ++ Y +K+E+ KRF +++ NL+L + + K E  TA YG    SD+T+EE +
Sbjct: 30 FANFIQQHNRKYRSKKELLKRFRIYKRNLRLAKLIQKNEQDTAIYGETPFSDMTQEEFR 88


>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 596

 Score = 49.7 bits (117), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 34  FEKFIRDFSKSYPTK-EEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F+ F+  + ++Y +  +E  +RF +F+ N ++++ LN+ E GTA YGI    D++ EE  
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228

Query: 93  SRLGLNLSK 101
             L    ++
Sbjct: 229 RTLAPGFTR 237


>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
          Length = 358

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 33/61 (54%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++F+ F   ++KSYP   E   R  +F DNL   + L +   G A +G+   SDLT EE 
Sbjct: 42  ERFKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQGLAQFGVTRFSDLTEEEF 101

Query: 92  K 92
           +
Sbjct: 102 R 102


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 46/75 (61%), Gaps = 2/75 (2%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           + N   ++ FEK++    K+Y + EE   RF VF+DNLK I+ +N+ E  +   G+N  +
Sbjct: 40  SSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINR-EVTSYWLGLNEFA 98

Query: 85  DLTREEMKSR-LGLN 98
           DLT +E K+  LGL+
Sbjct: 99  DLTHDEFKAAYLGLD 113


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/67 (34%), Positives = 38/67 (56%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  FE ++ ++ KSY    E  +RF +F+DNL+ +++ N   + +   G+N  SDL
Sbjct: 41  NDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDL 100

Query: 87  TREEMKS 93
           T  E  S
Sbjct: 101 TDAEYSS 107


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 38/67 (56%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E ++ ++ + Y    E A+RF VF+DN+  +E  N  ++     GIN  +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFADLTIEE 92

Query: 91 MKSRLGL 97
           K+  G 
Sbjct: 93 FKANKGF 99


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 4/80 (5%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           T N + +  FE +I  F + Y + EE  +RF +F+DNL  I+D NK        G+N  +
Sbjct: 38  TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVR-NYWLGLNEFA 96

Query: 85  DLTREEMKSR-LGL--NLSK 101
           DL+ EE K++ LGL  +LSK
Sbjct: 97  DLSHEEFKNKYLGLKPDLSK 116


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 36/60 (60%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           + +F  +   +++SYPT EE  +RF V+  N++ IE  N+  + T T G N  +DLT EE
Sbjct: 46  MDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEE 105


>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+KSY ++ E  +RF +F  NL+ I + N  +  TA Y IN  +DL+++E  S
Sbjct: 28 FEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIINKNHND-STAQYEINKFADLSKDETIS 86

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 87 KYTGLSL 93


>gi|413944254|gb|AFW76903.1| hypothetical protein ZEAMMB73_202256 [Zea mays]
          Length = 151

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 39/68 (57%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          E+   + + E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     GIN  +D
Sbjct: 29 EDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFAD 88

Query: 86 LTREEMKS 93
          LT +E ++
Sbjct: 89 LTNDEFRT 96


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 36/59 (61%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++    ++Y   EE  +RF +F+ NLK IE+ N   + T   G+NH +DLT EE
Sbjct: 36 EKHEQWMARHGRTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADLTDEE 94


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 44/72 (61%), Gaps = 3/72 (4%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
           +F+ F  +  K+Y  + E +KRF +F DN++ IE  N   E G  +Y  GIN  +D+++E
Sbjct: 25  KFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQE 84

Query: 90  EMKSRLGLNLSK 101
           E K+ L L+ S+
Sbjct: 85  EFKTMLTLSASR 96


>gi|70935030|ref|XP_738656.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56515053|emb|CAH79945.1| hypothetical protein PC000617.03.0 [Plasmodium chabaudi chabaudi]
          Length = 221

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 39/68 (57%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F++ F+K Y + EE+ +RF +F +NLK +E  NK +      GIN  SD+
Sbjct: 149 NLESVNIFYNFMKKFNKQYNSAEEMQERFYIFTENLKKVEKHNKEKKYMYKKGINPFSDM 208

Query: 87  TREEMKSR 94
             EE K R
Sbjct: 209 RPEEFKMR 216


>gi|71482942|gb|AAZ32410.1| cysteine proteinase aleuran type [Nicotiana benthamiana]
          Length = 360

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y T EE+ +RF VF DNLK+I   NK +  +   G+N  +D+T +
Sbjct: 57  HALLFARFAHRYGKRYETVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDITWD 115

Query: 90  EM-KSRLG 96
           E  + RLG
Sbjct: 116 EFRRDRLG 123


>gi|14424447|sp|P25780.2|PEPT1_EURMA RecName: Full=Peptidase 1; AltName: Full=Allergen Eur m I;
          AltName: Full=Mite group 1 allergen Eur m 1; AltName:
          Allergen=Eur m 1; Flags: Precursor
 gi|3941388|gb|AAC82351.1| group 1 allergen Eur m 1 0101 precursor [Euroglyphus maynei]
          Length = 321

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 12/73 (16%)

Query: 28 PEHLKQFEKFIRDFSKSY--PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y  P KEEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKTYATPEKEEVARK--NFLESLKYVES-NKG-------AINHLSD 69

Query: 86 LTREEMKSRLGLN 98
          L+ +E K++  +N
Sbjct: 70 LSLDEFKNQFLMN 82


>gi|323452406|gb|EGB08280.1| hypothetical protein AURANDRAFT_26549 [Aureococcus
          anophagefferens]
          Length = 339

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F KF  DFS +Y + +E ++RF  F+ NL +I+ LNK  H  A +GI   +D + +E   
Sbjct: 21 FSKFQEDFSTTYSSPDETSERFTYFKKNLGMIDKLNK-VHPHALFGITKFADKSEDERAV 79

Query: 94 R 94
          R
Sbjct: 80 R 80


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +KQFE+++ ++ + Y   +E   RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNE 93

Query: 91  MKSRL-GLNL 99
             ++  GL+L
Sbjct: 94  FVAQYTGLSL 103


>gi|389608785|dbj|BAM18004.1| unknown unsecreted protein [Papilio xuthus]
          Length = 88

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKF++DF K+Y    +  + +  F  +L  I ++N  + G+ATYG+N  +D T EE K 
Sbjct: 22 FEKFVKDFDKNYKDDADREEHYQAFIKSLHRINEMN-SKDGSATYGVNKFADYTEEETKQ 80

Query: 94 RLGL 97
            G+
Sbjct: 81 MRGM 84


>gi|301762528|ref|XP_002916735.1| PREDICTED: cathepsin W-like [Ailuropoda melanoleuca]
          Length = 374

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LKQ F  F   +++SY   EE A+R  +F  NL   + L   + GTA +G+   SDL
Sbjct: 35  PLELKQVFTLFQIQYNRSYSNPEEYARRLDIFARNLAQAQQLEAEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|281350618|gb|EFB26202.1| hypothetical protein PANDA_004780 [Ailuropoda melanoleuca]
          Length = 373

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LKQ F  F   +++SY   EE A+R  +F  NL   + L   + GTA +G+   SDL
Sbjct: 35  PLELKQVFTLFQIQYNRSYSNPEEYARRLDIFARNLAQAQQLEAEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
 gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 35/87 (40%), Positives = 45/87 (51%), Gaps = 4/87 (4%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           L  Q+    E    N EH   F  F   FSK+Y TKEE   RF VF+ NL +   L++  
Sbjct: 32  LIRQVVDTAEDHILNAEH--HFTSFKSKFSKNYATKEEHDYRFGVFKSNL-IKAKLHQKL 88

Query: 73  HGTATYGINHLSDLTREEMKSR-LGLN 98
             +A +GI   SDLT  E + + LGLN
Sbjct: 89  DPSAQHGITKFSDLTASEFRRQFLGLN 115


>gi|158524604|gb|ABW71226.1| cysteine protease [Nicotiana tabacum]
          Length = 360

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y + EE+ +RF VF DNLK+I   NK +  +   G+N  +DLT +
Sbjct: 57  HALSFVRFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115

Query: 90  EM-KSRLG 96
           E  + RLG
Sbjct: 116 EFRRDRLG 123


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 9/99 (9%)

Query: 9   ATLALFGQMKSNNELKTENPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDN 61
           ATL +   +  +  +   +PEHL         FE ++   SK+Y + EE   RF +F DN
Sbjct: 15  ATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDN 74

Query: 62  LKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGLNL 99
           LK I++ NK +  +   G+N  +DL+ EE KS+ LGL +
Sbjct: 75  LKHIDETNK-KVSSYWLGLNEFADLSHEEFKSKYLGLRV 112


>gi|194870649|ref|XP_001972693.1| GG15663 [Drosophila erecta]
 gi|190654476|gb|EDV51719.1| GG15663 [Drosophila erecta]
          Length = 549

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 29  EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           EH+ K F  F R    +YP+  E   R  +F  NL+ I   N+ +  T T  +NHL+D T
Sbjct: 239 EHVDKAFHHFKRKHGVAYPSDTEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKT 297

Query: 88  REEMKSRLG 96
            EE+K+R G
Sbjct: 298 EEELKARRG 306


>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           LA+  Q   N E KT        F  FI+ F + Y + EE   RF ++  N+   + L  
Sbjct: 140 LAMNSQEWQNEEKKT----LWSDFMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQF 195

Query: 71  GEHGTATYGINHLSDLTREEMK 92
            E GTA YG    SD+T EE +
Sbjct: 196 EEKGTAIYGATKFSDMTAEEFQ 217


>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
          Length = 375

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 4/90 (4%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+     ++E  EHL  +F +F     K+Y +  E   R  VF  NL+ I   N+
Sbjct: 52  ATFNPMQEFVHPRSE--EHLHDEFSRFKGKHQKTYASDREHEHRLNVFRQNLRFIHSHNR 109

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
              G  T  +NHL+D T +EMKS  G   S
Sbjct: 110 ANRGF-TVAVNHLADRTEDEMKSLRGFRSS 138


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 2/83 (2%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          LA++    S  EL       +++ EK++    K Y   EE  +RF +F++N++ IE  N 
Sbjct: 18 LAMWADQASTRELHEST--MVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNA 75

Query: 71 GEHGTATYGINHLSDLTREEMKS 93
            + +   GIN  +DLT EE ++
Sbjct: 76 AGNNSYMLGINRFADLTNEEFRA 98


>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
 gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
          Length = 353

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%)

Query: 20  NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           + +  T +    K + +FI++++KSY   +E+  R+ VF  N+       K ++ T  YG
Sbjct: 41  SQDTATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRYG 100

Query: 80  INHLSDLTREEMKS 93
              LSDLT +E+KS
Sbjct: 101 FTKLSDLTDQEVKS 114


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNE 93

Query: 91  MKSRLGLNLSK 101
             ++    +S+
Sbjct: 94  FIAQYTGGISR 104


>gi|312282841|dbj|BAJ34286.1| unnamed protein product [Thellungiella halophila]
          Length = 358

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNAEEIKLRFSIFKENLDLIRSTNK-KRLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRNKLG 121


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 36/66 (54%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E+++  + + Y    E A+RF VF+DNL  +E  N  +      G+N  +DLT EE 
Sbjct: 39  ERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEF 98

Query: 92  KSRLGL 97
           K+  G 
Sbjct: 99  KANKGF 104


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
           ++ FE +I +F K+Y T EE   RF VF+DNLK I++ NK   G + + G+N  +DL+ E
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105

Query: 90  EMKS-RLGL 97
           E K   LGL
Sbjct: 106 EFKKMYLGL 114


>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
          Length = 403

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 62  PLELKEAFKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 121

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 122 TEEEFGQLYG 131


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           K  + E +  +E ++    KSY    E  +RF +F+DNL+ IE+ N   + T   G+N  
Sbjct: 44  KRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHN-AVNRTYKVGLNRF 102

Query: 84  SDLTREEMKSR 94
           +DLT EE +SR
Sbjct: 103 ADLTNEEYRSR 113


>gi|351701945|gb|EHB04864.1| Cathepsin W [Heterocephalus glaber]
          Length = 373

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F+KSY    E A+R  +F  NL + + L + + GTA +G+   SDL
Sbjct: 35  PLELKEVFKLFQIQFNKSYSNPAEHARRLDIFVHNLAMAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|31096290|gb|AAP43630.1| chabaupain-2 [Plasmodium chabaudi chabaudi]
          Length = 471

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 39/68 (57%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F++ F+K Y + EE+ +RF +F +NLK +E  NK +      GIN  SD+
Sbjct: 147 NLESVNIFYNFMKKFNKQYNSAEEMQERFYIFTENLKKVEKHNKEKKYMYKKGINPFSDM 206

Query: 87  TREEMKSR 94
             EE K R
Sbjct: 207 RPEEFKMR 214


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/69 (44%), Positives = 44/69 (63%), Gaps = 4/69 (5%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY-GINHLSDLTRE 89
           ++ FE +I +F K+Y T EE   RF VF+DNLK I++ NK   G + + G+N  +DL+ E
Sbjct: 48  IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHE 105

Query: 90  EMKS-RLGL 97
           E K   LGL
Sbjct: 106 EFKKMYLGL 114


>gi|229594208|ref|XP_001031647.3| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|225567000|gb|EAR83984.3| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 331

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 35/89 (39%), Positives = 45/89 (50%), Gaps = 7/89 (7%)

Query: 11 LALFGQMKSNNELKTENPE---HLK--QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
          LAL G   +   L T NP    HL    F KF R F+  Y  + E + R +VF +NLK+I
Sbjct: 10 LALIG--AATVYLITRNPNGDGHLDMYSFLKFKRSFNVQYHNESEESYRLSVFLENLKMI 67

Query: 66 EDLNKGEHGTATYGINHLSDLTREEMKSR 94
          E  N     T    +N  +DLT EE +SR
Sbjct: 68 EKHNADSTRTYDQEVNQFADLTIEEFESR 96


>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
          Length = 376

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F +NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFANNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
 gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
          Length = 324

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
          FE F+ +F+K+Y +K E   RF +F+ NL+  E +NK  + T A Y IN  SDL+++E  
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85

Query: 93 SRL-GLNL 99
          S+  GL+L
Sbjct: 86 SKYTGLSL 93


>gi|8050826|gb|AAF71757.1| cysteine protease falcipain-3 [Plasmodium falciparum]
          Length = 488

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           +N E +  F  F+++ +K Y T EE+ KRF +F +N + IE  NK  +     G+N   D
Sbjct: 159 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 218

Query: 86  LTREEMKSRLGLNLSKH 102
           L+ EE +S+  LNL  H
Sbjct: 219 LSPEEFRSKY-LNLKTH 234


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/66 (31%), Positives = 36/66 (54%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E+++ ++ + Y    E A+RF VF+DN   +E  N  +      G+N  +DLT EE 
Sbjct: 3  ERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEEF 62

Query: 92 KSRLGL 97
          K+  G 
Sbjct: 63 KANKGF 68


>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
          Length = 368

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           GQ +S++ L T    H   F  F R F KSY ++EE   RF+VF+ NL+      K +  
Sbjct: 37  GQDESSSNLLTAEQHH---FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 92

Query: 75  TATYGINHLSDLTREEMKSR-LGL 97
           TA++G+   SDLT  E + + LGL
Sbjct: 93  TASHGVTQFSDLTSAEFRKQVLGL 116


>gi|402892809|ref|XP_003909601.1| PREDICTED: cathepsin W [Papio anubis]
          Length = 375

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTLFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
          Length = 1118

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           FE+FI+D++K Y   E+  +RF +F +NLK I  +N+     A YGIN  SDL++EE
Sbjct: 302 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 356



 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/57 (45%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           FE+FI+D++K Y   E+  +RF +F +NLK I  +N+     A YGIN  SDL++EE
Sbjct: 519 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKEE 573



 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 38/57 (66%), Gaps = 2/57 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           FE+FI+D++K Y   E+  +RF +F +NLK I  +N+     A YGIN  SDL+++E
Sbjct: 819 FEQFIKDYNKEYDESEK-EERFKIFVNNLKDINAMNE-RSSNAVYGINKFSDLSKDE 873


>gi|124803852|ref|XP_001347833.1| falcipain-3 [Plasmodium falciparum 3D7]
 gi|9255922|gb|AAF86352.1|AF282974_1 cysteine protease falcipain-3 [Plasmodium falciparum]
 gi|23496085|gb|AAN35746.1|AE014838_24 falcipain-3 [Plasmodium falciparum 3D7]
          Length = 492

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           +N E +  F  F+++ +K Y T EE+ KRF +F +N + IE  NK  +     G+N   D
Sbjct: 163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222

Query: 86  LTREEMKSRLGLNLSKH 102
           L+ EE +S+  LNL  H
Sbjct: 223 LSPEEFRSKY-LNLKTH 238


>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
 gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           GQ +S++ L T    H   F  F R F KSY ++EE   RF+VF+ NL+      K +  
Sbjct: 37  GQDESSSNLLTAEQHH---FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 92

Query: 75  TATYGINHLSDLTREEMKSR-LGL 97
           TA++G+   SDLT  E + + LGL
Sbjct: 93  TASHGVTQFSDLTSAEFRKQVLGL 116


>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 4/68 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT-ATYGINHLSDLTREEMK 92
          FE F+ +F+K+Y +K E   RF +F+ NL+  E +NK  + T A Y IN  SDL+++E  
Sbjct: 28 FEDFLHNFNKNYSSKSEKLHRFKIFQHNLE--EIINKNLNDTSAQYEINKFSDLSKDETI 85

Query: 93 SRL-GLNL 99
          S+  GL+L
Sbjct: 86 SKYTGLSL 93


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 35/59 (59%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           E+++  +S+ Y    E A+RF VF+ N++ IE  N G +     G+N  +DLT +E +S
Sbjct: 131 EQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRS 189


>gi|358347416|ref|XP_003637753.1| Cysteine proteinase [Medicago truncatula]
 gi|355503688|gb|AES84891.1| Cysteine proteinase [Medicago truncatula]
          Length = 323

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 38/63 (60%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          K  E++++DF ++Y    E  KRF +F  NL+ IE+ N+  + T   G+N   DLT++E 
Sbjct: 32 KTHEQWMKDFGRTYADDVEKEKRFKIFAKNLEYIENFNRAGNETYELGLNQFLDLTKKEF 91

Query: 92 KSR 94
           S+
Sbjct: 92 TSK 94


>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
          Length = 352

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y + EE+  RF +F +NL+LI+  NK +  +   G+NH +DL+ +
Sbjct: 49  HAASFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTNK-KRLSYKLGLNHFADLSWD 107

Query: 90  EMKSR 94
           E +++
Sbjct: 108 EFRTQ 112


>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
          Length = 362

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 34/86 (39%), Positives = 47/86 (54%), Gaps = 4/86 (4%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           L  Q+  + + +  N EH   F  F   FSKSY TKEE   RF VF+ NLK  + L++  
Sbjct: 28  LIRQVTDHEDDQLLNAEH--HFTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAK-LHQKL 84

Query: 73  HGTATYGINHLSDLTREEMKSR-LGL 97
             +A +G+   SDLT  E + + LGL
Sbjct: 85  DPSAEHGVTKFSDLTASEFRRQFLGL 110


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 47/81 (58%), Gaps = 9/81 (11%)

Query: 27  NPEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           +PEHL         FE ++   SK+Y + EE   RF +F DNLK I++ NK +  +   G
Sbjct: 33  SPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNK-KVSSYWLG 91

Query: 80  INHLSDLTREEMKSR-LGLNL 99
           +N  +DL+ EE KS+ LGL +
Sbjct: 92  LNEFADLSHEEFKSKYLGLRV 112


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 40/69 (57%), Gaps = 2/69 (2%)

Query: 25  TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           T  PE+ +Q +E+F + + K+Y   ++   RF+VF++NL     L   E GTA YG+   
Sbjct: 297 TPEPENARQLYEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQF 355

Query: 84  SDLTREEMK 92
            DLT +E +
Sbjct: 356 FDLTSQEFQ 364


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E ++ ++ + Y    E A+RF VF+DN+  +E  N  ++     G+N  +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFADLTTEE 92

Query: 91 MKSRLGL 97
           K+  G 
Sbjct: 93 FKANKGF 99


>gi|118397782|ref|XP_001031222.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89285547|gb|EAR83559.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 331

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/67 (31%), Positives = 37/67 (55%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          E L+ + KF R++ + Y  + E   R A+F +N + I+D N     T   G+N  SD+T+
Sbjct: 27 EALQAYNKFTRNYPRIYLNEAESDYRLAIFLENYQKIQDHNNNPENTYQIGVNRFSDMTQ 86

Query: 89 EEMKSRL 95
          +E   ++
Sbjct: 87 QEFSQKI 93


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score = 48.5 bits (114), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 49/83 (59%), Gaps = 5/83 (6%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           +++ G++ S+   +T++ E +  +E ++    KSY    E  KRF +F+DNL+ I++ N 
Sbjct: 27  MSIIGELSSS---RTDD-EVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHN- 81

Query: 71  GEHGTATYGINHLSDLTREEMKS 93
            E  T   G+N  +DLT +E +S
Sbjct: 82  AESRTYKVGLNRFADLTNDEYRS 104


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score = 48.5 bits (114), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 7/80 (8%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           G + S N L       ++ F++++    K Y + EE A+R  +F  NL+ I   NK  + 
Sbjct: 31  GDINSGNGL-------VRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNS 83

Query: 75  TATYGINHLSDLTREEMKSR 94
           +   G+N  +DLT EE K+R
Sbjct: 84  SFRLGLNKFADLTNEEFKTR 103


>gi|118387039|ref|XP_001026636.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89308403|gb|EAS06391.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 336

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 3/64 (4%)

Query: 29 EHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          +HL+Q  F  F + F+K Y ++E    RF VF +NLK IE LNK E  TA + +   SD 
Sbjct: 33 KHLQQQSFLDFKKSFAKKYNSQEHELFRFNVFLENLKEIERLNK-EITTAKFDVTQFSDY 91

Query: 87 TREE 90
          T+EE
Sbjct: 92 TKEE 95


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 19/60 (31%), Positives = 36/60 (60%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNE 93


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNE 93

Query: 91  MKSRLGLNLSK 101
             ++    +S+
Sbjct: 94  FVAQYTGGISR 104


>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
          Length = 715

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/63 (33%), Positives = 36/63 (57%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F   F + Y +K+E   RF +F +N++  + L   E GTA YG+   +D++  E K 
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFKQ 477

Query: 94  RLG 96
            +G
Sbjct: 478 YVG 480


>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
          Length = 462

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224


>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
 gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
 gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
 gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
          Length = 462

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224


>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
 gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
 gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
 gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
 gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
 gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
 gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
          Length = 462

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224


>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
 gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
          Length = 343

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 1/65 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F+ F++ F K Y T EE   R  VF+ NL  +  L K +  TA +GI   +DLT EE+ 
Sbjct: 45  HFKHFMQKFGKVYGTTEEYVHRLKVFQANLAHVMSLKK-QDPTAIHGITSFADLTPEELS 103

Query: 93  SRLGL 97
             LG 
Sbjct: 104 RFLGF 108


>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
          Length = 417

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 120 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 179


>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
          Length = 462

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224


>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
          Length = 350

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y + +E+  RF +F +NL+LI   NK    +   G+NH +D T E
Sbjct: 47  HAVSFARFANRYGKRYDSVDEMKLRFKIFSENLELIRSSNK-RRLSYKLGVNHFADWTWE 105

Query: 90  EMKS-RLG 96
           E +S RLG
Sbjct: 106 EFRSHRLG 113


>gi|388513209|gb|AFK44666.1| unknown [Lotus japonicus]
 gi|388514955|gb|AFK45539.1| unknown [Lotus japonicus]
          Length = 352

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F   + K Y + EE+  RF +F +NL+LI+  NK +  +   G+NH +DL+ +
Sbjct: 49  HAVSFARFASKYGKRYDSVEEIQHRFRIFSENLELIKSTNK-KRLSYKLGLNHFADLSWD 107

Query: 90  EMKSR 94
           E +++
Sbjct: 108 EFRTQ 112


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+KS    K  +    ++ E+++  + K Y    E+ KRF +FE+N++ IE  N   +  
Sbjct: 23  QVKSR---KLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKP 79

Query: 76  ATYGINHLSDLTREE-MKSRLG 96
               INHL+D T EE M S  G
Sbjct: 80  YKLSINHLADQTNEEFMASHKG 101


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/66 (36%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
          FE +   + K+Y ++EE A R  VFE+N   +   N   + + T  +N  +DLT  E K 
Sbjct: 29 FEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFKA 88

Query: 93 SRLGLN 98
          SRLG +
Sbjct: 89 SRLGFS 94


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTR 88
           F+ F     K+Y  + E  KRFA+F +NL+ IE  N    +G H + T GIN  +D+TR
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIH-SYTQGINKFADMTR 83

Query: 89 EEMKSRLG 96
           E K+ L 
Sbjct: 84 AEFKAMLA 91


>gi|195128649|ref|XP_002008774.1| GI11630 [Drosophila mojavensis]
 gi|193920383|gb|EDW19250.1| GI11630 [Drosophila mojavensis]
          Length = 547

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 14  FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           F      +E  + + EH+ K F  F R  +  Y T++E   R  +F  NL+ I   N+ +
Sbjct: 222 FATFNPMHEFISGSDEHVEKAFHHFKRKHAIDYSTEKEHEHRKNIFRQNLRYIHSKNRAK 281

Query: 73  HGTATYGINHLSDLTREEMKSRLGLNLS 100
             T    +NHL+D T EEMK+R G   S
Sbjct: 282 L-TYKLAVNHLADKTDEEMKARRGYKSS 308


>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
          Length = 332

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE  +
Sbjct: 35 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 94


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  +E+++    K+Y    E  KRF +F+DNL  I+  N  E+ T T G+N  +DLT 
Sbjct: 37  EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTN 95

Query: 89  EEMKS 93
           EE +S
Sbjct: 96  EEFRS 100


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  +E+++    K+Y    E  KRF +F+DNL  I+  N  E+ T T G+N  +DLT 
Sbjct: 46  EVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTN 104

Query: 89  EEMKS 93
           EE +S
Sbjct: 105 EEFRS 109


>gi|241111179|ref|XP_002399230.1| cysteine protease and A protease inhibitor, putative [Ixodes
           scapularis]
 gi|215492918|gb|EEC02559.1| cysteine protease and A protease inhibitor, putative [Ixodes
           scapularis]
          Length = 363

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 48/84 (57%), Gaps = 3/84 (3%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPT-KEEVAKRFAVFEDNLKLIEDLNK-GEH 73
           + ++++  +T +P     FE++++ ++K+Y +   E +KR   F D L  IED N+ G H
Sbjct: 29  RTETDDTNRTADPSVEAAFEQYVKRYNKTYASGSAEYSKRLNAFRDALIRIEDRNRHGNH 88

Query: 74  GT-ATYGINHLSDLTREEMKSRLG 96
              A YG+   SDLT +E ++ L 
Sbjct: 89  SNGALYGLTPYSDLTPDEFRALLA 112


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93

Query: 91  MKSRLGLNLSK 101
             ++    +S+
Sbjct: 94  FVAQYTGGISR 104


>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 14/81 (17%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
           ++K N+ L TE     K+F+ F++D+SK Y T EE   R  +F  N+     +   EH  
Sbjct: 42  ELKDNDLLTTE-----KKFKLFMKDYSKKYSTTEEYLLRLGIFAKNM-----VKAAEHQA 91

Query: 75  ---TATYGINHLSDLTREEMK 92
              TA +G+   SDL+ EE +
Sbjct: 92  LDPTAIHGVTQFSDLSEEEFE 112


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+++    K+Y    E  KRF +F+DNL+ I+D N  ++ T   G+N  +DLT EE ++
Sbjct: 4  YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN-ADNRTYKLGLNRFADLTNEEYRA 62

Query: 94 R 94
          R
Sbjct: 63 R 63


>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
          Length = 374

 Score = 48.5 bits (114), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           GQ +S+  L T    HL  F+   R F KSY ++EE   RF+VF+ NL+      K +  
Sbjct: 43  GQDESSPNLLTAEQHHLSLFK---RKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLD-P 98

Query: 75  TATYGINHLSDLTREEMKSR-LGL 97
           TA++G+   SDLT  E + + LGL
Sbjct: 99  TASHGVTQFSDLTSAEFRKQVLGL 122


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score = 48.5 bits (114), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 42/71 (59%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           L+++  +   ++E ++ +  ++Y    E  KRF +F+DNL+ IE+ N   + T   G+N 
Sbjct: 39  LQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQ 98

Query: 83  LSDLTREEMKS 93
            +DLT EE ++
Sbjct: 99  FADLTNEEYRT 109


>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
          Length = 358

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 4/69 (5%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTR 88
           H   F +F R + K Y + EE+ +RF +F DNL++I   N KG   +   G+N  SDLT 
Sbjct: 55  HALSFARFARRYGKRYDSVEEIKQRFDIFLDNLEMINSHNDKGL--SYKLGVNEFSDLTW 112

Query: 89  EEM-KSRLG 96
           +E  + RLG
Sbjct: 113 DEFRRDRLG 121


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 42/72 (58%), Gaps = 3/72 (4%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
           +F+ F     K+Y  + E   RF +F+DNL+ IE  N   E G  +Y  GIN  +D+T+E
Sbjct: 24  KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQE 83

Query: 90  EMKSRLGLNLSK 101
           E ++ L L+ SK
Sbjct: 84  EFRAFLTLSSSK 95


>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
          Length = 308

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%)

Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
          H    + FI  ++++Y  K+E+ KRF +++ NL+  +     E GTA YG    SDLT+ 
Sbjct: 3  HGISVDGFIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQA 62

Query: 90 EMK 92
          E +
Sbjct: 63 EFR 65


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FE++I +  K Y T EE   RF VF+DNLK I++ NK +  +   G+N  +DLT +E
Sbjct: 45  IELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNK-KVTSYWLGVNEFADLTHQE 103

Query: 91  MKSR-LGLNL 99
            K+  LGL +
Sbjct: 104 FKNMYLGLKV 113


>gi|401758206|gb|AFQ01138.1| cathepsin L3-like protease, partial [Chilo suppressalis]
          Length = 330

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F++F +  SK Y    E+AKR  +F  NL+ I   N+   G  T  +NHL+D T +EM 
Sbjct: 244 EFDRFAKKHSKQYQNDVELAKRLNIFRQNLRYIHSNNRARRGF-TLSVNHLADRTDDEMA 302

Query: 93  SRLGLNLS 100
           +  G   S
Sbjct: 303 ALRGRRYS 310


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +E ++    K+Y    E  +RF +F+DNL+ I++ N G+H T   G+N  +DLT EE +
Sbjct: 52  YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDH-TYKLGLNKFADLTNEEYR 109


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 41/68 (60%), Gaps = 5/68 (7%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEMK 92
           E+++  + + Y T+ E  KRF +F++N++ IE  NK   GT  Y  GIN  +DLT +E K
Sbjct: 38  EQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKA--GTKPYKLGINAFADLTNQEFK 95

Query: 93  -SRLGLNL 99
            SR G  L
Sbjct: 96  ASRNGYKL 103


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 19/64 (29%), Positives = 38/64 (59%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34 MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93

Query: 91 MKSR 94
            ++
Sbjct: 94 FVTQ 97


>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
          Length = 403

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 31/74 (41%), Positives = 38/74 (51%), Gaps = 7/74 (9%)

Query: 27  NPEHLKQ------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           N EHL        F+KFI +  K Y T EE  +R  +FE NL L    N+    TA +GI
Sbjct: 77  NREHLLNLRSKTLFDKFIVEHGKVYSTIEEYVRRLRIFEKNL-LKAAENQALDPTAVHGI 135

Query: 81  NHLSDLTREEMKSR 94
              SDLT  E +SR
Sbjct: 136 TPFSDLTEYEFESR 149


>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
 gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
          Length = 343

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 35/65 (53%), Gaps = 1/65 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F+ F++ F K Y T EE   R  VF+ NL  +  L K +  TA +GI   +DLT EE+ 
Sbjct: 45  HFKHFMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKK-QDPTAIHGITSFADLTPEELS 103

Query: 93  SRLGL 97
             LG 
Sbjct: 104 RFLGF 108


>gi|440292376|gb|ELP85581.1| cathepsin L, putative [Entamoeba invadens IP1]
          Length = 421

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF++F+++ +  Y T  E+ +R  +FE +L+ IE+ NK  H T   GI   SD T EE +
Sbjct: 16  QFKEFLKENNIVYTTPSELLRRRLIFEQSLREIEEFNKSPH-TFQIGITQFSDQTNEEFQ 74

Query: 93  SRLGLNLSKH 102
           ++  L + + 
Sbjct: 75  NQFSLTMDRQ 84


>gi|341893196|gb|EGT49131.1| hypothetical protein CAEBREN_18227 [Caenorhabditis brenneri]
          Length = 381

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG--EHGTAT-YGINHLSD 85
           E  K FE+F   F K Y T EE   R  VF  N   +  LNK   ++G  T +GIN  SD
Sbjct: 40  EAFKAFEEFKIRFHKKYKTPEEEKMRGEVFLKNHNTVGILNKKAEQNGQGTKFGINKFSD 99

Query: 86  LTREEMKSRL 95
           LT++E +SRL
Sbjct: 100 LTKKEFQSRL 109


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           L +FG +  +N L     ++  +FE +I  F K Y    E  KRF++F+ N+  +   N 
Sbjct: 158 LLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKYDV-SEFKKRFSIFKSNMDFVHSWNS 216

Query: 71  GEHGTATYGINHLSDLTREEMK 92
            ++     G+NHL+DLT  E +
Sbjct: 217 -KNSQTVLGLNHLADLTNLEYR 237


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTR 88
           F+ F     K+Y  + E  KRFA+F +NL+ IE  N    +G H + T GIN  +D+TR
Sbjct: 25 HFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIH-SYTQGINKFADMTR 83

Query: 89 EEMKSRLG 96
           E K+ L 
Sbjct: 84 AEFKAMLA 91


>gi|158519867|ref|NP_001103540.1| cathepsin W precursor [Bos taurus]
 gi|158455042|gb|AAI13313.1| CTSW protein [Bos taurus]
 gi|296471607|tpg|DAA13722.1| TPA: cathepsin W [Bos taurus]
          Length = 272

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F  F   +++SYP   E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDL 94

Query: 87  TREEMKSRLGLNLS 100
           T EE     G  ++
Sbjct: 95  TEEEFVQLYGSQVA 108


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF +F RD +K Y T+EE  KR+A+F++NL  I + N   + +    +N   DLT EE +
Sbjct: 88  QFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGY-SYVLKMNKFGDLTLEEFR 146

Query: 93  SR 94
            R
Sbjct: 147 QR 148


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FE++I +  K Y T EE   RF VF+DNLK I++ NK +  +   G+N  +DLT +E
Sbjct: 42  IELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNK-KVTSYWLGVNEFADLTHQE 100

Query: 91  MKSR-LGLNL 99
            K+  LGL +
Sbjct: 101 FKNMYLGLKV 110


>gi|355751954|gb|EHH56074.1| Cathepsin W [Macaca fascicularis]
          Length = 375

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 25  TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           +  P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   
Sbjct: 32  SPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPF 91

Query: 84  SDLTREEMKSRLG 96
           SDLT EE     G
Sbjct: 92  SDLTEEEFGQLYG 104


>gi|109105377|ref|XP_001112560.1| PREDICTED: cathepsin W-like isoform 2 [Macaca mulatta]
 gi|355566302|gb|EHH22681.1| Cathepsin W [Macaca mulatta]
          Length = 375

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 25  TENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           +  P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   
Sbjct: 32  SPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPF 91

Query: 84  SDLTREEMKSRLG 96
           SDLT EE     G
Sbjct: 92  SDLTEEEFGQLYG 104


>gi|20129967|ref|NP_610907.1| CG6357 [Drosophila melanogaster]
 gi|7303269|gb|AAF58330.1| CG6357 [Drosophila melanogaster]
          Length = 439

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
           ++KF+ DF   Y  ++E  KR  +F DN K I++ N+  E G  ++  GIN  SDLT EE
Sbjct: 347 WKKFLIDFGAKYQDEKETEKRRTIFCDNWKAIQEHNEQFELGVESFKKGINQWSDLTVEE 406

Query: 91  MKSRLGLNLS 100
            K++   NL+
Sbjct: 407 WKTKQRPNLA 416



 Score = 44.3 bits (103), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
           S +E+  +N      +EKF+ DF  SY    E  KR  VF DN K I   N + + G  +
Sbjct: 237 STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 296

Query: 78  Y--GINHLSDLTREEMKSR 94
           +  GIN  SDLT EE K++
Sbjct: 297 FKKGINQWSDLTVEEWKNK 315



 Score = 38.1 bits (87), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           +++F+ DF   Y    E  KR  +F +N + + D N K + G  ++  GIN  SDLT EE
Sbjct: 72  WQRFLVDFDVHYDNDYERQKRRDIFCENWQKVRDHNLKYDLGVVSFKKGINQWSDLTFEE 131

Query: 91  MKSR 94
            K +
Sbjct: 132 WKEK 135


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score = 48.1 bits (113), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           + +FE ++    K Y + EE   RF VF +NL  I++ NK E  +   G+N  +DL+ EE
Sbjct: 401 IARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNK-EVSSYWLGLNEFADLSHEE 459

Query: 91  MKSR-LGL 97
            KS+ LGL
Sbjct: 460 FKSKYLGL 467


>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 336

 Score = 47.8 bits (112), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F  F   + K Y T EE+  RF  F +++KL+E  NKG+H + +  +N  +D+T EE +
Sbjct: 28 HFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNKGQH-SYSLAVNEFADMTFEEFR 86

Query: 93 -SRL 95
           SRL
Sbjct: 87 DSRL 90


>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
 gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
 gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
 gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
 gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
          Length = 376

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY + EE A R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|294874412|ref|XP_002766943.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239868318|gb|EEQ99660.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 366

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F  F   F K Y +KEE  KR A+F+ NL  IE +N  ++ + T G+N  +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83


>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 330

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F+KF + ++K Y ++E    R ++F++NL+ IE  NK +   A +GI   +DLT EE
Sbjct: 30 FKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE--AQHGITQFADLTHEE 84


>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
 gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
          Length = 338

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 9   ATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
           AT  +   M +N +    N E L  F++F+  + K Y    E   RF VF+ NL +I + 
Sbjct: 15  ATTPIVSSM-NNLQYDLSNSEVL--FDEFVTKYGKVYANDAERKSRFDVFKANLAIINER 71

Query: 69  NKGEHGTATYGINHLSDLTREE-MKSRLGLNLSKH 102
           N  E  +AT+GIN  SDL+  E ++ + G   + H
Sbjct: 72  NAQEE-SATFGINFYSDLSSNELLRKQTGFKTALH 105


>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+K Y ++ E  +RF +F+ NL+ I   N+ +  TA Y IN  SDL+++E  S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86

Query: 94 R 94
          +
Sbjct: 87 K 87


>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
 gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+  F+K Y ++ E  +RF +F+ NL+ I   N+ +  TA Y IN  SDL+++E  S
Sbjct: 28 FEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQND-TTAQYEINKFSDLSKDETIS 86

Query: 94 R 94
          +
Sbjct: 87 K 87


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 4/82 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+KS    K  +    ++ E+++  + K Y    E  KRF +FE+N++ IE  N   +  
Sbjct: 23  QVKSR---KLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKP 79

Query: 76  ATYGINHLSDLTREE-MKSRLG 96
               INHL+D T EE M S  G
Sbjct: 80  YKLSINHLADQTNEEFMASHKG 101


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 1/89 (1%)

Query: 6   SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
           +++ ++  + Q  +N+ ++T++ E +  +  ++    KSY    E   RF +F+DNL+ I
Sbjct: 22  ASDMSIINYDQTHTNSLIRTDD-EVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYI 80

Query: 66  EDLNKGEHGTATYGINHLSDLTREEMKSR 94
           ++ N     +   G+N  +DLT EE +++
Sbjct: 81  DNHNADPDRSYELGLNRFADLTNEEYRAK 109


>gi|118387041|ref|XP_001026637.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89308404|gb|EAS06392.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 335

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          +  + F  F ++F+K Y ++E    RF VF +NLK IE LNK E  +A + +   SD T+
Sbjct: 34 QQQQSFLDFKKNFAKKYHSQEHEQYRFNVFLENLKEIERLNK-EITSAKFAVTQFSDYTK 92

Query: 89 EE 90
          EE
Sbjct: 93 EE 94


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           EN   L++ E+++    + Y    E A RF +F  N++ IE  N   H     G+N  +D
Sbjct: 33  ENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENH-KFKLGVNQFAD 91

Query: 86  LTREEMKSRLGLNLSK 101
           LT EE K+R  L  SK
Sbjct: 92  LTNEEFKTRNTLKPSK 107


>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
          Length = 357

 Score = 47.8 bits (112), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H++ F +F   + K Y + EE+ +RF +F +N KLI   N+ +  +   G+N  +D T E
Sbjct: 54  HVRSFARFAYRYEKRYESVEEMGRRFEIFAENKKLIRSTNR-KGLSYKLGVNRFADWTWE 112

Query: 90  EM-KSRLG 96
           E  + RLG
Sbjct: 113 EFQRHRLG 120


>gi|357631369|gb|EHJ78914.1| cysteine protease [Danaus plexippus]
          Length = 329

 Score = 47.8 bits (112), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 46/70 (65%), Gaps = 4/70 (5%)

Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
          K+ + E L  F +++  ++K Y  ++E  +RF +F++NL+ I +LN+  + T  YGINHL
Sbjct: 30 KSTDAEDL--FIEYVHKYNKRY-NEDEYDRRFQIFKENLENINELNRKSNLT-VYGINHL 85

Query: 84 SDLTREEMKS 93
          +DL  EE+ S
Sbjct: 86 TDLKYEEVAS 95


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score = 47.8 bits (112), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 43/68 (63%), Gaps = 2/68 (2%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K FE+++  + K+Y + EE   RF VF+DNL  I++ NK +  T   G+N  +DLT +E
Sbjct: 63  IKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANK-KVTTYWLGLNAFADLTHDE 121

Query: 91  MKSR-LGL 97
            K+  LGL
Sbjct: 122 FKATYLGL 129


>gi|167534377|ref|XP_001748864.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772544|gb|EDQ86194.1| predicted protein [Monosiga brevicollis MX1]
          Length = 340

 Score = 47.8 bits (112), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 34/67 (50%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE +  ++ K+Y T  E   R  VFE NL  I   N     T   G+NH+SD T EE + 
Sbjct: 32  FEHYKAEYKKAYATTTEHEYRRQVFEQNLAKIRAHNADTTKTWKEGVNHMSDWTSEEFRR 91

Query: 94  RLGLNLS 100
            LG + S
Sbjct: 92  LLGYDQS 98


>gi|326526731|dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 341

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 2/64 (3%)

Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
          +  +F++F   FSK+Y + EE   R+A F DNL+ +  LN  + G   +G+    D+T  
Sbjct: 28 NFAKFQEFTARFSKNYKSVEEYTTRYATFLDNLERVAKLN--QDGRGVFGVTKFMDMTPA 85

Query: 90 EMKS 93
          E K+
Sbjct: 86 EFKA 89


>gi|156089449|ref|XP_001612131.1| papain family cysteine protease containing protein [Babesia bovis]
 gi|154799385|gb|EDO08563.1| papain family cysteine protease containing protein [Babesia bovis]
          Length = 435

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 34/69 (49%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF  F RDF +   +  E  +RFA F  N+  I + N   H T T  IN  +D+T E+  
Sbjct: 119 QFNDFNRDFKRHDNSISEKIERFATFYRNVTRIREFNMNVHKTYTMKINQFADMTPEQFM 178

Query: 93  SRLGLNLSK 101
           S  G   SK
Sbjct: 179 SLQGTRASK 187


>gi|294874400|ref|XP_002766937.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239868312|gb|EEQ99654.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 347

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F  F   F K Y +KEE  KR A+F+ NL  IE +N  ++ + T G+N  +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83


>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
 gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
          Length = 349

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 38/62 (61%), Gaps = 1/62 (1%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          L +F+ +  +++++Y T EE  +RF V+ +N+K IE +N+    +   G N  +DLT EE
Sbjct: 34 LDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQ-PGSSYELGENQFADLTEEE 92

Query: 91 MK 92
           K
Sbjct: 93 FK 94


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          T+N E +  +E+++    K Y    E  KRF VF+DNL  I++ N  ++ T   G+N  +
Sbjct: 32 TDN-EVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFA 90

Query: 85 DLTREEMK 92
          D+T EE +
Sbjct: 91 DMTNEEYR 98


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 35/59 (59%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     G+N  +DLT +E ++
Sbjct: 38 EQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 96


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/73 (28%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
           L++F+ +  +++++Y T EE  +RF ++ +N++ I+ +N+   G++   G N  +DLT E
Sbjct: 35  LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEE 94

Query: 90  EMKSRLGLNLSKH 102
           E K    + L + 
Sbjct: 95  EFKDTYLMKLDEQ 107


>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
          Length = 322

 Score = 47.8 bits (112), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSD 85
          +H   FE F  +  KSY  + E  +RF +F  N+  IE  N   E G  +Y   IN  +D
Sbjct: 21 KHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTD 80

Query: 86 LTREEMKSRLGLNL 99
          LT+EE K+ LGL++
Sbjct: 81 LTQEEFKAYLGLHV 94


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          EH +Q+   +  FS+ Y  + E   RF VF+ NLK IE  NK    T   G+N  +D TR
Sbjct: 21 EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTR 77

Query: 89 EE 90
          EE
Sbjct: 78 EE 79


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EH +Q+   +  FS+ Y  + E   RF VF+ NLK IE  NK    T   G+N  +D TR
Sbjct: 45  EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTR 101

Query: 89  EE 90
           EE
Sbjct: 102 EE 103


>gi|443691408|gb|ELT93269.1| hypothetical protein CAPTEDRAFT_181131 [Capitella teleta]
          Length = 541

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 2/86 (2%)

Query: 16  QMKSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           Q+   +E   +N EH+   F+ + +D+SK Y    E A R  VF+ NL+ IE  N+    
Sbjct: 222 QVNPMHEYIHDNDEHIHGMFDGYKKDYSKDYKDDFEHASRLHVFKHNLRYIESQNR-RGL 280

Query: 75  TATYGINHLSDLTREEMKSRLGLNLS 100
           T T  +NHL+D    E+ S  G + S
Sbjct: 281 TYTLAMNHLADRKDRELVSLRGFHRS 306


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)

Query: 28  PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           PEHL         FE ++ + SK+Y + EE   RF VF +NL  I+  N  E  +   G+
Sbjct: 38  PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96

Query: 81  NHLSDLTREEMKSR-LGL 97
           N  +DLT EE K R LGL
Sbjct: 97  NEFADLTHEEFKGRYLGL 114


>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
          Length = 367

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY    E ++R  +F  NL   + L + + GTA +G+  LSDL
Sbjct: 35  PLELKEVFKLFQVQFNRSYSNPAEHSRRLDIFAHNLAKAQQLQEEDLGTAEFGMTSLSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGKIFG 104


>gi|407394331|gb|EKF26898.1| cysteine proteinase, putative [Trypanosoma cruzi marinkellei]
          Length = 392

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F++++ K Y  KE V +R A+FE  L  +   N+  +     GINH+SD T EE  S
Sbjct: 55  FDRFLQEYGKKYDAKEYVRRR-AIFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEEFTS 113

Query: 94  RLG 96
             G
Sbjct: 114 LNG 116


>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 377

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           +++ N  L+TE     K+F  F+ ++ K Y T+EE  +R  +F  N+ L    N+    T
Sbjct: 40  KLQDNQLLRTE-----KKFNVFMENYGKKYSTREEYLQRLEIFAGNM-LRAPENQALDPT 93

Query: 76  ATYGINHLSDLTREEMK 92
           A +G+   SDLT +E +
Sbjct: 94  AIHGVTQFSDLTEDEFQ 110


>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
 gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
          Length = 356

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + K Y T EE+  RF +F ++L+LI+  NK +  +   G+N  +D T E
Sbjct: 53  HALRFARFAHRYGKKYETAEEMKLRFGIFLESLELIKSTNK-QGLSYKLGVNQFADWTWE 111

Query: 90  EM-KSRLG 96
           E  K RLG
Sbjct: 112 EFRKHRLG 119


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++    KSY    E  +RF +F+DNL+ I++ N  E+ T   G+N  +DLT EE +S
Sbjct: 51  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRS 109


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 42/69 (60%), Gaps = 2/69 (2%)

Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          +EN E +  +E+++    K Y   +E  KRF VF+DNL  I+D N  ++ T T G+N  +
Sbjct: 28 SEN-EVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHN-AQNNTYTLGLNKFA 85

Query: 85 DLTREEMKS 93
          D+T EE ++
Sbjct: 86 DITNEEYRA 94


>gi|297793593|ref|XP_002864681.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310516|gb|EFH40940.1| hypothetical protein ARALYDRAFT_496172 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLTFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++    KSY    E  +RF +F+DNL+ I++ N  E+ T   G+N  +DLT EE +S
Sbjct: 53  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN-AENRTYKVGLNRFADLTNEEYRS 111


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/73 (28%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTRE 89
           L++F+ +  +++++Y T EE  +RF ++ +N++ I+ +N+   G++   G N  +DLT E
Sbjct: 61  LERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEE 120

Query: 90  EMKSRLGLNLSKH 102
           E K    + L + 
Sbjct: 121 EFKDTYLMKLDEQ 133


>gi|345314917|ref|XP_003429566.1| PREDICTED: cathepsin F-like, partial [Ornithorhynchus anatinus]
          Length = 219

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 37/65 (56%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  F++F+  +S+SY    E  +R  +F  NL+    + + + G+A YG+   SDLT 
Sbjct: 54  EVISLFKEFLTTYSRSYANATETQRRLGIFAHNLERARRIQELDQGSARYGVTKFSDLTE 113

Query: 89  EEMKS 93
           EE ++
Sbjct: 114 EEFRT 118


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 35/59 (59%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     G+N  +DLT +E ++
Sbjct: 38 EQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 96


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 36/60 (60%), Gaps = 1/60 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINHLSDLTREE 90
          ++ E+++  + K Y   +E  KRF +F +N+K IE  N G++  +   GIN  +DLT EE
Sbjct: 37 ERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEE 96


>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
 gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
          Length = 349

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 38/62 (61%), Gaps = 1/62 (1%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          L +F+ +  +++++Y T EE  +RF V+ +N+K IE +N+    +   G N  +DLT EE
Sbjct: 34 LDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQ-PGSSYELGENRFADLTEEE 92

Query: 91 MK 92
           K
Sbjct: 93 FK 94


>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
 gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
          Length = 462

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 33/57 (57%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           F+ F+  ++++Y ++EE   R  VF  N+   + +   + GTA YGI   SDLT EE
Sbjct: 165 FKDFMITYNRTYESREETQWRLTVFTRNMVKAQKIEALDRGTAQYGITKFSDLTEEE 221


>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 39/64 (60%), Gaps = 2/64 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F   F K+Y +KEE  KR A+F+ NL LIE +N  ++ +   G+N  +DLT EE  +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHLIEQVN-AKNLSYKLGVNEYADLTHEEFAA 86

Query: 94 -RLG 96
           +LG
Sbjct: 87 LKLG 90


>gi|189236657|ref|XP_970512.2| PREDICTED: similar to cathepsin o [Tribolium castaneum]
          Length = 329

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 3/82 (3%)

Query: 23  LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGI 80
           ++ + P+  + QF+++++ F+K+Y        R   F+ +L+ IE LN K  +G+A YG+
Sbjct: 23  IRIKGPDQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGL 82

Query: 81  NHLSDLTREE-MKSRLGLNLSK 101
              SDL  EE  ++ L  NLS+
Sbjct: 83  TKFSDLLPEEFFQTYLQSNLSQ 104


>gi|270006364|gb|EFA02812.1| cathepsin O precursor [Tribolium castaneum]
          Length = 326

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 3/82 (3%)

Query: 23  LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGI 80
           ++ + P+  + QF+++++ F+K+Y        R   F+ +L+ IE LN K  +G+A YG+
Sbjct: 23  IRIKGPDQAESQFQEYLKRFNKTYDDPSVYQNRLHAFKQSLQTIETLNSKKRNGSALYGL 82

Query: 81  NHLSDLTREE-MKSRLGLNLSK 101
              SDL  EE  ++ L  NLS+
Sbjct: 83  TKFSDLLPEEFFQTYLQSNLSQ 104


>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
 gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
          Length = 303

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 31/51 (60%)

Query: 41 FSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +++SY T+EE   R  +F +NLK    L + E GTA YG+   SDLT EE 
Sbjct: 4  YNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEF 54


>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
 gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
 gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
          Length = 362

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           +  H   F  F   + KSY T +E+  RF +F +NLKLI   N+ +    T  +N  +D 
Sbjct: 56  DTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNR-KGLPYTLAVNQFADW 114

Query: 87  TREEMKS-RLG 96
           T EE +  RLG
Sbjct: 115 TWEEFRRHRLG 125


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 9/78 (11%)

Query: 28  PEHLKQ-------FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           PEHL         FE ++ + SK+Y + EE   RF VF +NL  I+  N  E  +   G+
Sbjct: 38  PEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGL 96

Query: 81  NHLSDLTREEMKSR-LGL 97
           N  +DLT EE K R LGL
Sbjct: 97  NEFADLTHEEFKGRYLGL 114


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 41/71 (57%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
           L+++  +   ++E ++ +  ++Y    E  KRF +F+DNL+ IE  N   + T   G+N 
Sbjct: 39  LQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQ 98

Query: 83  LSDLTREEMKS 93
            +DLT EE ++
Sbjct: 99  FADLTNEEYRT 109


>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
          Length = 318

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           +  H   F  F   + KSY T +E+  RF +F +NLKLI   N+ +    T  +N  +D 
Sbjct: 56  DTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRSTNR-KGLPYTLAVNQFADW 114

Query: 87  TREEMKS-RLG 96
           T EE +  RLG
Sbjct: 115 TWEEFRRHRLG 125


>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
 gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
          Length = 299

 Score = 47.0 bits (110), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           K  N E L  +E+++    KSY    E  KRF +F+DNLK I++ N G + T   G+   
Sbjct: 45  KRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHN-GLNSTYRLGLTRF 103

Query: 84  SDLTREEMKSRL 95
           +DLT EE +S+ 
Sbjct: 104 ADLTNEEYRSKF 115


>gi|82705269|ref|XP_726900.1| berghepain-2 [Plasmodium yoelii yoelii 17XNL]
 gi|23482498|gb|EAA18465.1| berghepain-2 [Plasmodium yoelii yoelii]
          Length = 472

 Score = 47.0 bits (110), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F++ ++K Y + EE+ +RF +F + LK IE  NK  H   T GIN  SD+
Sbjct: 149 NLESVNLFYSFMKKYNKEYSSAEEMQERFYIFSEKLKKIEKHNKENH-LYTKGINAFSDM 207

Query: 87  TREEMKSR 94
             EE K +
Sbjct: 208 RHEEFKMK 215


>gi|344295866|ref|XP_003419631.1| PREDICTED: cathepsin W-like [Loxodonta africana]
          Length = 376

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F  F   +++SY    E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35  PLELKEVFALFQLQYNRSYSNPAEHARRLDIFARNLAQAQQLQEEDLGTAKFGVTPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE +   G
Sbjct: 95  TEEEFRQVYG 104


>gi|294874404|ref|XP_002766939.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239868314|gb|EEQ99656.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 339

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F  F   F K Y +KEE  KR A+F+ NL  IE +N  ++ + T G+N  +DLT EE
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEE 83


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 19/60 (31%), Positives = 38/60 (63%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++    ++Y   EE  +RF +F+ NL+ I++ NK  + T   G+N+ +DL+ EE
Sbjct: 36 VEKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEE 95


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
          max]
          Length = 337

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 34/59 (57%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E++++ + K Y    E  KR  +F+DN++ IE  N   +     GINHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTNEE 94


>gi|334265690|ref|YP_004376219.1| cathepsin [Clostera anachoreta granulovirus]
 gi|315451014|gb|ADU24593.1| cathepsin [Clostera anachoreta granulovirus]
 gi|327553705|gb|AEB00299.1| cathepsin [Clostera anachoreta granulovirus]
          Length = 332

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 43/69 (62%), Gaps = 3/69 (4%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          +N E L  FE+F+ +F+K+Y +++E   R+ +F+ NL LI + N  E   AT+ IN  SD
Sbjct: 23 DNSETL--FEEFVTNFNKTYSSQDEKLIRYEIFKKNLALINNKNM-ESKHATFDINIYSD 79

Query: 86 LTREEMKSR 94
          L + ++  R
Sbjct: 80 LHKNDLLHR 88


>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
          Length = 465

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 8/87 (9%)

Query: 10 TLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
          T+ L   M +  +L  E      QF +F   ++K Y T  E A+RFA F+ NLK+I++ N
Sbjct: 8  TVLLLVSMAAAKKLSLEE----TQFRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKN 62

Query: 70 K---GEHGTATYGINHLSDLTREEMKS 93
          +       +  +G+N  +DL++ E ++
Sbjct: 63 RDAASRKSSVRFGVNEFADLSQSEFRA 89


>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYKSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
           ++ + ++  + Q  ++      + E +  +E ++    K+Y    E  KRF +F+DNL+ 
Sbjct: 20  SAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRF 79

Query: 65  IEDLNKGEHGTATYGINHLSDLTREEMKS 93
           I++ N  ++ T   G+N  +DLT EE +S
Sbjct: 80  IDEHNS-QNLTYRLGLNRFADLTNEEYRS 107


>gi|194749983|ref|XP_001957411.1| GF24054 [Drosophila ananassae]
 gi|190624693|gb|EDV40217.1| GF24054 [Drosophila ananassae]
          Length = 549

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 2/84 (2%)

Query: 14  FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           F       E  +   EH+ K F  F R    SY   +E   R  +F  NL+ I   N+ +
Sbjct: 224 FATFNPMQEFVSGVDEHVEKAFHHFKRKHGVSYNNDKEHEHRLNIFRQNLRYIHSKNRAK 283

Query: 73  HGTATYGINHLSDLTREEMKSRLG 96
             T T  +NHL+D T +E+K+R G
Sbjct: 284 L-TYTLAVNHLADKTEDELKARRG 306


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 39/67 (58%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  +E ++  + KSY +  E   R  +F++NL+ I++ N   + + T G+N  +DL
Sbjct: 35  NDEVMALYESWLVKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADL 94

Query: 87  TREEMKS 93
           T EE +S
Sbjct: 95  TDEEYRS 101


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 341

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 36/63 (57%)

Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
          P  L++ E+++  FS+ Y  + E   R  VF+ NLK IE+ NK  + +   G+N  +D T
Sbjct: 33 PSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWT 92

Query: 88 REE 90
           EE
Sbjct: 93 NEE 95


>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
          Full=Major cysteine proteinase; Flags: Precursor
 gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
 gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 22  ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           E  T   + L+ FE ++ + SK Y + EE   RF VF +NL  I+  N  E  +   G+N
Sbjct: 39  EQLTSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNN-EINSYWLGLN 97

Query: 82  HLSDLTREEMKSR-LGL 97
             +DLT EE K R LGL
Sbjct: 98  EFADLTHEEFKGRYLGL 114


>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F +   KSY  KEE  KR A+F DNL  IE++N  ++ +   G+N  +DLT EE  +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVN-AQNLSYKLGVNEYTDLTLEEFAA 85


>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
          Length = 500

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 70  QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 128

Query: 93  SR 94
           SR
Sbjct: 129 SR 130


>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
 gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 426

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|71660475|ref|XP_821954.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|3063559|gb|AAC14094.1| TcC31.13 [Trypanosoma cruzi]
 gi|70887345|gb|EAO00103.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 322

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 70  QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 128

Query: 93  SR 94
           SR
Sbjct: 129 SR 130


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 37/62 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E+++ +  K Y    E  KRF +F+DN++ IE  N  ++      +NHL+DLT +E 
Sbjct: 38 ERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEF 97

Query: 92 KS 93
          K+
Sbjct: 98 KA 99


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
          FE + +   K+Y ++EE   R  VF+DN   + + N   + + T  +N  +DLT  E K 
Sbjct: 30 FETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKA 89

Query: 93 SRLGLN 98
          SRLGL+
Sbjct: 90 SRLGLS 95


>gi|77379397|gb|ABA71355.1| cysteine protease [Brassica napus]
          Length = 359

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +D+T +
Sbjct: 56  HVLSFARFTHRYGKRYENAEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFTDMTWQ 114

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 115 EFQRTKLG 122


>gi|145508365|ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407338|emb|CAK72735.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 47/85 (55%), Gaps = 6/85 (7%)

Query: 6  SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
          SA    + F +   +++ K      +KQ++++ + ++K YPT+ E   RF++++ N+  I
Sbjct: 15 SAGVYFSKFYEQNDHDQFKI-----IKQYQEWQQKYNKRYPTQNEQIYRFSIYQQNIMKI 69

Query: 66 EDLNKGEHGTATYGINHLSDLTREE 90
          ED N  ++ +    IN   DLT +E
Sbjct: 70 EDFNS-QNNSYKQKINKFGDLTDQE 93


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 18/64 (28%), Positives = 37/64 (57%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++FE+++ ++ + Y    E  +RF +F++N+  IE  N     + T G+N  +D+T  E
Sbjct: 7  MERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNNE 66

Query: 91 MKSR 94
            +R
Sbjct: 67 FLAR 70


>gi|145334857|ref|NP_001078774.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009932|gb|AED97315.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 361

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
 gi|1582620|prf||2119193A cathepsin L-related Cys protease
          Length = 324

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 38/76 (50%), Gaps = 7/76 (9%)

Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
          L   NP     +E+F   F + Y   EE   R  VF DNL+ IE+ NK  E G  TY   
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLA 68

Query: 80 INHLSDLTREEMKSRL 95
          IN  SDLT +E  S +
Sbjct: 69 INQFSDLTNDEFNSMM 84


>gi|23397070|gb|AAN31820.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
          Length = 358

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTAFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|334311632|ref|XP_001373241.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 328

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 4/81 (4%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--G 79
           L  +N +   ++E +   + K+Y  KEE  +R  V+E NLKLI D N+  + G  +Y  G
Sbjct: 18  LSPKNEKLDAEWEAWKTTYGKNYSEKEESFRR-QVWEKNLKLINDHNRLFKEGKKSYFMG 76

Query: 80  INHLSDLTREEMKSRLGLNLS 100
           +N   D+T +E +SRL L ++
Sbjct: 77  MNQFGDMTDKEFESRLNLRIA 97


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           + E +  +EK++    K+Y    E  +RF +F+DNL+ +++ N    G+   G+N  +DL
Sbjct: 40  DAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHN-AVAGSYRVGLNRFADL 98

Query: 87  TREEMKSR-LGLNL 99
           T EE +S  LG N+
Sbjct: 99  TNEEYRSMFLGGNM 112


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  TKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
 gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
          Length = 323

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 3/67 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K Y ++ E  +RF +F+ NL   E + K ++ +A Y IN  SDL+++E  +
Sbjct: 28 FEEFVHRFNKDYGSEVEKLRRFKIFQHNLN--EIIIKNQNDSAKYEINKFSDLSKDETIA 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
          Length = 467

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF  F + + + Y +  E A R +VF  NL L   L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFADFKQRYGRVYKSAAEEAFRLSVFRKNL-LDAKLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|242044818|ref|XP_002460280.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
 gi|241923657|gb|EER96801.1| hypothetical protein SORBIDRAFT_02g025920 [Sorghum bicolor]
          Length = 363

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  SD++ EE +
Sbjct: 61  RFARFAVRYGKSYESAAEVQKRFRIFSESLQLVRSTNR-KGLSYRLGINRFSDMSWEEFR 119

Query: 93  -SRLG 96
            +RLG
Sbjct: 120 ATRLG 124


>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
 gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
          Length = 329

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/71 (36%), Positives = 41/71 (57%), Gaps = 3/71 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F+ F+  ++K Y T EE A ++ +F +NL +I + N  +   A Y IN LSDL + E+  
Sbjct: 28  FDDFVIKYNKVYATDEERAAKYEIFRNNLVVINEKNS-KTTNALYDINRLSDLNKNELLR 86

Query: 94  RLG--LNLSKH 102
             G  +NL K+
Sbjct: 87  STGFSVNLKKN 97


>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
          Length = 367

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           L  Q+ S+ E    N EH   F  F   F K+Y T+EE   RF VF+ NL+  +  ++  
Sbjct: 32  LIRQVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMI 88

Query: 73  HGTATYGINHLSDLTREEMKSR-LGL 97
             TA +GI   SDLT +E + + LGL
Sbjct: 89  DPTAAHGITKFSDLTPKEFRRQFLGL 114


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 11 LALF-GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
          LALF G   +  +L  ++   + + E+++  +++ Y    E A+RF VF+ N+K IE  N
Sbjct: 14 LALFCGAALAARDL-NDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFN 72

Query: 70 KGEHGTATYGINHLSDLTREEMKS 93
           G +     G+N  +DLT +E ++
Sbjct: 73 AGGNRKFWLGVNQFADLTNDEFRA 96


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/67 (31%), Positives = 35/67 (52%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          + P    + E+++  F K Y    E  +RF +F+DN++ IE  N   +      +N  +D
Sbjct: 30 QEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFAD 89

Query: 86 LTREEMK 92
          LT EE+K
Sbjct: 90 LTNEELK 96


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E L  +E+++ +  K+Y    E  +RF +F+DNLK IE+ N   + +   G+N  SDLT 
Sbjct: 36  EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95

Query: 89  EEMK-SRLGLNLSK 101
           +E + S LG  + K
Sbjct: 96  DEFQASYLGGKMEK 109


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
           +F  F+ D+ K+Y T+EE   R  +F  N+     L   EH     TA +G+   SDLT 
Sbjct: 50  KFRVFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPTAVHGVTQFSDLTE 104

Query: 89  EEMK 92
           EE K
Sbjct: 105 EEFK 108


>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
 gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           L  Q+ S+ E    N EH   F  F   F K+Y T+EE   RF VF+ NL+  +  ++  
Sbjct: 32  LIRQVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMI 88

Query: 73  HGTATYGINHLSDLTREEMKSR-LGL 97
             TA +GI   SDLT +E + + LGL
Sbjct: 89  DPTAAHGITKFSDLTPKEFRRQFLGL 114


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  +E ++    K+Y +  E  +RF VF+DNL+ I++ N  E+ T   G+N  +DLT 
Sbjct: 37  EVMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNS-ENRTYRVGLNRFADLTN 95

Query: 89  EEMKS 93
           EE +S
Sbjct: 96  EEYRS 100


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINH 82
            KT++ E    FE ++    KSY    E  KRF +F++NL+ I++ N  E      G+N 
Sbjct: 35  FKTDD-EATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNK 93

Query: 83  LSDLTREEMKSR 94
            +DLT EE +S+
Sbjct: 94  FADLTNEEYRSK 105


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 35/59 (59%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  +S+ Y    E A+RF VF+ N+K IE  N G +     G+N  +DLT +E ++
Sbjct: 6  EQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTNDEFRA 64


>gi|18424347|ref|NP_568921.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|71152227|sp|Q8H166.2|ALEU_ARATH RecName: Full=Thiol protease aleurain; Short=AtALEU; AltName:
           Full=Senescence-associated gene product 2; Flags:
           Precursor
 gi|7230640|gb|AAF43041.1|AF233883_1 AALP protein [Arabidopsis thaliana]
 gi|13430722|gb|AAK25983.1|AF360273_1 putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|9757740|dbj|BAB08221.1| AALP protein [Arabidopsis thaliana]
 gi|21617934|gb|AAM66984.1| cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397068|gb|AAN31819.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|23397074|gb|AAN31822.1| putative cysteine proteinase AALP [Arabidopsis thaliana]
 gi|24417304|gb|AAN60262.1| unknown [Arabidopsis thaliana]
 gi|222423506|dbj|BAH19723.1| AT5G60360 [Arabidopsis thaliana]
 gi|222424411|dbj|BAH20161.1| AT5G60360 [Arabidopsis thaliana]
 gi|332009930|gb|AED97313.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 358

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 31/72 (43%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F KSY T+EE   RF VF  NL+  + L+     +A +G+   SDL
Sbjct: 39  NAEH--HFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAK-LHAKLDPSAEHGVTKFSDL 95

Query: 87  TREEMKSR-LGL 97
           T EE K + LGL
Sbjct: 96  TPEEFKRQYLGL 107


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           + E +  +E+++    K Y   EE  KRF +F+DNL  IE+ N   + T   G+N  SDL
Sbjct: 45  DEEVMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHN-AVNRTYKVGLNRFSDL 103

Query: 87  TREEMKSR 94
           + EE +S+
Sbjct: 104 SNEEYRSK 111


>gi|407867877|gb|EKG08706.1| cysteine proteinase, putative [Trypanosoma cruzi]
          Length = 392

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F++++ K Y  +E V +R A+FE  L  +   N+  +     GINH+SD T EE+ S
Sbjct: 55  FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113

Query: 94  RLG 96
             G
Sbjct: 114 LNG 116


>gi|71415597|ref|XP_809860.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
 gi|70874305|gb|EAN88009.1| cysteine proteinase, putative [Trypanosoma cruzi]
          Length = 392

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F++++ K Y  +E V +R A+FE  L  +   N+  +     GINH+SD T EE+ S
Sbjct: 55  FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113

Query: 94  RLG 96
             G
Sbjct: 114 LNG 116


>gi|71421935|ref|XP_811957.1| cysteine proteinase [Trypanosoma cruzi strain CL Brener]
 gi|70876682|gb|EAN90106.1| cysteine proteinase, putative [Trypanosoma cruzi]
          Length = 392

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F++F++++ K Y  +E V +R A+FE  L  +   N+  +     GINH+SD T EE+ S
Sbjct: 55  FDRFLQEYGKKYDAREYVRRR-ALFEQTLARVRTHNEAGNHLYVMGINHMSDWTPEELAS 113

Query: 94  RLG 96
             G
Sbjct: 114 LNG 116


>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 337

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/62 (41%), Positives = 35/62 (56%), Gaps = 2/62 (3%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTREE 90
          + F  +++ + K+Y T EE  +R  V+  N   IE LNK EHG  T Y +N  SDLT  E
Sbjct: 33 ESFNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIEQLNK-EHGPHTEYELNQFSDLTFAE 91

Query: 91 MK 92
           K
Sbjct: 92 FK 93


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + N  +  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N   + +   
Sbjct: 27  AKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKV 86

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DLT EE +S
Sbjct: 87  GLNQFADLTDEEFRS 101


>gi|323457344|gb|EGB13210.1| hypothetical protein AURANDRAFT_18666 [Aureococcus
          anophagefferens]
          Length = 346

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREEM 91
          FE F  D+ KSY + E  A+RF +F  NL+  E LN  + +   A +G+    DLT  E 
Sbjct: 20 FELFKSDYVKSYNSTEAEAERFTIFSANLRKTEALNAQRVDEDDAEFGVTQFMDLTEAEF 79

Query: 92 KSR 94
          K++
Sbjct: 80 KAQ 82


>gi|440792913|gb|ELR14120.1| papain family cysteine protease subfamily protein [Acanthamoeba
          castellanii str. Neff]
          Length = 321

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 30/64 (46%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F+  F+K Y    E  +R A F  NL  IE  N        +GI   SD+T  E 
Sbjct: 31 EQFYAFVGRFNKKYANDNEYQQRLAAFTHNLAQIEAFNAKYGEKTQFGITQFSDMTPTEF 90

Query: 92 KSRL 95
          K R+
Sbjct: 91 KERV 94


>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
 gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
 gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
          Length = 337

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+ +
Sbjct: 40 FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 98

Query: 94 R 94
          R
Sbjct: 99 R 99


>gi|79331505|ref|NP_001032106.1| thiol protease aleurain [Arabidopsis thaliana]
 gi|332009931|gb|AED97314.1| thiol protease aleurain [Arabidopsis thaliana]
          Length = 357

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +DLT +
Sbjct: 55  HVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADLTWQ 113

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 114 EFQRTKLG 121


>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
          Length = 342

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/96 (34%), Positives = 53/96 (55%), Gaps = 8/96 (8%)

Query: 6   SAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
           S EA+   F  ++  N++ +     L  + ++   + KSY  KE+V +R +++E NL+ I
Sbjct: 14  SVEASSLKFQPLRHQNDVMSSELNEL--WTEYKETYGKSYDMKEDVVRR-SLWEGNLRHI 70

Query: 66  EDLNK----GEHGTATYGINHLSDLTREEMKSRLGL 97
              N     G+H + + GIN LSDLT  E + RLGL
Sbjct: 71  SMHNVKHDLGKH-SFSMGINELSDLTPSEYRQRLGL 105


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           GQ+    E +T     L+ +E ++  + K+Y    E  +RF +F+DNLK ++  N   + 
Sbjct: 35  GQVPERTEAET-----LRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNP 89

Query: 75  TATYGINHLSDLTREEMKS 93
           +   G+N  +DL+ EE ++
Sbjct: 90  SYKLGLNKFADLSNEEYRA 108


>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
          Length = 364

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 32/75 (42%), Positives = 42/75 (56%), Gaps = 10/75 (13%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL---KLIEDLNKGEHGTATYGINHL 83
           N EH   F  F   FSK+Y TKEE   RF VF+ NL   K  ++L+     +A +G+   
Sbjct: 44  NAEH--HFSAFKTKFSKTYATKEEHDYRFGVFKSNLLRAKSHQELDP----SAIHGVTKF 97

Query: 84  SDLTREEMKSR-LGL 97
           SDLT  E +S+ LGL
Sbjct: 98  SDLTPSEFRSQFLGL 112


>gi|18141289|gb|AAL60582.1|AF454960_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 359

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y   EE+  RF++F++NL LI   NK +  +   G+N  +D+T +
Sbjct: 56  HVISFARFAHRYGKRYENAEEMKLRFSIFKENLDLIRSTNK-KGLSYKLGVNQFADMTWQ 114

Query: 90  EM-KSRLG 96
           E  +++LG
Sbjct: 115 EFQRTKLG 122


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 34/59 (57%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  ++K Y   EE  KRF +F++N+  IE  N   +     GIN  +DLT EE
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEE 95


>gi|28192371|gb|AAK07729.1| NTCP23-like cysteine proteinase [Nicotiana tabacum]
          Length = 360

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H     +F   + K Y + EE+ +RF VF DNLK+I   NK +  +   G+N  +DLT +
Sbjct: 57  HALSSARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNK-KGLSYKLGVNEFTDLTWD 115

Query: 90  EM-KSRLG 96
           E  + RLG
Sbjct: 116 EFRRDRLG 123


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 9/65 (13%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLT 87
           ++F  F+  + KSYPT++E   RF +F  NL     +   EH     TA +G+   SDL+
Sbjct: 87  RKFVMFMEKYGKSYPTRKEYLHRFGIFVKNL-----IRAAEHQALDPTAVHGVTQFSDLS 141

Query: 88  REEMK 92
            EE +
Sbjct: 142 EEEFE 146


>gi|17978641|gb|AAL48319.1| vinckepain-2 [Plasmodium vinckei]
          Length = 470

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 36/69 (52%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F++ ++K Y + EE+ +RF +F + LK IE  NK         IN  SDL
Sbjct: 146 NLEAVNIFYNFMKKYNKQYNSAEEMQERFYIFSEKLKKIEKHNKENKYMYKKAINSFSDL 205

Query: 87  TREEMKSRL 95
             EE K R 
Sbjct: 206 HPEEFKMRF 214


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          T+N E +  +E+++    K Y    E  KRF VF+DNL  I++ N  ++ T   G+N  +
Sbjct: 32 TDN-EVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFA 90

Query: 85 DLTREEMK 92
          D+T EE +
Sbjct: 91 DMTNEEYR 98


>gi|225707828|gb|ACO09760.1| Cathepsin S precursor [Osmerus mordax]
          Length = 282

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 39/63 (61%), Gaps = 5/63 (7%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATYGINHLSDLTR 88
          Q+EK+   + KSY  K E   R  V+E NL+L+    E+ + G+H  A  G+NHL+D+T 
Sbjct: 29 QWEKWKDKYQKSYGNKVEDLHRRIVWEKNLRLVHKHNEETSTGQHSFAM-GVNHLTDMTA 87

Query: 89 EEM 91
          EE+
Sbjct: 88 EEV 90


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F++ F+K Y   EE A+RF++F+ NL       K +   A +GIN  SDLT EE  
Sbjct: 74  HFAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDR-DAIHGINKFSDLTEEEFH 132

Query: 93  SR-LGL 97
            + LGL
Sbjct: 133 EQYLGL 138


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 5/63 (7%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN----KGEHGTATYGINHLSDLTRE 89
          F+ F   F+K Y + EE A+RF+VF  N+  I   N    +G H T T  +N  +DLT E
Sbjct: 30 FDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVH-THTVDVNQFADLTNE 88

Query: 90 EMK 92
          E +
Sbjct: 89 EYR 91


>gi|156106765|gb|ABU49605.1| Der f 1 allergen [Dermatophagoides farinae]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 9/65 (13%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLT 87
           ++F  F+  + KSYPT++E   RF +F  NL     +   EH     TA +G+   SDL+
Sbjct: 87  RKFVMFMEKYGKSYPTRKEYLHRFGIFVKNL-----IRAAEHQALDPTAVHGVTQFSDLS 141

Query: 88  REEMK 92
            EE +
Sbjct: 142 EEEFE 146


>gi|121531592|gb|ABM55481.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 318

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE+ N K + G  TY  G+N  +D+T E
Sbjct: 19 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAE 78

Query: 90 EMKSRLGL 97
          E +  LGL
Sbjct: 79 EFRHMLGL 86


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E+++    K Y    E  KRF +F+DNL+ I+  N  E+ T   G+N  +DLT EE ++
Sbjct: 40 YEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQN-AENRTYKLGLNRFADLTNEEYRA 98

Query: 94 R 94
          R
Sbjct: 99 R 99


>gi|119633262|gb|ABL84750.1| Der f 1 allergen [Dermatophagoides farinae]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
 gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
          Length = 337

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKFI  ++K Y T++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+  
Sbjct: 40 FEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNS-RNDSAIYKINRFADMTKNEVVI 98

Query: 94 R 94
          R
Sbjct: 99 R 99


>gi|121531590|gb|ABM55480.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE+ N K + G  TY  G+N  +D+T E
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTAE 81

Query: 90 EMKSRLGL 97
          E +  LGL
Sbjct: 82 EFRHMLGL 89


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
          lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
          lyrata]
          Length = 346

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          EH +Q+   +  FS+ Y  + E   RF VF+ NLK IE  NK    T   G+N  +D T+
Sbjct: 36 EHHQQW---MTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTK 92

Query: 89 EE 90
          EE
Sbjct: 93 EE 94


>gi|170064305|ref|XP_001867470.1| cathepsin l [Culex quinquefasciatus]
 gi|167881732|gb|EDS45115.1| cathepsin l [Culex quinquefasciatus]
          Length = 547

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 4/90 (4%)

Query: 12  ALFGQMKSNNELKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  MK     ++E  EHL+ +F +F     K+Y  ++E  +R  +F  NL+ I   N+
Sbjct: 223 ATFNPMKEFIHPRSE--EHLQDEFTRFKYKHGKTYNGEKEHDRRQDIFRQNLRFIHSHNR 280

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
              G  T  +NHL+D T EE+++  G   S
Sbjct: 281 ANKGY-TVAVNHLADRTDEEIQALRGFKSS 309


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  + + Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95


>gi|432882407|ref|XP_004074015.1| PREDICTED: cathepsin K-like [Oryzias latipes]
          Length = 330

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 27/79 (34%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 28  PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG---EHGTATYGINHLS 84
           P+  + +E++ +   KSY  + E+  R AV+E NL ++   N+    E  + T G+NHLS
Sbjct: 20  PDVNRLWEEWKQKHDKSYSNQTEMNFRRAVWEKNLHVVMKHNQQATEEKHSFTVGLNHLS 79

Query: 85  DLTREEMKSRL-GLNLSKH 102
           D+T EE+  +L G  + +H
Sbjct: 80  DMTAEEINEKLNGFKMEEH 98


>gi|440804881|gb|ELR25744.1| papain family cysteine protease subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 383

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/63 (38%), Positives = 33/63 (52%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE ++++F+K Y + EE   R AVFE  L  I   N     T   G+NHL+D    E + 
Sbjct: 41  FEAYVKEFNKVYASLEEREARRAVFEARLAKIRAHNADPTKTWKEGVNHLTDRHEHEFRR 100

Query: 94  RLG 96
            LG
Sbjct: 101 LLG 103


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L  +E+++ +  K+Y    E  +RF +F+DNLK IE+ N   + +   G+N  SDLT +E
Sbjct: 38  LTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADE 97

Query: 91  MK-SRLGLNLSK 101
            + S LG  + K
Sbjct: 98  FQASYLGGKMEK 109


>gi|403223173|dbj|BAM41304.1| cysteine protease precursor TacP [Theileria orientalis strain
           Shintoku]
          Length = 463

 Score = 46.2 bits (108), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E L+ FEKF  D++K + T +E  +RF VF +N   +E L    H T T  +N  SDLT 
Sbjct: 140 EALRSFEKFKADYNKVHATDDERRERFLVFRNN--YLETLTHKGHETFTKSVNFFSDLTE 197

Query: 89  EEM 91
           EE+
Sbjct: 198 EEL 200


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 33/62 (53%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y    E  KRF +F+DN+  IE  NK    T    IN  +DLT EE 
Sbjct: 37 ERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEF 96

Query: 92 KS 93
          +S
Sbjct: 97 RS 98


>gi|119633264|gb|ABL84751.1| Der f 1 allergen [Dermatophagoides farinae]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|27530349|dbj|BAC53948.1| Der f 1 allergen preproenzyme [Dermatophagoides farinae]
          Length = 321

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|357630543|gb|EHJ78591.1| hypothetical protein KGM_15350 [Danaus plexippus]
          Length = 87

 Score = 46.2 bits (108), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKF +D++++Y  + +  + F  F   LK I   N  E   AT+ IN  +D T EE K+
Sbjct: 18 FEKFTKDYNRNYKDEADRQEHFQAFIKTLKSINKAN-AESSHATFDINKFADYTPEERKN 76

Query: 94 RLGLNL 99
            GLNL
Sbjct: 77 MFGLNL 82


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  + + Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/83 (28%), Positives = 42/83 (50%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           L L     + N  K  N E    +E ++  + KSY +  E  +RF +F++ L+ I++ N 
Sbjct: 19  LVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRFIDEHNA 78

Query: 71  GEHGTATYGINHLSDLTREEMKS 93
             + +   G+N  +D T EE +S
Sbjct: 79  DTNRSYRVGLNQFADQTNEEFQS 101


>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
 gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
 gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
 gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
          Length = 341

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+ +
Sbjct: 44  FEKFITQYNKQYSSEDEKKYRYNIFRHNIESINAKNS-RNDSAVYKINRFADMTKNEVVN 102

Query: 94  R 94
           R
Sbjct: 103 R 103


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E+++  + + Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E +S
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRS 95


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          K+Y       KRF +F+DNL+ I++ NKG + +   G+N  +DL+ EE KS
Sbjct: 16 KNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKS 66


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 41/71 (57%), Gaps = 5/71 (7%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK----GEHGTATYGINHLSDLTRE 89
           F ++ +   K+Y ++EE   R  +F DN + ++  N     GEH T   G+NHL+DLT++
Sbjct: 68  FHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEH-THFVGLNHLADLTKD 126

Query: 90  EMKSRLGLNLS 100
           E K  LG N +
Sbjct: 127 EFKKMLGYNAA 137


>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
 gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
          Length = 324

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 43/67 (64%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F+  F+K+Y ++ E  +RF +F+ NL  I + N+ +   A Y IN  SDL+++E  +
Sbjct: 28 FEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEIINKNQND-SAAKYEINKFSDLSKDETIA 86

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 87 KYTGLSL 93


>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 377

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
            F  F + F KSY +KEE   RF VF+ NLK  +  ++    +AT+G+   SDLT  E  
Sbjct: 59  HFSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQR-HQALDPSATHGVTQFSDLTPSEFR 117

Query: 92  KSRLGL 97
           +S LGL
Sbjct: 118 RSFLGL 123


>gi|119633260|gb|ABL84749.1| Der f 1 allergen [Dermatophagoides farinae]
          Length = 321

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
          Length = 467

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF  F +   K Y +  E A R  VF++NL L   L+   +  A++G+   SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|403180727|gb|AEW46900.2| cathepsin-like protease, partial [Chilo suppressalis]
          Length = 100

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FEKFI+D+++SY  + +    +  F  NL+ I +LNK ++  ATYGIN  +D T  E K 
Sbjct: 33 FEKFIKDYNRSYRDEYDKKVHYEAFVINLQEINELNK-KNPRATYGINKFADYTDAEKKR 91

Query: 94 RLGL 97
            G 
Sbjct: 92 MFGF 95


>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella
          moellendorffii]
 gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella
          moellendorffii]
          Length = 330

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F+ FI  F K+Y T E  A R  VFE NL      ++    +A +GI   SDLT EE K
Sbjct: 20 HFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVS-HQALDPSAVHGITQFSDLTEEEFK 78

Query: 93 SR-LGLNL 99
           + LGL +
Sbjct: 79 QQFLGLRV 86


>gi|20428641|ref|NP_620470.1| 26-29kD-proteinase [Drosophila melanogaster]
 gi|6448467|dbj|BAA86910.1| homologue of Sarcophaga 26,29kDa proteinase [Drosophila
           melanogaster]
 gi|7294432|gb|AAF49777.1| 26-29kD-proteinase [Drosophila melanogaster]
 gi|21483518|gb|AAM52734.1| RE18380p [Drosophila melanogaster]
          Length = 549

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+   E  +   EH+ K F  F R    +Y +  E   R  +F  NL+ I   N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
            +  T T  +NHL+D T EE+K+R G   S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 35/59 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  ++K Y   +E  +RF +F++N+  IE  N   +   T GIN  +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95


>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 4/69 (5%)

Query: 28 PEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLS 84
          PE   QF +F   F+K Y + EE  +RF +F+ NL  IE+LN           +G+N  +
Sbjct: 23 PEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFA 81

Query: 85 DLTREEMKS 93
          DL+ +E K+
Sbjct: 82 DLSSDEFKN 90


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 35/59 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  ++K Y   +E  +RF +F++N+  IE  N   +   T GIN  +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95


>gi|282158089|ref|NP_001164088.1| cathepsin L precursor [Tribolium castaneum]
          Length = 552

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 23  LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           ++ E   H+  +F KF R   K+Y  K E   R  +F  N++ I  +N+   G  T  +N
Sbjct: 237 IRPEKSGHVDFEFGKFTRKHGKNYQNKTETLMRKDIFRQNVRFIHSMNRQNRGF-TLTVN 295

Query: 82  HLSDLTREEMKSRLGLNLS 100
           HL+D T  E+K+  G   S
Sbjct: 296 HLADKTPTELKALRGRTYS 314


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/60 (36%), Positives = 35/60 (58%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++ +  KSY    E  KRF +F+DNLK I++ N   + +   G+   +DLT EE +S
Sbjct: 49  YESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108


>gi|730035|sp|P16311.2|PEPT1_DERFA RecName: Full=Peptidase 1; AltName: Full=Allergen Der f I;
          AltName: Full=Major mite fecal allergen Der f 1;
          AltName: Allergen=Der f 1; Flags: Precursor
          Length = 321

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
           ++++ ++  + Q  +       + E +  +E+++    K Y    E  KRF VF+DNL+ 
Sbjct: 23  SASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRF 82

Query: 65  IEDLNKGEHGTATYGINHLSDLTREEMKS 93
           I++ N  E+ T   G+N  +DLT EE +S
Sbjct: 83  IDEHNS-ENRTYKLGLNGFADLTNEEYRS 110


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/54 (40%), Positives = 32/54 (59%)

Query: 42  SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
           ++SY + EE  +RF V+ DN++ IE  N+    T   G N  +DLTREE  +R 
Sbjct: 50  NQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTREEFIARF 103


>gi|195590156|ref|XP_002084812.1| GD14469 [Drosophila simulans]
 gi|194196821|gb|EDX10397.1| GD14469 [Drosophila simulans]
          Length = 549

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+   E  +   EH+ K F  F R    +Y +  E   R  +F  NL+ I   N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
            +  T T  +NHL+D T EE+K+R G   S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 1/91 (1%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
           A+  +TL L     S++  ++++ E +  ++ ++    K+Y    E  KRF +F+DNL+ 
Sbjct: 17  ATYISTLTLNQNHPSSSSWRSDD-EVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRF 75

Query: 65  IEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
           I++ N   + T   G+N  +DLT +E +++ 
Sbjct: 76  IDEHNSNNNTTYKLGLNKFADLTNQEYRAKF 106


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
           +E+++ +  K+Y    E  +RF +F+DNLK IE+ N   + +   G+N  SDLT +E + 
Sbjct: 41  YERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQA 100

Query: 93  SRLGLNLSK 101
           S LG  + K
Sbjct: 101 SYLGGKIEK 109


>gi|1323748|gb|AAC49287.1| thiol protease [Triticum aestivum]
          Length = 374

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +++F  ++    KSY   EE  +RF +F  N++ IE  N+    + T G+N  +DLT EE
Sbjct: 47  MERFHGWMAKHGKSYAGVEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQFADLTHEE 106


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE ++    KSY + EE   RF VF+DNLK I++ NK +  +   G+N  +DL+ EE K 
Sbjct: 48  FESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNK-KVSSYWLGLNEFADLSHEEFKR 106

Query: 94  R-LGLNL 99
           + LGL +
Sbjct: 107 KYLGLKI 113


>gi|195327474|ref|XP_002030443.1| GM25442 [Drosophila sechellia]
 gi|194119386|gb|EDW41429.1| GM25442 [Drosophila sechellia]
          Length = 549

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+   E  +   EH+ K F  F R    +Y +  E   R  +F  NL+ I   N+
Sbjct: 225 ATFNPMQ---EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR 281

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
            +  T T  +NHL+D T EE+K+R G   S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310


>gi|440291172|gb|ELP84441.1| cysteine protease, putative, partial [Entamoeba invadens IP1]
          Length = 472

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 2/60 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREEM 91
          F++F   +SK Y T  +   + A+F D+LK I +LN  +     A +GIN+ SDLT +EM
Sbjct: 25 FKEFELKYSKKYETPAQRLSKLALFRDSLKKIRELNSQRTRKSDAIFGINYYSDLTPKEM 84


>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
          Length = 291

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 41/71 (57%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K+FE+++ ++ + Y   +E  +RF +F++N+  IE  N     + T GIN  +D+T  E
Sbjct: 34  MKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNE 93

Query: 91  MKSRLGLNLSK 101
             ++    +S+
Sbjct: 94  FVAQYTGGISR 104


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 5/68 (7%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEMK 92
           E+++  + + Y  + E  KRF +F++N++ IE  NK   GT  Y  GIN  +DLT +E K
Sbjct: 40  EQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKA--GTKPYKLGINAFADLTNQEFK 97

Query: 93  -SRLGLNL 99
            SR G  L
Sbjct: 98  ASRNGYKL 105


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 35/59 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  ++K Y   +E  +RF +F++N+  IE  N   +   T GIN  +DLT EE
Sbjct: 37 ERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEE 95


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 24/70 (34%), Positives = 43/70 (61%), Gaps = 2/70 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F   ++KSY T+EE  +R+A+F++NL  I   N+  + + +  +NH  DL+R+E + 
Sbjct: 117 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 175

Query: 94  R-LGLNLSKH 102
           + LG   S++
Sbjct: 176 KYLGFKKSRN 185


>gi|388519111|gb|AFK47617.1| unknown [Medicago truncatula]
          Length = 241

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 46/90 (51%), Gaps = 10/90 (11%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL---KLIEDLN 69
           L  Q+    E    N EH   F  F   FSK+Y TKEE   RF VF+ NL   KL + L+
Sbjct: 32  LIRQVVDTAEDHILNAEH--HFTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLD 89

Query: 70  KGEHGTATYGINHLSDLTREEMKSR-LGLN 98
                +A +GI   SDLT  E + + LGLN
Sbjct: 90  P----SAQHGITKFSDLTASEFRRQFLGLN 115


>gi|298713906|emb|CBJ33775.1| Cathepsin-like proteinase [Ectocarpus siliculosus]
          Length = 462

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           +EL  +  E L  F++F   F KSY   +E A RF VF+ NLK I++ N    G   Y +
Sbjct: 115 SELSDQELESL--FQEFGIKFEKSYENDDEKAMRFEVFKRNLKRIDERNSKSLGV-KYDV 171

Query: 81  NHLSDLTREEMK 92
              +DLT EE K
Sbjct: 172 TMWTDLTHEEFK 183


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 42/69 (60%), Gaps = 2/69 (2%)

Query: 25 TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
          +EN E +  +E+++    K Y   +E  KRF VF+DNL  I+D N  ++ T T G+N  +
Sbjct: 28 SEN-EVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHN-AQNNTYTLGLNKFA 85

Query: 85 DLTREEMKS 93
          D+T +E ++
Sbjct: 86 DITNKEYRA 94


>gi|413917779|gb|AFW57711.1| hypothetical protein ZEAMMB73_361217 [Zea mays]
          Length = 390

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +F  ++    +SYPT EE  +RF ++  N++LIE  N+    T T G N  +DL+  E
Sbjct: 58  RFHAWMAAHGRSYPTAEEKLRRFHIYRANVELIEATNRDTSKTFTCGENQFTDLSHHE 115


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
          max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
          max]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  + K Y   EE  KRF VF++N+  IE  N   +     GIN  +DLT EE
Sbjct: 37 ERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEE 95


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella
          moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella
          moellendorffii]
          Length = 345

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 18/59 (30%), Positives = 35/59 (59%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          ++K+I++  K+Y +  E  KRF +F++N+  I   N   + + + G+N  +DLT  E +
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFR 96


>gi|403376395|gb|EJY88173.1| Cysteine protease-5 [Oxytricha trifallax]
          Length = 401

 Score = 45.8 bits (107), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 36/65 (55%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E  + F +F+ ++ K+Y TK  +  RF +F  N ++I+  N+ E      GIN  SD+
Sbjct: 65  NHETQQAFIQFVAEYGKTYATKNHLNSRFDIFAKNFEMIKSHNENEEKHYEMGINKFSDM 124

Query: 87  TREEM 91
           T EE 
Sbjct: 125 THEEF 129


>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
          Length = 358

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINH 82
          +N E  K FE +I  ++KSY     E  KRF  F+ +L+ IE +N  +    +A YG+  
Sbjct: 28 KNVEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSFQSSQESAYYGLTK 87

Query: 83 LSDLTREE 90
           SDL+ +E
Sbjct: 88 FSDLSEDE 95


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/70 (34%), Positives = 43/70 (61%), Gaps = 2/70 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F   ++KSY T+EE  +R+A+F++NL  I   N+  + + +  +NH  DL+R+E + 
Sbjct: 116 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 174

Query: 94  R-LGLNLSKH 102
           + LG   S++
Sbjct: 175 KYLGFKKSRN 184


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  ++K Y   EE  KRF +F++N+  IE  N         GIN  +DLT EE
Sbjct: 37 ERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEE 95


>gi|42564149|gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa
          decemlineata]
          Length = 322

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE  N K E G  TY   +   +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81

Query: 90 EMKSRLGL 97
          E + +LGL
Sbjct: 82 EFRKKLGL 89


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 22/71 (30%), Positives = 39/71 (54%), Gaps = 1/71 (1%)

Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA-TYGINH 82
          +T   +  ++  +++  + K Y   +E  KRF +F +N+  IE  NKG++    T G+N 
Sbjct: 28 RTLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQ 87

Query: 83 LSDLTREEMKS 93
           +DLT +E  S
Sbjct: 88 FADLTNDEFTS 98


>gi|270011045|gb|EFA07493.1| cathepsin L precursor [Tribolium castaneum]
          Length = 429

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 23  LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           ++ E   H+  +F KF R   K+Y  K E   R  +F  N++ I  +N+   G  T  +N
Sbjct: 235 IRPEKSGHVDFEFGKFTRKHGKNYQNKTETLMRKDIFRQNVRFIHSMNRQNRGF-TLTVN 293

Query: 82  HLSDLTREEMKSRLGLNLS 100
           HL+D T  E+K+  G   S
Sbjct: 294 HLADKTPTELKALRGRTYS 312


>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
          Length = 373

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F  F   +++SY +  E A R  +F  NL   + L + + GTA +G++  SDL
Sbjct: 35  PLELKEVFTLFQIQYNRSYSSPAEYAHRLDIFARNLAQAQRLQEDDLGTAEFGVSPFSDL 94

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 95  TEEEFGQLYG 104


>gi|42564153|gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa
          decemlineata]
          Length = 322

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE  N K E G  TY   +   +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81

Query: 90 EMKSRLGL 97
          E + +LGL
Sbjct: 82 EFRKKLGL 89


>gi|33590494|gb|AAQ22984.1| cathepsin L-like cysteine proteinase precursor [Acanthoscelides
          obtectus]
          Length = 321

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 41/69 (59%), Gaps = 3/69 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH-GTATY--GINHLSDLTR 88
          +++++F     ++Y T  E  +RF +F+ NL+ IE+ N+  H G  T+  GIN   D+T+
Sbjct: 21 EKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFGDMTQ 80

Query: 89 EEMKSRLGL 97
          EE K  L L
Sbjct: 81 EEFKRMLAL 89


>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
          Length = 367

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 3/81 (3%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           G+     E    N EH   F  F   F K Y TKEE  +RF VF+ NL+    L+     
Sbjct: 36  GEAAEKEEDHLLNAEH--HFASFKAKFGKKYATKEEHDRRFGVFKSNLRRAR-LHAKLDP 92

Query: 75  TATYGINHLSDLTREEMKSRL 95
           +A +G+   SDLT  E + + 
Sbjct: 93  SAVHGVTKFSDLTPAEFRRQF 113


>gi|170579222|ref|XP_001894733.1| cathepsin F-like cysteine proteinase [Brugia malayi]
 gi|158598547|gb|EDP36418.1| cathepsin F-like cysteine proteinase, putative [Brugia malayi]
          Length = 284

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           LA+  Q   N E KT        F  FI+ F + Y + EE   RF ++  N+   + L  
Sbjct: 156 LAMNSQEWQNEEKKT----LWSDFMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQF 211

Query: 71  GEHGTATYGINHLSDLTREEMK 92
            E GTA YG    SD+T EE +
Sbjct: 212 EEKGTAIYGATKFSDMTAEEFQ 233


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 304

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y    E   RF +F+ NLK +E  N   + T    +N  SDLT EE
Sbjct: 15 IEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEE 74

Query: 91 MKSR-LGL 97
           ++R +GL
Sbjct: 75 FQARYMGL 82


>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
 gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
          Length = 337

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F +   KSY  K+E  KR A+F DNL  IE++N  ++ +   G+N  +DLT EE  +
Sbjct: 27 FIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVN-AQNLSYKLGVNEYTDLTLEEFAA 85


>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
          Length = 370

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 21  NELKTENP-----EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           + L+ +NP     E  + F  F   +++SY    E A R  +F  NL   + L + + GT
Sbjct: 24  DSLRVQNPGAGPLELKEVFTLFQIQYNRSYSNPAEYAHRLDIFARNLAHAQRLQEEDLGT 83

Query: 76  ATYGINHLSDLTREEMKSRLG 96
           A +G+   SDLT EE     G
Sbjct: 84  AEFGVTAFSDLTEEEFDQLYG 104


>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
          Length = 383

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF  NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTAFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 29/72 (40%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F+K+Y TKEE   RF VF+ NL+    L+     +A +G+   SDL
Sbjct: 51  NAEH--HFASFKAKFAKTYATKEEHDHRFGVFKSNLRRAR-LHAKLDPSAVHGVTKFSDL 107

Query: 87  TREEMKSR-LGL 97
           T  E + + LGL
Sbjct: 108 TPAEFRRQFLGL 119


>gi|260819200|ref|XP_002604925.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
 gi|229290254|gb|EEN60935.1| hypothetical protein BRAFLDRAFT_77225 [Branchiostoma floridae]
          Length = 520

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 20/56 (35%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 40  DFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL 95
           + ++ Y T +E   RFA F+DNL  IE LN  E+    +  N  +D++ EE +S++
Sbjct: 181 EHNRRYKTADEEKARFATFQDNLLKIEKLN-AEYSGTEFATNQFADMSEEEFRSKI 235


>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 367

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+ S+ E    N EH   F  F   F K+Y T+EE   RF VF+ NL+  +  ++    T
Sbjct: 35  QVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMIDPT 91

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +G+   SDLT +E + + LGL
Sbjct: 92  AAHGVTKFSDLTPKEFRRQFLGL 114


>gi|148927396|gb|ABR19829.1| cysteine proteinase [Elaeis guineensis]
          Length = 358

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           +T N  H   F +F   + K Y + EE+  RFA+F +NL+LI   N+        GIN  
Sbjct: 51  QTRNALH---FARFAHRYGKRYQSVEEMKLRFAIFMENLELIRSTNR-RGLPYKLGINRY 106

Query: 84  SDLTREEMK-SRLG 96
           +D++ EE + SRLG
Sbjct: 107 ADMSWEEFRASRLG 120


>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
 gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
          Length = 367

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F+ FI  F K+Y T E  A R  VFE NL      ++    +A +GI   SDLT EE K
Sbjct: 57  HFKSFIARFGKAYATAEAYAHRLKVFEANLVRAVS-HQALDPSAVHGITQFSDLTEEEFK 115

Query: 93  SR-LGLNL 99
            + LGL +
Sbjct: 116 QQFLGLRV 123


>gi|71666438|ref|XP_820178.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
 gi|70885512|gb|EAN98327.1| cruzipain precursor, putative, partial [Trypanosoma cruzi]
          Length = 174

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE 
Sbjct: 36 SQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEF 94

Query: 92 KSR 94
          +SR
Sbjct: 95 RSR 97


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E+++  F + Y   +E   R+ +F++N++ IE  NK    +   GIN  +DLT EE 
Sbjct: 37 EKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEF 96

Query: 92 KS 93
          K+
Sbjct: 97 KT 98


>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
          Length = 360

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  +D++ EE +
Sbjct: 58  RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116

Query: 93  -SRLG 96
            +RLG
Sbjct: 117 ATRLG 121


>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
 gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
          Length = 360

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  +D++ EE +
Sbjct: 58  RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116

Query: 93  -SRLG 96
            +RLG
Sbjct: 117 ATRLG 121


>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
 gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
          Length = 360

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  +D++ EE +
Sbjct: 58  RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116

Query: 93  -SRLG 96
            +RLG
Sbjct: 117 ATRLG 121


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 14/81 (17%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
           ++  N  L+TE     K+F+ F+ ++ +SY T+EE  +R  +F  N+     +   EH  
Sbjct: 41  KLGDNELLRTE-----KKFKVFMENYGRSYSTEEEYLRRLGIFAQNM-----VRAAEHQA 90

Query: 75  ---TATYGINHLSDLTREEMK 92
              TA +G+   SDLT +E +
Sbjct: 91  LDPTAVHGVTQFSDLTEDEFE 111


>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
          Length = 364

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 29/72 (40%), Positives = 38/72 (52%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F K+Y TKEE   RF VF+ NL+    L+     +A +G+   SDL
Sbjct: 45  NAEH--HFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRAR-LHAQLDPSAVHGVTKFSDL 101

Query: 87  TREEMKSR-LGL 97
           T  E + + LGL
Sbjct: 102 TAAEFQRQFLGL 113


>gi|26245865|gb|AAN77408.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
          Length = 173

 Score = 45.4 bits (106), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE  N K E G  TY   +   +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTRD 81

Query: 90 EMKSRLGL 97
          E + +LGL
Sbjct: 82 EFRKKLGL 89


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/69 (33%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N + L  +E+++    K+Y    E  KRF +F+DNL  I++ N  ++ +   G+N  +DL
Sbjct: 40  NDQVLTMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNS-KNLSFRLGLNRFADL 98

Query: 87  TREEMKSRL 95
           T EE ++R 
Sbjct: 99  TNEEYRTRF 107


>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
 gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
 gi|228243|prf||1801240A Cys protease 1
          Length = 322

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--G 79
          L   NP     +E+F   F + Y   EE   R  VF DNL+ IE+ NK  E G  TY   
Sbjct: 13 LAAANP----SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLA 68

Query: 80 INHLSDLTREEMKS 93
          IN  SD+T E+  +
Sbjct: 69 INQFSDMTNEKFNA 82


>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
 gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
          Length = 360

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F +F   + KSY +  EV KRF +F ++L+L+   N+ +  +   GIN  +D++ EE +
Sbjct: 58  RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR-KGLSYRLGINRFADMSWEEFR 116

Query: 93  -SRLG 96
            +RLG
Sbjct: 117 ATRLG 121


>gi|310656787|gb|ADP02216.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 195

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E
Sbjct: 2  VERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAGNH-KFWLGVNQFTDLTNDE 60

Query: 91 MKS 93
           K+
Sbjct: 61 FKA 63


>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
          Length = 340

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+  
Sbjct: 43  FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMTKNEIVI 101

Query: 94  R 94
           R
Sbjct: 102 R 102


>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
 gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
          Length = 339

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+  
Sbjct: 42  FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMTKNEIVI 100

Query: 94  R 94
           R
Sbjct: 101 R 101


>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
           +F  F+ D+ K+Y T+EE   R  +F  N+     L   EH     +A +G+   SDLT 
Sbjct: 50  KFRLFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPSAVHGVTQFSDLTE 104

Query: 89  EEMK 92
           EE K
Sbjct: 105 EEFK 108


>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
 gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
           +F  F+ D+ K+Y T+EE   R  +F  N+     L   EH     +A +G+   SDLT 
Sbjct: 50  KFRLFMSDYGKNYSTREEYIHRLGIFAKNV-----LKAAEHQMMDPSAVHGVTQFSDLTE 104

Query: 89  EEMK 92
           EE K
Sbjct: 105 EEFK 108


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 34/60 (56%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++    K+Y    E  +RF +F+DNL+ IE+ N     +   G+N  +DLT EE ++
Sbjct: 48  YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107


>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
 gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
          Length = 332

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 39/60 (65%), Gaps = 1/60 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          K F+ F++ ++K+Y T+EE   +F  F++NL++I + N+G    A + IN  SDL + ++
Sbjct: 28 KLFDSFVKQYNKTYLTEEERMIKFDNFKNNLRIINEKNRGSK-HAVFDINKYSDLNKNDL 86


>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
          Length = 467

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF  NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|407838603|gb|EKG00105.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 326

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF +F +   + Y +  E A R +VF  NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 70  QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 128

Query: 93  SR 94
           SR
Sbjct: 129 SR 130


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/63 (31%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y    E A+RF  F+ N+  IE  N G H     G+N  +DLT +E
Sbjct: 34 VEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNH-KFWLGVNQFTDLTNDE 92

Query: 91 MKS 93
           ++
Sbjct: 93 FRA 95


>gi|223996996|ref|XP_002288171.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975279|gb|EED93607.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 413

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 41/77 (53%), Gaps = 10/77 (12%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT--------YGINHLSD 85
           FE+++  F KSY   +E  +R  +F +NL++I + NKG    ++         G+N  +D
Sbjct: 35  FEQYLAHFDKSYSNPDESIRRSRIFNNNLQIILNHNKGRDMDSSGRVQEGFVMGVNQFTD 94

Query: 86  LTREEMKSRLGLNLSKH 102
           + R E+   LG N S H
Sbjct: 95  VERSELP--LGYNKSLH 109


>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF  NL  +  L+   +  AT+G+   SDLTREE +
Sbjct: 37 QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHATFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 41  FEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 100


>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
          Length = 442

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 43/72 (59%), Gaps = 7/72 (9%)

Query: 35  EKFIRDF----SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           E   RDF    +++Y + +E  KRF +F  N+K   +LN+ ++  AT+G N  +D++ EE
Sbjct: 22  EVLFRDFKTTHARNYASADEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEE 80

Query: 91  MKSRLGLNLSKH 102
            ++R   N ++H
Sbjct: 81  FQTR--HNAARH 90


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 35/60 (58%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++ +  KSY    E  KRF +F+DNL+ I++ N   + +   G+   +DLT EE +S
Sbjct: 49  YESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTREE 90
          F+ F    SKSY  + E AKR A+F +NL+ IE+ N     G  +Y   +N  +DLT +E
Sbjct: 25 FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTDLTIDE 84

Query: 91 MKSRLGLN 98
           K+ L L+
Sbjct: 85 FKAYLTLH 92


>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
 gi|255645733|gb|ACU23360.1| unknown [Glycine max]
          Length = 362

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 29/91 (31%), Positives = 49/91 (53%), Gaps = 5/91 (5%)

Query: 10  TLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDL 68
           T +L   M SN   +  + E + Q F+ + ++  + Y  +EE AKRF +F+ NL+ I ++
Sbjct: 20  TCSLSLAMSSNQLEQFASEEEVFQLFQAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEM 79

Query: 69  NKGEHGTAT---YGINHLSDLTREE-MKSRL 95
           N       T    G+N  +D++ EE MK+ L
Sbjct: 80  NAKRKSPTTQHRLGLNKFADMSPEEFMKTYL 110


>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 4/83 (4%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q+ S+ E    N EH   F  F   F K+Y T+EE   RF VF+ NL+  +  ++    T
Sbjct: 35  QVVSDGEDDLLNAEH--HFTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKK-HQMIDPT 91

Query: 76  ATYGINHLSDLTREEMKSR-LGL 97
           A +G+   SDLT +E + + LGL
Sbjct: 92  AAHGVTKFSDLTPKEFRRQFLGL 114


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E++++ + K Y    E  KR  +F+DN++ IE  N   +      INHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEE 94


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score = 45.1 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
          AL  Q+ +   L  ++    ++ E+++  + + Y    E  KR+ +FE+N+ LIE  NK 
Sbjct: 18 ALASQLAAARSL--QDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKD 75

Query: 72 EHGTATYGINHLSDLTREEMKS 93
           +      +N  +DLT EE K+
Sbjct: 76 ANKPYKLSVNQFADLTNEEFKA 97


>gi|405953314|gb|EKC21001.1| Cathepsin F [Crassostrea gigas]
          Length = 397

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 1/67 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L  F+K+  + +K Y        +F VF +NLK+I +LN    G  T+G+N L+DL+++E
Sbjct: 51  LPLFQKWKSEHNKIYRNHMIERSKFKVFLENLKVINELNGQFQGKTTFGLNQLADLSQKE 110

Query: 91  MKSRLGL 97
             SR+ L
Sbjct: 111 F-SRIVL 116


>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 354

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
           E+++  F ++Y   +E A+R  VF  N + ++ +N+  + T T G+NH SDLT  E ++ 
Sbjct: 39  ERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTDHEFLQQ 98

Query: 94  RLG 96
            LG
Sbjct: 99  HLG 101


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
          sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
          sativus]
          Length = 365

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 2/94 (2%)

Query: 1  MAEDASAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFE 59
          MA   ++ A L+ F    S + L   +   +++ ++ ++    K+Y   +E  KRF +F+
Sbjct: 1  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFK 60

Query: 60 DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +NLK I+D N  E+ T   G+N  +DLT EE ++
Sbjct: 61 ENLKFIDDHNS-ENRTYKVGLNMFADLTNEEYRA 93


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
          [Cucumis sativus]
          Length = 314

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 18/61 (29%), Positives = 39/61 (63%), Gaps = 1/61 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          +++K++  + + Y ++EE  +RF +++ N++ I++ N   H + T   N+ +DLT EE K
Sbjct: 18 RYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTNEEFK 76

Query: 93 S 93
          +
Sbjct: 77 A 77


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
          sativus]
          Length = 317

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 18/61 (29%), Positives = 39/61 (63%), Gaps = 1/61 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          +++K++  + + Y ++EE  +RF +++ N++ I++ N   H + T   N+ +DLT EE K
Sbjct: 18 RYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTNEEFK 76

Query: 93 S 93
          +
Sbjct: 77 A 77


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 37/68 (54%)

Query: 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          E    LK+ E+++    + Y   +E  KR+ +F++N++ IE  N G       G+N  +D
Sbjct: 32 EQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFAD 91

Query: 86 LTREEMKS 93
          LT EE ++
Sbjct: 92 LTNEEFRA 99


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 2/69 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           +E ++    K+Y    E  +RF +F+DNL+ I++ N+ E  T   G+   +DLT EE ++
Sbjct: 62  YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLTRFADLTNEEYRA 120

Query: 94  R-LGLNLSK 101
           R LG   S+
Sbjct: 121 RFLGGRFSR 129


>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
 gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
          Length = 320

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 41  FEDWAAKHGKSYSSDWEKARRMTIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 100


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella
          moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella
          moellendorffii]
          Length = 337

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 37 FEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 96


>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
 gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
           Precursor
 gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
 gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
          Length = 368

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F R F K Y + EE   RF+VF+ NL+      K +  +AT+G+   SDLTR E +
Sbjct: 50  HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108

Query: 93  SR 94
            +
Sbjct: 109 KK 110


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score = 45.1 bits (105), Expect = 0.006,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 35/59 (59%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++    ++Y    E  +RF +F++NL  IE+ NK  + T   G+N  SDL+ EE
Sbjct: 38 EKHEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEE 96


>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF +F +   + Y +  E A R +VF +NL  +  L+   +  AT+G+   SDLTREE  
Sbjct: 37 QFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREEFW 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
          Length = 363

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 32/98 (32%), Positives = 53/98 (54%), Gaps = 9/98 (9%)

Query: 6   SAEATLALFGQMKSNNELKTE-----NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
           +A+++  L  Q+  N+E + E     +PEH   F+ F   F ++Y T+EE   R  VF+ 
Sbjct: 19  TADSSDPLIRQVVQNDETEIESDPLLDPEH--HFKLFKNKFGRTYDTEEEHEYRLTVFKS 76

Query: 61  NLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
           NL+  +  ++    TA +G+   SDLT  E + + LGL
Sbjct: 77  NLRRAKR-HQVLDPTAKHGVTKFSDLTPSEFRKKYLGL 113


>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
          Length = 318

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           T   + +  F+ ++ ++ K Y   +E   RF +F+DNLK I++ NK ++ T   G+   +
Sbjct: 39  TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNK-KNNTYWLGLTSFT 97

Query: 85  DLTREEMKSR 94
           DLT +E K +
Sbjct: 98  DLTNDEFKEK 107


>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
          Length = 318

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           T   + +  F+ ++ ++ K Y   +E   RF +F+DNLK I++ NK ++ T   G+   +
Sbjct: 39  TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNK-KNNTYWLGLTSFT 97

Query: 85  DLTREEMKSR 94
           DLT +E K +
Sbjct: 98  DLTNDEFKEK 107


>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
          Length = 368

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F R F K Y + EE   RF+VF+ NL+      K +  +AT+G+   SDLTR E +
Sbjct: 50  HFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLD-PSATHGVTQFSDLTRSEFR 108

Query: 93  SR 94
            +
Sbjct: 109 KK 110


>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 374

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 14/81 (17%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG- 74
           ++  N  L+TE     K+F+ F+ ++ +SY T+EE  +R  +F  N+     L   EH  
Sbjct: 41  KVGDNELLRTE-----KKFKVFMENYGRSYSTREEYLRRLGIFSQNM-----LRAAEHQA 90

Query: 75  ---TATYGINHLSDLTREEMK 92
              TA +G+   SDLT  E +
Sbjct: 91  LDPTAVHGVTQFSDLTEVEFE 111


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
           F+ + +   K+Y ++EE  +R  +F+DN   +   N   + T +  +N  +DLT  E K 
Sbjct: 30  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 89

Query: 93  SRLGLNLS 100
           SRLGL++S
Sbjct: 90  SRLGLSVS 97


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
           F+ + +   K+Y ++EE  +R  +F+DN   +   N   + T +  +N  +DLT  E K 
Sbjct: 32  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 93  SRLGLNLS 100
           SRLGL++S
Sbjct: 92  SRLGLSVS 99


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
           F+ + +   K+Y ++EE  +R  +F+DN   +   N   + T +  +N  +DLT  E K 
Sbjct: 32  FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 93  SRLGLNLS 100
           SRLGL++S
Sbjct: 92  SRLGLSVS 99


>gi|115533516|ref|NP_001041281.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
 gi|85539716|emb|CAJ58500.1| Protein R07E3.1, isoform b [Caenorhabditis elegans]
          Length = 348

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTREE 90
          K++  +   F KSY T +E  KR   + +  + I + N + EHG+A YG N +SD T EE
Sbjct: 34 KEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEE 93

Query: 91 MKSRL 95
           +  L
Sbjct: 94 FEKTL 98


>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
 gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|345488505|ref|XP_001599980.2| PREDICTED: crustapain-like [Nasonia vitripennis]
          Length = 111

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          ++E++   F+K Y   EE  +R+ ++ D  K +E+ N K  +G  ++  GINH +D T E
Sbjct: 22 EWEQYKIKFNKKYANPEEEQRRYKIYLDTKKKVEEHNVKYNNGEVSFSLGINHFADRTPE 81

Query: 90 EMKSRLGLNL 99
          E+KS  GL L
Sbjct: 82 ELKSMHGLRL 91


>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
 gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 451

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|60679562|gb|AAX34043.1| Sui m 1 allergen [Suidasia medanensis]
          Length = 336

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 27/60 (45%), Positives = 38/60 (63%), Gaps = 3/60 (5%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F   + K Y T EE  +R A+FE+NL+ I++ N G+HG A   +N  +DLT EE  S
Sbjct: 28 FEQFKELYGKQY-TAEEEPQRRAIFEENLRWIQE-NHGKHG-AGLEVNEHADLTAEEFSS 84


>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


>gi|313224805|emb|CBY20597.1| unnamed protein product [Oikopleura dioica]
          Length = 343

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 35/64 (54%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L+ F ++  +FSK Y T EE   R   F  N ++I   N+ E  T T G+N  +DLT  E
Sbjct: 39  LRAFRQYEVEFSKMYETAEERRIRAQTFSKNFEMITSHNQREDVTWTMGLNFDADLTFSE 98

Query: 91  MKSR 94
            +SR
Sbjct: 99  FQSR 102


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 2/61 (3%)

Query: 38 IRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LG 96
          I  + K+Y + EE  +RF VF+DNL  I+D+NK +  +   G+N  +DLT +E K+  LG
Sbjct: 33 IVGYRKAYASFEEKVRRFEVFKDNLNHIDDINK-KVTSYWLGLNEFADLTHDEFKATYLG 91

Query: 97 L 97
          L
Sbjct: 92 L 92


>gi|156553312|ref|XP_001599758.1| PREDICTED: cathepsin O-like [Nasonia vitripennis]
          Length = 345

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 6/69 (8%)

Query: 34 FEKFIRDFSKSYPTK-EEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREE 90
          FE +++D+ K Y    +E  +RF  F+ +L+ IE LN  +    +A YG+   SD+T +E
Sbjct: 26 FEAYVQDYKKPYKNDPDEYERRFGRFQQSLRKIESLNRLRSSADSARYGLTDYSDMTEQE 85

Query: 91 MKSRLGLNL 99
              L LNL
Sbjct: 86 F---LALNL 91


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 39/65 (60%), Gaps = 2/65 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE ++   SK Y + +E   RF +F DNLK I+D NK +      G+N  +DLT EE K+
Sbjct: 49  FESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNK-KVSNYWLGLNEFADLTHEEFKN 107

Query: 94  R-LGL 97
           + LGL
Sbjct: 108 KFLGL 112


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 1/86 (1%)

Query: 9  ATLALFGQMKSNNELKT-ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
          A + L G + S    +T ++    ++ E+++  F + Y    E   R+ +F++N++ IE 
Sbjct: 13 ALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72

Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
           NK    +   GIN  +DLT EE K+
Sbjct: 73 FNKASGKSYKLGINQFADLTNEEFKT 98


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 20/63 (31%), Positives = 36/63 (57%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          LK+ E+++    + Y   +E  KR+ +F++N++ IE  N G       G+N  +DLT EE
Sbjct: 2  LKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEE 61

Query: 91 MKS 93
           ++
Sbjct: 62 FRA 64


>gi|125552771|gb|EAY98480.1| hypothetical protein OsI_20393 [Oryza sativa Indica Group]
          Length = 296

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
            E+K +      +F   ++++ +SY T+EE A+R+ VF++     + +N  E  T  YG 
Sbjct: 178 QEVKVDEATMKARFHDLMKEYGRSYSTEEEKARRYEVFKEATLWADKVNALEPRTIPYGP 237

Query: 81  NHLSDLTREEMK 92
           N  +D T EE K
Sbjct: 238 NGYADFTDEEFK 249


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 7/98 (7%)

Query: 3   EDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL 62
           + A  E    L  Q+K++  L    P H + +++F   F K Y T EE  KRF +F D L
Sbjct: 27  QPAKVEHASNLKLQVKASTRL---GPYH-ETWKEFKTLFGKVYDTVEEEIKRFDIFRDTL 82

Query: 63  KLIEDLNKGEH-GTATY--GINHLSDLTREEMKSRLGL 97
           + IE+ N+  H G  +Y  G+N  SD++ +E     GL
Sbjct: 83  ERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLRHNGL 120


>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
 gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
 gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
          Length = 371

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/70 (35%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F+ F   F++SY    E  +R  +F  NL   + L + + GTA +G    SDL
Sbjct: 33  PLELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDL 92

Query: 87  TREEMKSRLG 96
           T EE     G
Sbjct: 93  TEEEFGQLYG 102


>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
          Length = 491

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 28  PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           P  LK+ F  F   +++SY +  E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 161 PLELKEVFALFQIQYNRSYSSPAEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDL 220

Query: 87  TREE 90
           T EE
Sbjct: 221 TDEE 224


>gi|356569685|ref|XP_003553027.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like [Glycine
           max]
          Length = 428

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 24/61 (39%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H   F +F     K Y +  E+   F +F DNLKLI   N+    T T G+NH +D T E
Sbjct: 50  HALSFARFACRHDKRYHSVGEIRNDFQIFSDNLKLIRSTNR-RSLTYTLGVNHFADWTWE 108

Query: 90  E 90
           E
Sbjct: 109 E 109


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)

Query: 43  KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KSRLGLNLSK 101
           K YP K E   R  ++++NLK I   N+G+H +    +NHL D+T  E+ ++ LGL L K
Sbjct: 38  KEYPNKNEETMRNFIWQNNLKKIVTHNEGKH-SFKLAMNHLGDMTSLEISQTLLGLKLKK 96

Query: 102 H 102
           H
Sbjct: 97  H 97


>gi|357613024|gb|EHJ68277.1| BCP inhibitor [Danaus plexippus]
          Length = 90

 Score = 44.7 bits (104), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI+DF K+Y   E+    +  F  +LK I  LN  E    TY IN  +D T  + + 
Sbjct: 22  FEKFIKDFDKTYKDAEDREIHYQAFVQSLKDINRLN-SEQPDTTYDINQFADYTEADQQG 80

Query: 94  RLGLNLSK 101
             GL L +
Sbjct: 81  MRGLILPE 88


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 20/64 (31%), Positives = 35/64 (54%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +  +E ++    KSY    E  +RF +F+DN   I++ N  +  +   G+N  +DLT EE
Sbjct: 41  MAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEE 100

Query: 91  MKSR 94
            +S+
Sbjct: 101 YRSK 104


>gi|402584107|gb|EJW78049.1| hypothetical protein WUBG_11042, partial [Wuchereria bancrofti]
          Length = 213

 Score = 44.7 bits (104), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 34/52 (65%)

Query: 41 FSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          +++ Y +K+E  KRF +++ NL+L + +   E GTA YG    SD+T+EE +
Sbjct: 1  YNRKYRSKKEFLKRFRIYKRNLRLAKLIQNKEEGTAIYGETPYSDMTQEEFR 52


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
          FE +  +  KSY + EE   R  VF DN + +   N  ++ + T  +N  +DLT  E K 
Sbjct: 29 FEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFKV 88

Query: 93 SRLGLN 98
          SRLG +
Sbjct: 89 SRLGFS 94


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 36/70 (51%)

Query: 25  TENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLS 84
           + NP   + +E F  + +K Y +  E   R  +FE+N + IED N  +      G+NH  
Sbjct: 72  SPNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFG 131

Query: 85  DLTREEMKSR 94
           DLT +E + R
Sbjct: 132 DLTNKEYRER 141


>gi|1019667|gb|AAA79287.1| rangelipain, partial [Trypanosoma rangeli]
          Length = 263

 Score = 44.7 bits (104), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           QF  F +   K Y +  E A R  VF++NL L   L+   +  A++G+   SDLTREE 
Sbjct: 36 SQFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEF 94

Query: 92 KSR 94
          +SR
Sbjct: 95 RSR 97


>gi|357631370|gb|EHJ78915.1| cathepsin [Danaus plexippus]
          Length = 327

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 42/67 (62%), Gaps = 3/67 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
           F+K++ ++ K Y  +EE    + +F+DNL+ I +LNK  + T  Y IN  +DL  EE+  
Sbjct: 37  FQKYVIEYDKHY-NEEEYWAHYEIFKDNLEKINELNKNSNST-VYDINQFTDLKFEEVAN 94

Query: 93  SRLGLNL 99
           + +G++L
Sbjct: 95  TYMGMSL 101


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 1/86 (1%)

Query: 9  ATLALFGQMK-SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED 67
          A L LFG    S N    E+    ++ E+++    K Y    E   R+ +F+ N+K IE 
Sbjct: 13 ALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEG 72

Query: 68 LNKGEHGTATYGINHLSDLTREEMKS 93
           N   + +   G+N  +DLT EE K+
Sbjct: 73 FNNAGNKSHKLGVNQFADLTEEEFKA 98


>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 454

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
          Length = 367

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
          vivax Y486]
          Length = 389

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like,
          fragment, partial [Trypanosoma vivax Y486]
          Length = 323

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
          vivax Y486]
          Length = 447

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 26 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 84

Query: 94 R 94
          R
Sbjct: 85 R 85


>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 441

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 452

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|115533514|ref|NP_001041280.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
 gi|3878958|emb|CAA89070.1| Protein R07E3.1, isoform a [Caenorhabditis elegans]
          Length = 402

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 1/65 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTREE 90
           K++  +   F KSY T +E  KR   + +  + I + N + EHG+A YG N +SD T EE
Sbjct: 88  KEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEE 147

Query: 91  MKSRL 95
            +  L
Sbjct: 148 FEKTL 152


>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
 gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
          Length = 364

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y  ++E   R+ +F  N++ I   N   + +A Y IN  +D+T+ E+  
Sbjct: 67  FEKFISQYNKHYKNEDEKKYRYNIFRHNIESINHKNS-RNDSAVYKINRFADMTKNEVVI 125

Query: 94  R 94
           R
Sbjct: 126 R 126


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score = 44.7 bits (104), Expect = 0.008,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|313221004|emb|CBY31836.1| unnamed protein product [Oikopleura dioica]
          Length = 323

 Score = 44.3 bits (103), Expect = 0.008,   Method: Composition-based stats.
 Identities = 26/59 (44%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          FE +  +  KSY T EE   R  VFE+N+  IE +NK E+ + T G+N  SDLT +E +
Sbjct: 18 FEDWTAEHWKSYETAEEEKFRKGVFEENVAKIEQINK-ENRSWTAGLNKFSDLTWDEFQ 75


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score = 44.3 bits (103), Expect = 0.008,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 2/68 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+++    ++Y + EE  +RF VF+DNL  I++ N+ +  +   G+N  +DLT +E K+
Sbjct: 59  FERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNR-KVSSYWLGLNEFADLTHDEFKA 117

Query: 94  R-LGLNLS 100
             LGL  S
Sbjct: 118 TYLGLRSS 125


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score = 44.3 bits (103), Expect = 0.008,   Method: Composition-based stats.
 Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 5/84 (5%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           + NEL+     H   F  F+R F KSY   +E   R +VF  NL+      + +  +A +
Sbjct: 46  AENELELNAEAH---FASFVRRFGKSYRDADEHEHRLSVFRANLRRARRHQRLD-PSAVH 101

Query: 79  GINHLSDLTREEMKSR-LGLNLSK 101
           GI   SDLT +E + R LGL  S+
Sbjct: 102 GITKFSDLTPDEFRERFLGLRKSR 125


>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
          Length = 264

 Score = 44.3 bits (103), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           E+++ ++ + Y    + A+RF VF+DN   +E  N  +      G+N  +DLT E  K+ 
Sbjct: 42  ERWMAEYGRVYKDAADKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKAN 101

Query: 95  LGL 97
            G 
Sbjct: 102 KGF 104


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 44.3 bits (103), Expect = 0.008,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK- 92
           F+ + +   K+Y ++EE  +R  +F+DN   +   N   + T +  +N  +DLT  E K 
Sbjct: 32  FDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query: 93  SRLGLNLS 100
           SRLGL++S
Sbjct: 92  SRLGLSVS 99


>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
          Length = 367

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 28 PEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          P  LK+ F  F   +++SY    E A+R  +F  NL   + L + + GTA +G+   SDL
Sbjct: 35 PMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDL 94

Query: 87 TREE 90
          T EE
Sbjct: 95 TEEE 98


>gi|332376813|gb|AEE63546.1| unknown [Dendroctonus ponderosae]
          Length = 312

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 27/67 (40%), Positives = 42/67 (62%), Gaps = 3/67 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTR 88
          +++ KF    +K Y T  E   RFA+F++N+++IE+ N+  E G ATY   +N  +DL+R
Sbjct: 23 EKWLKFKNQHNKVYETVYEEKLRFAIFQENVQIIEEQNRLYEAGEATYRMAVNKFADLSR 82

Query: 89 EEMKSRL 95
          EE  S L
Sbjct: 83 EEYLSIL 89


>gi|195379510|ref|XP_002048521.1| GJ11312 [Drosophila virilis]
 gi|194155679|gb|EDW70863.1| GJ11312 [Drosophila virilis]
          Length = 549

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 14  FGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           F       E  + + EH+ K F  F R     Y  ++E   R  +F  NL+ I   N+ +
Sbjct: 224 FATFNPMQEFISGSDEHVDKAFHHFKRKHGVDYRNEKEHEHRKNIFRQNLRYIHSKNRAK 283

Query: 73  HGTATYGINHLSDLTREEMKSRLGLNLS 100
             T    +NHL+D T EE+K+R G   S
Sbjct: 284 L-TYKLAVNHLADKTEEELKARRGYKSS 310


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          E+++  + + Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E +
Sbjct: 38 ERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFR 94


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          E+++  + + Y    E A+RF VF+ N+  IE  N G H     G+N  +DLT +E +
Sbjct: 38 ERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFR 94


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 19/67 (28%), Positives = 35/67 (52%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E ++ ++ + Y    E A+RF  F+ N+  +E  N  +      G+N  +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEE 92

Query: 91 MKSRLGL 97
           K+  G 
Sbjct: 93 FKANKGF 99


>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
           [Strongylocentrotus purpuratus]
          Length = 453

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 6/93 (6%)

Query: 6   SAEATLALFGQ---MKSNNELKTENPEHLKQFEKFIRDFSKSYPTKE---EVAKRFAVFE 59
           ++E++L+L  Q   +  + +      E+   F+KF+  F + Y   +   E   R++VF 
Sbjct: 125 NSESSLSLKAQDFSITKDCQASDIKDEYRDLFDKFLMTFKREYRQNDGTNEYEYRYSVFV 184

Query: 60  DNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            N+  +E  N+ E GTA YG    +D+T  E +
Sbjct: 185 QNMLTVEMFNQFEQGTAKYGPTKFADMTEAEFR 217


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella
          moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella
          moellendorffii]
          Length = 358

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 36/60 (60%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +EK++ D  + Y    E  +RF +F DN + IE+ N+  + T   G+N+ +D+T +E K+
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 22 ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
          + K  N E    +E ++    KSY +  E  +RF +F++ L+ I++ N     +   G+N
Sbjct: 26 DAKRTNDEVKAMYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLN 85

Query: 82 HLSDLTREEMKS 93
            +DLT EE +S
Sbjct: 86 QFADLTNEEFRS 97


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 34/59 (57%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  + K Y   +E  KRF VF++N+  IE  N   + +   GIN  +DLT +E
Sbjct: 37 ERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKE 95


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella
          moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella
          moellendorffii]
          Length = 358

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 36/60 (60%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +EK++ D  + Y    E  +RF +F DN + IE+ N+  + T   G+N+ +D+T +E K+
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93


>gi|326523323|dbj|BAJ88702.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 161

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 38/64 (59%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E  + F +++ ++ K Y +  E  +R+A+F+D L+ ++ LN        YGIN LSD+T 
Sbjct: 64  ETRRVFAEWMVEYGKKYSSAGEEDRRYALFKDELRRVDLLNAAFGPNPIYGINFLSDITD 123

Query: 89  EEMK 92
           +E +
Sbjct: 124 KEWR 127


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           + +E ++ +  K+Y    E  KRF +F+DNL+ I++ N  +  +   G+N  +DLT EE 
Sbjct: 49  RMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDR-SYKVGLNRFADLTNEEY 107

Query: 92  KS 93
           K+
Sbjct: 108 KA 109


>gi|125979159|ref|XP_001353612.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
 gi|54642377|gb|EAL31126.1| GA21427 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+   E  +   EH+ + F  F      +Y  ++E   R  +F  NL+ I   N+
Sbjct: 225 ATFNPMQ---EFISHTDEHVDRAFHHFKHKHGMAYRNEQEHEHRKNIFRQNLRYIHSKNR 281

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
            +  T T  +NHL+D T EE+K+R G   S
Sbjct: 282 AKL-TYTLAVNHLADKTEEELKARRGYKSS 310


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E++++ + K Y    E  KR  +F+DN++ IE  N   +      INHL+D T EE
Sbjct: 36 ERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLSINHLADQTNEE 94


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|215261456|pdb|3F75|P Chain P, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex
           With Its Propeptide
          Length = 106

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 42/69 (60%), Gaps = 2/69 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F   ++KSY T+EE  +R+A+F++NL  I   N+  + + +  +NH  DL+R+E + 
Sbjct: 25  FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRR 83

Query: 94  R-LGLNLSK 101
           + LG   S+
Sbjct: 84  KYLGFKKSR 92


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis
          vinifera]
          Length = 341

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE 
Sbjct: 37 ERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 373

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 6/77 (7%)

Query: 26  ENPEHL----KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           EN EHL      F  F   + K+Y T+EE   RF VF+ NL+     N+    +A +G+ 
Sbjct: 43  ENDEHLLNAEHHFSLFKSKYEKTYATQEEHDHRFRVFKANLRRARR-NQLLDPSAVHGVT 101

Query: 82  HLSDLTREEMKSR-LGL 97
             SDLT +E + + LGL
Sbjct: 102 QFSDLTPKEFRRKFLGL 118


>gi|91992512|gb|ABE72972.1| cathepsin L [Aedes aegypti]
          Length = 548

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+     ++E  EHL  +F +F     KSY  ++E   R  +F  NL+ I   N+
Sbjct: 223 ATFNPMQEFIHPRSE--EHLDNEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNR 280

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
              G  T  +NHL+D T EE+K+  G   S
Sbjct: 281 AGKGF-TVAVNHLADRTDEELKALRGFKSS 309


>gi|33242865|gb|AAQ01137.1| cathepsin [Branchiostoma lanceolatum]
          Length = 328

 Score = 44.3 bits (103), Expect = 0.009,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 43/72 (59%), Gaps = 6/72 (8%)

Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATY 78
          + T +P    Q+E F + +++ Y  +EE A+R  +FEDNLK I    E+ ++G H T   
Sbjct: 12 MATASPLMNPQWEVFKKAYNRVYAAEEEFARRL-IFEDNLKTIQMHNEEADRGLH-TFRL 69

Query: 79 GINHLSDLTREE 90
          G+N  +D+T +E
Sbjct: 70 GVNQYADMTHKE 81


>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
          Length = 469

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF  F +   K Y +  E   R  VF++NL L   L+   +  A++G+   SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEETFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|223998002|ref|XP_002288674.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975782|gb|EED94110.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 415

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 10/77 (12%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT--------YGINHLSD 85
           FE+++ +F KSY   +E  +R  +F +NL++I + NKG    ++         G+N  +D
Sbjct: 40  FEQYLANFDKSYSNPDEFTRRSRIFNNNLQIILNHNKGRDMDSSGRVKEGFVMGVNQFTD 99

Query: 86  LTREEMKSRLGLNLSKH 102
           + R E+   +G N   H
Sbjct: 100 VERSELP--MGYNKGLH 114


>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
 gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
          Length = 337

 Score = 44.3 bits (103), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE FI +++K YP  +    RF +F+ NL+ I + NK  + +A Y IN  SDL++ E+ +
Sbjct: 32 FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNK-LNDSAIYNINKFSDLSKNELLT 90

Query: 94 R 94
          +
Sbjct: 91 K 91


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 38/67 (56%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  ++ ++    K+Y    E  KRF +F+DNL+ I++ N   + T   G+N  +DLT 
Sbjct: 41  EVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTN 100

Query: 89  EEMKSRL 95
           +E +++ 
Sbjct: 101 QEYRAKF 107


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 2/83 (2%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          LA+     ++ EL     E   + EK++    K Y   +E  +RF +F+ N+  IE  N 
Sbjct: 18 LAMCADQAASREL--HELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNT 75

Query: 71 GEHGTATYGINHLSDLTREEMKS 93
            + +   GIN  +DLT EE ++
Sbjct: 76 AGNKSYMLGINKFADLTNEEFRA 98


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
          ++++   + + E+++  + + Y    E A+RF +F+ N+  IE  N G H     G+N  
Sbjct: 27 QSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQF 85

Query: 84 SDLTREEMKS 93
          +DLT  E ++
Sbjct: 86 ADLTNYEFRA 95


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score = 44.3 bits (103), Expect = 0.010,   Method: Composition-based stats.
 Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 24 KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
          ++++   + + E+++  + + Y    E A+RF +F+ N+  IE  N G H     G+N  
Sbjct: 27 QSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQF 85

Query: 84 SDLTREEMKS 93
          +DLT  E ++
Sbjct: 86 ADLTNYEFRA 95


>gi|324518532|gb|ADY47133.1| Cysteine proteinase [Ascaris suum]
          Length = 334

 Score = 44.3 bits (103), Expect = 0.011,   Method: Composition-based stats.
 Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 18 KSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
          +   E    + E ++  + +F+ D+ ++  T++E   RFA+F+ N+ LI++LN   + + 
Sbjct: 17 RGEAEYSKNDTEQMRTLYNQFLHDYRRTNITEDEYKFRFAIFQKNMLLIDELNS-RNDSI 75

Query: 77 TYGINHLSDLTREE 90
           YGI   +D T  E
Sbjct: 76 VYGITQFADWTDSE 89


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score = 44.3 bits (103), Expect = 0.011,   Method: Composition-based stats.
 Identities = 19/67 (28%), Positives = 35/67 (52%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E ++ ++ + Y    E A+RF  F+ N+  +E  N  +      G+N  +DLT EE
Sbjct: 33 VERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEE 92

Query: 91 MKSRLGL 97
           K+  G 
Sbjct: 93 FKANKGF 99


>gi|33667928|gb|AAQ24541.1| Blo t 1 allergen [Blomia tropicalis]
          Length = 333

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +K FE+F + F K Y   EE A+R   F++ LK +E+ N G  G   Y IN  SD++ 
Sbjct: 23  EEIKTFEQFKKVFGKVYRNAEEEARREHHFKEQLKWVEEHN-GIDGV-EYAINEYSDMSE 80

Query: 89  EEMKSRL---GLNLS 100
           +E    L   GLN +
Sbjct: 81  QEFSFHLSGGGLNFT 95


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 6/73 (8%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI-----EDLNKGEHGTATYGINHLSDLTR 88
           FEK+ ++ SK+Y ++EE   R  VFEDN   +        N   + + T  +N  +DLT 
Sbjct: 33  FEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADLTH 92

Query: 89  EEMK-SRLGLNLS 100
            E K +RLGL L+
Sbjct: 93  HEFKTTRLGLPLT 105


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
           ++++  + ++Y    E+ KR  +F++NL+ IE+ N   + +   G+N  SDLT EE + S
Sbjct: 34  QQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIAS 93

Query: 94  RLGLNLS 100
             G  +S
Sbjct: 94  HTGFKVS 100


>gi|195428245|ref|XP_002062184.1| GK16790 [Drosophila willistoni]
 gi|194158269|gb|EDW73170.1| GK16790 [Drosophila willistoni]
          Length = 549

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 1/67 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           F  F R    +Y + +E   R  +F  NL+ I   N+ +  T T  +NHL+D T EE+K+
Sbjct: 245 FHHFKRKHGVAYRSDKEHEHRKNIFRQNLRYIHSKNRAKL-TYTLAVNHLADKTEEELKA 303

Query: 94  RLGLNLS 100
           R G   S
Sbjct: 304 RRGYKSS 310


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 42/77 (54%)

Query: 17 MKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTA 76
          +K++  +   + E +  +E+++    K Y    +  KRF VF+DNL  I++ N   + T 
Sbjct: 21 IKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTY 80

Query: 77 TYGINHLSDLTREEMKS 93
            G+N  +D+T EE ++
Sbjct: 81 KLGLNKFADMTNEEYRA 97


>gi|20151497|gb|AAM11108.1| GM07827p [Drosophila melanogaster]
          Length = 219

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
           S +E+  +N      +EKF+ DF  SY    E  KR  VF DN K I   N + + G  +
Sbjct: 58  STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 117

Query: 78  Y--GINHLSDLTREEMKSR 94
           +  GIN  SDLT EE K++
Sbjct: 118 FKKGINQWSDLTVEEWKNK 136


>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
 gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 356

 Score = 43.9 bits (102), Expect = 0.011,   Method: Composition-based stats.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 2/64 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
           F +F     K Y + EE+ +RF +F DNLK+I   N+ +  +   GIN  +DLT +E  K
Sbjct: 57  FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNR-KGLSYKLGINEFTDLTWDEFRK 115

Query: 93  SRLG 96
            +LG
Sbjct: 116 HKLG 119


>gi|194883258|ref|XP_001975720.1| GG20406 [Drosophila erecta]
 gi|190658907|gb|EDV56120.1| GG20406 [Drosophila erecta]
          Length = 345

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 37/70 (52%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           + KF+ DF   Y    E  KR  +F DN   I+  N + + G  ++  GIN  SDLT EE
Sbjct: 271 WNKFLIDFGPKYSDDTETKKRRNIFCDNWNSIQKHNVQYDLGNISFKKGINQWSDLTVEE 330

Query: 91  MKSRLGLNLS 100
            KS+   NLS
Sbjct: 331 WKSKQQPNLS 340



 Score = 40.0 bits (92), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           +EKF+ DF + Y    E  +R  +F DN   I+  N + + G  ++  GIN  SDLT EE
Sbjct: 179 WEKFMIDFKRKYEDDNETKQRRNIFCDNWNSIQKHNVQYDLGNISFRKGINQWSDLTVEE 238

Query: 91  MKSR 94
            K +
Sbjct: 239 WKKK 242


>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
           AltName: Allergen=Car p 1; Flags: Precursor
 gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
 gi|387885|gb|AAA72774.1| papain [synthetic construct]
 gi|225437|prf||1303270A papain
          Length = 345

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S N+L T     ++ FE ++   +K Y   +E   RF +F+DNLK I++ NK ++ +   
Sbjct: 34  SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 91

Query: 79  GINHLSDLTREEMKSR 94
           G+N  +D++ +E K +
Sbjct: 92  GLNVFADMSNDEFKEK 107


>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
 gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
          Length = 325

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+ +++K Y   +E A R+ +F+ NL+ I   N+ E   A + IN  SD+++ E+ S
Sbjct: 27 FESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVED-HAVFSINKFSDMSKSEIIS 85

Query: 94 RL-GLNL 99
          +  GL+L
Sbjct: 86 KYTGLSL 92


>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
 gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
          Length = 327

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 45/74 (60%), Gaps = 4/74 (5%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           K FE F++ ++KSY ++EE   +F  F++N++ I + N   + +A Y IN  SD+ + E+
Sbjct: 23  KLFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINEKNSLSN-SAVYDINFYSDMNKNEL 81

Query: 92  ---KSRLGLNLSKH 102
              ++   +NL K+
Sbjct: 82  LRKQTGFKINLKKN 95


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 21/45 (46%), Positives = 30/45 (66%), Gaps = 1/45 (2%)

Query: 50  EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           E  KRF +F+DNLK I++ N  E+ T   G+N  +DL+ EE +SR
Sbjct: 71  EKDKRFEIFKDNLKFIDEHN-AENRTYKVGLNRFADLSNEEYRSR 114


>gi|255550445|ref|XP_002516273.1| cysteine protease, putative [Ricinus communis]
 gi|223544759|gb|EEF46275.1| cysteine protease, putative [Ricinus communis]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 2/64 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-K 92
           F +F+    K Y +++E+  RFA+F +NL  I   N+ +  + T  +N  +DLT +E  K
Sbjct: 59  FSRFVYRHGKRYQSEDEMKMRFAIFSENLDFIRSTNR-KGLSYTLAVNDFADLTWQEFQK 117

Query: 93  SRLG 96
            RLG
Sbjct: 118 HRLG 121


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 36/70 (51%), Gaps = 10/70 (14%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT----ATYGINHLSDLTR 88
            F  FIR + K Y   EE   RF VF+ NL     L   EH      A++G+   SDLT+
Sbjct: 56  HFRHFIRRYGKKYSGPEEHEHRFGVFKSNL-----LRALEHQKLDPRASHGVTKFSDLTQ 110

Query: 89  EEMKSR-LGL 97
           EE + + LGL
Sbjct: 111 EEFRHQYLGL 120


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +K +E ++    K+Y    E  +RF +F+DNL+ +++ N     T   G+   +DLT EE
Sbjct: 49  MKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEE 108

Query: 91  MKSR-LGLNLSK 101
            ++  LG  + K
Sbjct: 109 YRAMYLGAKMEK 120


>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 350

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KS 93
           E+++  F + Y    E A+R AVF  N + ++ +N+  + T T G+N  SDLT  E  K+
Sbjct: 41  EQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTDNEFAKT 100

Query: 94  RLG 96
            LG
Sbjct: 101 HLG 103


>gi|294883332|ref|XP_002770713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873998|gb|EER02718.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 332

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F   F K+Y +KEE  KR A+F+ NL+ IE +N  +  +   G+N  +DLT EE  +
Sbjct: 28 FMGFKHKFGKNYESKEEEVKRNAIFQANLQHIEQVNAKDL-SYKLGVNEHADLTHEEFAA 86


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 36/70 (51%), Gaps = 10/70 (14%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT----ATYGINHLSDLTR 88
            F  FIR + K Y   EE   RF VF+ NL     L   EH      A++G+   SDLT+
Sbjct: 56  HFRHFIRRYGKKYSGPEEHEHRFGVFKSNL-----LRALEHQKLDPRASHGVTKFSDLTQ 110

Query: 89  EEMKSR-LGL 97
           EE + + LGL
Sbjct: 111 EEFRHQYLGL 120


>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
          Length = 371

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 11  LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
           L L GQ  S++ L  +    P  LK+ F+ F   F++SY    E  +R ++F  NL   +
Sbjct: 13  LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72

Query: 67  DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
            L + + GTA +G    SDLT EE     G
Sbjct: 73  RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 20/59 (33%), Positives = 32/59 (54%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E++ + + K Y    E  KR  +F+DN++ IE  N   +      INHL+D T EE
Sbjct: 38 ERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEE 96


>gi|353441042|gb|AEQ94105.1| putative drought-inducible cysteine proteinase [Elaeis guineensis]
          Length = 187

 Score = 43.9 bits (102), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F+R F KSY  ++E A RF+VF+ NL+      K +  TA +GI   SDLT  E +
Sbjct: 54  HFSSFLRRFGKSYADEKEHAYRFSVFKANLRRARRHQKMD-PTAVHGITKFSDLTPAEFR 112


>gi|313213098|emb|CBY36961.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          FE +  +  KSY T E+   R  VFE+N+  IE +NK E+ + T G+N  SDLT +E +
Sbjct: 18 FEDWTSEHWKSYETAEDEKFRKGVFEENIAKIEQINK-ENRSWTAGLNKFSDLTWDEFQ 75


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 23/75 (30%), Positives = 45/75 (60%), Gaps = 2/75 (2%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S+ +L+ E+   ++ +E ++ +  ++Y   +E  KRF+VF+DN   I + N+G   +   
Sbjct: 28  SSKDLR-EDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNR-SYKL 85

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DL+ EE K+
Sbjct: 86  GLNQFADLSHEEFKA 100


>gi|157113282|ref|XP_001657758.1| cathepsin l [Aedes aegypti]
 gi|108877803|gb|EAT42028.1| AAEL006389-PA, partial [Aedes aegypti]
          Length = 538

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 12  ALFGQMKSNNELKTENPEHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           A F  M+     ++E  EHL  +F +F     KSY  ++E   R  +F  NL+ I   N+
Sbjct: 213 ATFNPMQEFIHPRSE--EHLDNEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNR 270

Query: 71  GEHGTATYGINHLSDLTREEMKSRLGLNLS 100
              G  T  +NHL+D T EE+K+  G   S
Sbjct: 271 AGKGF-TVAVNHLADRTDEELKALRGFKSS 299


>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
 gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
 gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
 gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
 gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
 gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
 gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
          Length = 371

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 11  LALFGQMKSNNELKTE---NPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
           L L GQ  S++ L  +    P  LK+ F+ F   F++SY    E  +R ++F  NL   +
Sbjct: 13  LLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQ 72

Query: 67  DLNKGEHGTATYGINHLSDLTREEMKSRLG 96
            L + + GTA +G    SDLT EE     G
Sbjct: 73  RLQQEDLGTAEFGETPFSDLTEEEFGQLYG 102


>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
          Length = 229

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 36/64 (56%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
          E +  +E+++    K Y    E  KRF VF+DNL  I++ N  ++ T   G+N  +D+T 
Sbjct: 35 EVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTN 94

Query: 89 EEMK 92
          EE +
Sbjct: 95 EEYR 98


>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
            castaneum]
          Length = 1726

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/44 (52%), Positives = 27/44 (61%)

Query: 54   RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL 97
            RF VF  NL  I  LN  E GTATYGI   +D+T++E    LGL
Sbjct: 1442 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGL 1485


>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
 gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
          Length = 401

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E+ K FE+F + ++K+Y + EE  +RF +++ N+  I+  N  +  +    +N   DL++
Sbjct: 81  EYRKSFEEFKKKYNKTYSSMEEENQRFEIYKQNMNFIKTTNS-QGFSYVLEMNEFGDLSK 139

Query: 89  EEMKSRL 95
           EE  +R 
Sbjct: 140 EEFMARF 146


>gi|1594287|gb|AAC48340.1| cathepsin L-like cysteine proteinase [Toxocara canis]
          Length = 360

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTRE 89
           L +FE+FIR + K Y + EE A+RF ++ +N+   + LN+      T YG N  +D    
Sbjct: 47  LDRFEEFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADWNVN 106

Query: 90  EMKSRL 95
           E +  L
Sbjct: 107 EFREIL 112


>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
          Length = 1761

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/44 (52%), Positives = 27/44 (61%)

Query: 54   RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL 97
            RF VF  NL  I  LN  E GTATYGI   +D+T++E    LGL
Sbjct: 1477 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGL 1520


>gi|6967097|emb|CAB72480.1| cysteine protease-like protein [Arabidopsis thaliana]
          Length = 377

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y + EE+  RF+VF++NL LI   NK +  +    +N  +DLT +
Sbjct: 55  HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113

Query: 90  EMK 92
           E +
Sbjct: 114 EFQ 116


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 6/69 (8%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEM 91
          FE +I    K Y + EE   RF +F+DNL  I++ NK       Y  G+N  +DL+ EE 
Sbjct: 33 FESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNK---KVVNYWLGLNEFADLSHEEF 89

Query: 92 KSR-LGLNL 99
          K++ LGLN+
Sbjct: 90 KNKYLGLNV 98


>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 365

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 33/95 (34%), Positives = 46/95 (48%), Gaps = 6/95 (6%)

Query: 6   SAEATLALFGQMKSNNELKTE--NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
           S +A   L  Q+    E++    N EH   F  F   F K+Y TKEE   RF VF+ N++
Sbjct: 23  STDADDILIRQVVPEGEVEDHLLNAEH--HFSTFKSKFGKTYATKEEHDHRFGVFKSNMR 80

Query: 64  LIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
               L+     +A +G+   SDLT  E   + LGL
Sbjct: 81  RAR-LHAQLDPSAVHGVTKFSDLTPAEFHRKFLGL 114


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG-TATYGINHLSDLTREEMK 92
          +E ++    K+Y    E  +RF +F DNL+ I+D N+ E+  + T G+   +DLT EE +
Sbjct: 38 YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYR 97

Query: 93 S 93
          S
Sbjct: 98 S 98


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 31/61 (50%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           E+++    + Y    E A+R  VF+ N+  IE  N G       G+N  +DLT EE K+ 
Sbjct: 45  ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104

Query: 95  L 95
           +
Sbjct: 105 M 105


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 1/72 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   GIN  +D+T EE 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEF 96

Query: 92  KSRL-GLNLSKH 102
            ++  G+N+  +
Sbjct: 97  LTKFTGINIPSY 108


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 31/61 (50%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           E+++    + Y    E A+R  VF+ N+  IE  N G       G+N  +DLT EE K+ 
Sbjct: 45  ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104

Query: 95  L 95
           +
Sbjct: 105 M 105


>gi|18407961|ref|NP_566880.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|73622182|sp|Q8RWQ9.1|ALEUL_ARATH RecName: Full=Thiol protease aleurain-like; Flags: Precursor
 gi|20147207|gb|AAM10319.1| AT3g45310/F18N11_70 [Arabidopsis thaliana]
 gi|332644500|gb|AEE78021.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 358

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y + EE+  RF+VF++NL LI   NK +  +    +N  +DLT +
Sbjct: 55  HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113

Query: 90  EMK 92
           E +
Sbjct: 114 EFQ 116


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 38/67 (56%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  +  ++   SK+Y    E  KRF +F++NL+ I++ N  ++ T   G+   +DLT 
Sbjct: 43  EVISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTN 102

Query: 89  EEMKSRL 95
           EE +++ 
Sbjct: 103 EEYRAKF 109


>gi|10441624|gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
          virgifera]
          Length = 322

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTR 88
          + +E F     K Y    E   RF+VF+ NLK I + N K E G   Y   +N  +D+T 
Sbjct: 19 QHWESFKVQHGKVYKNPIEERVRFSVFQANLKTINEHNAKYEQGLVGYTMAVNQFADMTP 78

Query: 89 EEMKSRLGL 97
          EE K++LG+
Sbjct: 79 EEFKAKLGM 87


>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName:
          Full=Cysteine proteinase; Short=CP; Flags: Precursor
 gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
          Length = 331

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 23/67 (34%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE F+ +++K Y    E  +RF++F+  L+ I   N+  + +A Y IN  +DL++ E+ S
Sbjct: 31 FETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNR-LNDSAVYQINKFADLSKNEIIS 89

Query: 94 RL-GLNL 99
          +  GLN+
Sbjct: 90 KYTGLNM 96


>gi|225718616|gb|ACO15154.1| Cathepsin K precursor [Caligus clemensi]
          Length = 377

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
          +P  +++FE+F + F K Y  +   +KR  +F  NL++I   N     +    +N  +DL
Sbjct: 24 SPYEIQRFEEFQKTFGKVYDDRMTYSKRLRIFIHNLRVINAHNANPGRSYDLAVNKFTDL 83

Query: 87 TREEMKSRL 95
          T +E   R 
Sbjct: 84 TEKEFTQRF 92


>gi|403334193|gb|EJY66252.1| Cysteine protease [Oxytricha trifallax]
          Length = 397

 Score = 43.5 bits (101), Expect = 0.014,   Method: Composition-based stats.
 Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT---YGINHL 83
           +PE  + F  F+    +S+ TKEE   R + F DN + I+  N+G          G+N  
Sbjct: 70  DPETQQAFSDFVAKHQRSFLTKEEYKARLSNFRDNYQTIKSHNEGRRKNGVSFKMGVNQF 129

Query: 84  SDLTREEMKS 93
           SD ++ E+ S
Sbjct: 130 SDWSKAELNS 139


>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
          Length = 365

 Score = 43.5 bits (101), Expect = 0.014,   Method: Composition-based stats.
 Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F K+Y TKEE   RF VF+ N++    L+     +A +G+   SDL
Sbjct: 46  NAEH--HFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRAR-LHAQLDPSAVHGVTKFSDL 102

Query: 87  TREEMKSR-LGL 97
           T  E   + LGL
Sbjct: 103 TPAEFHRKFLGL 114


>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
          Length = 467

 Score = 43.5 bits (101), Expect = 0.014,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF  F +   K Y +  E A R  VF++NL L   L+   +  A++ +   SDLTREE +
Sbjct: 37 QFAAFKQRHGKVYGSAAEEAFRLGVFKENL-LFARLHAAANPHASFAVTPFSDLTREEFR 95

Query: 93 SR 94
          SR
Sbjct: 96 SR 97


>gi|79314271|ref|NP_001030812.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
 gi|332644501|gb|AEE78022.1| thiol protease aleurain-like protein [Arabidopsis thaliana]
          Length = 357

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y + EE+  RF+VF++NL LI   NK +  +    +N  +DLT +
Sbjct: 55  HVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113

Query: 90  EMK 92
           E +
Sbjct: 114 EFQ 116


>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 384

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 35/68 (51%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L +F  ++    +SYPT EE  +RF V+  N++ IE  N+    + + G    +DLT +E
Sbjct: 49  LGRFHGWMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDE 108

Query: 91  MKSRLGLN 98
             +    N
Sbjct: 109 FMAMYSSN 116


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYP-TKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           L   + E +  +E ++ +  KSY     E  KRF +F+DNL+ I++ N     +   G+N
Sbjct: 38  LSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLN 97

Query: 82  HLSDLTREEMKS 93
             +DLT EE +S
Sbjct: 98  RFADLTNEEYRS 109


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          E+++  + + Y    E A+RF VF+ N   IE  N G H     G+N  +DLT +E +
Sbjct: 38 ERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNH-KFWLGVNQFADLTNDEFR 94


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ EK++  + K Y    E  KRF +F++N++ IE  N          IN  +DL  EE 
Sbjct: 35  ERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEF 94

Query: 92  KSRLGLNLSK 101
           K+ L +N+ K
Sbjct: 95  KASL-INVQK 103


>gi|340053967|emb|CCC48260.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
          vivax Y486]
          Length = 182

 Score = 43.5 bits (101), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 18/59 (30%), Positives = 33/59 (55%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          ++++  + + Y T  E  +R  +F++NLK I+  NK  +     G+N  +DLT EE  +
Sbjct: 40 DQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTT 98


>gi|91992516|gb|ABE72974.1| cathepsin L [Ochlerotatus atropalpus]
          Length = 313

 Score = 43.5 bits (101), Expect = 0.015,   Method: Composition-based stats.
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 29  EHL-KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           EHL  +F +F     K+Y   +E  +R  +F  NL+ I   N+   G  T  +NHL+D T
Sbjct: 3   EHLDNEFSRFKNKHGKNYHNDKEHDRRRDIFRQNLRFIHSHNRAGKGF-TVAVNHLADRT 61

Query: 88  REEMKSRLGLNLS 100
            EE+K+  G   S
Sbjct: 62  DEELKALRGFKSS 74


>gi|357439381|ref|XP_003589967.1| Cysteine proteinase [Medicago truncatula]
 gi|357439401|ref|XP_003589977.1| Cysteine proteinase [Medicago truncatula]
 gi|357439405|ref|XP_003589979.1| Cysteine proteinase [Medicago truncatula]
 gi|355479015|gb|AES60218.1| Cysteine proteinase [Medicago truncatula]
 gi|355479025|gb|AES60228.1| Cysteine proteinase [Medicago truncatula]
 gi|355479027|gb|AES60230.1| Cysteine proteinase [Medicago truncatula]
          Length = 127

 Score = 43.5 bits (101), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           K F+++I ++ ++Y    E+ KR  +F++ LK ++  NK      T G+N  SD T EE 
Sbjct: 35  KAFQQWIHEYGRTYSNTTEMNKRRVIFKEELKYVKKFNKAGDEGYTIGLNQYSDWTDEEY 94

Query: 92  KSRLGLNLSKH 102
               G  L K+
Sbjct: 95  ---FGSQLPKY 102


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L  F +++   S+ Y +  E  +RF +F+DNL  I + NK E  +   G+N  SDLT +E
Sbjct: 49  LDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEK-SYWLGLNKFSDLTHDE 107

Query: 91  MKS 93
            ++
Sbjct: 108 FRA 110


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 6  SAEATLALFGQMKSNNELKTENPEHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
          S+ A L +FG +      +T     LK+  E+++  + K Y    E   R  +F++N++ 
Sbjct: 10 SSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQR 69

Query: 65 IEDLNKGEHGTATYGINHLSDLTREEMKSR 94
          IE  N   +     GIN  +DLT EE K+R
Sbjct: 70 IEAFNNAGNKPYKLGINQFADLTNEEFKAR 99


>gi|224035611|gb|ACN36881.1| unknown [Zea mays]
          Length = 327

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 2/68 (2%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+++    ++Y + EE  +RF VF+DNL  I++ N+ +  +   G+N  +DLT +E K+
Sbjct: 59  FERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNR-KVSSYWLGLNEFADLTHDEFKA 117

Query: 94  R-LGLNLS 100
             LGL  S
Sbjct: 118 TYLGLRSS 125


>gi|4733887|gb|AAD02173.3| cysteine proteinase [Acanthamoeba culbertsoni]
          Length = 482

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           QF  ++R  ++SY + +E  +R+  + +N+  IE+ N+G H T T  +N   DLT EE
Sbjct: 63  QFNSWMRRHARSY-SNDEFLERYNTWRENMDFIEEFNRGNH-TFTVAMNEHGDLTPEE 118


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ EK++  + K Y    E  KRF +F++N++ IE  N          IN  +DL  EE 
Sbjct: 35  ERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEF 94

Query: 92  KSRLGLNLSK 101
           K+ L +N+ K
Sbjct: 95  KASL-INVQK 103


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score = 43.5 bits (101), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           + +FE ++    K Y + EE   RF VF +NL  I++ NK E  +   G+N  +DL+ EE
Sbjct: 46  IARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNK-EVSSYWLGLNEFADLSHEE 104

Query: 91  MKSR 94
            KS+
Sbjct: 105 FKSK 108


>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
           endopeptidase; AltName: Full=Papaya peptidase B;
           AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
           Precursor
 gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
          Length = 348

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 13/95 (13%)

Query: 11  LALFGQMK-----------SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFE 59
           + LFG M            S ++L T     ++ F  ++   +K+Y   +E   RF +F+
Sbjct: 15  ICLFGHMSLSYCDFSIVGYSQDDL-TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFK 73

Query: 60  DNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           DNLK I++ NK  +G    G+N  SDL+ +E K +
Sbjct: 74  DNLKYIDERNKMINGY-WLGLNEFSDLSNDEFKEK 107


>gi|440797510|gb|ELR18596.1| Cathepsin L precursor (Cysteine proteinase 1), putative
          [Acanthamoeba castellanii str. Neff]
          Length = 340

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 9  ATLALFGQMKSNNELKTENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
          A LALF  +   + L + +    ++  F +++R  +KSY T +E + R+AV+ DN + IE
Sbjct: 5  AILALFAVVFVVSALASGHSTSAEEQIFAQWMRAHAKSYAT-QEFSHRWAVWRDNHRFIE 63

Query: 67 DLNKGEHGTATYGINHLSDLTREE 90
            N+  + T T  +N   DLT  E
Sbjct: 64 AHNRQPNKTFTLAMNQFGDLTDHE 87


>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
          Length = 341

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 1/73 (1%)

Query: 20 NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
          N  L T + ++  +F+ ++ + +K Y  +EE   R + F  N+  IE +N+    TAT+G
Sbjct: 19 NVRLSTAD-DYTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFG 77

Query: 80 INHLSDLTREEMK 92
          +N  SDL+ +E K
Sbjct: 78 LNKFSDLSLDEFK 90


>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 43.5 bits (101), Expect = 0.016,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM-KS 93
           E+++  + + Y    E  +R  VF  N + I+ +N+  + T T G+NH SDLT EE  ++
Sbjct: 42  ERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQT 101

Query: 94  RLG 96
            LG
Sbjct: 102 HLG 104


>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
 gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
          Length = 274

 Score = 43.5 bits (101), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 24/39 (61%)

Query: 54 RFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          R+ VF+DNLK  E L   E GTA YG+    DLT EE +
Sbjct: 1  RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFR 39


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella
          moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella
          moellendorffii]
          Length = 299

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 2  FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61


>gi|118359377|ref|XP_001012928.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89294695|gb|EAR92683.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 377

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 34 FEKFIRDFSKSYP-TKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          FE+++++F+K+Y    E+   R ++FE NL  I D N   + +   G+N  +D T+ E+K
Sbjct: 29 FEQYVKEFNKNYGFNSEDYQLRKSIFERNLAEIIDFNNDPNHSYKKGVNQFTDQTQNELK 88

Query: 93 SR 94
           +
Sbjct: 89 EK 90


>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 368

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
            F  F + F K Y ++EE   RF+VF+ NL+      K +  +A +G+   SDLTR E K
Sbjct: 50  HFSLFKKKFGKVYASREEHDYRFSVFKSNLRRARRHQKLD-PSARHGVTQFSDLTRSEFK 108

Query: 93  SR 94
            +
Sbjct: 109 RK 110


>gi|353441136|gb|AEQ94152.1| drought-inducible cysteine proteinase [Elaeis guineensis]
          Length = 252

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 34/59 (57%), Gaps = 1/59 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           F  F+R F KSY  ++E A RF+VF+ NL+      K +  TA +GI   SDLT  E +
Sbjct: 55  FSSFLRRFGKSYADEKEHAYRFSVFKANLRRARRHQKMDP-TAVHGITKFSDLTPAEFR 112


>gi|294885122|ref|XP_002771197.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239874644|gb|EER03013.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 111

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 2/65 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MK 92
          F  F   F K Y +KEE  KR A+F+ NL  IE +N  ++ + T G+N  +DLT EE + 
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQVN-AQNLSYTLGVNEYADLTHEEFVA 86

Query: 93 SRLGL 97
           ++G+
Sbjct: 87 QKVGI 91


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGI 80
           L+    E    ++ ++ +  +SY    E  +RF VF DNLK ++  N    EHG    G+
Sbjct: 38  LERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGM 97

Query: 81  NHLSDLTREEMKS 93
           N  +DLT +E +S
Sbjct: 98  NRFADLTNDEFRS 110


>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
          Length = 340

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIED---LNKGEHGTATYGINHLSD 85
          E +  +E+++    K Y +  E  KRF +F+DNL+ I+     NK  H   T G+N  +D
Sbjct: 29 EVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFAD 88

Query: 86 LTREEMKS 93
          LT +E  S
Sbjct: 89 LTLDEFSS 96


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T EE 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 366

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F+K+Y T+EE   RF +F++NL   +   K +  +A +G+   SDL
Sbjct: 46  NAEH--HFSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PSAVHGVTRFSDL 102

Query: 87  TREEMKSR-LGL 97
           T  E + + LGL
Sbjct: 103 TPSEFRGQFLGL 114


>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
          Length = 552

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 29  EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           EH++  F+ F +  SK Y +  E  KR  +F  NL+ I   N+   G  T  +NHL+D T
Sbjct: 242 EHVETAFDHFRKRHSKDYASNLEHTKRKEIFRQNLRFIHSKNRARLGF-TLDVNHLADRT 300

Query: 88  REEMKSRLG 96
             E+K+  G
Sbjct: 301 ELELKALRG 309


>gi|389611850|dbj|BAM19484.1| cathepsin L [Papilio xuthus]
          Length = 342

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F++F    +K Y ++ E AKR  +F  NL+ I   N+   G  T  +NHL+D   +E+ 
Sbjct: 35  EFDRFKAKHNKKYASEIEHAKRLNIFRQNLRYIHSNNRARRGF-TLAVNHLADWAEDELA 93

Query: 93  SRLGLNLS 100
           +  G   S
Sbjct: 94  ALRGRRYS 101


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           +++ E+++  F++ Y    E A+RF VF+ N+  IE  N  E+     G+N  +DLT +E
Sbjct: 34  VERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFN-AENRKFWLGVNQFTDLTNDE 92

Query: 91  M---KSRLGLNLS 100
               K+  GL +S
Sbjct: 93  FRATKTNKGLKMS 105


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score = 43.5 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEE 95


>gi|76097507|gb|ABA39436.1| Der f 1 allergen precursor [Dermatophagoides farinae]
          Length = 276

 Score = 43.5 bits (101), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 2  PASIKTFEEFKKAFNKNYATVEEEEVARKN--FLESLKYVE-ANKG-------AINHLSD 51

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 52 LSLDEFKNR 60


>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
 gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
          Length = 376

 Score = 43.5 bits (101), Expect = 0.018,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 35/68 (51%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E ++ F+ F   +++SY    E A+R  +F  NL   + L + + GTA +G    SDLT 
Sbjct: 35  ELIEVFKLFQIKYNRSYANPAEYARRLNIFAHNLAQAQRLQEEDLGTAEFGETPFSDLTE 94

Query: 89  EEMKSRLG 96
           EE     G
Sbjct: 95  EEFGQLYG 102


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score = 43.5 bits (101), Expect = 0.018,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ E ++  + + Y    E +KR+ +F+DN+  IE  NK  + +    IN  +DLT EE 
Sbjct: 37 ERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEF 96

Query: 92 KS 93
          ++
Sbjct: 97 RA 98


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score = 43.5 bits (101), Expect = 0.018,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 42  SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
           SK Y + +E  KR+ +F+ NL+ I + N+  +G+   G+NH +D+  EE K S LGL
Sbjct: 63  SKIYASPKEKVKRYEIFKRNLRHIVETNR-RNGSYWLGLNHFADIAHEEFKASYLGL 118


>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
          Length = 343

 Score = 43.5 bits (101), Expect = 0.018,   Method: Composition-based stats.
 Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 3/91 (3%)

Query: 9   ATLALFGQMK---SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI 65
           AT+ LF   +   S N    E  +    F  ++  + KSY TKEE   R+  ++ N+  +
Sbjct: 15  ATVGLFAISEAPASTNLFAIEVTQDNVAFANYLAKYGKSYGTKEEFQFRYEQYQKNMAKV 74

Query: 66  EDLNKGEHGTATYGINHLSDLTREEMKSRLG 96
              N     T   GIN  +D T EE K  LG
Sbjct: 75  AQYNGQNGNTFRLGINKFTDYTPEEYKVLLG 105


>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 376

 Score = 43.1 bits (100), Expect = 0.018,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 21  NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           NEL+     H   F  F++ F+KSY   +E A R +VF  NL+      + +  +A +G+
Sbjct: 42  NELELNAEAH---FASFVQRFNKSYRDADEHAHRLSVFTANLRRARRHQRLD-PSAVHGV 97

Query: 81  NHLSDLTREEMKSR-LGL 97
              SDLT +E + R LGL
Sbjct: 98  TKFSDLTPDEFRDRFLGL 115


>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
 gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
          Length = 337

 Score = 43.1 bits (100), Expect = 0.018,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FEKFI  ++K Y +++E   R+ +F  N++ I   N   + +A Y IN  +D+ + E+  
Sbjct: 40  FEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQKNS-RNDSAVYKINRFADMPKNEIVI 98

Query: 94  R--------LGLNL 99
           R        LGLN 
Sbjct: 99  RHTGLASGELGLNF 112


>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score = 43.1 bits (100), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 2/67 (2%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           KQFE F+ +  K Y +++E   RF  F +NLK I+  N  E G+A YG+   +DL+  E 
Sbjct: 48  KQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDFEF 106

Query: 92  KSR-LGL 97
           +   LGL
Sbjct: 107 RRHYLGL 113


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella
          moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella
          moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella
          moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella
          moellendorffii]
          Length = 300

 Score = 43.1 bits (100), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 31/60 (51%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE +     KSY +  E A+R  VF D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 2  FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score = 43.1 bits (100), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 43/72 (59%), Gaps = 6/72 (8%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
           ++ FE ++    K Y T EE   RF VF+DNLK I+D NK     + Y  G+N  +DL+ 
Sbjct: 44  IELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---VVSNYWLGLNEFADLSH 100

Query: 89  EEMKSR-LGLNL 99
           +E K++ LGL +
Sbjct: 101 QEFKNKYLGLKV 112


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score = 43.1 bits (100), Expect = 0.018,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 42  SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK-SRLGL 97
           SK Y + +E  KR+ +F+ NL+ I + N+  +G+   G+NH +D+  EE K S LGL
Sbjct: 54  SKIYASPKEKVKRYEIFKRNLRHIVETNR-RNGSYWLGLNHFADIAHEEFKASYLGL 109


>gi|37958161|gb|AAP35075.1| Der f 1 allergen [Dermatophagoides farinae]
          Length = 263

 Score = 43.1 bits (100), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 20 PASIKTFEEFKKAFNKNYATVEEEEVARKN--FLESLKYVE-ANKG-------AINHLSD 69

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 70 LSLDEFKNR 78


>gi|440800456|gb|ELR21495.1| cathepsin Llike proteinase [Acanthamoeba castellanii str. Neff]
          Length = 557

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 37/81 (45%), Gaps = 1/81 (1%)

Query: 11  LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
           L++  +M   + +   N E   QF      F K Y    E + R  +F  NL+ IE  NK
Sbjct: 211 LSVLRKMFQASPVPDHNDEVAAQFAAHAHKFGKVYADHSEYSMRLNIFRKNLEYIEQYNK 270

Query: 71  GEHGTATYGINHLSDLTREEM 91
            + G     +NH  D+T +E+
Sbjct: 271 KDTGM-KLAMNHFGDMTYDEI 290


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 33/60 (55%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          +E ++    K+Y    E   RF +F DNLK I++ N   + +   G+N  +DLT EE +S
Sbjct: 36 YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTNEEYRS 95


>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
          Length = 365

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + KSY +  EV +RF +F ++L+ +   N+ +  +   GIN  SD++ E
Sbjct: 60  HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLSYRLGINRFSDMSWE 118

Query: 90  EMK-SRLG 96
           E + +RLG
Sbjct: 119 EFQATRLG 126


>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
 gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 3/62 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL-KLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++F+ FI++ +K Y T+EE   RF +F  NL + +E  ++    TA +G+    DLT EE
Sbjct: 12 EKFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVE--HQALDPTAIHGVTPFMDLTEEE 69

Query: 91 MK 92
           +
Sbjct: 70 FE 71


>gi|407844577|gb|EKG02025.1| cysteine peptidase, putative,cysteine peptidase, clan CA, family
           C1, cathepsin L-like, putative, partial [Trypanosoma
           cruzi]
          Length = 308

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF +F +   + Y +  E A R +VF  NL  +  L+   +  A +G+   SDLTREE +
Sbjct: 65  QFAEFKQKHGRVYGSAAEEAFRLSVFRANL-FLARLHAAANPHANFGVTPFSDLTREEFR 123

Query: 93  SR 94
           SR
Sbjct: 124 SR 125


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T EE 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|326515410|dbj|BAK03618.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 202

 Score = 43.1 bits (100), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 3/61 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHG--TATYGINHLSDLTREE 90
           F  +   F K+Y +  E  +R+AVF++ L+L++  N  GE G   A  GIN L+D+T EE
Sbjct: 50  FGAWKAKFGKTYSSVGEEERRYAVFKETLRLVDQHNAAGEAGVPVARMGINGLADMTTEE 109

Query: 91  M 91
            
Sbjct: 110 W 110


>gi|357630541|gb|EHJ78589.1| hypothetical protein KGM_15348 [Danaus plexippus]
          Length = 98

 Score = 43.1 bits (100), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 36/66 (54%), Gaps = 5/66 (7%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT--ATYGINHLSDLTRE 89
          K FEKF+ D+ K Y  + + A  +  F  +L  I   NKG   +   TY INHL+D T E
Sbjct: 30 KLFEKFMADYDKHYKDQIDTANHYNAFLASLVTI---NKGNRDSPLTTYDINHLADYTPE 86

Query: 90 EMKSRL 95
          E+ S L
Sbjct: 87 EIDSTL 92


>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
          Length = 330

 Score = 43.1 bits (100), Expect = 0.019,   Method: Composition-based stats.
 Identities = 27/80 (33%), Positives = 47/80 (58%), Gaps = 6/80 (7%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATYGINH 82
           +P   K +E++     K Y  + E+  R AV+E N+ L+    ++ + G+H + T G+NH
Sbjct: 19  SPAVNKLWEEWKTKHGKVYDNQTEIDFRRAVWEKNVHLVLRHNQEASAGKH-SFTLGLNH 77

Query: 83  LSDLTREEMKSRL-GLNLSK 101
           L+D+T EE+  +L GL L +
Sbjct: 78  LADMTAEEINEKLNGLKLEE 97


>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
 gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
          Length = 331

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 13/78 (16%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG----TATYGINHLSDLTR 88
           QF +F + F K Y +K+E   RF VF+ NL          HG    +AT+G+   SDLT 
Sbjct: 47  QFNEFKQRFGKVYSSKDEHDYRFNVFKSNLH-----RAKRHGIMDPSATHGVTRFSDLTP 101

Query: 89  EEMKSRL----GLNLSKH 102
            E ++ +    G+ L +H
Sbjct: 102 REFRNSILGLKGVGLPRH 119


>gi|345316917|ref|XP_001511419.2| PREDICTED: cathepsin W-like, partial [Ornithorhynchus anatinus]
          Length = 252

 Score = 43.1 bits (100), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 39/73 (53%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           GQ +    L     E + +F++F   ++KSY  + E A+RF +F  NL     L + + G
Sbjct: 28  GQNQHPQPLPDTTLELMDKFKEFQIRYNKSYEDQAEHARRFEIFVQNLARARKLQEEDQG 87

Query: 75  TATYGINHLSDLT 87
           TA +G+   SDL+
Sbjct: 88  TAEFGVTPFSDLS 100


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 33/64 (51%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          ++ EK++  + K Y    E  KRF VF++N++ IE  N          IN  +DL  EE 
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92

Query: 92 KSRL 95
          K+ L
Sbjct: 93 KALL 96


>gi|38423491|emb|CAD80247.1| salarin [Salvelinus alpinus]
          Length = 342

 Score = 43.1 bits (100), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHG--TATYGINHLSDLTR 88
           K+FE +   + K+YP+ EE AKR  ++    K++ + NK  E+G  + T  +NH +DLT 
Sbjct: 272 KEFETWKVKYGKTYPSTEEEAKRKEIWLATRKMVTEHNKRAENGQESFTMAVNHFADLTT 331

Query: 89  EEMKSRL 95
           EE+   L
Sbjct: 332 EEVPKGL 338



 Score = 38.9 bits (89), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 3/67 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTR 88
          K+FE +   + KSYP+ EE AKR  ++    K + + N +  +G  +Y   +NH +DLT 
Sbjct: 32 KEFETWKVKYGKSYPSTEEEAKRKEMWLATRKRVMEHNTRAGNGLESYTMAVNHFADLTT 91

Query: 89 EEMKSRL 95
          EE+   L
Sbjct: 92 EEVPKGL 98


>gi|387765908|gb|AFJ95133.1| cathepsin-L [Toxocara canis]
          Length = 360

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT-YGINHLSDLTRE 89
           L +FE FIR + K Y + EE A+RF ++ +N+   + LN+      T YG N  +D    
Sbjct: 47  LDRFEDFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGENEFADWNVN 106

Query: 90  EMKSRL 95
           E +  L
Sbjct: 107 EFREIL 112


>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
          Length = 411

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           E+  ++ QF  F+  + + Y    E  +RF  F +N+K I+ + +G+     +GI   +D
Sbjct: 96  EDFAYIDQFIDFMNVYGRKYHGYHETRERFQNFVNNMKYIKKIQQGKQ-NVQFGITRFAD 154

Query: 86  LTREEMKS 93
            + EEMKS
Sbjct: 155 WSEEEMKS 162


>gi|198435380|ref|XP_002128293.1| PREDICTED: similar to cathepsin H [Ciona intestinalis]
          Length = 438

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 20  NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYG 79
           NN+++ E     K ++    +  K Y  +EE  KRF +F  +LK I++ N     T   G
Sbjct: 124 NNDVEVEERNLFKGWQI---EHGKQYINQEEAEKRFQIFSKSLKTIKEFNNRVDRTWEMG 180

Query: 80  INHLSDLTREEMKS 93
           +N  SD T EE  S
Sbjct: 181 LNEFSDRTFEEFAS 194


>gi|313229615|emb|CBY18430.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score = 43.1 bits (100), Expect = 0.020,   Method: Composition-based stats.
 Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          FE +  +  KSY T E+   R  VFE+N+  IE +NK E+ + T G+N  SDLT +E +
Sbjct: 18 FEDWTAEHWKSYETAEDEKFRKGVFEENVAKIEKINK-ENRSWTAGLNKFSDLTWDEFQ 75


>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
 gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
 gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
          Length = 365

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
           F+ F++ ++KSY   +E   R+ VF+DNL  I   N+               +A +G+N 
Sbjct: 55  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114

Query: 83  LSDLTREE-MKSRLG--LNLSKH 102
            SD T +E + S  G  LNLS+H
Sbjct: 115 FSDKTPDEVLHSNTGFFLNLSQH 137


>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score = 43.1 bits (100), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 2/67 (2%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           KQFE F+ +  K Y +++E   RF  F +NLK I+  N  E G+A YG+   +DL+  E 
Sbjct: 48  KQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDFEF 106

Query: 92  KSR-LGL 97
           +   LGL
Sbjct: 107 RRHYLGL 113


>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
           [Cucumis sativus]
          Length = 381

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 4/70 (5%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EH   F  F R F KSY T+EE  +RF +F+ N++  E  ++    +A +G+   SDLT 
Sbjct: 56  EH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAER-HQSFDPSAIHGVTQFSDLTP 112

Query: 89  EEM-KSRLGL 97
            E  K+ LGL
Sbjct: 113 FEFRKAFLGL 122


>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
 gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
          Length = 363

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S N+L T     ++ FE ++   +K Y   +E   RF +F+DNLK I++ NK ++ +   
Sbjct: 52  SQNDL-TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNK-KNNSYWL 109

Query: 79  GINHLSDLTREEMKSR 94
           G+N  +D++ +E K +
Sbjct: 110 GLNVFADMSNDEFKEK 125


>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
           Australia]
          Length = 367

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
           F+ F++ ++KSY   +E   R+ VF+DNL  I   N+               +A +G+N 
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 83  LSDLTREE-MKSRLG--LNLSKH 102
            SD T +E + S  G  LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139


>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
 gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
 gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
 gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
          Length = 367

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 14/83 (16%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE-----------HGTATYGINH 82
           F+ F++ ++KSY   +E   R+ VF+DNL  I   N+               +A +G+N 
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 83  LSDLTREE-MKSRLG--LNLSKH 102
            SD T +E + S  G  LNLS+H
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQH 139


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine
          endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 33/59 (55%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE
Sbjct: 37 ERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEE 95


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score = 43.1 bits (100), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 41/69 (59%), Gaps = 6/69 (8%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTREEM 91
           FE ++    K Y T EE   RF VF+DNLK I+D NK     + Y  G+N  +DL+ +E 
Sbjct: 47  FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNK---IVSNYWLGLNEFADLSHQEF 103

Query: 92  KSR-LGLNL 99
           K++ LGL +
Sbjct: 104 KNKYLGLKV 112


>gi|2695929|emb|CAA10983.1| putative thiol protease [Hordeum vulgare subsp. vulgare]
          Length = 111

 Score = 43.1 bits (100), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 33/60 (55%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           L +F  ++    +SYPT EE  +RF V+  N++ IE  N+    + + G    +DLT EE
Sbjct: 46  LGRFHGWMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHEE 105


>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 387

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 4/70 (5%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           EH   F  F R F KSY T+EE  +RF +F+ N++  E  ++    +A +G+   SDLT 
Sbjct: 56  EH--HFSLFKRRFGKSYATEEEHDRRFKIFKANMRRAER-HQSFDPSAIHGVTQFSDLTP 112

Query: 89  EEM-KSRLGL 97
            E  K+ LGL
Sbjct: 113 FEFRKAFLGL 122


>gi|297819034|ref|XP_002877400.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323238|gb|EFH53659.1| hypothetical protein ARALYDRAFT_323209 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 317

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H+  F +F   + K Y + EE+  RF+VF++NL LI   NK +  +    +N  +DLT +
Sbjct: 55  HVLSFSRFAHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNK-KGLSYKLSLNQFADLTWQ 113

Query: 90  EMK 92
           E +
Sbjct: 114 EFQ 116


>gi|297613009|ref|NP_001066557.2| Os12g0273800 [Oryza sativa Japonica Group]
 gi|255670224|dbj|BAF29576.2| Os12g0273800 [Oryza sativa Japonica Group]
          Length = 210

 Score = 43.1 bits (100), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 31/61 (50%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           E+++    + Y    E A+R  VF+ N+  IE  N G       G+N  +DLT EE K+ 
Sbjct: 45  ERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKAT 104

Query: 95  L 95
           +
Sbjct: 105 M 105


>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
 gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
          Length = 327

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          + FE+++  F K YP   E   RF VF DN++ I          +   +N  +DLT +E 
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76

Query: 92 KS 93
           S
Sbjct: 77 VS 78


>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score = 43.1 bits (100), Expect = 0.021,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           + FE+++  F K YP   E   RF VF DN++ I          +   +N  +DLT +E 
Sbjct: 39  QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98

Query: 92  KS 93
            S
Sbjct: 99  VS 100


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score = 43.1 bits (100), Expect = 0.022,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           E+P   + F+ +++   ++Y + EE  +RF V+ DNL+ + + N G H +    +   +D
Sbjct: 34  ESPR--EAFDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYNAG-HTSHWLSMGVYAD 90

Query: 86  LTREEMKSR-LGLNLSKH 102
           L+++E +S+ LG N   H
Sbjct: 91  LSQDEYRSKALGYNADLH 108


>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 322

 Score = 43.1 bits (100), Expect = 0.022,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
           ++++  FS+ Y  + E   R  VF+ NLK IE+ N   + + T G+N  +D T EE + +
Sbjct: 39  QQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSYTVGVNEFTDWTIEEFLAT 98

Query: 94  RLGLNLS 100
             GL ++
Sbjct: 99  HTGLRVN 105


>gi|42564157|gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa
          decemlineata]
          Length = 322

 Score = 43.1 bits (100), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 3/68 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTRE 89
          Q+  F +   K+Y +  E   RF +F++NL+ IE  N + E G  TY   +   +D+TR+
Sbjct: 22 QWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAEYEEGKVTYYMAVTQFADMTRD 81

Query: 90 EMKSRLGL 97
          E + +LGL
Sbjct: 82 EFRKKLGL 89


>gi|340053969|emb|CCC48263.1| cysteine peptidase precursor, fragment, partial [Trypanosoma
          vivax Y486]
          Length = 259

 Score = 43.1 bits (100), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 34 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 92

Query: 94 R 94
          R
Sbjct: 93 R 93


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score = 43.1 bits (100), Expect = 0.023,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + KSY +  EV +RF +F ++L+ +   N+ +  +   GIN  SD++ E
Sbjct: 58  HALRFARFAVRYGKSYESAAEVQRRFRIFSESLEEVRSTNQ-KGLSYRLGINRYSDMSWE 116

Query: 90  EMK-SRLG 96
           E + SRLG
Sbjct: 117 EFQASRLG 124


>gi|403348594|gb|EJY73736.1| Cysteine protease [Oxytricha trifallax]
          Length = 362

 Score = 43.1 bits (100), Expect = 0.023,   Method: Composition-based stats.
 Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 8/92 (8%)

Query: 9   ATLALFGQMKSN--------NELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
           A LAL G +  N        N L   +PE    F  F+    +S+ T+EE   R A+F D
Sbjct: 12  AALALIGVLNLNESSLENNSNLLLKVSPEVQSAFNNFVSRQQRSFLTQEEFKARLAIFRD 71

Query: 61  NLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           N + ++  N  +  +    IN  +D +++E++
Sbjct: 72  NYERVQLHNSQKDVSFKLAINKFADWSKQELQ 103


>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
          Length = 325

 Score = 43.1 bits (100), Expect = 0.023,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          QF  F + F K+Y  +EE   R +VF +NLK+++  N  +  +   GI    DL+ +E +
Sbjct: 23 QFAAFEKKFGKTYVGEEERRFRMSVFSNNLKIVDYYNS-KQSSFVLGITPFIDLSNDEFR 81

Query: 93 SRLGLN 98
           R   N
Sbjct: 82 ERFASN 87


>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
          Length = 373

 Score = 43.1 bits (100), Expect = 0.024,   Method: Composition-based stats.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 3/94 (3%)

Query: 1   MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
           +AE +S++    +  Q+    E K  + E    F  F R F K Y + EE   R +VF+ 
Sbjct: 25  VAETSSSDGDDLVIRQVVDGAEPKVLSSE--DHFSLFKRKFGKVYASSEEHDYRLSVFKA 82

Query: 61  NLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           NL+      K +  +A +G+   SDLTR E + +
Sbjct: 83  NLRRARRHQKLD-PSARHGVTQFSDLTRSEFRKK 115


>gi|118397743|ref|XP_001031203.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila]
 gi|89285527|gb|EAR83540.1| Papain family cysteine protease containing protein [Tetrahymena
          thermophila SB210]
          Length = 358

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 37 FIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F   +SK Y +KE    RFA F +NLK I+ LN  E  TA + I+  SD T+EE
Sbjct: 42 FKNTYSKVYESKEVEQFRFATFVENLKEIDRLN-AEVTTAQFDISFFSDFTKEE 94


>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
          Length = 443

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F    +++Y + +E  KRF +F  N+K    LN+ ++  AT+G N  +D+T EE ++
Sbjct: 25 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNR-KNPMATFGPNEFADMTSEEFQT 83

Query: 94 R 94
          R
Sbjct: 84 R 84


>gi|332374780|gb|AEE62531.1| unknown [Dendroctonus ponderosae]
          Length = 544

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%)

Query: 23  LKTENPEHLK-QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           +K E   HL+ +F KF +   + Y  + E   R  +F  N++ I   N+   G  +  +N
Sbjct: 228 IKPETTGHLEFEFNKFTKKHRRIYTNQNERLLRMEIFRQNVRFIHSHNRKNVGF-SLSVN 286

Query: 82  HLSDLTREEMKSRLGLNLSK 101
           HL+D T  E+K+  G   SK
Sbjct: 287 HLADKTETELKALRGKTYSK 306


>gi|343412631|emb|CCD21595.1| hypothetical protein, conserved in T. vivax [Trypanosoma vivax
          Y486]
          Length = 257

 Score = 42.7 bits (99), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F + + +SY T  E A R  VFEDN++    +    +  AT+G+   SDLT EE ++
Sbjct: 14 FAAFKQKYGRSYGTAAEEAFRLRVFEDNMRR-SRMYAAANPHATFGVTPFSDLTPEEFRT 72

Query: 94 R 94
          R
Sbjct: 73 R 73


>gi|294883322|ref|XP_002770704.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
 gi|239873993|gb|EER02713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
          Length = 333

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 2/64 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F   F K+Y +KEE  KR A+F+ NL  IE +N  +  +   G+N  +DLT EE  +
Sbjct: 28 FMGFQHKFGKNYESKEEEVKRNAIFQANLHHIEQVNAKDL-SYKLGVNEHADLTHEEFAA 86

Query: 94 -RLG 96
           +LG
Sbjct: 87 LKLG 90


>gi|145547990|ref|XP_001459676.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427502|emb|CAK92279.1| unnamed protein product [Paramecium tetraurelia]
          Length = 329

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 36/60 (60%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          L QF+++  +F+K+Y +K E   RF ++  NL++I+  N   + + T G N   DLT +E
Sbjct: 22 LNQFQEWKTEFNKNYQSKYEEIYRFQIYIANLEIIQTHNSNNNYSYTLGENQFMDLTNDE 81


>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
          Length = 428

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F    +++Y + +E  KRF +F  N+K    LN+ ++  AT+G N  +D+T EE ++
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNR-KNPMATFGPNEFADMTSEEFQT 68

Query: 94 R 94
          R
Sbjct: 69 R 69


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 39/72 (54%)

Query: 22  ELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGIN 81
           E      E  + +E+++ +  K+Y    E  +RF +F+DNLK +E+ +   + T   G+ 
Sbjct: 31  ETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLT 90

Query: 82  HLSDLTREEMKS 93
             +DLT +E ++
Sbjct: 91  RFADLTNDEFRA 102


>gi|294869083|ref|XP_002765753.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
          50983]
 gi|239865917|gb|EEQ98470.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
          50983]
          Length = 174

 Score = 42.7 bits (99), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F +   KSY  KEE  KR A+F DNL  IE++N  ++ +   G+N  +DLT EE  +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNA-QNLSYKLGVNEYTDLTLEEFAA 85


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 3   EDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNL 62
           ED ++E+ L +   ++  N L       L+QF  +     K+Y   E+   RFAV++DNL
Sbjct: 30  EDGTSESFLHMTTDLEHENLL-------LEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNL 82

Query: 63  KLIEDLNKGEHGTATYGINHLSDLTREEMK 92
             I   +   + T + G+   +DLT EE +
Sbjct: 83  AYIR--HSETNRTYSLGLTKFADLTNEEFR 110


>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
 gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
          Length = 343

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 4/68 (5%)

Query: 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK---GEHGTATYGINHLSD 85
          E   QF +F   F+K Y + EE  +RF +F+ NL  IE+LN           +G+N  +D
Sbjct: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFAD 82

Query: 86 LTREEMKS 93
          L+ +E K+
Sbjct: 83 LSSDEFKN 90


>gi|389615359|dbj|BAM20657.1| cathepsin L, partial [Papilio polytes]
          Length = 377

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F++F    +K Y ++ E AKR  +F  NL+ I   N+   G  T  +NHL+D   +E+ 
Sbjct: 246 EFDRFKMKHNKKYASEIEHAKRLNIFRQNLRYIHSNNRARRGY-TLAVNHLADWAEDELA 304

Query: 93  SRLGLNLS 100
           +  G   S
Sbjct: 305 ALRGRRYS 312


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 347

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 18/63 (28%), Positives = 37/63 (58%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y  + E   RF +F+ NL+ ++  N  ++ T    +N  SDLT EE
Sbjct: 32 IEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEE 91

Query: 91 MKS 93
           ++
Sbjct: 92 FRA 94


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score = 42.7 bits (99), Expect = 0.024,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FE ++    K Y T +E   RF  F+DNL  I++ NK ++ +   G+N  +DLT +E
Sbjct: 45  IRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNK-KNNSYWLGLNEFADLTHDE 103

Query: 91  MKSR 94
            K +
Sbjct: 104 FKEK 107


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 3/72 (4%)

Query: 31  LKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           LKQ FEK+++  SK Y  ++E   RF +++ N++LI+ +N   H       N  +D+T  
Sbjct: 39  LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINS-LHLPFKLTDNRFADMTNS 97

Query: 90  EMKSR-LGLNLS 100
           E K+  LGLN S
Sbjct: 98  EFKAHFLGLNTS 109


>gi|323447420|gb|EGB03341.1| hypothetical protein AURANDRAFT_15921 [Aureococcus
          anophagefferens]
          Length = 124

 Score = 42.7 bits (99), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          F+ +  DFS++Y T +E A+R+A F+ NL  ++ LN G H  A +G+   +D +  E
Sbjct: 8  FDAWAADFSRAYATADERAERYAHFKKNLAEVDRLN-GAHPYALFGLTRFADRSDAE 63


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 3/72 (4%)

Query: 31  LKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           LKQ FEK+++  SK Y  ++E   RF +++ N++LI+ +N   H       N  +D+T  
Sbjct: 39  LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINS-LHLPFKLTDNRFADMTNS 97

Query: 90  EMKSR-LGLNLS 100
           E K+  LGLN S
Sbjct: 98  EFKAHFLGLNTS 109


>gi|294901125|ref|XP_002777247.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239884778|gb|EER09063.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 214

 Score = 42.7 bits (99), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F +   KSY  KEE  KR A+F DNL  IE++N  ++ +   G+N  +DLT EE  +
Sbjct: 27 FIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNA-QNLSYKLGVNEYTDLTLEEFAA 85


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 11 LALFGQMKSNNELKT-ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
          L +FG +      +T E+    ++ E+++  + K Y    E   R  +F++N++ IE  N
Sbjct: 15 LLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFN 74

Query: 70 KGEHGTATYGINHLSDLTREEMKSR 94
             + +   GIN  +DLT EE K+R
Sbjct: 75 NAGNKSYKLGINQFADLTNEEFKAR 99


>gi|389583697|dbj|GAB66431.1| vivapain [Plasmodium cynomolgi strain B]
          Length = 487

 Score = 42.7 bits (99), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++F K Y T +E+ +R+  F +NL  I+  N  E+     G+N   DL
Sbjct: 160 NLESVNSFYLFVKEFGKKYKTADEMQQRYQSFVENLAKIKAHNSKENVLYRKGMNQFGDL 219

Query: 87  TREEMKSRL 95
           + EE K + 
Sbjct: 220 SFEEFKKKF 228


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 43  KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
           + Y   EE AKRF +F++NLK + + N   H   T G+N  +D++ EE K +
Sbjct: 55  RVYKHAEETAKRFEIFKENLKYVIERNSKGH-RHTLGMNKFADMSNEEFKEK 105


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 30/84 (35%), Positives = 51/84 (60%), Gaps = 7/84 (8%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           S+ +LK+ + + ++ FE +I    K Y + EE   RF +F+DNLK I++ NK     + Y
Sbjct: 34  SSEDLKSMD-KLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNK---VVSNY 89

Query: 79  --GINHLSDLTREEMKSR-LGLNL 99
             G+N  +DL+ +E K++ LGL +
Sbjct: 90  WLGLNEFADLSHQEFKNKYLGLKV 113


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 15 GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
          GQ+    E +T      + +E ++    ++Y    E  +RF +F+DNLK I++ N   + 
Sbjct: 11 GQVPERTEAETR-----RIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNP 65

Query: 75 TATYGINHLSDLTREEMKS 93
          +   G+N  +DL+ +E +S
Sbjct: 66 SYKLGLNKFADLSNDEYRS 84


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 1/71 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
            ++ E+++  + + Y  + E + RF +F DN+K IE+ NK    +    +N  +D T EE
Sbjct: 54  FERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEE 113

Query: 91  MK-SRLGLNLS 100
            + SR G  ++
Sbjct: 114 FQASRNGYKMA 124


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 21/65 (32%), Positives = 36/65 (55%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E  + +E+++ +  K+Y    E   RF +F DNLK IE+ N   + T   G+   +DLT 
Sbjct: 38  EARRMYEQWLVENRKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTN 97

Query: 89  EEMKS 93
           +E ++
Sbjct: 98  DEFRA 102


>gi|47606562|gb|AAT36265.1| vivapain-3 [Plasmodium vivax]
 gi|47606564|gb|AAT36266.1| vivapain-3 [Plasmodium vivax]
 gi|47606566|gb|AAT36267.1| vivapain-3 [Plasmodium vivax]
 gi|47606568|gb|AAT36268.1| vivapain-3 [Plasmodium vivax]
 gi|47606570|gb|AAT36269.1| vivapain-3 [Plasmodium vivax]
 gi|47606572|gb|AAT36270.1| vivapain-3 [Plasmodium vivax]
 gi|47606574|gb|AAT36271.1| vivapain-3 [Plasmodium vivax]
 gi|47606588|gb|AAT36278.1| vivapain-3 [Plasmodium vivax]
 gi|47606590|gb|AAT36279.1| vivapain-3 [Plasmodium vivax]
 gi|47606592|gb|AAT36280.1| vivapain-3 [Plasmodium vivax]
 gi|47606594|gb|AAT36281.1| vivapain-3 [Plasmodium vivax]
 gi|47606596|gb|AAT36282.1| vivapain-3 [Plasmodium vivax]
 gi|47606598|gb|AAT36283.1| vivapain-3 [Plasmodium vivax]
 gi|47606600|gb|AAT36284.1| vivapain-3 [Plasmodium vivax]
          Length = 495

 Score = 42.7 bits (99), Expect = 0.025,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++  K Y T +E+ +R+  F +NL  I+  N  E+     G+N   DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225

Query: 87  TREEMKSR 94
           + EE+K +
Sbjct: 226 SFEEIKKK 233


>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
          Length = 331

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 6/72 (8%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK----GEHGTATYGINHLSDLTR 88
           +E++    +K Y T EE   R A++E NL++IE  N+    G H T T G+N   D+T+
Sbjct: 27 HWEEWKMTHTKEYITVEEEGIRRAIWEKNLRMIEAHNQEAALGMH-TYTLGMNQFGDMTQ 85

Query: 89 EEMKSRL-GLNL 99
          EE+  R+ GL +
Sbjct: 86 EEVVERMTGLQM 97


>gi|387178006|gb|AFJ68066.1| Der f 1 variant, partial [Dermatophagoides farinae]
          Length = 305

 Score = 42.7 bits (99), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 44/69 (63%), Gaps = 12/69 (17%)

Query: 28 PEHLKQFEKFIRDFSKSYPT--KEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
          P  +K FE+F + F+K+Y T  +EEVA++   F ++LK +E  NKG        INHLSD
Sbjct: 4  PASIKIFEEFKKAFNKNYATVEEEEVARK--NFLESLKYVE-ANKG-------AINHLSD 53

Query: 86 LTREEMKSR 94
          L+ +E K+R
Sbjct: 54 LSLDEFKNR 62


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 19/63 (30%), Positives = 36/63 (57%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++ E+++  F++ Y  + E   RF +F+ NL+ +++ N     T    IN  SDLT EE
Sbjct: 32 IEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEE 91

Query: 91 MKS 93
           ++
Sbjct: 92 FRA 94


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ EK++  + + Y    E  KRF VF++N+  IE  N          IN  +DL  EE 
Sbjct: 35  ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94

Query: 92  KSRLGLNLSK 101
           K+ L +N+ K
Sbjct: 95  KALL-INVQK 103


>gi|47606576|gb|AAT36272.1| vivapain-3 [Plasmodium vivax]
 gi|47606584|gb|AAT36276.1| vivapain-3 [Plasmodium vivax]
 gi|47606586|gb|AAT36277.1| vivapain-3 [Plasmodium vivax]
          Length = 495

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++  K Y T +E+ +R+  F +NL  I+  N  E+     G+N   DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225

Query: 87  TREEMKSR 94
           + EE+K +
Sbjct: 226 SFEEIKKK 233


>gi|156098482|ref|XP_001615273.1| vivapain-3 [Plasmodium vivax Sal-1]
 gi|32395685|gb|AAP04594.1| vivapain-3 [Plasmodium vivax]
 gi|47606602|gb|AAT36285.1| vivapain-3 [Plasmodium vivax]
 gi|47606604|gb|AAT36286.1| vivapain-3 [Plasmodium vivax]
 gi|148804147|gb|EDL45546.1| vivapain-3 [Plasmodium vivax]
          Length = 495

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++  K Y T +E+ +R+  F +NL  I+  N  E+     G+N   DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225

Query: 87  TREEMKSR 94
           + EE+K +
Sbjct: 226 SFEEIKKK 233


>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
 gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
          Length = 368

 Score = 42.7 bits (99), Expect = 0.026,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           + FEKF   F K+Y T EE   RF VF+ NL+  +  ++    +A +G+   SDLT  E 
Sbjct: 50  RHFEKFKARFQKTYATPEEHDYRFNVFKANLRRAKR-HQLLDPSAVHGVTQFSDLTPAEF 108

Query: 92  -KSRLGLN 98
            +  LGLN
Sbjct: 109 RRDYLGLN 116


>gi|326516056|dbj|BAJ88051.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score = 42.7 bits (99), Expect = 0.027,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + KSY +  EV +RF +F ++L+ +   N+ +      GIN  SD++ E
Sbjct: 57  HALRFARFAVGYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115

Query: 90  EMK-SRLG 96
           E + +RLG
Sbjct: 116 EFQATRLG 123


>gi|47606578|gb|AAT36273.1| vivapain-3 [Plasmodium vivax]
 gi|47606580|gb|AAT36274.1| vivapain-3 [Plasmodium vivax]
 gi|47606582|gb|AAT36275.1| vivapain-3 [Plasmodium vivax]
          Length = 495

 Score = 42.7 bits (99), Expect = 0.027,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 37/68 (54%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E +  F  F+++  K Y T +E+ +R+  F +NL  I+  N  E+     G+N   DL
Sbjct: 166 NLETVNSFYLFMKEHGKEYSTADEMQQRYLSFAENLAKIKAHNSRENVLYRKGMNRFGDL 225

Query: 87  TREEMKSR 94
           + EE+K +
Sbjct: 226 SFEEIKKK 233


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score = 42.7 bits (99), Expect = 0.027,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          + + E+++  FS+ Y  + E   R  VF+ NLK IE+ NK  + +   G+N  +D T EE
Sbjct: 36 VDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEE 95


>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score = 42.7 bits (99), Expect = 0.027,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 11/83 (13%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHG 74
           G+  +N+ L     E    F+ F+ ++ K+Y T EE   R  +F  NL     +   EH 
Sbjct: 57  GRSSANHRLLGTTTE--VHFKSFVEEYEKTYSTHEEYVHRLGIFAKNL-----IKAAEHQ 109

Query: 75  ----TATYGINHLSDLTREEMKS 93
               +A +G+   SDLT EE ++
Sbjct: 110 AMDPSAIHGVTQFSDLTEEEFEA 132


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score = 42.7 bits (99), Expect = 0.028,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 34/60 (56%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          + + E+++  FS+ Y  + E   R  VF+ NLK IE+ NK  + +   G+N  +D T EE
Sbjct: 36 VDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEE 95


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score = 42.7 bits (99), Expect = 0.028,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 16  QMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           Q +S + L+    E   + EK++  + K Y    E+ KRF +F++N++ IE  N      
Sbjct: 100 QCRSKSRLEACTSE---RHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAGDKP 156

Query: 76  ATYGINHLSDLTREEMKSRL 95
               IN   DL  EE K+ L
Sbjct: 157 FNIRINQFPDLHDEEFKALL 176


>gi|195484884|ref|XP_002090861.1| GE12567 [Drosophila yakuba]
 gi|194176962|gb|EDW90573.1| GE12567 [Drosophila yakuba]
          Length = 299

 Score = 42.7 bits (99), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           +EKF+ DF   Y    E  KR  +F DN K I++ N + + G  ++  G+N  SDLT EE
Sbjct: 227 WEKFLVDFKVKYQDDTETEKRRNIFCDNWKAIQEHNVQFDLGVESFKKGVNQWSDLTVEE 286

Query: 91  MKSR 94
            K++
Sbjct: 287 WKNK 290



 Score = 40.0 bits (92), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           ++KF+ DF   Y    E+ +R  VF  N + + D N K + G  ++  GIN  SDLT EE
Sbjct: 71  WKKFLVDFDVHYDNYSELQRRRKVFCGNWQKVSDHNLKYDSGVVSFRKGINQFSDLTFEE 130

Query: 91  MKSR 94
            K +
Sbjct: 131 WKEK 134


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score = 42.7 bits (99), Expect = 0.028,   Method: Composition-based stats.
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 1/82 (1%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTAT 77
           ++ + +    P      +K++ +FS+ Y  + E   R  VF +NLK IE+ N     +  
Sbjct: 22  EATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYK 81

Query: 78  YGINHLSDLTREE-MKSRLGLN 98
            G+N  +D T+EE + +  GL+
Sbjct: 82  LGVNKFTDWTKEEFLATHTGLS 103


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score = 42.7 bits (99), Expect = 0.029,   Method: Composition-based stats.
 Identities = 25/57 (43%), Positives = 35/57 (61%), Gaps = 2/57 (3%)

Query: 42  SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGL 97
           SK Y + EE  KR+ VF+ NLK I + N+  +G+   G+N  +D+  EE KS  LGL
Sbjct: 56  SKIYVSPEEKVKRYEVFKQNLKHIVETNR-RNGSYWLGLNQFADVAHEEFKSTYLGL 111


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score = 42.7 bits (99), Expect = 0.029,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ EK++  + + Y    E  KRF VF++N+  IE  N          IN  +DL  EE 
Sbjct: 35  ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94

Query: 92  KSRLGLNLSK 101
           K+ L +N+ K
Sbjct: 95  KALL-INVQK 103


>gi|340375899|ref|XP_003386471.1| PREDICTED: probable cysteine proteinase A494-like [Amphimedon
          queenslandica]
          Length = 373

 Score = 42.7 bits (99), Expect = 0.030,   Method: Composition-based stats.
 Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 7  AEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
          AE+   LF  +  + +L   + E    F  + +  SKSY T  E  +R +V++ N  L++
Sbjct: 6  AESISFLFIFLLCSFQLAVSSNEPQLSFTDWCKLHSKSYRTITEAKERESVYKSNADLVQ 65

Query: 67 DLN-KGEHGTATYGINHLSDLTREEMK 92
           LN +      T+ +NH +DL+ EE K
Sbjct: 66 QLNNEYRERNVTFSLNHFADLSIEEFK 92


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score = 42.7 bits (99), Expect = 0.030,   Method: Composition-based stats.
 Identities = 33/98 (33%), Positives = 57/98 (58%), Gaps = 11/98 (11%)

Query: 9   ATLALFGQMK----SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
           A+LA+ G       S+ +LK+ + + ++ FE ++    K Y + EE   RF +F+DNLK 
Sbjct: 19  ASLAVAGDFSIVGYSSEDLKSMD-KLIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKH 77

Query: 65  IEDLNKGEHGTATY--GINHLSDLTREEMKSR-LGLNL 99
           I++ NK     + Y  G+N  +DL+ +E K++ LGL +
Sbjct: 78  IDERNK---VVSNYWLGLNEFADLSHQEFKNKYLGLKV 112


>gi|54300680|gb|AAV32963.1| cysteine proteinase inhibitor [Oncorhynchus mykiss]
          Length = 131

 Score = 42.7 bits (99), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 39/63 (61%), Gaps = 3/63 (4%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTR 88
           K+F+ +   + K+YP+ EE AKR  ++    K++ + NK  E+G  +Y   +NH +DLT 
Sbjct: 61  KEFQTWKVKYGKTYPSPEEEAKRKEIWLATRKMVTEHNKRAENGLESYTLAVNHFADLTT 120

Query: 89  EEM 91
           +E+
Sbjct: 121 QEV 123


>gi|113603|sp|P05167.1|ALEU_HORVU RecName: Full=Thiol protease aleurain; Flags: Precursor
 gi|19021|emb|CAA28804.1| aleurain [Hordeum vulgare]
          Length = 362

 Score = 42.7 bits (99), Expect = 0.030,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
           H  +F +F   + KSY +  EV +RF +F ++L+ +   N+ +      GIN  SD++ E
Sbjct: 57  HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNR-KGLPYRLGINRFSDMSWE 115

Query: 90  EMK-SRLG 96
           E + +RLG
Sbjct: 116 EFQATRLG 123


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score = 42.7 bits (99), Expect = 0.031,   Method: Composition-based stats.
 Identities = 27/69 (39%), Positives = 36/69 (52%), Gaps = 4/69 (5%)

Query: 30  HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATYGINHLSDLTR 88
           H   F +F R + K Y + EE+  RFA F  NL LI   N KG   +   G+N  +D + 
Sbjct: 58  HALSFARFARRYGKIYESVEEMKLRFATFSKNLDLIRSTNCKGL--SYRLGLNKFADWSW 115

Query: 89  EEM-KSRLG 96
           EE  + RLG
Sbjct: 116 EEFQRHRLG 124


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score = 42.7 bits (99), Expect = 0.031,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 11 LALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
          L L G + +  EL  ++   + + E ++  + + Y    E A++F VF+ N + I   N 
Sbjct: 15 LCLCGSVLAAREL-NDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNA 73

Query: 71 GEHGTATYGINHLSDLTREEMKS 93
          G H     GIN  +D+T EE K+
Sbjct: 74 GNH-KFWLGINQFADITNEEFKA 95


>gi|1019670|gb|AAA79289.1| rangelipain, partial [Trypanosoma rangeli]
          Length = 265

 Score = 42.4 bits (98), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           QF  F +   K Y +  E   R  VF++NL L   L+   +  A++G+   SDLTREE 
Sbjct: 36 SQFAAFKQRHGKVYGSAAEETFRLGVFKENL-LFARLHAAANPHASFGVTPFSDLTREEF 94

Query: 92 KSR 94
          +SR
Sbjct: 95 RSR 97


>gi|302143414|emb|CBI21975.3| unnamed protein product [Vitis vinifera]
          Length = 286

 Score = 42.4 bits (98), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 33/59 (55%)

Query: 35 EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          E ++  + + Y   +E +KR+ +F+DN+  IE  NK    +    IN  +DLT EE ++
Sbjct: 40 EDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRA 98


>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
 gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
          Length = 327

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 25/72 (34%), Positives = 43/72 (59%), Gaps = 6/72 (8%)

Query: 23 LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI----EDLNKGEHGTATY 78
          + T +P    ++E F + +++ Y  +EE A+R  +FEDNLK I    E+ ++G H T   
Sbjct: 12 MATASPLMNPEWEVFKKAYNRVYAAEEEYARRL-IFEDNLKTIQMHNEEADRGLH-TFRL 69

Query: 79 GINHLSDLTREE 90
          G+N  +D+T +E
Sbjct: 70 GVNQYADMTHKE 81


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 2/82 (2%)

Query: 12 ALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG 71
          AL G + +  +L  ++   + + E+++  + + Y    E A+RF VF+ N+ LIE +N G
Sbjct: 20 ALSGSLAAR-DLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAG 78

Query: 72 EHGTATYGINHLSDLTREEMKS 93
           H       N  +DLT +E ++
Sbjct: 79 NHKFWLEA-NRFADLTDDEFRA 99


>gi|403340410|gb|EJY69490.1| Cysteine protease [Oxytricha trifallax]
          Length = 355

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSD 85
           ++ + + +F  FI    K+Y TKEE   R  +F+ N   I+  N  E+      +N  +D
Sbjct: 40  QDQQVMLKFNDFISKHQKNYLTKEEYKARLGLFKQNFDYIQKSN-AENKDYVLDLNAFAD 98

Query: 86  LTREEMKSRLGL 97
           ++ EE   RLG 
Sbjct: 99  MSDEEYNKRLGF 110


>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
 gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
          Length = 343

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+FI  ++K Y  + E   RF +F  N++ I   N   + +A Y IN  +D+T+ E+  
Sbjct: 45  FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQKNS-RNDSAVYKINRFADMTKNEVVI 103

Query: 94  R 94
           R
Sbjct: 104 R 104


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ EK++  + + Y    E  KRF VF++N+  IE  N          IN  +DL  EE 
Sbjct: 35  ERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEF 94

Query: 92  KSRLGLNLSK 101
           K+ L +N+ K
Sbjct: 95  KALL-INVQK 103


>gi|328722454|ref|XP_001951172.2| PREDICTED: counting factor associated protein D-like [Acyrthosiphon
           pisum]
          Length = 558

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 34  FEKFIRDFSKSYPTKEEVA-KRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           FE+F R  +K+YP    +   R  +F  NL+ I   N+   G  T  +NHL+D +  E+K
Sbjct: 252 FEEFKRKHNKNYPNDTIIHFDRKNIFRQNLRYIRSKNRANVGY-TLAVNHLADYSSTELK 310

Query: 93  SRLG 96
           S LG
Sbjct: 311 SMLG 314


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 1/65 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E +  ++ ++    K+Y    E  KRF +F+DNLK I++ N  ++ T   G+N  +DLT 
Sbjct: 41  EVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHN-AQNRTYKVGLNRFADLTN 99

Query: 89  EEMKS 93
           EE ++
Sbjct: 100 EEYRA 104


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score = 42.4 bits (98), Expect = 0.032,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 4/69 (5%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY--GINHLSDLTRE 89
           K  ++++  + +SY    E+ KRF +F +NL+ IE  N    G  +Y   +N  SDLT E
Sbjct: 36  KTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAP-GNKSYKLDLNQFSDLTNE 94

Query: 90  E-MKSRLGL 97
           E + S  GL
Sbjct: 95  EFIASHTGL 103


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
          max]
          Length = 343

 Score = 42.4 bits (98), Expect = 0.033,   Method: Composition-based stats.
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 36 KFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          +++  ++K Y   +E  KRF +F++N+  IE  N  ++ +    IN  +DLT EE
Sbjct: 41 QWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEE 95


>gi|403342666|gb|EJY70658.1| Cysteine protease [Oxytricha trifallax]
          Length = 367

 Score = 42.4 bits (98), Expect = 0.034,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 1/90 (1%)

Query: 10  TLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
           T ++ G   +N +LK + P     F  F+    +S+ T+EE   R A+F D  + ++  N
Sbjct: 34  TESVSGNAATNLKLKVD-PSIQTAFNNFVSRHQRSFLTQEEYKARLAIFRDTFEAVQLHN 92

Query: 70  KGEHGTATYGINHLSDLTREEMKSRLGLNL 99
             E  +    IN  SD++++E      L L
Sbjct: 93  SLESKSYKLAINKFSDMSKDEFSKFSSLQL 122


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score = 42.4 bits (98), Expect = 0.034,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGI 80
           L+   PE    +E ++ +  ++Y    E  +RF VF DNL+ ++  N+   EHG    G+
Sbjct: 41  LERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGM 99

Query: 81  NHLSDLTREEMKS 93
           N  +DLT +E ++
Sbjct: 100 NQFADLTNDEFRA 112


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score = 42.4 bits (98), Expect = 0.034,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGI 80
           L+   PE    +E ++ +  ++Y    E  +RF VF DNL+ ++  N+   EHG    G+
Sbjct: 38  LERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGM 96

Query: 81  NHLSDLTREEMKS 93
           N  +DLT +E ++
Sbjct: 97  NQFADLTNDEFRA 109


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score = 42.4 bits (98), Expect = 0.034,   Method: Composition-based stats.
 Identities = 25/61 (40%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHGTATY--GINHLSDLTRE 89
          Q+E F     K Y  KEE A+R  +F+DNLK IE  N+  + G  +Y  G+N  +D+T  
Sbjct: 23 QWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHA 81

Query: 90 E 90
          E
Sbjct: 82 E 82


>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
 gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score = 42.4 bits (98), Expect = 0.035,   Method: Composition-based stats.
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 11/87 (12%)

Query: 15  GQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI---EDLNKG 71
           GQ  S++ L +    H   F  F   F KSY ++EE   RF+VF+ NL+     ++L+  
Sbjct: 37  GQDASSSNLLSAEQHH---FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDP- 92

Query: 72  EHGTATYGINHLSDLTREEMKSR-LGL 97
              TA++G+   SDLT  E + + LGL
Sbjct: 93  ---TASHGVTQFSDLTPAEFRKQVLGL 116


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score = 42.4 bits (98), Expect = 0.036,   Method: Composition-based stats.
 Identities = 21/75 (28%), Positives = 40/75 (53%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           +  E++    E    +E+++ +  K+Y    E  +RF +F+DNLK +++ N     T   
Sbjct: 29  TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88

Query: 79  GINHLSDLTREEMKS 93
           G+   +DLT EE ++
Sbjct: 89  GLTRFADLTNEEFRA 103


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score = 42.4 bits (98), Expect = 0.036,   Method: Composition-based stats.
 Identities = 21/75 (28%), Positives = 40/75 (53%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           +  E++    E    +E+++ +  K+Y    E  +RF +F+DNLK +++ N     T   
Sbjct: 29  TETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEV 88

Query: 79  GINHLSDLTREEMKS 93
           G+   +DLT EE ++
Sbjct: 89  GLTRFADLTNEEFRA 103


>gi|58201366|gb|AAW66804.1| cysteine protease [Pinus taeda]
 gi|58201368|gb|AAW66805.1| cysteine protease [Pinus taeda]
 gi|58201392|gb|AAW66817.1| cysteine protease [Pinus taeda]
 gi|58201394|gb|AAW66818.1| cysteine protease [Pinus taeda]
 gi|58201398|gb|AAW66820.1| cysteine protease [Pinus taeda]
          Length = 193

 Score = 42.4 bits (98), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATY 78
           SN +L+ E+   ++ +E ++ +  K+Y   +E  KRF VF+DN   I + N+G   +   
Sbjct: 28  SNKDLR-EDDAIMELYELWVAEHKKAYNGLDEKQKRFTVFKDNFLYIHEHNQGNR-SYKL 85

Query: 79  GINHLSDLTREEMKS 93
           G+N  +DL+ EE K+
Sbjct: 86  GLNQFADLSHEEFKA 100


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score = 42.4 bits (98), Expect = 0.036,   Method: Composition-based stats.
 Identities = 19/63 (30%), Positives = 35/63 (55%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          + +F  F   ++++Y + EE  +RF V+  N+  IE +N+    T   G N  +DLT +E
Sbjct: 37 MDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQE 96

Query: 91 MKS 93
           ++
Sbjct: 97 FRA 99


>gi|118364806|ref|XP_001015624.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297391|gb|EAR95379.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 375

 Score = 42.4 bits (98), Expect = 0.036,   Method: Composition-based stats.
 Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 34  FEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           FE++I DF K Y     E  +R   FE NL  I   N  +H +   G+N  +DLT +E +
Sbjct: 29  FEQYIVDFEKEYEVDSVEYNQRKQTFEKNLVEIIAFNNKDH-SYKKGVNRNTDLTTKEFQ 87

Query: 93  SRLGLNLS 100
            +LGL  S
Sbjct: 88  VQLGLKKS 95


>gi|356570072|ref|XP_003553215.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 3-like,
          partial [Glycine max]
          Length = 301

 Score = 42.4 bits (98), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 2/68 (2%)

Query: 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTRE 89
          H   F  F     K Y + +E+   F +F DNLKLI   N+    T   G+NH +D T E
Sbjct: 29 HALSFACFACHHDKRYHSIDEIRNGFQIFSDNLKLIRSTNR-RSLTYMLGVNHFADWTWE 87

Query: 90 EM-KSRLG 96
          E  + +LG
Sbjct: 88 EFTRHKLG 95


>gi|48374352|gb|AAT09103.1| digestive cysteine proteinase [Bigelowiella natans]
          Length = 360

 Score = 42.4 bits (98), Expect = 0.037,   Method: Composition-based stats.
 Identities = 20/65 (30%), Positives = 38/65 (58%)

Query: 31 LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          + +FE + ++F KSY    +  K    F +N ++I+ LN+ E G+A YG    SD++ E+
Sbjct: 21 IPKFEAWKKEFGKSYEEAGKEDKARLNFVENERIIQGLNENELGSAVYGHTRFSDMSPEQ 80

Query: 91 MKSRL 95
           ++ +
Sbjct: 81 FRAMM 85


>gi|390344145|ref|XP_798313.2| PREDICTED: cathepsin O-like [Strongylocentrotus purpuratus]
          Length = 361

 Score = 42.4 bits (98), Expect = 0.037,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 34  FEKFIRDFSKSYPT-KEEVAKRFAVFEDNLKLIEDLNK--GEHGTATYGINHLSDLTREE 90
           F+ FI+ F+K+Y    +E  KR+ +F+++L   E LN        ATYGI   SDLT EE
Sbjct: 54  FQIFIQKFNKTYTRGSQEYFKRYRIFKESLLKHEMLNAIATHRDHATYGITKFSDLTSEE 113

Query: 91  MK 92
            +
Sbjct: 114 FQ 115


>gi|326428462|gb|EGD74032.1| hypothetical protein PTSG_05727 [Salpingoeca sp. ATCC 50818]
          Length = 398

 Score = 42.4 bits (98), Expect = 0.037,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 32/65 (49%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE F  ++ K Y + EE   R  VFE  L  ++  N     T   GINH+SD T  E K 
Sbjct: 57  FEHFKAEYGKRYLSSEEHDFRRQVFERTLASVKAHNSDPTKTWKQGINHMSDWTDGEFKR 116

Query: 94  RLGLN 98
            LG +
Sbjct: 117 LLGYD 121


>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
 gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
 gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score = 42.4 bits (98), Expect = 0.037,   Method: Composition-based stats.
 Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 11/90 (12%)

Query: 5   ASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKL 64
           +S++    L  Q+ S  E    N EH   F  F   F K+Y T+EE   RF+VF+ NL  
Sbjct: 24  SSSDLDDPLIRQVVSEGEDHLLNAEH--HFTTFKSKFGKNYATQEEHDYRFSVFKANL-- 79

Query: 65  IEDLNKGEHG----TATYGINHLSDLTREE 90
              L   +H     TA +G+   SDLT +E
Sbjct: 80  ---LRAKKHQIMDPTAAHGVTKFSDLTPKE 106


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score = 42.4 bits (98), Expect = 0.037,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 32/59 (54%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ E+++  + K Y   +E  KRF +F++N+  IE  N   +      IN  +DLT EE
Sbjct: 55  ERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEE 113


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella
          moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella
          moellendorffii]
          Length = 300

 Score = 42.4 bits (98), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 2  FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella
          moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella
          moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella
          moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella
          moellendorffii]
          Length = 300

 Score = 42.4 bits (98), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE +     KSY +  E A+R  +F D L  IE  N   + T T G+N  SDLT  E ++
Sbjct: 2  FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61


>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score = 42.4 bits (98), Expect = 0.038,   Method: Composition-based stats.
 Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 3/85 (3%)

Query: 14  FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
            G ++ + E      +H   F  F R F KSY ++EE   RF VF+ NL+      + + 
Sbjct: 43  LGDVEGSEEENLLTADH-HHFSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLD- 100

Query: 74  GTATYGINHLSDLTREEMK-SRLGL 97
            +AT+G+   SDLT  E + + LGL
Sbjct: 101 PSATHGVTQFSDLTPAEFRGTYLGL 125


>gi|440293210|gb|ELP86353.1| cysteine protease, putative [Entamoeba invadens IP1]
          Length = 453

 Score = 42.4 bits (98), Expect = 0.038,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 2/59 (3%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHLSDLTREE 90
          F  +   F KSY T+ E  +R A+F D  + I + N  +     A  G+N+LSDLT +E
Sbjct: 36 FASYKMLFQKSYNTQSEELRRLAIFADKSRFIAEFNTQRKSSNDALLGLNNLSDLTTDE 94


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score = 42.4 bits (98), Expect = 0.038,   Method: Composition-based stats.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGI 80
           L+    E    ++ ++ +  +SY    E  +RF VF DNLK ++  N    EHG    G+
Sbjct: 38  LERTEAEARAAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGM 97

Query: 81  NHLSDLTREEMKS 93
           N  +DLT +E ++
Sbjct: 98  NRFADLTNDEFRA 110


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score = 42.4 bits (98), Expect = 0.039,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score = 42.4 bits (98), Expect = 0.039,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.4 bits (98), Expect = 0.039,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.4 bits (98), Expect = 0.040,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
          Length = 282

 Score = 42.4 bits (98), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 31  LKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
           ++ FE ++    K Y + EE   RF +F+DNL  I++ NK ++ +   G+N  +DLT +E
Sbjct: 45  IRLFESWMLKHDKVYKSMEEKINRFEIFKDNLMYIDETNK-KNNSYWLGLNEFADLTHDE 103

Query: 91  MKSR 94
            K +
Sbjct: 104 FKKK 107


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.4 bits (98), Expect = 0.040,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.4 bits (98), Expect = 0.040,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 503

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 98  FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 156

Query: 94  R 94
           R
Sbjct: 157 R 157


>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 533

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
           FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 186

Query: 94  R 94
           R
Sbjct: 187 R 187


>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
          MHOM/GT/2001/U1103]
 gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
          MHOM/GT/2001/U1103]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
          cysteine proteinase A-2; Flags: Precursor
 gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
          Length = 444

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
 gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-HQARNPHAQFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gi|383852175|ref|XP_003701604.1| PREDICTED: cathepsin O-like [Megachile rotundata]
          Length = 370

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 43/71 (60%), Gaps = 7/71 (9%)

Query: 29  EHLKQFEKFIRDFSKSY---PTKEEVAKRFAVFEDNLKLIEDLN--KGEHGTATYGINHL 83
           E +K F+ ++  ++K+Y   PT+ E  +RF  F+ +L+ IE +N  +    +A YG+   
Sbjct: 47  EDIKLFKNYVTRYNKTYRNDPTEYE--ERFQRFQRSLRHIETMNSLRSSPESAFYGLTEF 104

Query: 84  SDLTREEMKSR 94
           SD+T +E +S+
Sbjct: 105 SDMTEDEFRSQ 115


>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
          Length = 309

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF  F+R   + Y   EE A+R  VF  NL      ++    TA +G+   SDLTREE +
Sbjct: 47  QFAAFVRRHGREYSGPEEYARRLRVFAANLAR-AAAHQALDPTARHGVTPFSDLTREEFE 105

Query: 93  SRL 95
           +RL
Sbjct: 106 ARL 108


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score = 42.0 bits (97), Expect = 0.041,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  + K Y   +E  KRF +F++N+K IE  N   +     G+N  +DLT +E
Sbjct: 37 ERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKLGVNQFTDLTNKE 95


>gi|213514356|ref|NP_001134251.1| Cathepsin M precursor [Salmo salar]
 gi|38423489|emb|CAD80246.1| cystein proteinase inhibitor protein [Salmo salar]
 gi|209731860|gb|ACI66799.1| Cathepsin M precursor [Salmo salar]
          Length = 342

 Score = 42.0 bits (97), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK-GEHG--TATYGINHLSDLTR 88
           K+FE +   + K+YP+  E AKR  ++    K++ + NK  E+G  + T G+NH +DLT 
Sbjct: 272 KEFETWKVKYGKTYPSTVEEAKRKEIWLATRKMVMEHNKRAENGLESFTMGVNHFADLTA 331

Query: 89  EEMKSRL 95
           EE+   L
Sbjct: 332 EEVPRGL 338



 Score = 40.8 bits (94), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 3/67 (4%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVF-EDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
          K+FE +   + KSYP+ EE AKR  ++     K++E   +  +G  +Y   +NHL+DLT 
Sbjct: 32 KEFETWKVKYGKSYPSTEEEAKRKEMWLATRKKVMEHNTRAGNGLESYTMAVNHLADLTT 91

Query: 89 EEMKSRL 95
          EE+   L
Sbjct: 92 EEVPKGL 98



 Score = 35.4 bits (80), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 3/66 (4%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVF-EDNLKLIEDLNKGEHGTATY--GINHLSDLTR 88
           K+FE +     K+Y + EE AKR  ++     +++E   + E G+ ++  G+NHLSD T 
Sbjct: 195 KEFETWKVQHGKNYGSTEEEAKRKGIWLATRTRVMEHNKRAETGSESFTMGMNHLSDKTT 254

Query: 89  EEMKSR 94
            E+  R
Sbjct: 255 AEVTGR 260


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 24/60 (40%), Positives = 32/60 (53%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
          +FE + R F KSY    E   R AV+E N  L++  N     + T G+N  +DLT EE K
Sbjct: 29 EFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFK 88


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N EH   F  F   F K+Y T+EE   RF +F++NL   +   K +  +A +G+   SDL
Sbjct: 46  NAEH--HFSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PSAVHGVTRFSDL 102

Query: 87  TREEMKSR-LGL 97
           T  E + + LGL
Sbjct: 103 TPAEFRRQFLGL 114


>gi|145520919|ref|XP_001446315.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413792|emb|CAK78918.1| unnamed protein product [Paramecium tetraurelia]
          Length = 317

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 4/90 (4%)

Query: 1  MAEDASAEATLALFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFED 60
          M++   A  T+AL G +   N+   ++ +++ +FE F + F K Y + EE A R AV+  
Sbjct: 1  MSKTILALGTIALIGALLMANQ--PQSVDYVSKFEAFKQRFGKRYGSTEE-AYRLAVYTQ 57

Query: 61 NLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          NL   E  N  + G   +G     DLT+EE
Sbjct: 58 NLLFAEAHNL-QKGKRVFGETIFFDLTQEE 86


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
          Length = 709

 Score = 42.0 bits (97), Expect = 0.042,   Method: Composition-based stats.
 Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           QF  F+R   + Y   EE A+R  VF  NL      ++    TA +G+   SDLTREE +
Sbjct: 47  QFAAFVRRHGREYSGPEEYARRLRVFAANLAR-AAAHQALDPTARHGVTPFSDLTREEFE 105

Query: 93  SRL 95
           +RL
Sbjct: 106 ARL 108


>gi|443732032|gb|ELU16924.1| hypothetical protein CAPTEDRAFT_222012 [Capitella teleta]
          Length = 342

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 3/81 (3%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKE-EVAKRFAVFEDNLKLIEDLN--KGEHGTATYG 79
           L+  N E    F KF   + K+Y     E   R  +F DN K    LN  +  + +A YG
Sbjct: 21  LRVSNEEIDDLFVKFTEKYHKTYLIGSLEYMHRRGIFRDNFKKHVALNSLRTNNASAWYG 80

Query: 80  INHLSDLTREEMKSRLGLNLS 100
           +   SDLT+EE  +R   N +
Sbjct: 81  VTQFSDLTQEEFTNRFLSNFT 101


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F    +++Y +  E  KRF +F  N+K   +LN+ ++  AT+G N  +D++ EE ++
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEEFQT 83

Query: 94 R 94
          R
Sbjct: 84 R 84


>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
          Length = 443

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          F  F    +++Y +  E  KRF +F  N+K   +LN+ ++  AT+G N  +D++ EE ++
Sbjct: 25 FSDFKATHARNYVSPGEERKRFEIFAANMKKAAELNR-KNPMATFGPNEFADMSSEEFQT 83

Query: 94 R 94
          R
Sbjct: 84 R 84


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 9  ATLALFGQMKSNNELKT--ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
          A L + G   S +  +T  + P + ++ E+++  + + Y    E A R+++F++N+  I+
Sbjct: 13 ALLFILGAWPSKSTARTLLDAPMY-ERHEQWMTQYGRVYKDDNERATRYSIFKENVARID 71

Query: 67 DLNKGEHGTATYGINHLSDLTREEMKS 93
            N     +   G+N  +DLT EE K+
Sbjct: 72 AFNSQTGKSYKLGVNQFADLTNEEFKA 98


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 29  EHLKQ-FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLT 87
           E +K+ +E ++    K Y    E  KRF +F+DNLK I++ N   H T   G+   +DLT
Sbjct: 39  EEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSENH-TYKMGLTPYTDLT 97

Query: 88  REEMKS 93
            EE ++
Sbjct: 98  NEEFQA 103


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
          Length = 365

 Score = 42.0 bits (97), Expect = 0.043,   Method: Composition-based stats.
 Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM- 91
            F  F R F KSY T+E+   RF+VF+ NL+      + +  +A +G+   SDLT  E  
Sbjct: 49  HFRLFKRRFGKSYATQEDHDYRFSVFKTNLRRARHHQRLD-PSAVHGVTQFSDLTPAEFR 107

Query: 92  KSRLGL 97
           ++ LGL
Sbjct: 108 RNHLGL 113


>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
          Length = 440

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
          Length = 444

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 97

Query: 92 KS 93
          ++
Sbjct: 98 RA 99


>gi|194352770|emb|CAQ00113.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 310

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 29/52 (55%)

Query: 43 KSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR 94
          KSYP  +E  +RF V+  N++ IE  N+      T G N  +DLT EE  +R
Sbjct: 4  KSYPAVDEELRRFEVYRRNVERIEATNRDGGRGYTLGENQFTDLTSEEFLAR 55


>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
          Length = 442

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          +QF  F + +S+SY    E A RF VF+ N++  ++     +  AT+G+   SD++ EE 
Sbjct: 34 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKE-EAAANPYATFGVTRFSDMSPEEF 92

Query: 92 KS 93
          ++
Sbjct: 93 RA 94


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
          Length = 377

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 3/85 (3%)

Query: 14  FGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEH 73
            G ++   E      +H   F  F R F KSY ++EE   RF VF+ NL+      + + 
Sbjct: 43  LGDVEGGEEENLLTADH-HHFSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLD- 100

Query: 74  GTATYGINHLSDLTREEMK-SRLGL 97
            +AT+G+   SDLT  E + + LGL
Sbjct: 101 PSATHGVTQFSDLTPAEFRGTYLGL 125


>gi|326515420|dbj|BAK03623.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326522532|dbj|BAK07728.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 205

 Score = 42.0 bits (97), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 7/66 (10%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLI---EDLNKGEHGTATYGINHLSDLTRE 89
           QF  F+R   K Y   EE A+R  VF  N+      + L+ G    A +G+   SDLTRE
Sbjct: 49  QFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQALDPG----ARHGVTPFSDLTRE 104

Query: 90  EMKSRL 95
           E ++RL
Sbjct: 105 EFEARL 110


>gi|195583147|ref|XP_002081385.1| GD10988 [Drosophila simulans]
 gi|194193394|gb|EDX06970.1| GD10988 [Drosophila simulans]
          Length = 349

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           ++KF+ DF   Y  + E  KR  +F DN K I++ N + E G  ++  GIN  SDLT EE
Sbjct: 257 WKKFLIDFGAKYQDETETEKRRTIFCDNWKAIQEHNVQFELGVQSFKKGINQWSDLTVEE 316

Query: 91  MKSRLGLNLS 100
            K++   NL+
Sbjct: 317 WKTKQRPNLA 326



 Score = 39.3 bits (90), Expect = 0.28,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 42/76 (55%), Gaps = 5/76 (6%)

Query: 24  KTENPEHLKQ--FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY-- 78
           K EN + + Q  +EKF+ DF  +Y    E  KR  VF DN K I   N + + G  ++  
Sbjct: 150 KIENYDIICQAAWEKFLIDFKPTYQDDTETEKRRNVFCDNFKSIHKHNVQYDLGNISFKK 209

Query: 79  GINHLSDLTREEMKSR 94
           GIN  SDLT EE K++
Sbjct: 210 GINQWSDLTVEEWKNK 225


>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
 gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
           domain near N-terminus [Cryptosporidium parvum Iowa II]
 gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
          Length = 401

 Score = 42.0 bits (97), Expect = 0.044,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTR 88
           E+ K FE+F + + K Y + EE  +RF +++ N+  I+  N  +  +    +N   DL++
Sbjct: 81  EYRKSFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNS-QGFSYVLEMNEFGDLSK 139

Query: 89  EEMKSRL 95
           EE  +R 
Sbjct: 140 EEFMARF 146


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 32/59 (54%)

Query: 32 KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          ++ E+++  + K Y   +E  KRF +F++N+  IE  N   +      IN  +DLT EE
Sbjct: 37 ERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEE 95


>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa
          decemlineata]
          Length = 324

 Score = 42.0 bits (97), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 3/66 (4%)

Query: 33 QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTRE 89
          +++ F  +FSKSY    E  +RF +F  NL  IE+ N+    G +TY  G+N  +DLT E
Sbjct: 22 KWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNKFADLTPE 81

Query: 90 EMKSRL 95
          E   R 
Sbjct: 82 EFMERF 87


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 21/63 (33%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 35  EKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE-MKS 93
           E+++  F + Y   +E A+R  VF  N + ++ +N+  + T T G+N  SDLT +E +++
Sbjct: 40  EEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQT 99

Query: 94  RLG 96
            LG
Sbjct: 100 HLG 102


>gi|21430502|gb|AAM50929.1| LP08365p [Drosophila melanogaster]
          Length = 432

 Score = 42.0 bits (97), Expect = 0.045,   Method: Composition-based stats.
 Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 3/70 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKG-EHGTATY--GINHLSDLTREE 90
           ++KF+ DF   Y  ++E  KR  +F DN K I++ N+  E G  ++  GIN  SDLT EE
Sbjct: 347 WKKFLIDFGAKYQDEKETEKRRTIFCDNWKAIQEHNEQFELGVESFKKGINQWSDLTVEE 406

Query: 91  MKSRLGLNLS 100
            K++   NL+
Sbjct: 407 WKTKQRPNLA 416



 Score = 40.8 bits (94), Expect = 0.094,   Method: Composition-based stats.
 Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 19  SNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTAT 77
           S +E+  +N      +EKF+ DF  SY    E  KR  VF DN K I   N + + G  +
Sbjct: 237 STSEIDNDNIICQPAWEKFLIDFKPSYQDDTETEKRRNVFCDNFKSIHKHNVQFDLGNIS 296

Query: 78  Y--GINHLSDLTREEMKSR 94
           +  GIN  SDLT EE K++
Sbjct: 297 FKKGINQWSDLTVEEWKNK 315


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.046,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score = 42.0 bits (97), Expect = 0.046,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 32  KQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
           ++ E ++    + Y  + E  +RF +F++N+K IE +NK  + +   G+N  +D+T +E 
Sbjct: 37  ERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEF 96

Query: 92  KSRL-GLNL 99
            ++  GLN+
Sbjct: 97  LAKFTGLNI 105


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score = 42.0 bits (97), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 37/59 (62%), Gaps = 2/59 (3%)

Query: 42 SKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSR-LGLNL 99
           KSY + EE   RF VF+DNLK I++ NK +  +   G+N  +DL+ EE K + LGL +
Sbjct: 5  GKSYRSFEEKLHRFEVFQDNLKHIDETNK-KVSSYWLGLNEFADLSHEEFKRKYLGLKI 62


>gi|195334170|ref|XP_002033757.1| GM21494 [Drosophila sechellia]
 gi|194125727|gb|EDW47770.1| GM21494 [Drosophila sechellia]
          Length = 427

 Score = 42.0 bits (97), Expect = 0.046,   Method: Composition-based stats.
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 4/80 (5%)

Query: 24  KTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GI 80
           K +NP     ++KF+ DF   Y  + E  KR  +F DN K I++ N + E G  ++  GI
Sbjct: 338 KDDNPCQ-AAWKKFLVDFGVKYQDETETEKRRTIFCDNWKAIQEHNVQFELGVESFKKGI 396

Query: 81  NHLSDLTREEMKSRLGLNLS 100
           N  SDLT EE K++   NL+
Sbjct: 397 NQWSDLTVEEWKTKQRPNLA 416



 Score = 38.9 bits (89), Expect = 0.38,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 36/64 (56%), Gaps = 3/64 (4%)

Query: 34  FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN-KGEHGTATY--GINHLSDLTREE 90
           +EKF+ DF  +Y    E  KR  VF DN K I   N + + G  ++  GIN  SDLT EE
Sbjct: 252 WEKFLIDFKPTYQDHTETEKRRNVFCDNFKSIHKHNVEFDLGNISFKKGINQWSDLTVEE 311

Query: 91  MKSR 94
            K++
Sbjct: 312 WKNK 315


>gi|261328619|emb|CBH11597.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
           gambiense DAL972]
          Length = 201

 Score = 42.0 bits (97), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 33  QFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           +F  F + + K Y   +E A RF  FE+N++  + +    +  AT+G+   SD+TREE +
Sbjct: 40  RFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAK-IQAAANPYATFGVTPFSDMTREEFR 98

Query: 93  SR 94
           +R
Sbjct: 99  AR 100


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.128    0.347 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,444,504,571
Number of Sequences: 23463169
Number of extensions: 51466450
Number of successful extensions: 157379
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 895
Number of HSP's successfully gapped in prelim test: 1195
Number of HSP's that attempted gapping in prelim test: 155848
Number of HSP's gapped (non-prelim): 2141
length of query: 102
length of database: 8,064,228,071
effective HSP length: 71
effective length of query: 31
effective length of database: 6,398,343,072
effective search space: 198348635232
effective search space used: 198348635232
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 69 (31.2 bits)